Big Data

Collect, process, and store data at a massive scale. Learn horizontally-scalable tools such as, Spark, BigQuery, Redshift, Hadoop etc.

Simplify BigQuery ETL jobs using SQLAlchemy

Simplify BigQuery ETL jobs using SQLAlchemy

Extract and move data between BigQuery and relational databases using a plugin for SQLAlchemy.
Learning Apache Spark with PySpark & Databricks

Learning Apache Spark with PySpark & Databricks

Get started with Apache Spark in part 1 of our series, where we leverage Databricks and PySpark.
Google BigQuery's Python SDK: Creating Tables Programmatically

Google BigQuery's Python SDK: Creating Tables Programmatically

Explore the benefits of Google BigQuery and use the Python SDK to programmatically create tables.
From CSVs to Tables: Infer Data Types From Raw Spreadsheets

From CSVs to Tables: Infer Data Types From Raw Spreadsheets

The quest to never explicitly set a table schema ever again.
Lynx Roundup, July 20th

Lynx Roundup, July 20th

Kafka! New brain cell! Computers made of liquid!
Data Could Save Humanity if it Weren't for Humanity

Data Could Save Humanity if it Weren't for Humanity

A compelling case for robot overlords.
Lynx Roundup, June 18th

Lynx Roundup, June 18th

Daily roundup of Data Science news around the industry, 6/18/2018.