Big Data

Work with vast unstructured data across file types and schemas. Tools such as data warehouses, Spark, BigQuery, Redshift, Hadoop etc.

DataFrame Transformations in PySpark (Continued)

DataFrame Transformations in PySpark (Continued)

Continuing to apply transformations to Spark DataFrames using PySpark.
Executing Basic DataFrame Transformations in PySpark

Executing Basic DataFrame Transformations in PySpark

Using PySpark to apply transformations to real datasets.
Learning Apache Spark with PySpark & Databricks

Learning Apache Spark with PySpark & Databricks

Get started with Apache Spark in part 1 of our series, where we leverage Databricks and PySpark.
Google BigQuery's Python SDK: Creating Tables Programmatically

Google BigQuery's Python SDK: Creating Tables Programmatically

Explore the benefits of Google BigQuery and use the Python SDK to programmatically create tables.
From CSVs to Tables: Infer Data Types From Raw Spreadsheets

From CSVs to Tables: Infer Data Types From Raw Spreadsheets

The quest to never explicitly set a table schema ever again.
Lynx Roundup, July 20th

Lynx Roundup, July 20th

Kafka! New brain cell! Computers made of liquid!
Data Could Save Humanity if it Weren't for Humanity

Data Could Save Humanity if it Weren't for Humanity

A compelling case for robot overlords.
Lynx Roundup, July 16th

Lynx Roundup, July 16th

How likely is likely? Awesome TensorFlow tutorial, and a guide on maffs for Deep Learning
Lynx Roundup, June 18th

Lynx Roundup, June 18th

Daily roundup of Data Science news around the industry, 6/18/2018.