Big Data

Work with vast unstructured data across file types and schemas. Tools such as data warehouses, Spark, BigQuery, Redshift, Hadoop etc.

DataFrame Transformations in PySpark (Continued)

DataFrame Transformations in PySpark (Continued)

Continuing to apply transformations to Spark DataFrames using PySpark.
Spark
8 min read
May 07
Executing Basic DataFrame Transformations in PySpark

Executing Basic DataFrame Transformations in PySpark

Using PySpark to apply transformations to real datasets.
Spark
9 min read
April 29
Learning Apache Spark with PySpark & Databricks

Learning Apache Spark with PySpark & Databricks

Get started with Apache Spark in part 1 of our series, where we leverage Databricks and PySpark.
Spark
13 min read
April 26
Google BigQuery's Python SDK: Creating Tables Programmatically

Google BigQuery's Python SDK: Creating Tables Programmatically

Explore the benefits of Google BigQuery and use the Python SDK to programmatically create tables.
Big Data
8 min read
February 02
From CSVs to Tables: Infer Data Types From Raw Spreadsheets

From CSVs to Tables: Infer Data Types From Raw Spreadsheets

The quest to never explicitly set a table schema ever again.
Big Data
9 min read
January 23
Lynx Roundup, July 20th

Lynx Roundup, July 20th

Kafka! New brain cell! Computers made of liquid!
Data Could Save Humanity if it Weren't for Humanity

Data Could Save Humanity if it Weren't for Humanity

A compelling case for robot overlords.
Data Science
7 min read
July 20
Lynx Roundup, July 16th

Lynx Roundup, July 16th

How likely is likely? Awesome TensorFlow tutorial, and a guide on maffs for Deep Learning