Use Apache Spark to build data pipelines
Performing Macro Operations on PySpark DataFrames
Perform SQL-like joins and aggregations on your PySpark DataFrames.
Working with PySpark RDDs
Working with Spark's original data structure API: Resilient Distributed Datasets.
Structured Streaming in PySpark
Become familiar with building a structured stream in PySpark using the Databricks interface.
DataFrame Transformations in PySpark (Continued)
Continuing to apply transformations to Spark DataFrames using PySpark.
Executing Basic DataFrame Transformations in PySpark
Using PySpark to apply transformations to real datasets.
Cleaning PySpark DataFrames
Easy DataFrame cleaning techniques, ranging from dropping problematic rows to selecting important columns.
Learning Apache Spark with PySpark & Databricks
Get started with Apache Spark in part 1 of our series, where we leverage Databricks and PySpark.