Building Java Projects with Gradle
Simply your Java projects by automating dependency resolution, testing, and more with Gradle.
Using Amazon Redshift as your Data Warehouse
Get the most out of Redshift by performance tuning your cluster and learning how to query your data optimally.
Constructing Database Queries with SQLAlchemy
Query your data models using SQLAlchemy's query API.
Managing Relationships in SQLAlchemy Data Models
Using the SQLAlchemy ORM to build data models with meaningful relationships.
Performing Macro Operations on PySpark DataFrames
Perform SQL-like joins and aggregations on your PySpark DataFrames.
Side Projects Are A Good Idea
Side projects are a good idea but make sure to do the day job.
Manage Files in Google Cloud Storage With Python
Manage files in your Google Cloud Storage bucket using the google-cloud-storage Python library.
Working with PySpark RDDs
Working with Spark's original data structure API: Resilient Distributed Datasets.
PowerPivot 3: Managing the Data Model
Analyzing ginormous files with Microsoft PowerPivot.
Manage Data Pipelines with Apache Airflow
Use Apache Airflow to build and monitor better data pipelines.
Page 1 of 51