Data Analysis

Prepare reports and draw meaningful conclusions from data. Learn to clean and manipulate datasets to draw meaningful conclusions from existing information.

Code Snippet Corner: Splitting Columns With Pandas

Code Snippet Corner: Splitting Columns With Pandas

Splitting up columns that contain multiple things with Python and Pandas
Comparing Rows Between Two Pandas DataFrames

Comparing Rows Between Two Pandas DataFrames

Find which rows are different between two DataFrames, as well as which DataFrame they are unique to.
Recasting Low-Cardinality Columns as Categoricals

Recasting Low-Cardinality Columns as Categoricals

Downcast strings in Pandas to their proper data-types using HDF5.
Removing Duplicate Columns in Pandas

Removing Duplicate Columns in Pandas

Dealing with duplicate column names in your Pandas DataFrame.
Using Hierarchical Indexes With Pandas

Using Hierarchical Indexes With Pandas

Use Panda's Multiindex to make your data work harder for you.
Reshaping Pandas DataFrames

Reshaping Pandas DataFrames

A guide to DataFrame manipulation using groupby, melt, pivot tables, pivot, transpose, and stack.
Welcome to SQL: Modifying Databases and Tables

Welcome to SQL: Modifying Databases and Tables

Brush up on SQL fundamentals such as creating tables, schemas, and views.
Geocoding Raw Datasets for Mapbox

Geocoding Raw Datasets for Mapbox

Use the Mapbox Python SDK to transform a collection of addresses into lat/long coordinates.