Data Analysis

Prepare reports and draw meaningful conclusions from data. Learn to clean and manipulate datasets to draw meaningful conclusions from existing information.

Recasting Low-Cardinality Columns as Categoricals

Downcast strings in Pandas to their proper data-types using HDF5.
Code Snippet Corner
3 min read
June 03

Removing Duplicate Columns in Pandas

Dealing with duplicate column names in your Pandas DataFrame.
Code Snippet Corner
3 min read
May 28

Using Hierarchical Indexes With Pandas

Use Panda's Multiindex to make your data work harder for you.
Pandas
12 min read
May 28

Reshaping Pandas DataFrames

A guide to DataFrame manipulation using groupby, melt, pivot tables, pivot, transpose, and stack.
Pandas
12 min read
May 21

Welcome to SQL: Modifying Databases and Tables

Brush up on SQL fundamentals such as creating tables, schemas, and views.
SQL
10 min read
February 19

Geocoding Raw Datasets for Mapbox

Use the Mapbox Python SDK to transform a collection of addresses into lat/long coordinates.
Data Vis
8 min read
December 18

Dynamic Tension! Creating and Using Dynamic Named Ranges in Excel

Dynamically load data in smart pivot tables.
Excel
7 min read
September 01

Connecting Pandas to a Database with SQLAlchemy

Easily drop data into Pandas from a SQL database, or upload your DataFrames to a SQL table.
Pandas
5 min read
July 03

Getting Iffy With it: Conditional Statements in Excel

Effectively utilize conditionals such as IF statements in your Excel workflow.
Excel
4 min read
June 10

Taking Out the Trash: Dirty Data in Excel

Dealing With Dirty Data in Excel (continued).
Excel
4 min read
June 05