Data Science

Watch as we attempt to maintain a delicate harmony of math, engineering, and intuition to solve larger-than-life problems.

From CSVs to Tables: Infer Data Types From Raw Spreadsheets

From CSVs to Tables: Infer Data Types From Raw Spreadsheets

The quest to never explicitly set a table schema ever again.
Using Random Forests for Feature Selection with Categorical Features

Using Random Forests for Feature Selection with Categorical Features

Python helper functions for adding feature importance, and displaying them as a single variable.
Tuning Random Forests Hyperparameters with Binary Search Part III: min_samples_leaf

Tuning Random Forests Hyperparameters with Binary Search Part III: min_samples_leaf

Tune the min_samples_leaf parameter in for a Random Forests classifier in scikit-learn in Python .
Tuning Random  Forests Hyperparameters with Binary Search Part II: max_depth

Tuning Random Forests Hyperparameters with Binary Search Part II: max_depth

Tune the max_depth parameter in for a Random Forests classifier in scikit-learn in Python
Tuning Machine Learning Hyperparameters with Binary Search

Tuning Machine Learning Hyperparameters with Binary Search

Tune the n_estimators parameter in for a Random Forests classifier in scikit-learn in Python.
Automagically Turn JSON into Pandas DataFrames

Automagically Turn JSON into Pandas DataFrames

Let Pandas do the heavy lifting for you when turning JSON into a DataFrame.
Trash Pandas: Messy, Convenient DB Operations via Pandas

Trash Pandas: Messy, Convenient DB Operations via Pandas

(And a way to clean it up with SQLAlchemy).
Data Could Save Humanity if it Weren't for Humanity

Data Could Save Humanity if it Weren't for Humanity

A compelling case for robot overlords.