Hackers and Slackers Logo
AboutSeriesJoinSearchDonate
Search results
No results for 'undefined'
mobile-menu
Search results
No results for ''
Trending Searches
HomeAboutSearch
Tags
PythonSoftwareDevOpsData EngineeringArchitecturePandasData AnalysisData ScienceREST APIsSQLCode Snippet CornerFlaskJavaScriptAWSNodeJS
Series
Build Flask AppsData Analysis with PandasGoogle Cloud Platform ArchitectureLearning Apache SparkCreate a REST API in AWSWorking with MySQLGraphQL TutorialsHacking Tableau ServerMastering Python's SQLAlchemyMongoDB Atlas Cloud ArchitectureWelcome to SQL: Tutorials for NewcomersMapping Data with MapboxMicrosoft PowerPivotGetting Started with DjangoWeb Scraping With Python
Authors
Todd BirchardMax MileafGraham BeckleyDavid AquinoMatthew AlhonteRyan RosadoPaul ArmstrongDylan Castillo
JoinRSS Donate
Data Engineering

Data Engineering (page 1)

Collect and transform data on a large scale. Build data pipelines, work with a horizontally scalable architecture, or simply scrape and collect data.

Scrape Structured Data with Python and Extruct
Python

Scrape Structured Data with Python and Extruct

Supercharge your scraper to extract quality page metadata by parsing JSON-LD data via Python's extruct library.

Todd
Todd
28 June, 2020•13 min read
Simplify BigQuery ETL jobs using SQLAlchemy
Data Warehouses

Simplify BigQuery ETL jobs using SQLAlchemy

Extract and move data between BigQuery and relational databases using PyBigQuery: a connector for SQLAlchemy.

Todd
Todd
15 November, 2019•8 min read
Using Amazon Redshift as your Data Warehouse
Data Warehouses

Using Amazon Redshift as your Data Warehouse

Get the most out of Redshift by performance tuning your cluster and learning how to query your data optimally.

Todd
Todd
09 June, 2019•12 min read
Join and Aggregate PySpark DataFrames
Spark

Join and Aggregate PySpark DataFrames

Perform SQL-like joins and aggregations on your PySpark DataFrames.

Todd
Todd
24 June, 2019•7 min read
Working with PySpark RDDs
Spark

Working with PySpark RDDs

Working with Spark's original data structure API: Resilient Distributed Datasets.

Todd
Todd
06 June, 2019•8 min read
Manage Data Pipelines with Apache Airflow
Apache

Manage Data Pipelines with Apache Airflow

Use Apache Airflow to build and monitor better data pipelines.

Todd
Todd
01 June, 2019•13 min read
Structured Streaming in PySpark
Spark

Structured Streaming in PySpark

Become familiar with building a structured stream in PySpark using the Databricks interface.

Todd
Todd
13 May, 2019•8 min read
Becoming Familiar with Apache Kafka and Message Queues
Apache

Becoming Familiar with Apache Kafka and Message Queues

Getting to know Apache Kafka: a horizontally scalable event streaming platform. Learn what makes Kafka critical to high-volume low-latency data pipelines.

Todd
Todd
01 May, 2019•6 min read
Page 1 of 5
Next
Hackers and Slackers

Community of hackers obsessed with data science, data engineering, and analysis. Openly pushing a pro-robot agenda.

Trending Posts
Extract Nested Data From Complex JSONMake Your First API Calls with JQuery AJAXIntegrate Plotly Dash Into Your Flask AppScraping Data on the Web with BeautifulSoupSSH & SCP in Python with ParamikoConfiguring Python Projects with INI, TOML, YAML, and ENV filesQueries as Python Code with SQLAlchemy's Expression LanguagePackage Python Projects the Proper Way with Poetry

Tags

PythonSoftwareDevOpsData EngineeringArchitecturePandasExcelData AnalysisData ScienceREST APIsSQLCode Snippet CornerFlaskJavaScriptAWSNodeJSGoogle CloudMySQLApacheFrontendData VisNoSQLBIExpressJSGraphQLPostgreSQLSparkETL PipelinesTableauAtlassianBig DataGatsbyJSMachine LearningSQLAlchemy

Newsletter

Donations

Uday Kumar Paturu
1

Thanks alot for spark tutorial

Winfried
5

Thanks for your great Flask tutorial! Although I have been working with Flask for some time now, the tutorial helped me to improve my code enormously.

Brian
1

I appreciate Todd’s Django tutorials. They are informative, funny, and in depth. Thank you.

Björn-Eric
1

The Node Fetch tutorial really helped, reference it A LOT.

squeezer
1

Thx for an amazing post - it saved me a lot time!

Hackers and Slackers Logo

Community of hackers obsessed with data science, data engineering, and analysis. Openly pushing a pro-robot agenda.

PagesAboutSeriesJoinSearchDonateSign up
SeriesBuild Flask AppsData Analysis with PandasLearning Apache SparkWorking with MySQLEmbracing GraphQLMastering SQLAlchemyWelcome to SQLMapping Data with Mapbox
Authors
Todd Birchard
Max Mileaf
Graham Beckley
David Aquino
Matthew Alhonte
Ryan Rosado
Paul Armstrong
Dylan Castillo
©2020 Hackers and Slackers, All Rights Reserved.