Week 3 - Ingesting and Validating Data

Introduction to Data Ingestion

Ingesting from APIs

Production Error Handling

Reading Multiple File Formats

Data Validation with Pydantic

Writing to Databases

Practice

Assignment: Build a Validated Ingestion Pipeline

Gotchas & Pitfalls

Lesson Plan

Week 3 - Ingesting and Validating Data

Welcome to Week 3! Now that you can structure clean pipelines, it's time to connect them to the real world. This week is all about data ingestion: pulling data from APIs, reading various file formats, and validating incoming data before it corrupts your pipeline.

By the end of this week, you will have built a robust ingestion system that can handle messy external data sources, validate their structure and content, and safely load them into databases.

Learning goals


Chapters

  1. Introduction to Data Ingestion
  2. Production Error Handling
  3. Ingesting from APIs
  4. Reading File Formats (CSV, JSON, Parquet)
  5. Data Validation with Pydantic
  6. Writing to Databases
  7. Practice
  8. Assignment: Build a Validated Ingestion Pipeline
  9. Gotchas & Pitfalls

Lesson plan


Back to Data Track


The HackYourFuture curriculum is licensed under CC BY-NC-SA 4.0

CC BY-NC-SA 4.0 Icons

*https://hackyourfuture.net/*

Found a mistake or have a suggestion? Let us know in the feedback form.