Week 3 - Ingesting and Validating Data

Introduction to Data Ingestion

Ingesting from APIs

Reading Multiple File Formats

Pydantic for Data Validation

Writing to Databases

Error Handling and Logging

Practice

Assignment

Gotchas & Pitfalls

Back to Track

Week 3 - Ingesting and Validating Data

Welcome to Week 3! Now that you can structure clean pipelines, it's time to connect them to the real world. This week is all about data ingestion: pulling data from APIs, reading various file formats, and validating incoming data before it corrupts your pipeline.

By the end of this week, you will have built a robust ingestion system that can handle messy external data sources, validate their structure and content, and safely load them into databases.

Learning goals


Chapters

  1. Introduction to Data Ingestion
  2. Production Error Handling
  3. Ingesting from APIs
  4. Reading File Formats (CSV, JSON, Parquet)
  5. Data Validation with Pydantic
  6. Writing to Databases
  7. Practice
  8. Assignment: Build a Validated Ingestion Pipeline
  9. Gotchas & Pitfalls

Lesson plan


Back to Data Track


CC BY-NC-SA 4.0 Icons

*https://hackyourfuture.net/*

Found a mistake or have a suggestion? Let us know in the feedback form.