Week 2 - Structuring Data Pipelines

Introduction to Data Pipelines

Configuration & Secrets (.env)

Separation of Concerns (I/O vs Logic)

Dataclasses for Data Objects

OOP vs Functional Programming

Functional Composition

Testing with Pytest

Practice

Assignment: A Clean Pipeline

Gotchas & Pitfalls

Back to Track

Week 2 - Structuring Data Pipelines

Welcome to Week 2! Now that you know the Python basics, it's time to move from "writing scripts" to "building engineering systems." This week is all about architecture. You will learn how to structure your code so that it is readable, testable, and robust against the messy reality of production data.

By the end of this week, you will have refactored a messy "God Script" into a professional, modular pipeline that separates configuration, data modeling, and business logic.

Learning goals


Chapters

  1. Introduction to Data Pipelines
  2. Configuration & Secrets (.env)
  3. Separation of Concerns (I/O vs Logic)
  4. Dataclasses for Data Objects
  5. OOP vs Functional Programming
  6. Functional Composition
  7. Testing with Pytest
  8. Practice
  9. Assignment: Refactoring to a Clean Pipeline
  10. Gotchas & Pitfalls

Lesson plan


Back to Data Track


CC BY-NC-SA 4.0 Icons

*https://hackyourfuture.net/*

Found a mistake or have a suggestion? Let us know in the feedback form.