Week 4 - Data Processing with Pandas
The HackYourFuture curriculum is licensed under CC BY-NC-SA 4.0

*https://hackyourfuture.net/*
Found a mistake or have a suggestion? Let us know in the feedback form.
Introduction to Pandas and DataFrames
Selecting, Filtering, and Sorting Data
Grouping and Aggregation
Joining and Merging DataFrames
Working with Strings and Dates
Advanced Transformations
Writing Data
Visualizing Data with Pandas
Alternatives to Pandas
Practice
Assignment: MessyCorp Goes Pandas
Gotchas & Pitfalls
Lesson Plan
Week 4 - Data Processing with Pandas
Welcome to Week 4! You have learned how to structure code (Week 2) and ingest and validate data (Week 3). Now it is time to process it at scale. This week introduces Pandas, the industry-standard tool for high-performance data manipulation in Python. You will also learn about modern data architectures and efficient storage formats like Parquet.
By the end of this week, you will be able to load complex datasets, transform them efficiently using vectorized operations, and produce clean, reusable outputs for downstream systems.
Learning goals
- Master the Pandas library for tabular data manipulation (DataFrames and Series)
- Select, filter, and sort data efficiently using
loc, iloc, and boolean indexing
- Perform grouping and aggregation operations to summarize data by categories
- Join and merge multiple DataFrames using different join types (inner, outer, left, right)
- Clean and transform text data using string operations and pattern matching
- Work with datetime data: parsing, extracting components, and time-based calculations
- Apply advanced transformations: pivoting, melting, window functions, and vectorized operations
- Replace slow Python loops with high-performance vectorized operations
- Create quick visualizations with Pandas plotting to validate trends and outliers
- Handle data quality issues (missing values, duplicates) within DataFrames
- Export processed data to CSV, Parquet, and SQLite databases
First lesson: Introduction to Pandas and DataFrames
Lesson plan