Week 4 - Data Processing with Pandas
Introduction to Pandas and DataFrames
Selecting, Filtering, and Sorting Data
Joining and Merging DataFrames
Working with Strings and Dates
Assignment: MessyCorp Goes Pandas
This week is about making data usable. Students should leave with confidence in selecting, grouping, and reshaping data with Pandas, and they should understand how those transformations map to real reporting tasks.
By the end of this lesson, students should be able to:
Inspect* a DataFrame and spot data quality issues quickly.
Filter and sort* data with loc, iloc, and boolean masks.
Group and aggregate* to produce summary metrics.
Join* two datasets without duplicating or losing rows.
Visualize* a quick trend plot for sanity checks.
Export* results to CSV or Parquet.
| Time | Activity | Duration |
|---|---|---|
| 0:00 | Welcome and warm-up | 5 min |
| 0:05 | Live demo: DataFrame basics | 15 min |
| 0:20 | Filtering and sorting mini-lab | 20 min |
| 0:40 | Break | 10 min |
| 0:50 | Groupby and aggregation workshop | 25 min |
| 1:15 | Joins and merges demo | 15 min |
| 1:30 | Writing outputs and Azure context | 10 min |
| 1:40 | Assignment walkthrough | 15 min |
| 1:55 | Q&A and wrap-up | 5 min |
| 2:00 | End | - |
Total: 2 hours
Goal: Show how fast Pandas can make sense of messy data.
info() and describe().Goal: Build confidence with loc, iloc, and boolean masks.
Goal: Produce a real report table.
region and calculate total revenue.count and size.Goal: Show how bad joins create wrong numbers.
indicator=True to find unmatched rows.Goal: Explain why format and storage matter.
Goal: Connect the chapters to the assignment tasks.
The HackYourFuture curriculum is licensed under CC BY-NC-SA 4.0

Found a mistake or have a suggestion? Let us know in the feedback form.