Week 9 - SQL for Analytics


The HackYourFuture curriculum is licensed under CC BY-NC-SA 4.0 *https://hackyourfuture.net/*

CC BY-NC-SA 4.0 Icons

Built with ❤️ by the HackYourFuture community · Thank you, contributors

Found a mistake or have a suggestion? Let us know in the feedback form.

Introduction to SQL for Analytics

2. Joins, CTEs, and Aggregations

3. Data Validation Queries

4. Data Modeling Concepts

5. Building Views in Azure PostgreSQL

6. Gotchas & Pitfalls

7. Practice

8. Assignment

Lesson Plan

Week 9 - SQL for Analytics

Welcome to Week 9! This week shifts focus from Python pipelines to the SQL layer. You will learn how to write analytical queries, validate ingested data with SQL, and understand the architecture of modern data warehouses. By the end of the week, you will deliver a fact and dimension view in Azure PostgreSQL.

Dataset

This week uses a curated subset of the NYC Taxi & Limousine Commission (TLC) Trip Record Data. The dataset includes:

Trip data is available as monthly Parquet files at:

https://d37ci6vzurychx.cloudfront.net/trip-data/greentripdataYYYY-MM.parquet

A loading script will be provided to load a curated subset into your Azure PostgreSQL database. This same dataset continues into Week 10 (dbt) and Week 11 (dashboarding).

Learning goals


Chapters

  1. Introduction to SQL for Analytics
  2. Joins, CTEs, and Aggregations
  3. Data Validation Queries
  4. Data Modeling Concepts
  5. Building Views in Azure PostgreSQL
  6. Practice
  7. Assignment
  8. Gotchas & Pitfalls

Lesson plan