Data
Python Pipelines from Sheets to Warehouses
This course connects spreadsheet pain to small Python services. Participants learn pandas hygiene, environment discipline, and how to document assumptions before anything touches a warehouse. Labs include Git-based homework and code review language that mirrors real teams.
Blended (async + live) · 8 weeks · Advanced beginner
7,800,000 VND
Informational pricing — admissions confirms payment schedule.
Request informationResponsible lead
Duy Khanh Le
Data engineer teaching cohorts in HCMC; previously built ingestion for a fintech pilot.
Curriculum inclusions
- Virtual environments with pinned dependencies
- pandas recipes for wide-to-long reshaping
- Great Expectations-style checks on sample data
- Packaging a weekly job with readable logs
- Pairing with SQL outputs from the analytics SQL track
- Secrets handling overview (no live production keys)
- Retro template for incidents in scheduled jobs
Participant outcomes
- Deliver a scheduled notebook or script with runbook
- List failure modes your stakeholders should watch
- Pair Python transforms with documented SQL upstream
FAQ
We standardize on Python 3.11 for labs; guidance is provided if your laptop needs an upgrade path.
Cohort reviews
I liked how the Python Pipelines cohort forced me to write a runbook before the final demo. The pandas section on messy VAT exports matched my workplace files.