Module 2 Overview#
Theme#
Pipelines, orchestration, and quality
Essential Question#
How do data pipelines fail, and how are failures detected?
Module Components#
Book prose: conceptual framing, domain scenario, methods, and failure modesAssignment: evidence-backed production of a specific artifactSlides: presentation sequence for seminar or lecture deliveryNarration: spoken version of the slide flowInstructor notes: facilitation plan, discussion prompts, and grading cuesRubric: criteria for evaluating the module artifactNotebook: executable lab aligned with the module theme using synthetic pipeline events with freshness, schema drift, lineage completeness, volume, and access-risk indicators
Module Artifact#
AI data platform design review with lineage, quality checks, cost controls, and access model focused on pipelines, orchestration, and quality: Build a small quality-checked pipeline.
Professional Setting#
Students work as if advising a platform team designing data infrastructure for repeatable model training and monitoring. Their work must be intelligible to data engineer, ML engineer, security architect, data steward, and platform owner.