Module 5: Metadata, lineage, and provenance

Module 5: Metadata, lineage, and provenance#

AINS6006 — Big Data Management for AI Applications

Essential Question#

How do we preserve the story of data transformations?

Core Moves#

  • Define the problem boundary

  • Identify evidence and assumptions

  • Build or evaluate the artifact

  • Communicate limits and next actions

Lab & Assignment#

Create a lineage record for a training dataset.

Use the assignment notebook to turn lab evidence into a defensible recommendation.