Syllabus: AINS6006 Big Data Management for AI Applications

Syllabus: AINS6006 Big Data Management for AI Applications#

Catalog Description#

Designs data platforms, pipelines, lineage, cloud integration, and security for AI workflows.

Course Structure#

Each week includes readings, a lecture/slide sequence, an executable lab, and an applied deliverable. Students maintain a reproducible project record and submit work through the LMS or GitHub workflow selected by the instructor.

Weekly Schedule#

Week

Topic

Essential Question

Deliverable

1

Data architectures for AI

What architecture supports trustworthy AI workflows?

Lab notebook + assignment brief

2

Pipelines, orchestration, and quality

How do data pipelines fail, and how are failures detected?

Lab notebook + assignment brief

3

Storage, indexing, and retrieval

How do access patterns shape storage choices?

Lab notebook + assignment brief

4

Distributed processing and scale

When does scale require distributed computation?

Lab notebook + assignment brief

5

Metadata, lineage, and provenance

How do we preserve the story of data transformations?

Lab notebook + assignment brief

6

Cloud integration and cost control

How do cloud choices affect reliability and budget?

Lab notebook + assignment brief

7

Security and access governance

How should sensitive data be protected across AI workflows?

Lab notebook + assignment brief

8

AI data platform readiness review

What proves a data system can support production AI?

Lab notebook + assignment brief

Assessment#

Component

Weight

Weekly labs and notebooks

30%

Applied assignments

35%

Participation and technical critique

15%

Final synthesis portfolio

20%

Graduate Expectations#

Submissions must show technical reasoning, evidence awareness, clear limitations, and responsible use of AI assistance. Code and analysis should be reproducible enough for instructor review.