Lectures

Week

Topic(s)

Additional Resources

Week 01

Introduction [pdf]

Week 02

Tokenization, Word Embeddings and Representational Learning
- Tokenization [pdf]
- Word Emdeddings [pdf]
- Contextual Embeddings[pdf]

- BERT Embedding Example 1 (ipynb)
- BERT Embedding Example 2 (ipynb)
- https://github.com/SumanthRH/tokenization/

Week 03

- LLMs Basics [pdf]

- Simple Neural Networks and Neural Language Models [pdf]
- Large Language Models explained briefly by 3Blue1Brown [video]
- SLP3 Book Chapter 7 [pdf]

Week 04

Attention in Transformers
- Attention [pdf]
- Numerical Example [pdf]

- Attention Is All You Need [pdf]
- The Illustrated Transformer [html]

Week 05

- Transformer Arhitecture [pdf]

- Transformers, the tech behind LLMs [video]
- Attention in transformers, step-by-step [video]
- How might LLMs store facts [video]
- LLM Visualization [html]

- Efficient Attention Mechanisms for Large Language Models: A Survey [pdf]

Week 06

Transformer-Based Architectures and Models [pdf]

Mixture of Experts [pdf]

-BERT: Pre-training of Deep Bidirectional Transformers for Language
-Understanding Contextual Word Representations: A Contextual Introduction
- The Illustrated BERT, ELMo, and co.
- Jurafsky and Martin Chapter 10 (Masked Language Models)
- The Llama 3 Herd of Models

Week 07

LLM Training
- Pre-Training [pdf]
- Post-Training
- Intruction Tuning [pdf]
- Preference Alignment [pdf]

- Post-training: Instruction Tuning, Alignment, and Test-Time Compute
- Datasets for large language models: a comprehensive survey [pdf]

Week 08

Midterm Week

Week 09

- LLM Inferencing [pdf]
- Prompting, ICL, RAG, FT [pdf]
- Prompt Enginneering [pdf]

- https://github.com/dlops-io/llm-rag?tab=readme-ov-file#agents
- https://developers.openai.com/api/docs/guides/prompt-engineering
- https://developers.openai.com/cookbook/articles/related_resources
- https://developers.openai.com/api/docs/guides/prompt-guidance

Week 10

- Information Retrieval and
Retrieval Augmented Generation - RAG [pdf]

- https://web.stanford.edu/~jurafsky/slp3/11.pdf
- GitHub - Tongji-KGLLM/RAG-Survey � GitHub

- PyTorch Tutorial from CS224n [ipynb]

Week 11

- Parameter Efficient Fine Tuning [pdf]
- LMM Model Compression [pdf]

- Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning [pdf]
- https://huggingface.co/blog/samuellimabraz/peft-methods
- https://sumanthrh.com/post/distributed-and-efficient-finetuning/
- https://github.com/stas00/ml-engineering
- https://huggingface.co/docs/transformers/v4.20.1/en/training
- https://huggingface.co/docs/transformers/v4.20.1/en/tasks/token_classification

- Hugging Face Transformers Tutorial from CS224n [pdf, ipynb]

Week 12

- Agentic AI I [pdf]
- Agentic AI II [pdf]
- Agentic AI Frameworks [pdf]

- https://language-agent-tutorial.github.io/
- https://mitsloan.mit.edu/ideas-made-to-matter/agentic-ai-explained
- https://arxiv.org/pdf/2505.10468
- https://www.geeksforgeeks.org/artificial-intelligence/agentic-ai-tutorial/

Week 13

Model Context Protocol (MCP) [pdf]

- https://www.anthropic.com/news/model-context-protocol
- https://github.com/modelcontextprotocol
- https://en.wikipedia.org/wiki/Model_Context_Protocol
- https://www.geeksforgeeks.org/artificial-intelligence/model-context-protocol-mcp/
- https://www.youtube.com/watch?v=c7yl0GS2mJQ
- https://github.com/kmkarakaya/mcp_tutorial

Week 14

Multi-Modal LLMs [pdf]

- https://magazine.sebastianraschka.com/p/understanding-multimodal-llms
- https://www.ibm.com/think/topics/multimodal-llm
- https://arxiv.org/pdf/2408.01319v1

Similar Courses and Resources :

   - CS224N: Natural Language Processing with Deep Learning [html]
   - CME 295 - Transformers & Large Language Model [html]
   - CS336: Language Modeling from Scratch [html]
- COS 484: Natural Language Processing [html]
   - COS 597G (Fall 2022): Understanding Large Language Models [html]
   - CS 194/294-267 Understanding Large Language Models: Foundations and Safety [html]
   - LLM Courses [html]
   - HuggingFace LLM Course [html]
   - M. Labonne-llm-course [html]
   - CSC 2541 Winter 2025: Large Models [html]
   - CSC6203: Large Language Model [html]
   - CMU 11868 Large Language Model Systems [html]
   - CS 886: Recent Advances on Foundation Models - Waterloo [html]
   - AC215 : MLOps & LLMOps: Production AI Systems - AC215 [html]
   - CS294/194-196: Agentic AI [html]

CS 194/294-267 Understanding Large Language Models: Foundations and Safety