LLM Introductory Courses
Introductory Courses
CS224N — Stanford Deep Learning for NLP
Covers NLP fundamentals and includes a miniGPT project.
- Official site: CS224N Stanford
- Video: (2025, bilingual) Stanford CS224N — Deep Learning for Natural Language Processing
- Slides and assignments: Quark Cloud Drive
CMU Advanced NLP
- Course homepage: CMU Advanced NLP Spring 2025
- Code: GitHub Repo
- Assignments:
NanoGPT — GPT from Scratch
- Code: NanoGPT GitHub
- Learning materials: [LLM Training Series] NanoGPT source code walkthrough and Chinese GPT training practice
Stanford CS336 — Language Modeling from Scratch (Spring 2025)
- Course videos: YouTube Playlist
- Course homepage: CS336 Official Site
- Code repository: GitHub Repo
- Video translation: Bilibili
- Translation collection: Zhihu Column
- Private assignment repo: TODO
Assignment Overview
- Assignment 1: Implement a BPE tokenizer, Transformer architecture, and Adam optimizer; train on TinyStories and OpenWebText (PyTorch primitives only).
- Assignment 2: Implement Flash Attention 2 in Triton; distributed data parallelism + optimizer sharding.
- Assignment 3: Scaling Laws. Fit scaling laws using IsoFLOP, simulating experiments under a fixed compute budget.
- Assignment 4: Data pipeline. Convert Common Crawl HTML to text, filter (quality, harmful content, PII), deduplicate.
- Assignment 5: Alignment. Implement supervised fine-tuning, expert iteration, GRPO and variants; run RL on Qwen 2.5 Math 1.5B to improve MATH benchmark performance.
Prerequisites
- Math: MATH 51, CME 100
- Probability: CS 109
Study Notes
- CS336 study notes: TODO
Happy-LLM — Build a 215M LLM from Scratch
- Code repository: Happy-LLM GitHub
贡献者
这篇文章有帮助吗?
最近更新
Involution Hell© 2026 byCommunityunderCC BY-NC-SA 4.0