内卷地狱

LLM Introductory Courses

Edit Me

Introductory Courses

CS224N — Stanford Deep Learning for NLP

Covers NLP fundamentals and includes a miniGPT project.

  • Official site: CS224N Stanford
  • Video: (2025, bilingual) Stanford CS224N — Deep Learning for Natural Language Processing
  • Slides and assignments: Quark Cloud Drive

CMU Advanced NLP


NanoGPT — GPT from Scratch

  • Code: NanoGPT GitHub
  • Learning materials: [LLM Training Series] NanoGPT source code walkthrough and Chinese GPT training practice

Stanford CS336 — Language Modeling from Scratch (Spring 2025)

Assignment Overview

  1. Assignment 1: Implement a BPE tokenizer, Transformer architecture, and Adam optimizer; train on TinyStories and OpenWebText (PyTorch primitives only).
  2. Assignment 2: Implement Flash Attention 2 in Triton; distributed data parallelism + optimizer sharding.
  3. Assignment 3: Scaling Laws. Fit scaling laws using IsoFLOP, simulating experiments under a fixed compute budget.
  4. Assignment 4: Data pipeline. Convert Common Crawl HTML to text, filter (quality, harmful content, PII), deduplicate.
  5. Assignment 5: Alignment. Implement supervised fine-tuning, expert iteration, GRPO and variants; run RL on Qwen 2.5 Math 1.5B to improve MATH benchmark performance.

Prerequisites

  • Math: MATH 51, CME 100
  • Probability: CS 109

Study Notes

  • CS336 study notes: TODO

Happy-LLM — Build a 215M LLM from Scratch


贡献者


这篇文章有帮助吗?

最近更新

Involution Hell© 2026 byCommunityunderCC BY-NC-SA 4.0CCBYNCSA