CS336: Language Modeling from Scratch Spring 2025

This course provides a comprehensive, hands-on introduction to language modeling, guiding students through building language models from scratch. Topics include data collection, transformer architectures, model training, evaluation, and deployment. The course is implementation-heavy and requires strong Python and deep learning skills.

Logistics

Lectures: Tuesday/Thursday 3:00–4:20pm, NVIDIA Auditorium
Office Hours:
- Tatsu Hashimoto (Gates 364): Fridays 3–4pm
- Percy Liang (Gates 350): Fridays 11am–12pm
- Marcel Rød (Gates 415): Mon/Wed 11am–12pm
- Neil Band (Gates 358): Mon 4–5pm, Tues 5–6pm
- Rohith Kuditipudi (Gates 358): Mon/Wed 10–11am
Contact: Use public Slack channels for questions and announcements. For personal matters, email cs336-spr2425-staff@lists.stanford.edu.

Coursework

Basics: Implement and train a standard Transformer language model.
Systems: Profile, optimize, and distribute model training.
Scaling: Analyze and fit scaling laws for model growth.
Data: Process and filter large-scale pretraining data.
Alignment and Reasoning RL: Apply supervised finetuning and RL for reasoning tasks.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
assignment		assignment
lecture		lecture
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS336: Language Modeling from Scratch Spring 2025

Logistics

Coursework

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CS336: Language Modeling from Scratch Spring 2025

Logistics

Coursework

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages