This repository contains my implementation of the programming assignments for Stanford CS336 (Spring 2025).
- Python: 3.11+
- Environment manager: uv
Install uv and download datasets:
pip install uv
source download_datasets.shEach assignment is self-contained.
Enter the assignment folder and install dependencies, for example:
cd assignment1-basics
uv syncStanford-CS336-Implementation/
├── README.md # This file
├── download_datasets.sh # Dataset download script
├── datasets/ # Shared datasets
│ ├── TinyStoriesV2-GPT4-train.txt
│ ├── TinyStoriesV2-GPT4-valid.txt
│ ├── owt_train.txt
│ └── owt_valid.txt
├── assignment1-basics/ # Transformer LM from scratch
│ ├── README.md
│ ├── Report.pdf
│ ├── setup.py
│ ├── pyproject.toml
│ ├── cs336_basics/ # Main package source code
│ ├── tests/ # Unit tests
│ └── scripts/ # Training / evaluation scripts
├── assignment2-systems/ # TODO
├── assignment3-scaling/ # TODO
├── assignment4-data/ # TODO
└── assignment5-alignment/ # TODO