learn-attn

This repository contains docs and code which builds a small scale GPT on top of the Tiny Shakespeare dataset. It is designed to teach people like me - have basic understanding of DL and related math concepts (namely basic multivar calc and linear algebra) and want to dive deeper into the world of attention and transformers.

The repository is generated as follows:

I asked Claude Code to fetch Harvard's The Annotated Transformer and Karpathy's nanoGPT, summarize and restructure them in a way that is easy to understand for someone with my background.
I follow the docs generated and asked Claude Code to make edits as I go along, rinse and repeat.

Quick Start

uv sync                                                                # install deps
uv run python -m babygpt.dataset                                       # print dataset stats
uv run python -m babygpt.train                                         # train (~14 min on RTX 3080)
uv run python -m babygpt.generate --prompt "ROMEO:" --temperature 0.8  # generate text
./build.sh                                                             # rebuild tutorial PDF

Credits

I cannot take credit for this repository which is based heavily on the work of others. I'd like to thank:

The original authors of the paper Attention is All You Need and the many researchers who have built on top of it.
The authors of the Annotated Transformer. (MIT License)
Karpathy's tutorials and his nanoGPT codebase. (MIT License)

How is this not AI slop?

I read everything.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
babygpt		babygpt
data		data
learn		learn
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
build.sh		build.sh
pyproject.toml		pyproject.toml
the-annotated-transformer.txt		the-annotated-transformer.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

learn-attn

Quick Start

Credits

How is this not AI slop?

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

learn-attn

Quick Start

Credits

How is this not AI slop?

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages