Skip to content

addo561/engine-2-transformer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Engine 02: The Transformer

Transformer decoder built from scratch following Sebastien Raschka's Build a Large Language Model From Scratch. Covers the full architecture from tokenization to autoregressive generation.

Architecture(Right one-> decoder)

attention_research_1

🎯 Objective

Build a standalone Transformer engine to master the architecture.

🛠️ Implementation Progress

Data - setup done ✅, got a short story from Wikipedia ("The Journey" by Edith Wharton)

Attention.py - done, full multi-head attention mechanism.

Transformer.ipynb - contains full decoder.

About

Transformer decoder built from scratch following Sebastien Raschka's approach — tokenization through to autoregressive text generation, no training frameworks.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors