engram

A miniature agentic language model built from scratch using PyTorch — no pre-trained weights, no APIs. Engram departs from traditional LLMs by embedding agentic reasoning and persistent memory directly into the architecture.

Architecture

Engram separates reasoning (PyTorch) from vocabulary (ChromaDB) and adds three agentic capabilities that make it fundamentally different from standard next-token predictors.

[ChromaDB: infinite learnable word vectors, normalized]
           ↓  look up last N words
[PyTorch AttentionBrain: fixed-size, adaptive pondering]
           ↓  predict next concept vector
[ChromaDB Episodic Memory: retrieve similar brain states]
           ↓  blend memories with prediction
[ChromaDB: nearest-neighbor search → word]

PyTorch brain (AttentionBrain):

~137k parameters — fixed size regardless of vocabulary
3 stacked attention layers with causal masking
Adaptive pondering: loops through layers up to 3 times with learned halt gate
Allocates more compute to difficult inputs (like PonderNet/TRM)
Knows HOW to think in concept space, not WHAT words mean

ChromaDB vocabulary (concept space):

Each word = coordinate in 64D concept space
Vectors are learnable via gradient descent
L2-normalized for semantic similarity (not magnitude-based)
New words can be added anytime without touching brain architecture

ChromaDB episodic memory (hippocampus):

Separate collection storing specific interaction moments
Indexed by brain's internal state, not word identity
Retrieved during generation and blended with predictions
Dynamic topic retrieval emerges from embedding geometry — no explicit topic management needed

Three agentic capabilities:

Surprise-gated learning (dopamine signal): High prediction error = learn more aggressively (up to 3x gradient). Low error = learn gently. Physical allocation of neural change to novel moments.
Episodic memory (hippocampus): Remembers specific interactions in a searchable brain-state-indexed collection. Blends retrieved memories during generation.
Recurrent pondering (adaptive compute): Loops through attention blocks 1-3 times based on learned halt gate. More "thinking" for novel inputs.

What makes this different from GPT: Standard LLMs treat every token identically, compress everything into weights, and use fixed compute per token. Engram physically allocates more neural change to surprises, remembers specific moments in episodic memory, and adaptively allocates reasoning depth. The vocabulary is an external, persistent, continuously-updatable semantic space.

Files

File	Purpose
`ingest.py`	Train on `training_data.txt` + `corpus/*.txt`. Auto-detects Q&A pairs, trains with pondering, saves normalized embeddings
`test_brain.py`	Interactive chat. Shows surprise scores, episodic memory stores, pondering steps, and subconscious thoughts
`training_data.txt`	Training corpus — add text here or drop files in `corpus/` folder
`corpus/`	Additional .txt files for training (optional)
`engram_weights.pth`	Saved PyTorch brain weights
`engram_memory/`	ChromaDB persistence: `engram_vocab` (words) + `engram_episodes` (memories)

Quickstart

# 1. Train on the corpus
uv run ingest.py

# 2. Chat with the trained brain
uv run test_brain.py

Tuning knobs

In ingest.py:

EMBED_DIM (default 64): embedding size — higher = more capacity, slower training
CONTEXT_SIZE (default 8): how many past words to attend over
N_LAYERS (default 3): attention block depth
EPOCHS (default 5): training passes — increase for better convergence
max_ponder in AttentionBrain (default 3): maximum pondering loops

In test_brain.py:

TEMPERATURE (default 0.9): higher = more creative/random, lower = more conservative
TOP_K (default 10): sample from the top K candidate words at each step
surprise_threshold for episode storage (default 1.5x average): lower = more memories stored
episode_blend_weight (default 0.3): how much to blend retrieved episodes with prediction

Evolution roadmap

Capability	Status	What it enables
Adaptive pondering	✅ Working	Variable compute allocation (1-3 reasoning loops)
Surprise-gated learning	✅ Working	Up to 3x gradient for novel inputs
Episodic memory	✅ Working	Persistent memory of specific interactions
Q&A auto-detection	✅ Working	Learns conversational turn-taking automatically
Paragraph boundaries	✅ Working	No cross-topic garbage transitions
Normalized embeddings	✅ Working	Semantic similarity (not magnitude-based)
Context window + attention	✅ Working	8-word memory span
Diverse word generation	✅ Working	Temperature + top-k sampling
Coherent short phrases	Partial	More training data (10k+ words)
Long-range coherence	Not yet	Larger model, more data, more epochs

Key features:

Training separates paragraphs (blank lines) to avoid cross-topic noise
Auto-detects Q&A patterns and injects <USER>/<BOT> markers
Episodes persist across sessions (not wiped by re-training)
Pondering depth varies: common words = 1 step, novel concepts = 2-3 steps

The biggest lever for quality improvement is more training data. Drop .txt files into corpus/ folder and re-run ingest.py.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
__pycache__		__pycache__
corpus		corpus
engram_memory		engram_memory
models		models
plans		plans
static		static
test_chroma_debug		test_chroma_debug
AGENT_NOTIFICATIONS.md		AGENT_NOTIFICATIONS.md
AUTONOMOUS_TRAINING_PLAN.md		AUTONOMOUS_TRAINING_PLAN.md
LICENSE		LICENSE
README.md		README.md
download_book.py		download_book.py
download_dailydialog.py		download_dailydialog.py
engram_weights.pth		engram_weights.pth
eval_brain.py		eval_brain.py
eval_results_baseline_1_20260319_100958.json		eval_results_baseline_1_20260319_100958.json
eval_results_baseline_1_20260319_125624.json		eval_results_baseline_1_20260319_125624.json
eval_results_baseline_1_20260319_183926.json		eval_results_baseline_1_20260319_183926.json
eval_results_baseline_1_20260319_221022.json		eval_results_baseline_1_20260319_221022.json
eval_results_baseline_2_20260319_120406.json		eval_results_baseline_2_20260319_120406.json
eval_results_baseline_2_20260320_011644.json		eval_results_baseline_2_20260320_011644.json
eval_results_baseline_3_20260320_050122.json		eval_results_baseline_3_20260320_050122.json
eval_results_baseline_4_20260320_223713.json		eval_results_baseline_4_20260320_223713.json
eval_results_large_1_20260319_104125.json		eval_results_large_1_20260319_104125.json
eval_results_large_1_20260319_142555.json		eval_results_large_1_20260319_142555.json
eval_results_large_1_20260319_201509.json		eval_results_large_1_20260319_201509.json
eval_results_large_1_20260319_234633.json		eval_results_large_1_20260319_234633.json
eval_results_large_4_20260320_232217.json		eval_results_large_4_20260320_232217.json
eval_results_medium_1_20260319_102151.json		eval_results_medium_1_20260319_102151.json
eval_results_medium_1_20260319_133357.json		eval_results_medium_1_20260319_133357.json
eval_results_medium_1_20260319_191907.json		eval_results_medium_1_20260319_191907.json
eval_results_medium_1_20260319_225021.json		eval_results_medium_1_20260319_225021.json
eval_results_medium_2_20260320_021701.json		eval_results_medium_2_20260320_021701.json
eval_results_medium_4_20260320_225622.json		eval_results_medium_4_20260320_225622.json
eval_results_target_small_1_20260320_003958.json		eval_results_target_small_1_20260320_003958.json
eval_results_target_small_4_20260320_234832.json		eval_results_target_small_4_20260320_234832.json
fix_encoding.py		fix_encoding.py
ingest.py		ingest.py
run_iter6_10.py		run_iter6_10.py
server.py		server.py
start_server.bat		start_server.bat
test_brain.py		test_brain.py
train_colab.ipynb		train_colab.ipynb
train_runner.py		train_runner.py
training_data.txt		training_data.txt
training_live.log		training_live.log
training_live_err.log		training_live_err.log
training_log.jsonl		training_log.jsonl
training_log_backup.jsonl		training_log_backup.jsonl
training_pid.txt		training_pid.txt
training_run.log		training_run.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

engram

Architecture

Files

Quickstart

Tuning knobs

Evolution roadmap

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

engram

Architecture

Files

Quickstart

Tuning knobs

Evolution roadmap

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages