Feature/sota optimizations by adityagupta26 · Pull Request #358 · openai/parameter-golf

adityagupta26 · 2026-03-21T19:06:38Z

Description

This PR upgrades the baseline train_gpt.py with several state-of-the-art techniques used by top leaderboard entries to achieve ~1.14 BPB.

Architectural Improvements
BigramHash Embedding: Adds token-pair hashing for cheap local context.
SmearGate: Implements learned gating to blend information between adjacent tokens.
Improved Initialization: Linear layers now use orthogonal initialization.
Training & Optimizer Enhancements
Quantization-Aware Training (QAT): Uses Straight-Through Estimators (STE) to simulate Int8 rounding during training.
Stochastic Weight Averaging (SWA): Averages weights during the warmdown phase for better generalization.
Muon Upgrade: Adds weight decay support to the Muon optimizer.
Compression & Evaluation
Magnitude Pruning: Zeroes out the smallest 3% of weights post-training to maximize compression.
Zstandard (Zstd-22): Replaces zlib with maximum Zstd compression for the 16MB artifact.
Sliding Window Evaluation: Implements strided evaluation (stride = 64) to provide tokens with near-full context.
Verification
Verified syntax correctness with py_compile.
Confirmed environment setup using uv.

), and clarify submission rules (openai#43)

…dow Eval, and SWA

adityagupta26-star added 5 commits March 22, 2026 00:11

Resolve open issues: add flash-attn (openai#280), add TIPS.md (openai#82

4202ac9

), and clarify submission rules (openai#43)

Implement core SOTA optimizations: SmearGate, BigramHash, Sliding Win…

5b0163b

…dow Eval, and SWA

Implement QAT with STE, Magnitude Pruning, and Zstd-22 compression

c53b2a8

Scale up model (10L, 3x MLP) and implement Mixed Int6 QAT + Zstd-22

237363f

Implement Test-Time Training (TTT) with LoRA adapters for evaluation

a24d23e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/sota optimizations#358

Feature/sota optimizations#358
adityagupta26 wants to merge 5 commits intoopenai:mainfrom
adityagupta26:feature/sota-optimizations

adityagupta26 commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

adityagupta26 commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants