Add hypernetwork approach and issue #140 analysis summary#336
Draft
jackopenn wants to merge 3 commits intoopenai:mainfrom
Draft
Add hypernetwork approach and issue #140 analysis summary#336jackopenn wants to merge 3 commits intoopenai:mainfrom
jackopenn wants to merge 3 commits intoopenai:mainfrom
Conversation
jackopenn
commented
Mar 21, 2026
- Issue 140 summary: 7 MD files covering leaderboard, core/advanced techniques, negative results, untried ideas, tier analysis, and technique lineage from the live AI commentary thread.
- Hypernetwork prototype: a shared-trunk MLP that generates full GPT weights from compact conditioning vectors (9.34x compression, 26.5M target params from 2.8M hypernet params, 2.09 MB artifact).
- Artifact size checker with --sweep mode to explore config space.
- Baseline train_gpt.py copied for reference.
- Issue 140 summary: 7 MD files covering leaderboard, core/advanced techniques, negative results, untried ideas, tier analysis, and technique lineage from the live AI commentary thread. - Hypernetwork prototype: a shared-trunk MLP that generates full GPT weights from compact conditioning vectors (9.34x compression, 26.5M target params from 2.8M hypernet params, 2.09 MB artifact). - Artifact size checker with --sweep mode to explore config space. - Baseline train_gpt.py copied for reference. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- train_gpt_hypernet.py: combined single file with full baseline GPT architecture, hypernetwork, HyperGPT wrapper, training loop, and serialization. Target GPT config matches baseline (9L/512d/2xMLP). - Move check_artifact_size.py to attempts/ root. - Update hypernetwork.py defaults to match baseline. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Remove train_gpt_baseline.py (redundant, baseline lives at repo root) - Add README.md explaining hypernetwork architecture, weight generation flow, parameter budget, and configuration Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.