Skip to content

Add hypernetwork approach and issue #140 analysis summary#336

Draft
jackopenn wants to merge 3 commits intoopenai:mainfrom
jackopenn:claude/jolly-ishizaka
Draft

Add hypernetwork approach and issue #140 analysis summary#336
jackopenn wants to merge 3 commits intoopenai:mainfrom
jackopenn:claude/jolly-ishizaka

Conversation

@jackopenn
Copy link

  • Issue 140 summary: 7 MD files covering leaderboard, core/advanced techniques, negative results, untried ideas, tier analysis, and technique lineage from the live AI commentary thread.
  • Hypernetwork prototype: a shared-trunk MLP that generates full GPT weights from compact conditioning vectors (9.34x compression, 26.5M target params from 2.8M hypernet params, 2.09 MB artifact).
  • Artifact size checker with --sweep mode to explore config space.
  • Baseline train_gpt.py copied for reference.

- Issue 140 summary: 7 MD files covering leaderboard, core/advanced
  techniques, negative results, untried ideas, tier analysis, and
  technique lineage from the live AI commentary thread.
- Hypernetwork prototype: a shared-trunk MLP that generates full GPT
  weights from compact conditioning vectors (9.34x compression,
  26.5M target params from 2.8M hypernet params, 2.09 MB artifact).
- Artifact size checker with --sweep mode to explore config space.
- Baseline train_gpt.py copied for reference.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
jackopenn and others added 2 commits March 21, 2026 11:15
- train_gpt_hypernet.py: combined single file with full baseline GPT
  architecture, hypernetwork, HyperGPT wrapper, training loop, and
  serialization. Target GPT config matches baseline (9L/512d/2xMLP).
- Move check_artifact_size.py to attempts/ root.
- Update hypernetwork.py defaults to match baseline.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Remove train_gpt_baseline.py (redundant, baseline lives at repo root)
- Add README.md explaining hypernetwork architecture, weight generation
  flow, parameter budget, and configuration

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant