Skip to content

Fix model-weight distribution so the repo is clone-and-run#1

Open
mb-westover wants to merge 1 commit into
mainfrom
fix/weights-clone-and-run
Open

Fix model-weight distribution so the repo is clone-and-run#1
mb-westover wants to merge 1 commit into
mainfrom
fix/weights-clone-and-run

Conversation

@mb-westover

Copy link
Copy Markdown
Contributor

Problem

A fresh git clone of GROND could not run the models. The weight caches
(pd_channel_cache, cet_cache, dl_cache, models) are gitignored, but the
README claimed they ship in git, and .gitignore pointed at an empty mirror
path (s3://bdsp-opendata-credentialed/iiic-freq3/data/). So the ~12 MB of
inference weights a clone needs were unreachable from any documented source.

Fix

  • Uploaded the missing gitignored inference weights (~12 MB: pd_channel_cache,
    cet_cache, dl_cache/ridge_freq_model.npz, models/bipd_gbt.pkl) to
    s3://bdsp-opendata-credentialed/grond/data/. Round-trip verified: a fresh
    aws s3 sync fetches all 5 cnn_attn folds + the ridge model and they load.
  • README: replaced the incorrect "weights live in git" claim with the actual
    sync step (aws s3 sync s3://…/grond/data/ data/) and updated the S3 contents.
  • .gitignore: corrected the stale iiic-freq3/data mirror path to
    grond/data/ and documented the sync command.

Verified: all 11 models (UnifiedPDModel, HemiCET, ChannelPD-Net, CET-UNet,
PDProfiler, RDA-Profiler, ridge, BIPD-GBT, …) load and run after
git clone + aws s3 sync s3://…/grond/data/ data/.

Notes (not addressed here)

  • e2e folds 3–4 exist only as mid-training checkpoints (experimental, not in
    the published pipeline).
  • code/visualization/visualize_embeddings.py is orphaned (imports a nonexistent
    code/dl/ module) — candidate for removal.

🤖 Generated with Claude Code

The weight caches (pd_channel_cache, cet_cache, dl_cache, models) are
gitignored, but the README claimed they ship in git — so a fresh clone could
not run the models, and the .gitignore pointed at an empty mirror path
(iiic-freq3/data). Upload these ~12 MB inference weights to
s3://bdsp-opendata-credentialed/grond/data/ (verified round-trip), document the
sync step in the README, and correct the .gitignore mirror path.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant