TinyWAM is a tiny DreamZero-style World Action Model reproduction for a 2D tabletop manipulation task. The installable Python package is tinydreamzero. It trains on scripted pixel demonstrations, predicts future visual latents and action chunks jointly with flow matching, and evaluates closed-loop on held-out layouts.
uv venv --python python3.14
uv sync --extra devuv run --extra dev python -m tinydreamzero smoke --run-dir runs/smokeExpected artifacts:
runs/smoke/data/metadata.jsonruns/smoke/checkpoints/autoencoder.ptruns/smoke/checkpoints/wam.ptruns/smoke/metrics_eval_baselines.jsonruns/smoke/report.md
uv run --extra dev python -m tinydreamzero generate --run-dir runs/default
uv run --extra dev python -m tinydreamzero train-ae --run-dir runs/default --data-root runs/default/data --device auto
uv run --extra dev python -m tinydreamzero train-wam --run-dir runs/default --data-root runs/default/data --device auto
uv run --extra dev python -m tinydreamzero eval --run-dir runs/default --policy baselines
uv run --extra dev python -m tinydreamzero eval --run-dir runs/default --policy wam --device auto
uv run --extra dev python -m tinydreamzero demo --run-dir runs/defaultOn an 8GB M1, start with the smoke workflow, then reduce --batch-size before increasing episode counts.
After generating a dataset, play episodes in real time with the Tkinter/Pillow viewer:
uv run --extra dev python scripts/play_dataset.py runs/dev/data --split train --fps 12 --scale 8Use --split all to continue through train, validation, and test episodes in sorted order. Press space to pause, right arrow to skip to the next episode, and q or escape to quit.
After training the autoencoder and WAM checkpoints, inspect closed-loop rollouts and the model's predicted future frames:
uv run --extra dev python scripts/play_wam_rollouts.py --run-dir runs/default --split test --episodes 32 --fps 8 --scale 5The viewer shows the current rendered frame, autoencoder reconstruction, WAM-predicted future frames, and the actual frames produced by executing the predicted action chunk.
MIT License. See LICENSE.