feat: add autoscaling examples and repo cleanup by deanq · Pull Request #32 · runpod/flash-examples

deanq · 2026-02-23T04:57:13Z

Summary

Add autoscaling examples (04_scaling_performance/01_autoscaling/) with 5 scaling strategies: GPU scale-to-zero, always-on, high-throughput, and CPU scale-to-zero, burst-ready
Add load test script for observing scaling behavior with concurrent request bursts
Add .flash/ to gitignores and test block to pipeline.py
Regenerate CLAUDE.md with comprehensive repo analysis

Test plan

Verify flash run discovers all 5 new @remote endpoints in 01_autoscaling/
Run python gpu_worker.py and python cpu_worker.py directly to confirm test blocks work
Run python load_test.py --help to verify CLI args parse correctly
Confirm README links resolve correctly from 04_scaling_performance/README.md
Verify existing examples still work with flash run from repo root

Remove ambiguity with RunPod's {"input": ...} HTTP wrapping by renaming the input_data parameter to payload in 13 Python files (19 functions) and 5 documentation files. Also fix a local variable shadowing issue in cpu_burst_ready by renaming its internal payload variable to serialized.

- Add .flash/ directory to gitignore across getting_started examples - Add __main__ test block to mixed_workers pipeline.py

Output of /analyze-repos: adds module structure, public API surface, cross-repo dependencies, code health assessment, and test strategy.

Five scaling strategies across GPU (scale-to-zero, always-on, high-throughput) and CPU (scale-to-zero, burst-ready) with load test script for observing scaling behavior. Includes cost analysis and configuration reference in README.

All QB endpoint URLs used the wrong path segment. The flash-generated routes use /runsync, not /run_sync.

deanq added 5 commits February 21, 2026 20:34

chore: add .flash/ to gitignores and test block to pipeline

ee0758b

- Add .flash/ directory to gitignore across getting_started examples - Add __main__ test block to mixed_workers pipeline.py

docs: regenerate CLAUDE.md with comprehensive repo analysis

ef20769

Output of /analyze-repos: adds module structure, public API surface, cross-repo dependencies, code health assessment, and test strategy.

feat: add autoscaling examples for GPU and CPU workers

36d79f1

Five scaling strategies across GPU (scale-to-zero, always-on, high-throughput) and CPU (scale-to-zero, burst-ready) with load test script for observing scaling behavior. Includes cost analysis and configuration reference in README.

fix: correct endpoint paths from run_sync to runsync in docs

35e219a

All QB endpoint URLs used the wrong path segment. The flash-generated routes use /runsync, not /run_sync.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat: add autoscaling examples and repo cleanup#32

feat: add autoscaling examples and repo cleanup#32
deanq wants to merge 5 commits intomainfrom
deanq/ae-2079-cleanup

deanq commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

deanq commented Feb 23, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant