Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
28 commits
Select commit Hold shift + click to select a range
3891b3a
feat(http): add Content-Type header to httpx and requests factories
deanq Feb 14, 2026
2c85b2b
feat(http): add centralized aiohttp session factory
deanq Feb 14, 2026
aae56bd
refactor(api): use centralized aiohttp factory in RunPod clients
deanq Feb 14, 2026
ad5b9cb
refactor(app): use centralized requests factory for tarball operations
deanq Feb 14, 2026
65401be
test(http): add comprehensive tests for HTTP client factories
deanq Feb 14, 2026
c99b486
feat: cleanup flash deploy/undeploy/build command output format (#191)
KAJdev Feb 14, 2026
826f169
refactor: remove noisy debug logs from flash (AE-1966) (#204)
deanq Feb 14, 2026
b60df1b
feat(http): add Content-Type header to httpx and requests factories
deanq Feb 14, 2026
afea835
feat(http): add centralized aiohttp session factory
deanq Feb 14, 2026
51f8d2e
refactor(api): use centralized aiohttp factory in RunPod clients
deanq Feb 14, 2026
4543d2d
refactor(app): use centralized requests factory for tarball operations
deanq Feb 14, 2026
04c055d
test(http): add comprehensive tests for HTTP client factories
deanq Feb 14, 2026
90c4aee
Merge branch 'deanq/ae-2106-consolidated-http' of https://github.com/…
deanq Feb 14, 2026
5859f4b
fix(runtime): restore on-demand provisioning for flash run (#206)
deanq Feb 16, 2026
51693c7
fix(docs): change idleTimeout from minutes to seconds (#205)
DeJayDev Feb 17, 2026
84d349c
feat(http): add Content-Type header to httpx and requests factories
deanq Feb 14, 2026
5cebb9c
feat(http): add centralized aiohttp session factory
deanq Feb 14, 2026
8740c9c
refactor(api): use centralized aiohttp factory in RunPod clients
deanq Feb 14, 2026
fb8a6ff
refactor(app): use centralized requests factory for tarball operations
deanq Feb 14, 2026
f9ac3d1
test(http): add comprehensive tests for HTTP client factories
deanq Feb 14, 2026
f6ef93b
Merge branch 'deanq/ae-2106-consolidated-http' of https://github.com/…
deanq Feb 19, 2026
7432693
chore: release 1.2.0 (#194)
runpod-release-please-bot[bot] Feb 19, 2026
6622518
feat(http): add Content-Type header to httpx and requests factories
deanq Feb 14, 2026
94f76a5
feat(http): add centralized aiohttp session factory
deanq Feb 14, 2026
74c0b29
refactor(api): use centralized aiohttp factory in RunPod clients
deanq Feb 14, 2026
6daf9eb
refactor(app): use centralized requests factory for tarball operations
deanq Feb 14, 2026
9ca84b6
test(http): add comprehensive tests for HTTP client factories
deanq Feb 14, 2026
af2d37d
Merge branch 'deanq/ae-2106-consolidated-http' of https://github.com/…
deanq Feb 20, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .release-please-manifest.json
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
{
".": "1.1.1"
".": "1.2.0"
}
29 changes: 29 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,34 @@
# Changelog

## [1.2.0](https://github.com/runpod/flash/compare/v1.1.1...v1.2.0) (2026-02-17)


### Features

* add API key propagation for cross-endpoint calls ([#193](https://github.com/runpod/flash/issues/193)) ([f87c9c1](https://github.com/runpod/flash/commit/f87c9c1cef7dfd1f427b278ea50fcc03f4e36372))
* add file-based logging for local CLI usage ([#197](https://github.com/runpod/flash/issues/197)) ([665bcfa](https://github.com/runpod/flash/commit/665bcfa108f95ebc040c82d9496cc6c6df484d36))
* add User-Agent header with version, OS, and arch ([#202](https://github.com/runpod/flash/issues/202)) ([5632907](https://github.com/runpod/flash/commit/5632907baae9681658d82ab649cb15c47d5d85b8))
* AE-2089: update sls endpoint template params ([#198](https://github.com/runpod/flash/issues/198)) ([656fa46](https://github.com/runpod/flash/commit/656fa4608ccae1e89e1ac28e6dae6b60e18ca175))
* cleanup flash deploy/undeploy/build command output format ([#191](https://github.com/runpod/flash/issues/191)) ([c99b486](https://github.com/runpod/flash/commit/c99b486d301043e7982b7f995f1754fb89379ff8))
* **logger:** add sensitive data filter to prevent logging API keys and tokens ([#200](https://github.com/runpod/flash/issues/200)) ([10967a4](https://github.com/runpod/flash/commit/10967a43c40ee5c7823c461eb2647b9472dde30b))


### Bug Fixes

* **docs:** change idleTimeout from minutes to seconds ([#205](https://github.com/runpod/flash/issues/205)) ([51693c7](https://github.com/runpod/flash/commit/51693c7e2dd0c9d803f3c49de1d0009ded285d5d))
* prevent false deployment attempts in Flash environments ([#192](https://github.com/runpod/flash/issues/192)) ([f07c9fb](https://github.com/runpod/flash/commit/f07c9fb92003d4603fbf8cdc17b956c368009353))
* **runtime:** restore on-demand provisioning for flash run ([#206](https://github.com/runpod/flash/issues/206)) ([5859f4b](https://github.com/runpod/flash/commit/5859f4b78476a070db2100b689dfd94caf5fc93f))


### Code Refactoring

* remove noisy debug logs from flash (AE-1966) ([#204](https://github.com/runpod/flash/issues/204)) ([826f169](https://github.com/runpod/flash/commit/826f1695ab2bbe620da290783194b8456fbb77cb))


### Documentation

* update CLI documentation for deploy, env, and app commands ([#195](https://github.com/runpod/flash/issues/195)) ([4126b37](https://github.com/runpod/flash/commit/4126b3704e625878d11bdd257fa6cc0fbe6bc709))

## [1.1.1](https://github.com/runpod/flash/compare/v1.1.0...v1.1.1) (2026-02-09)


Expand Down
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -595,7 +595,7 @@ The following parameters can be used with `LiveServerless` (full remote code exe
| `gpuCount` | Number of GPUs per worker | 1 | 1, 2, 4 |
| `workersMin` | Minimum number of workers | 0 | Set to 1 for persistence |
| `workersMax` | Maximum number of workers | 3 | Higher for more concurrency |
| `idleTimeout` | Minutes before scaling down | 5 | 10, 30, 60 |
| `idleTimeout` | Seconds before scaling down | 60 | 300, 600, 1800 |
| `env` | Environment variables | `None` | `{"HF_TOKEN": "xyz"}` |
| `networkVolumeId` | Persistent storage ID | `None` | `"vol_abc123"` |
| `executionTimeoutMs`| Max execution time (ms) | 0 (no limit) | 600000 (10 min) |
Expand Down
2 changes: 1 addition & 1 deletion docs/Flash_SDK_Reference.md
Original file line number Diff line number Diff line change
Expand Up @@ -124,7 +124,7 @@ class ResourceConfig:
# Worker scaling
workersMin: int = 0 # Minimum workers to maintain
workersMax: int = 3 # Maximum workers allowed
idleTimeout: int = 300 # Seconds before idle worker terminates
idleTimeout: int = 60 # Seconds before idle worker terminates

# Networking
networkVolumeId: Optional[str] = None # Mount persistent storage
Expand Down
2 changes: 1 addition & 1 deletion pyproject.toml
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
[project]
name = "runpod-flash"
version = "1.1.1"
version = "1.2.0"
description = "A Python library for distributed inference and serving of machine learning models"
authors = [
{ name = "Runpod", email = "engineer@runpod.io" },
Expand Down
Loading
Loading