Releases: mlsys-io/FlowMesh
Releases · mlsys-io/FlowMesh
v0.1.1
What's Changed
- chore: run release workflows on self-hosted runners by @kaiitunnz in #39
- chore: bump astral-sh/setup-uv from 7.6.0 to 8.1.0 by @dependabot[bot] in #40
- chore: bump actions/download-artifact from 4.3.0 to 8.0.1 by @dependabot[bot] in #41
- chore: bump actions/upload-artifact from 4.6.2 to 7.0.1 by @dependabot[bot] in #42
- fix: send api key on worker node registration by @kaiitunnz in #43
- fix: drop redundant main fetch in release workflows by @kaiitunnz in #45
- chore: prepare v0.1.1 release by @kaiitunnz in #46
Full Changelog: v0.1.0...v0.1.1
FlowMesh container images for v0.1.1
| Image | Platforms | Digest |
|---|---|---|
ghcr.io/mlsys-io/flowmesh_server:v0.1.1 |
linux/amd64,linux/arm64 |
sha256:b5185f4c7df71874362c2e8f1d09b6baf08984ca4dfed1b4475aa6194a30d0c9 |
ghcr.io/mlsys-io/flowmesh_worker:v0.1.1-cpu |
linux/amd64,linux/arm64 |
sha256:293bf7999a6a0099e7075ef781947cc11b80196791a5f195b62678787591660c |
ghcr.io/mlsys-io/flowmesh_worker_builder:v0.1.1-gpu |
linux/amd64,linux/arm64 |
sha256:b342971dce7c8505f0206324938caef578a0fe41b9f54c8fb9f8488b3eb9627b |
ghcr.io/mlsys-io/flowmesh_worker:v0.1.1-gpu |
linux/amd64,linux/arm64 |
sha256:1927f92279ec7d645191f916ac44eabfd6d09cc994732900da8bf2f4ae20cbe0 |
ghcr.io/mlsys-io/flowmesh_ssh:v0.1.1-cpu |
linux/amd64,linux/arm64 |
sha256:955c871dd7dcb55a18a16e509cfa8e50eb8002d650902c55d0ab72c93fa24b11 |
ghcr.io/mlsys-io/flowmesh_ssh:v0.1.1-gpu |
linux/amd64,linux/arm64 |
sha256:ef5d0e041c97ce17ea9f3741ef534d365dd509c4f4e2d211a1a1eaf1e9d9781e |
v0.1.0
What's Changed
- feat: add ignore_unreachable to NodeClient.destroy_all_workers by @timzsu in #2
- chore: switch CI to self-hosted runners by @timzsu in #4
- chore: speed up CI (cancel superseded runs + pre-built runner image) by @timzsu in #7
- chore: add zizmor security audit; harden existing workflow permissions by @timzsu in #5
- chore: add gitleaks (CI job + pre-commit hook) by @timzsu in #9
- chore: add bandit Python source security audit by @timzsu in #10
- chore: add pip-audit dependency CVE scan by @timzsu in #11
- feat: workflow trace via OTel spans + analyzer by @timzsu in #3
- refactor: in-process torchrun + DeepSpeed launchers; fix multi-GPU SFT/DPO/PPO spawn by @timzsu in #12
- fix: close merged-child result race; rename legacy responses.json by @timzsu in #13
- docs: split AGENTS.md into separate documents by @timzsu in #14
- refactor: remove legacy CLI runtime helpers and fix GPU worker count by @kaiitunnz in #16
- fix: resolve local destination hang and unreachable pigz in worker images by @kaiitunnz in #17
- refactor: scope stack docker object names to avoid collision on shared hosts by @kaiitunnz in #20
- chore: bump astral-sh/setup-uv from 94527f2e458b27549849d47d273a16bec83a01e9 to 37802adc94f370d6bfd71619e3f0bf239e1f3b78 by @dependabot[bot] in #21
- fix: unblock PPO multi-GPU training by @kaiitunnz in #18
- feat: data agent connector for lumid.data by @timzsu in #19
- fix: ignore diffusers GHSA-98h9-4798-4q5v in pip-audit by @timzsu in #24
- feat: support aarch64 stack and worker images by @kaiitunnz in #30
- feat: add flowmesh-hook plugin contract package and server hooks by @kaiitunnz in #22
- refactor: prepare lightweight PyPI package publishing by @kaiitunnz in #25
- fix: support LoRA archive URL outputs by @timzsu in #29
- refactor: move templates under examples by @timzsu in #32
- [BREAKING] refactor: rename GpuPlatformInfo.gpus to .devices by @kaiitunnz in #33
- perf: parallelize supervisor command dispatch by @kaiitunnz in #31
- refactor: clean up legacy template fields and surface remaining knobs by @timzsu in #34
- perf: split stack build/push buildx driver and cache strategy by @kaiitunnz in #36
- feat: wire training.target_kl + training.early_stopping by @timzsu in #37
- fix: make multi-node stack deployment work by @kaiitunnz in #35
- chore: add PyPI release workflow by @kaiitunnz in #26
- feat: add flowmesh stack bundle init by @kaiitunnz in #38
New Contributors
- @timzsu made their first contribution in #2
- @dependabot[bot] made their first contribution in #21
Known issue
v0.1.0 has a bug where worker-node supervisor registration can fail against a root server using an IdentityProvider plugin chain, because the registration request may be sent without the API key authorization header. This was fixed in #43 and is included in v0.1.1. If you use multi-node deployments with auth enabled, use v0.1.1 or later.
Full Changelog: https://github.com/mlsys-io/FlowMesh/commits/v0.1.0