Skip to content

Releases: mlsys-io/FlowMesh

v0.1.1

13 May 12:17
4ca664e

Choose a tag to compare

What's Changed

  • chore: run release workflows on self-hosted runners by @kaiitunnz in #39
  • chore: bump astral-sh/setup-uv from 7.6.0 to 8.1.0 by @dependabot[bot] in #40
  • chore: bump actions/download-artifact from 4.3.0 to 8.0.1 by @dependabot[bot] in #41
  • chore: bump actions/upload-artifact from 4.6.2 to 7.0.1 by @dependabot[bot] in #42
  • fix: send api key on worker node registration by @kaiitunnz in #43
  • fix: drop redundant main fetch in release workflows by @kaiitunnz in #45
  • chore: prepare v0.1.1 release by @kaiitunnz in #46

Full Changelog: v0.1.0...v0.1.1

FlowMesh container images for v0.1.1

Image Platforms Digest
ghcr.io/mlsys-io/flowmesh_server:v0.1.1 linux/amd64,linux/arm64 sha256:b5185f4c7df71874362c2e8f1d09b6baf08984ca4dfed1b4475aa6194a30d0c9
ghcr.io/mlsys-io/flowmesh_worker:v0.1.1-cpu linux/amd64,linux/arm64 sha256:293bf7999a6a0099e7075ef781947cc11b80196791a5f195b62678787591660c
ghcr.io/mlsys-io/flowmesh_worker_builder:v0.1.1-gpu linux/amd64,linux/arm64 sha256:b342971dce7c8505f0206324938caef578a0fe41b9f54c8fb9f8488b3eb9627b
ghcr.io/mlsys-io/flowmesh_worker:v0.1.1-gpu linux/amd64,linux/arm64 sha256:1927f92279ec7d645191f916ac44eabfd6d09cc994732900da8bf2f4ae20cbe0
ghcr.io/mlsys-io/flowmesh_ssh:v0.1.1-cpu linux/amd64,linux/arm64 sha256:955c871dd7dcb55a18a16e509cfa8e50eb8002d650902c55d0ab72c93fa24b11
ghcr.io/mlsys-io/flowmesh_ssh:v0.1.1-gpu linux/amd64,linux/arm64 sha256:ef5d0e041c97ce17ea9f3741ef534d365dd509c4f4e2d211a1a1eaf1e9d9781e

v0.1.0

13 May 03:44
49a8fd2

Choose a tag to compare

What's Changed

  • feat: add ignore_unreachable to NodeClient.destroy_all_workers by @timzsu in #2
  • chore: switch CI to self-hosted runners by @timzsu in #4
  • chore: speed up CI (cancel superseded runs + pre-built runner image) by @timzsu in #7
  • chore: add zizmor security audit; harden existing workflow permissions by @timzsu in #5
  • chore: add gitleaks (CI job + pre-commit hook) by @timzsu in #9
  • chore: add bandit Python source security audit by @timzsu in #10
  • chore: add pip-audit dependency CVE scan by @timzsu in #11
  • feat: workflow trace via OTel spans + analyzer by @timzsu in #3
  • refactor: in-process torchrun + DeepSpeed launchers; fix multi-GPU SFT/DPO/PPO spawn by @timzsu in #12
  • fix: close merged-child result race; rename legacy responses.json by @timzsu in #13
  • docs: split AGENTS.md into separate documents by @timzsu in #14
  • refactor: remove legacy CLI runtime helpers and fix GPU worker count by @kaiitunnz in #16
  • fix: resolve local destination hang and unreachable pigz in worker images by @kaiitunnz in #17
  • refactor: scope stack docker object names to avoid collision on shared hosts by @kaiitunnz in #20
  • chore: bump astral-sh/setup-uv from 94527f2e458b27549849d47d273a16bec83a01e9 to 37802adc94f370d6bfd71619e3f0bf239e1f3b78 by @dependabot[bot] in #21
  • fix: unblock PPO multi-GPU training by @kaiitunnz in #18
  • feat: data agent connector for lumid.data by @timzsu in #19
  • fix: ignore diffusers GHSA-98h9-4798-4q5v in pip-audit by @timzsu in #24
  • feat: support aarch64 stack and worker images by @kaiitunnz in #30
  • feat: add flowmesh-hook plugin contract package and server hooks by @kaiitunnz in #22
  • refactor: prepare lightweight PyPI package publishing by @kaiitunnz in #25
  • fix: support LoRA archive URL outputs by @timzsu in #29
  • refactor: move templates under examples by @timzsu in #32
  • [BREAKING] refactor: rename GpuPlatformInfo.gpus to .devices by @kaiitunnz in #33
  • perf: parallelize supervisor command dispatch by @kaiitunnz in #31
  • refactor: clean up legacy template fields and surface remaining knobs by @timzsu in #34
  • perf: split stack build/push buildx driver and cache strategy by @kaiitunnz in #36
  • feat: wire training.target_kl + training.early_stopping by @timzsu in #37
  • fix: make multi-node stack deployment work by @kaiitunnz in #35
  • chore: add PyPI release workflow by @kaiitunnz in #26
  • feat: add flowmesh stack bundle init by @kaiitunnz in #38

New Contributors

Known issue

v0.1.0 has a bug where worker-node supervisor registration can fail against a root server using an IdentityProvider plugin chain, because the registration request may be sent without the API key authorization header. This was fixed in #43 and is included in v0.1.1. If you use multi-node deployments with auth enabled, use v0.1.1 or later.

Full Changelog: https://github.com/mlsys-io/FlowMesh/commits/v0.1.0