distributed-inferencing

Here are 3 public repositories matching this topic...

hou-yz / pytorch-pruning-2step

2-stage pruning to favor distributed inference (local device compute half of the model, upload the feature for further computing on stronger devices or cloud).

pytorch pruning 2-step distributed-inferencing

Updated May 31, 2018
Python

DJStompZone / ShadowSWARM

Star

ShadowSWARM is a streamlined framework for setting up a multi-node, GPU-accelerated, distributed system for PyTorch workloads using Docker Swarm

docker-swarm multi-gpu distributed-inferencing streamlit llm-inference fdsp

Updated Jan 14, 2025
Shell

Akuien / DNN-partitioning-and-Oflloading-framework-REAP

Star

An adaptive AI task partitioning and offloading framework for distributed DNN inference across Raspberry Pi, laptop, and GPU desktop nodes, optimizing energy consumption and latency in a heterogeneous end-edge-cloud environment.

iot internet-of-things edge-computing computing-continuum distributed-inferencing dnn-partitioning task-oflloading

Updated Jun 10, 2026
Python

Improve this page

Add a description, image, and links to the distributed-inferencing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the distributed-inferencing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly