2-stage pruning to favor distributed inference (local device compute half of the model, upload the feature for further computing on stronger devices or cloud).
-
Updated
May 31, 2018 - Python
2-stage pruning to favor distributed inference (local device compute half of the model, upload the feature for further computing on stronger devices or cloud).
ShadowSWARM is a streamlined framework for setting up a multi-node, GPU-accelerated, distributed system for PyTorch workloads using Docker Swarm
An adaptive AI task partitioning and offloading framework for distributed DNN inference across Raspberry Pi, laptop, and GPU desktop nodes, optimizing energy consumption and latency in a heterogeneous end-edge-cloud environment.
Add a description, image, and links to the distributed-inferencing topic page so that developers can more easily learn about it.
To associate your repository with the distributed-inferencing topic, visit your repo's landing page and select "manage topics."