BOER: Enhancing Resource Utilization for Deep Learning Inference with Hybrid Spatial GPU Sharing

What is BOER?

BOER an inference-serving system that integrates MPS atop MIG partitions while preserving QoS. Also, BOER employs a robust Bayesian optimization framework to efficiently configure MPS and accelerates the optimization process.

Software

Refer to environment.yml

Run the following commands:

$conda env create -f environment.yml
$python -m spacy download de_core_news_sm
$python -m spacy download en-core-web-sm

Dependencies

Hardware

2 servers, each equipped with 2 * Intel Xeon Gold 5320 CPUs, 4 * NVIDIA A100 80GB GPUs, and

CUDA version 12.6
Ubuntu 22.04

How to run

Assume there are three tasks: model1, model2, and model3, and their loads are load1, load2, load3.

Before the official run, execute the following command on each node:

$python node/worker.py

Then, run the following command on the master node:

$python scheduler.py --task model1,load1,model2,load2,model3,load3

For example, you can run:

$python scheduler.py --task resnet50,500,resnet152,300,bert,100

BOER will try to deploy these services using the least amount of resources.

Citation

If you find this work useful in your research, please cite our paper:

@inproceedings{zhang2025boer,
  author    = {Bowen Zhang and Yuhang Wang and Zhuozhao Li},
  title     = {BOER: Enhancing Resource Utilization for Deep Learning Inference with Hybrid Spatial GPU Sharing},
  booktitle = {Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC '25)},
  year      = {2025},
  location  = {St. Louis, MO, USA},
  doi       = {10.1145/3712285.3759857}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
grpc_tool		grpc_tool
jobs		jobs
node		node
script		script
util		util
README.md		README.md
bayesian_pruning.py		bayesian_pruning.py
environment.yml		environment.yml
scheduler.py		scheduler.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BOER: Enhancing Resource Utilization for Deep Learning Inference with Hybrid Spatial GPU Sharing

What is BOER?

Software

Dependencies

Hardware

How to run

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

BOER: Enhancing Resource Utilization for Deep Learning Inference with Hybrid Spatial GPU Sharing

What is BOER?

Software

Dependencies

Hardware

How to run

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages