Skip to content

Css22/BOER

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BOER: Enhancing Resource Utilization for Deep Learning Inference with Hybrid Spatial GPU Sharing

What is BOER?

BOER an inference-serving system that integrates MPS atop MIG partitions while preserving QoS. Also, BOER employs a robust Bayesian optimization framework to efficiently configure MPS and accelerates the optimization process.

Software

Refer to environment.yml

Run the following commands:

$conda env create -f environment.yml
$python -m spacy download de_core_news_sm
$python -m spacy download en-core-web-sm

Dependencies

Hardware

2 servers, each equipped with 2 * Intel Xeon Gold 5320 CPUs, 4 * NVIDIA A100 80GB GPUs, and

  • CUDA version 12.6
  • Ubuntu 22.04

How to run

Assume there are three tasks: model1, model2, and model3, and their loads are load1, load2, load3.

Before the official run, execute the following command on each node:

$python node/worker.py

Then, run the following command on the master node:

$python scheduler.py --task model1,load1,model2,load2,model3,load3

For example, you can run:

$python scheduler.py --task resnet50,500,resnet152,300,bert,100

BOER will try to deploy these services using the least amount of resources.

Citation

If you find this work useful in your research, please cite our paper:

@inproceedings{zhang2025boer,
  author    = {Bowen Zhang and Yuhang Wang and Zhuozhao Li},
  title     = {BOER: Enhancing Resource Utilization for Deep Learning Inference with Hybrid Spatial GPU Sharing},
  booktitle = {Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC '25)},
  year      = {2025},
  location  = {St. Louis, MO, USA},
  doi       = {10.1145/3712285.3759857}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors