Build-TensorFlow-with-TensorRT

This repository contains helper Dockerfiles and build scripts to produce TensorFlow python wheels built with NVIDIA TensorRT support for specific Python, CUDA, and cuDNN combinations.

This project was born out of the work I did in trying to get convert keras models to TensorRT models on an HPC running Rocky Linux 9. (hence why the docker images are all Rocky Linux 9). I initially went with Ubuntu images, but the wheels they produced were not compatible with the GLIBC runtime of the HPC. This HPC system is the only system I have tested this project with, so it may well be that they don't work for some other systems. Smilarly, there is no ARM support (e.g. Jetson) right now, although I will be working on that!

This is not feature complete nor intended to be. Contributions are welcome. You are on your own trying to fix issues. Considering that I fixed most of mine with ChatGPT, it's not too hard.

The TensorRT I opted to use is the latest offered by NVIDIA that is still compatible with TensorFlow (Version 8.6.1) TensorRT versions > 9 would require manual patching. Help wanted with that!

Prerequisites

Linux host with recent Docker installed (no GPU is needed to build)
Sufficient disk space and memory for building TensorFlow (tens of GBs Disk Space, 16 GB of RAM at least)
Lots of time!!!! (4-8 hours)

Usage

Choose the configuration directory that matches the TensorFlow/Python/CUDA/cuDNN/ combination you want. Example:

cd 2.16.2/tf-cuda12.1-cudnn8.9-TRT-8.6.1/

Build the docker image:

docker build -t build_tf -f Dockerfile .

Enter the docker image:

docker run --name TFBUILD -it build_tf /bin/bash

Start the TensorFlow build:

./build_tf.sh

The build_tf.sh script executes inside the docker image to build TensorFlow. Edit this if you want to change something about the build configuration. You will need to rebuild the docker image if you make changes.

Export the built wheel:

docker cp TFBUILD:/output ./

Check the wheel:

pip install <your built wheel file>
import tensorflow as tf
print(tf.__version__)
print(tf.sysconfig.get_build_info())
print(tf.config.list_physical_devices('GPU'))

Customization

To change build flags or additional dependencies, edit the Dockerfile and build_tf.sh in the chosen directory.
Add or change Bazel flags in the build steps to tune performance or enable/disable features.
These builds are configured for compute capability 8.6. To change this for your GPU, set the environment variable in build_tf.sh. NVIDIA has a chart of the corresponding compute capability to set for each GPU.

Troubleshooting

If Docker build fails with CUDA or driver errors, ensure your host NVIDIA driver is compatible with the CUDA runtime used by the image.
Out-of-memory errors: increase Docker build resources or swap; building TensorFlow can be memory intensive. Try adding --jobs=x to bazel, where x is the number of cores to use when building; this dramatically reduces memory usage.
If TensorRT symbols are missing at runtime (TF-TRT Warning: Could not find TensorRT), run python test-tf.py and check your CUDA and cuDNN versions. If they aren't found, then install them. If you see any 'not found' errors in loading .so libraries, make sure you have them in your $LD_LIBRARY_PATH.
Unlike running pip install tensorflow[and-cuda], which bundles CUDA and cuDNN internally, these wheels do not include CUDA or cuDNN libraries, and those must be installed from NVIDIA. (This is often already the case with CUDA, but you may be missing cuDNN libraries)
It goes without saying that you also have to download TensorRT libraries as well. Make sure they are the same as the ones you built TensorFlow against. They must be in your $LD_LIBRARY_PATH.

Contributing configurations

Add a new folder following the existing naming pattern for additional Python/TF/CUDA/cuDNN/TRT combinations and include a Dockerfile and build_tf.sh.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
2.13.1		2.13.1
2.14.0		2.14.0
2.14.1		2.14.1
2.15.0		2.15.0
2.15.1		2.15.1
2.16.1		2.16.1
2.16.2		2.16.2
2.17.0		2.17.0
2.17.1		2.17.1
.gitignore		.gitignore
README.md		README.md
check_nvidia_versions.py		check_nvidia_versions.py
test-tf.py		test-tf.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Build-TensorFlow-with-TensorRT

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Build-TensorFlow-with-TensorRT

About

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages