TensortRT Execution Provider (preview)

The TensorRT execution provider in the ONNX Runtime will make use of NVIDIA's TensortRT Deep Learning inferencing engine to accelerate ONNX model in their family of GPUs. Microsoft and NVIDIA worked closely to integrate the TensorRT execution provider with ONNX Runtime.

This execution provider release is currently in preview but, we have validated support for all the ONNX Models in the model zoo. With the TensorRT execution provider, the ONNX Runtime delivers better inferencing performance on the same hardware compared to generic GPU acceleration.

Build TensorRT execution provider

Developers can now tap into the power of TensorRT through ONNX Runtime to accelerate inferencing of ONNX models. Instructions to build the TensorRT execution provider from source is available here.

Using the TensorRT execution provider

C/C++

The TensortRT execution provider needs to be registered with ONNX Runtime to enable in the inference session.

InferenceSession session_object{so};
session_object.RegisterExecutionProvider(std::make_unique<::onnxruntime::TensorrtExecutionProvider>());
status = session_object.Load(model_file_name);

The C API details are here.

Python

When using the python wheel from the ONNX Runtime build with TensorRT execution provider, it will be automatically prioritized over the default GPU or CPU execution providers. There is no need to separately register the execution provider. Python APIs details are here.

Using onnxruntime_perf_test

You can test the performance for your ONNX Model with the TensorRT execution provider. Use the flag -e tensorrt in onnxruntime_perf_test.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TensorRT-ExecutionProvider.md

TensorRT-ExecutionProvider.md

TensortRT Execution Provider (preview)

Build TensorRT execution provider

Using the TensorRT execution provider

C/C++

Python

Using onnxruntime_perf_test

Files

TensorRT-ExecutionProvider.md

Latest commit

History

TensorRT-ExecutionProvider.md

File metadata and controls

TensortRT Execution Provider (preview)

Build TensorRT execution provider

Using the TensorRT execution provider

C/C++

Python

Using onnxruntime_perf_test