Upgrade to CUDA10.2 for TensorRT #3084

stevenlix · 2020-02-24T23:11:46Z

Upgrade from CUDA10.0 to CUDA10.2 since TensorRT doesn't have CUDA10.1 package
Check if TensorRT subgraph inputs have shape specified. If not, remind user to do offline shape inference first
Enable more unit tests for TensorRT as the result of bug fixing in the parser
Provide optimization profile for dynamic shape inputs only

…ble some TensorRT unit tests

jywu-msft · 2020-02-24T23:19:33Z

docs/execution_providers/TensorRT-ExecutionProvider.md

@@ -19,6 +19,8 @@ status = session_object.Load(model_file_name);
 ```
 The C API details are [here](../C_API.md#c-api).

+If certain operators in the model are not supported by TensorRT, ONNX Runtime will partition the graph and only send supported subgraphs to TensorRT execution provider. Because TensorRT requires all inputs of the subgraph have shape specified, ONNX Runtime will thow error if there is no shape for any inputs. In this case please infer shapes on the model first by running shape inference [here](https://github.com/microsoft/onnxruntime/blob/master/onnxruntime/core/providers/nuphar/scripts/symbolic_shape_infer.py).


Can you add ## Symbolic Shape Inference
to line 21.
this will allow one to create a permalink to the section with the instructions.

jywu-msft · 2020-02-24T23:20:34Z

onnxruntime/core/providers/tensorrt/tensorrt_execution_provider.cc

+              ORT_THROW_IF_ERROR(ORT_MAKE_STATUS(ONNXRUNTIME, EP_FAIL,
+                                                 "TensorRT input: " + input_arg->Name() + " has no shape specified. " +
+                                                     "Please run shape inference on the onnx model first. Details can be found in " +
+                                                     "https://github.com/microsoft/onnxruntime/blob/master/docs/execution_providers/TensorRT-ExecutionProvider.md"));


if you add a ## header as suggested above, you can provide a direct link to the exact section pertaining to the shape inference script, rather than the entire TRT EP document.

we might consider returning FAIL instead of EP_FAIL here to make it a fatal error.
otherwise, under python it would retry with CPU provider and succeed, and the user might not notice this error?

jywu-msft · 2020-02-24T23:23:15Z

tools/ci_build/github/azure-pipelines/win-gpu-tensorrt-ci-pipeline.yml

@@ -42,7 +42,7 @@ jobs:
    displayName: 'Generate cmake config'
    inputs:
      scriptPath: '$(Build.SourcesDirectory)\tools\ci_build\build.py'
-      arguments: '--config $(BuildConfig) --build_dir $(Build.BinariesDirectory) --skip_submodule_sync --build_shared_lib --update --cmake_generator "Visual Studio 16 2019" --msvc_toolset 14.16 --build_wheel --enable_onnx_tests --use_tensorrt --tensorrt_home="C:\local\TensorRT-7.0.0.11.cuda-10.0.cudnn7.6\TensorRT-7.0.0.11" --cuda_version=10.0 --cuda_home="C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.0" --cudnn_home="C:\local\cudnn-10.0-windows10-x64-v7.6.5.32\cuda" --cmake_extra_defines CMAKE_SYSTEM_VERSION=10.0.18362.0'
+      arguments: '--config $(BuildConfig) --build_dir $(Build.BinariesDirectory) --skip_submodule_sync --build_shared_lib --update --cmake_generator "Visual Studio 16 2019" --msvc_toolset 14.16 --build_wheel --enable_onnx_tests --use_tensorrt --tensorrt_home="C:\local\TensorRT-7.0.0.11.cuda-10.2.cudnn7.6\TensorRT-7.0.0.11" --cuda_version=10.2 --cuda_home="C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.2" --cudnn_home="C:\local\cudnn-10.2-windows10-x64-v7.6.5.32\cuda" --cmake_extra_defines CMAKE_SYSTEM_VERSION=10.0.18362.0'


if you update cuda to 10.2 , you'll need to add some code to build.py to update the PATH to point to the cuda 10.2 directory and take precedence over any other cuda versions.
i suggest separating out the cuda 10.2 update to a different PR to undergo more testing.

snnn · 2020-02-24T23:52:14Z

tools/ci_build/github/azure-pipelines/win-gpu-tensorrt-ci-pipeline.yml

@@ -81,7 +81,7 @@ jobs:
     del wheel_filename_file
     python.exe -m pip install -q --upgrade %WHEEL_FILENAME%
     set PATH=$(Build.BinariesDirectory)\$(BuildConfig)\$(BuildConfig);%PATH%
-     python $(Build.SourcesDirectory)\tools\ci_build\build.py --config $(BuildConfig) --build_dir $(Build.BinariesDirectory) --skip_submodule_sync --build_shared_lib --test --cmake_generator "Visual Studio 16 2019" --msvc_toolset 14.16 --build_wheel --enable_onnx_tests --use_tensorrt --tensorrt_home="C:\local\TensorRT-7.0.0.11.cuda-10.0.cudnn7.6\TensorRT-7.0.0.11" --cuda_version=10.0 --cuda_home="C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.0" --cudnn_home="C:\local\cudnn-10.0-windows10-x64-v7.6.5.32\cuda" --cmake_extra_defines CMAKE_SYSTEM_VERSION=10.0.18362.0
+     python $(Build.SourcesDirectory)\tools\ci_build\build.py --config $(BuildConfig) --build_dir $(Build.BinariesDirectory) --skip_submodule_sync --build_shared_lib --test --cmake_generator "Visual Studio 16 2019" --msvc_toolset 14.16 --build_wheel --enable_onnx_tests --use_tensorrt --tensorrt_home="C:\local\TensorRT-7.0.0.11.cuda-10.2.cudnn7.6\TensorRT-7.0.0.11" --cuda_version=10.2 --cuda_home="C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.2" --cudnn_home="C:\local\cudnn-10.2-windows10-x64-v7.6.5.32\cuda" --cmake_extra_defines CMAKE_SYSTEM_VERSION=10.0.18362.0


--msvc_toolset 14.16 is not needed when cuda version >= 10.1

@stevenlix , it looks like using newer toolset encounters build issue?
e.g.
Error C2127: 'getMatrixOp': illegal initialization of 'constexpr' entity with a non-constant expression
we/nvidia should fix these issues so we don't remain on 14.16 msvc toolset.

jywu-msft

please address remaining review comments. thanks!

jywu-msft · 2020-02-25T05:02:13Z

seems like a build error with the tensorrt parser update?

stevenlix and others added 7 commits February 21, 2020 15:51

Switch to CUDA10.2

1f38fff

Update win-gpu-tensorrt-ci-pipeline.yml

6487e0d

Update win-gpu-tensorrt-ci-pipeline.yml

c091ed1

remove dynamic_shape

56a52bc

update onnx-tensorrt submodule

c8a8a7b

check if input shape is specified for TensorRT subgraph input and ena…

e46d4ab

…ble some TensorRT unit tests

fix format issue

ed6f9c4

stevenlix requested review from jywu-msft and a team February 24, 2020 23:11

jywu-msft reviewed Feb 24, 2020

View reviewed changes

snnn reviewed Feb 24, 2020

View reviewed changes

faxu added this to the 1.2 milestone Feb 25, 2020

jywu-msft requested changes Feb 25, 2020

View reviewed changes

stevenlix added 2 commits February 25, 2020 03:13

add shape inference instruction for TensorRT

fc921b1

update according to the reviews

0a01983

jywu-msft previously approved these changes Feb 25, 2020

View reviewed changes

Update win-gpu-tensorrt-ci-pipeline.yml

187a18f

stevenlix dismissed jywu-msft’s stale review via 187a18f February 25, 2020 05:13

jywu-msft approved these changes Feb 25, 2020

View reviewed changes

jywu-msft merged commit f4a5d17 into master Feb 25, 2020

jywu-msft deleted the stevenlix/cuda10-2 branch February 25, 2020 13:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade to CUDA10.2 for TensorRT #3084

Upgrade to CUDA10.2 for TensorRT #3084

stevenlix commented Feb 24, 2020

jywu-msft Feb 24, 2020

jywu-msft Feb 24, 2020

jywu-msft Feb 25, 2020

jywu-msft Feb 24, 2020

snnn Feb 24, 2020

jywu-msft Feb 25, 2020

jywu-msft left a comment

jywu-msft commented Feb 25, 2020

Upgrade to CUDA10.2 for TensorRT #3084

Upgrade to CUDA10.2 for TensorRT #3084

Conversation

stevenlix commented Feb 24, 2020

jywu-msft Feb 24, 2020

Choose a reason for hiding this comment

jywu-msft Feb 24, 2020

Choose a reason for hiding this comment

jywu-msft Feb 25, 2020

Choose a reason for hiding this comment

jywu-msft Feb 24, 2020

Choose a reason for hiding this comment

snnn Feb 24, 2020

Choose a reason for hiding this comment

jywu-msft Feb 25, 2020

Choose a reason for hiding this comment

jywu-msft left a comment

Choose a reason for hiding this comment

jywu-msft commented Feb 25, 2020