Add TensorRT Reformat-Free I/O Support. #942

nvpohanh · 2019-12-09T21:39:40Z

Changes:

~~Fix build errors when gRPC and HTTP are disabled.~~
- Split to Fix build errors when HTTP and gRPC are both turned off #943
Add TensorRT Reformat-Free I/O Support.
- User needs to specified padded dimensions in the model description file, since TRTIS has the assumption that byte_size = product of dimensions in model description.
- Change validation logic so that it won't throw error since TensorRT engine reports unpadded dimensions.

tanmayv25 · 2019-12-09T22:05:11Z

src/backends/tensorrt/plan_utils.cc

@@ -166,6 +197,15 @@ CompareDimsSupported(
      continue;
    }

+    // Pad channel dimension if necessary.


Shouldn't we query the strides from the engine to get the total buffer size to allocate for the tensor? Does this formulation always return the exact byte size of the reformatted tensor? Also, what about the cases with dynamic-shapes?

I think according to TRT engine APIs, the engine always return on padded dimensions. User needs to pad the C dimension when computing buffer size. Also, I don't think there is a TRT API for getting strides.

Can you point me to the documentation you are referring to? The GetStrides() function is available per execution context as it might change with the dimensions set. As per my initial investigation and talks with some TensorRT engineers, we are supposed to use getBindingBytesPerComponents(), getBindingComponentsPerElement() and getStrides() APIs to obtain the correct byte size. If the engine returns padded dimension then it's much easier to attain the byte size for the tensor.

@tanmayv25 My bad. You are right. Let me fix this and use the more modern APIs.

What would be your opinion on the dimensions in the TRTIS model description file? Should it be the unpadded or padded dimensions?

When using dynamic shapes we would need unpadded dimensions to call setBindingDimensions() with. So, it would make sense to hold on to unpadded dimensions in model config description file. If we can internally determine the maximum buffer size within plan_backend to allocate for the inputs then it would be great. Otherwise a new field for padded dimensions can be added to the model config.

But when I tried using unpadded dimensions in model config, I ran into some assertion errors in provider.cc that I didn't dare to change.

@GuanLuo I would propose that we close this PR for now, but keep in mind that this is a must-have feature if we end up using TRTIS. What do you think?

Yeah, I agree that we should close the PR for now as it requires some thinking on how to expand the model config so that it fits into different cases. And depending on the progress, I think we can at least have it for fixed input shapes as the required information seems to be determined once the model is loaded.

GuanLuo · 2019-12-09T22:09:55Z

Can you split it into two separate PRs (build fix, and reformat-free I/O) so that they can be reviewed and merged independently?

nvpohanh · 2019-12-09T23:21:18Z

Split out the build error part to #943 . Keep this one only for TRT Reformat-free I/O support. Updated title.

nvpohanh · 2019-12-10T21:28:47Z

Closing this PR. Will file an internal tracker instead.

tanmayv25 reviewed Dec 9, 2019

View reviewed changes

nvpohanh mentioned this pull request Dec 9, 2019

Fix build errors when HTTP and gRPC are both turned off #943

Merged

Support TensorRT reformat-free I/O

aa1f1e5

nvpohanh force-pushed the dev-pohanh-trt-format-fix branch from e6964fc to aa1f1e5 Compare December 9, 2019 23:20

nvpohanh changed the title ~~Add TensorRT Reformat-Free I/O Support. Fix build errors when gRPC and HTTP are disabled.~~ Add TensorRT Reformat-Free I/O Support. Dec 9, 2019

nvpohanh closed this Dec 10, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add TensorRT Reformat-Free I/O Support. #942

Add TensorRT Reformat-Free I/O Support. #942

Uh oh!

nvpohanh commented Dec 9, 2019 •

edited

Loading

Uh oh!

tanmayv25 Dec 9, 2019 •

edited

Loading

Uh oh!

nvpohanh Dec 9, 2019

Uh oh!

tanmayv25 Dec 10, 2019

Uh oh!

nvpohanh Dec 10, 2019

Uh oh!

nvpohanh Dec 10, 2019

Uh oh!

tanmayv25 Dec 10, 2019 •

edited

Loading

Uh oh!

nvpohanh Dec 10, 2019

Uh oh!

GuanLuo Dec 10, 2019

Uh oh!

GuanLuo commented Dec 9, 2019

Uh oh!

nvpohanh commented Dec 9, 2019

Uh oh!

nvpohanh commented Dec 10, 2019

Uh oh!

Uh oh!

Add TensorRT Reformat-Free I/O Support. #942

Add TensorRT Reformat-Free I/O Support. #942

Uh oh!

Conversation

nvpohanh commented Dec 9, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tanmayv25 Dec 9, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nvpohanh Dec 9, 2019

Choose a reason for hiding this comment

Uh oh!

tanmayv25 Dec 10, 2019

Choose a reason for hiding this comment

Uh oh!

nvpohanh Dec 10, 2019

Choose a reason for hiding this comment

Uh oh!

nvpohanh Dec 10, 2019

Choose a reason for hiding this comment

Uh oh!

tanmayv25 Dec 10, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nvpohanh Dec 10, 2019

Choose a reason for hiding this comment

Uh oh!

GuanLuo Dec 10, 2019

Choose a reason for hiding this comment

Uh oh!

GuanLuo commented Dec 9, 2019

Uh oh!

nvpohanh commented Dec 9, 2019

Uh oh!

nvpohanh commented Dec 10, 2019

Uh oh!

Uh oh!

nvpohanh commented Dec 9, 2019 •

edited

Loading

tanmayv25 Dec 9, 2019 •

edited

Loading

tanmayv25 Dec 10, 2019 •

edited

Loading