fix: expand dim for scalar numpy when freezing tensors to IConstantLayers #3251
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
When compiling
facebook/bart-base
with Torch-TensorRT, I encountered an error similar to the one in this issue, whereaten_ops.scatter.src
fails withinimpl.elementwise.eq
. Upon investigation, I found that the issue arises when a scalar Numpy value is used in a slice operation as a constant layer. The output of this constant layer, which has a shape of()
(scalar), is passed intoaten_ops.scatter.src
, causing the error. To fix this, I modified the shape to(1,)
to avoid the issue.I confirmed that after the modification, the TensorRT engine is successfully obtained using the Torch-TensorRT backend with the following code:
This code compares the model's outputs before and after applying Torch-TensorRT optimization.
Fixes # (issue)
Type of change
Please delete options that are not relevant and/or add your own.
Checklist: