Consider adding support for DLA (Deep learning accelerator) modules #26

tehkillerbee · 2021-11-22T06:41:33Z

Some platforms including Jetson Xavier AGX and NX supports DLA modules. However, when converting the pytorch module to tensorrt, it will never attempt to use DLA cores but will always use the GPU. This is apparent from the TensorRT log output:

[TensorRT] INFO: 
[TensorRT] INFO: --------------- Layers running on DLA: 
[TensorRT] INFO: 
[TensorRT] INFO: --------------- Layers running on GPU: 
[TensorRT] INFO: (Unnamed Layer* 26) [Convolution] + (Unnamed Layer* 28) [Activation], 
...

The DLA cores must be enabled by changing the IBuilderConfig
https://docs.nvidia.com/deeplearning/tensorrt/api/python_api/infer/Core/NetworkConfig.html#tensorrt.IBuilderConfig

A good example on how to do this is described by jkjung-avt in this issue
jkjung-avt/tensorrt_demos#463

https://github.com/jkjung-avt/tensorrt_demos/blob/f53b5ae9b004489463a407d8e9b230f39230d051/yolo/onnx_to_tensorrt.py#L165-L170

The text was updated successfully, but these errors were encountered:

grimoire · 2021-11-22T12:11:46Z

That sounds like a good idea! I will try.
But that would take a long time since I need to find where did I hide my jetson nano...

tehkillerbee · 2021-11-22T12:33:57Z

@grimoire I think the DLA cores are only supported in the Jetson Xavier NX, AGX series and more recent GPUs listed here

I have been playing around with it today by adding the following lines to here

# set DLA_core enabled if supported
config.default_device_type = trt.DeviceType.DLA
config.DLA_core = 0
config.set_flag(trt.BuilderFlag.GPU_FALLBACK)
config.set_flag(trt.BuilderFlag.STRICT_TYPES)

After doing this, I can see that some layers are now running on the DLA while some layers are incompatible. I also see a large number of warning such as the one below. But this is odd, since I am already using FP16 mode...

[TensorRT] WARNING: DLA only supports FP16 and Int8 precision type. Switching (Unnamed Layer* 1302) [Shape] device type to GPU.

Unfortunately, the export process never finishes but crashes instead..! I suspect it may be caused by the older JetPack version I am currently running, as I see there were some DLA support added in the more recent version of JetPack.

python: ../rtSafe/cuda/cudaReformatRunner.cpp:37: nvinfer1::rt::cuda::ReformatRunner::ReformatRunner(nvinfer1::rt::DefaultRunnerParameters, const ReformatParameters&): Assertion `matchValidDims(defaultParams.inputs[0].extent, defaultParams.outputs[0].extent)' failed.
Aborted (core dumped)

I'll keep you updated with my progress! :)

grimoire · 2021-11-22T13:44:13Z

Cool! I will try to find a device that supports DLA.
If you find a way to add this, please share it with me. And of cause a PR is welcome!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider adding support for DLA (Deep learning accelerator) modules #26

Consider adding support for DLA (Deep learning accelerator) modules #26

tehkillerbee commented Nov 22, 2021 •

edited

Loading

grimoire commented Nov 22, 2021

tehkillerbee commented Nov 22, 2021 •

edited

Loading

grimoire commented Nov 22, 2021

Consider adding support for DLA (Deep learning accelerator) modules #26

Consider adding support for DLA (Deep learning accelerator) modules #26

Comments

tehkillerbee commented Nov 22, 2021 • edited Loading

grimoire commented Nov 22, 2021

tehkillerbee commented Nov 22, 2021 • edited Loading

grimoire commented Nov 22, 2021

tehkillerbee commented Nov 22, 2021 •

edited

Loading

tehkillerbee commented Nov 22, 2021 •

edited

Loading