Converting a model with MMDeploy

To speed up inference of a trained model in MMDetection framework you can perform optimization with MMDeploy tool. Keep in mind that this optimization is platfrom specific. It means that for each Hardware setup (mostly for GPU) the optimized models will be different.

Make sure your model can be converted in ONNX and TensorRT format
Install MMDeploy framework
Convert model
Put in scripts/checkpoints following files:
1. TensorRT .engine file
2. MMDetection config file
3. MMDeploy config file
4. tensorRT config file

Inference

For TensorRT framework the only file needed is .engine file. In a file models/all_model.py uncomment import of TensorRTWrapper and change self.segm_model wrapper to imported one. For MMdeploy framework all 4 files are needed.

Comparing TensorRT and MMDeploy, the first one is slightly faster (on GTX 1050 ti).

MMdeploy autimatically performs preprocessing of the result, so the output masks are much finer that in TensorRT, where you need to perform postprocessing step (I am sure it can be fixed with a proper processing).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model_deployment.md

Model_deployment.md

Converting a model with MMDeploy

Inference

Files

Model_deployment.md

Latest commit

History

Model_deployment.md

File metadata and controls

Converting a model with MMDeploy

Inference