SSD MobileNet v1 quantized

Use case : Object detection

Model description

The mobilenet-ssd model is a Single-Shot multibox Detection (SSD) network intended to perform object detection. Mobilenet-ssd is using MobileNet as a backbone which is a general architecture that can be used for multiple use cases. Depending on the use case, it can use different input layer size and different width factors. This allows different width models to reduce the number of multiply-adds and thereby reduce inference cost on mobile devices.

The model is quantized in int8 using tensorflow lite converter.

Network information

Network information	Value
Framework	TensorFlow Lite
Quantization	int8
Provenance	https://www.tensorflow.org/api_docs/python/tf/keras/applications/mobilenet
Paper	https://arxiv.org/abs/1704.04861, https://arxiv.org/abs/1512.02325

The models are quantized using tensorflow lite converter.

Network inputs / outputs

For an image resolution of NxM and NC classes

Input Shape	Description
(1, N, M, 3)	Single NxM RGB image with UINT8 values between 0 and 255

Output Shape	Description
(1, NA, 8 + NC)	FLOAT values Where NA is thge number of anchors and NC is the number of classes

Recommended Platforms

Platform	Supported	Recommended
STM32L0	[]	[]
STM32L4	[]	[]
STM32U5	[]	[]
STM32H7	[x]	[x]
STM32MP1	[x]	[x]
STM32MP2	[x]	[]

Performances

Training

To train a SSD mobilenetv1 model with transfer learning or from scratch on your own dataset, you need to configure the user_config.yaml file following the tutorial under the training section.

As an example, st_ssd_mobilenet_v1_025_192_config.yaml file is used to train this model on the person dataset, you can copy its content in the user_config.yaml file provided under the training section to reproduce the results presented below

Deployment

To deploy your trained model, you need to configure the user_config.yaml file following the tutorial under the deployment section.

Metrics

Measures are done with default STM32Cube.AI configuration with enabled input / output allocated option.

Reference MCU memory footprint based on COCO Person dataset (see Accuracy for details on dataset)

Model	Format	Resolution	Series	Activation RAM	Runtime RAM	Weights Flash	Code Flash	Total RAM	Total Flash	STM32Cube.AI version
ST SSD Mobilenet v1 0.25	Int8	192x192x3	STM32H7	266.3. KiB	29.93 KiB	438.28 KiB	96.39 KiB	296.23 KiB	534.67 KiB	9.1.0
ST SSD Mobilenet v1 0.25	Int8	224x224x3	STM32H7	379.6 KiB	34.33 KiB	595.66 KiB	107.15 KiB	413.93 KiB	702.81 KiB	9.1.0
ST SSD Mobilenet v1 0.25	Int8	256x256x3	STM32H7	456.1 KiB	33.75 KiB	595.66 KiB	106.31 KiB	489.85 KiB	701.97 KiB	9.1.0

Reference MCU inference time based on COCO Person dataset (see Accuracy for details on dataset)

Model	Format	Resolution	Board	Execution Engine	Frequency	Inference time (ms)	STM32Cube.AI version
ST SSD Mobilenet v1 0.25	Int8	192x192x3	STM32H747I-DISCO	1 CPU	400 MHz	149.22 ms	9.1.0
ST SSD Mobilenet v1 0.25	Int8	224x224x3	STM32H747I-DISCO	1 CPU	400 MHz	218.68 ms	9.1.0
ST SSD Mobilenet v1 0.25	Int8	256x256x3	STM32H747I-DISCO	1 CPU	400 MHz	266.4 ms	9.1.0

Reference MPU inference time based on COCO Person dataset (see Accuracy for details on dataset)

Model	Format	Resolution	Quantization	Board	Execution Engine	Frequency	Inference time (ms)	%NPU	%GPU	%CPU	X-LINUX-AI version	Framework
ST SSD Mobilenet v1 0.25	Int8	192x192x3	per-channel**	STM32MP257F-DK2	NPU/GPU	800 MHz	12.34 ms	15.35	84.65	0	v5.1.0	OpenVX
ST SSD Mobilenet v1 0.25	Int8	224x224x3	per-channel**	STM32MP257F-DK2	NPU/GPU	800 MHz	18.65 ms	14.02	85.98	0	v5.1.0	OpenVX
ST SSD Mobilenet v1 0.25	Int8	256x256x3	per-channel**	STM32MP257F-DK2	NPU/GPU	800 MHz	14.33 ms	14.12	85.88	0	v5.1.0	OpenVX
ST SSD Mobilenet v1 0.25	Int8	192x192x3	per-channel	STM32MP157F-DK2	2 CPU	800 MHz	67.80 ms	NA	NA	100	v5.1.0	TensorFlowLite 2.11.0
ST SSD Mobilenet v1 0.25	Int8	224x224x3	per-channel	STM32MP157F-DK2	2 CPU	800 MHz	100.20 ms	NA	NA	100	v5.1.0	TensorFlowLite 2.11.0
ST SSD Mobilenet v1 0.25	Int8	256x256x3	per-channel	STM32MP157F-DK2	2 CPU	800 MHz	119.00 ms	NA	NA	100	v5.1.0	TensorFlowLite 2.11.0
ST SSD Mobilenet v1 0.25	Int8	192x192x3	per-channel	STM32MP135F-DK2	1 CPU	1000 MHz	95.36 ms	NA	NA	100	v5.1.0	TensorFlowLite 2.11.0
ST SSD Mobilenet v1 0.25	Int8	224x224x3	per-channel	STM32MP135F-DK2	1 CPU	1000 MHz	139.00 ms	NA	NA	100	v5.1.0	TensorFlowLite 2.11.0
ST SSD Mobilenet v1 0.25	Int8	256x256x3	per-channel	STM32MP135F-DK2	1 CPU	1000 MHz	168.80 ms	NA	NA	100	v5.1.0	TensorFlowLite 2.11.0

** To get the most out of MP25 NPU hardware acceleration, please use per-tensor quantization

AP on COCO Person dataset

Dataset details: link , License CC BY 4.0 , Quotation[1] , Number of classes: 80, Number of images: 118,287

Model	Format	Resolution	AP*
ST SSD Mobilenet v1 0.25	Int8	192x192x3	33.70 %
ST SSD Mobilenet v1 0.25	Int8	224x224x3	44.45 %
ST SSD Mobilenet v1 0.25	Int8	256x256x3	46.26 %

* EVAL_IOU = 0.4, NMS_THRESH = 0.5, SCORE_THRESH =0.001

Retraining and code generation

Link to training script: here
Link to STM32Cube.AI generation script: here

Demos

Integration in simple example

coming soon.

References

[1] Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P. and Zitnick, C.L., 2014. "Microsoft coco: Common objects in context". In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13 (pp. 740-755). Springer International Publishing. [Online]. Available: https://cocodataset.org/#download.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

SSD MobileNet v1 quantized

Use case : Object detection

Model description

Network information

Network inputs / outputs

Recommended Platforms

Performances

Training

Deployment

Metrics

Reference MCU memory footprint based on COCO Person dataset (see Accuracy for details on dataset)

Reference MCU inference time based on COCO Person dataset (see Accuracy for details on dataset)

Reference MPU inference time based on COCO Person dataset (see Accuracy for details on dataset)

AP on COCO Person dataset

Retraining and code generation

Demos

Integration in simple example

References

Files

README.md

Latest commit

History

README.md

File metadata and controls

SSD MobileNet v1 quantized

Use case : Object detection

Model description

Network information

Network inputs / outputs

Recommended Platforms

Performances

Training

Deployment

Metrics

Reference MCU memory footprint based on COCO Person dataset (see Accuracy for details on dataset)

Reference MCU inference time based on COCO Person dataset (see Accuracy for details on dataset)

Reference MPU inference time based on COCO Person dataset (see Accuracy for details on dataset)

AP on COCO Person dataset

Retraining and code generation

Demos

Integration in simple example

References