fpgaconvnet-torch

PyTorch frontend for fpgaConvNet, providing emulated accuracy results for features such as quantization and sparsity.

Code Structure

models/ general interfaces for model creation, inference and onnx export.
quantization/ emulation for fixed point, and block floating point representations.
sparsity/ post-activation sparsity, and also tunable threshold relu.
optimiser_interface/ python interface to launch fpgaconvnet optimiser and collect prediction results.

Examples

python quantization_example.py
python activation_sparsity_example.py
python threshold_relu_example.py

Model Zoo

imagenet: resnet18, resnet50, mobilenet_v2, repvgg_a0
coco: yolov8n
camvid: unet
cityscapes: unet
llgmri: unet
ucf101: x3d_s, x3d_m
brats20: unet3d

Quantization Results

imagenet (val, top-1 acc)

Model	Source	Float32	Fixed16	Fixed8	BFP8 (Layer)	BFP8 (Channel)
resnet18	torchvision	69.76	69.76	1.03	68.48	69.26
resnet50	torchvision	76.13	76.10	0.36	74.38	75.75
mobilenet_v2	torchvision	71.87	71.76	0.10	53.68	69.51
repvgg_a0	timm	72.41	72.40	0.21	0.21	66.08

coco (val, mAP50-95)

Model	Source	Float32	Fixed16	Fixed8	BFP8 (Layer)	BFP8 (Channel)
yolov8n	ultralytics	37.1	37.1	0.0	0.0	35.1

camvid (val, mIOU)

Model	Source	Float32	Fixed16	Fixed8	BFP8 (Layer)	BFP8 (Channel)
unet	nncf	71.95	71.95	61.02	71.60	71.85
unet-bilinear	nncf	71.67	71.67	60.62	71.40	71.75

cityscapes (val, mIOU)

Model	Source	Float32	Fixed16	Fixed8	BFP8 (Layer)	BFP8 (Channel)
unet	mmsegmentation	69.10	69.10	1.98	61.74	68.43

llgmri (val, Dice coefficient)

Model	Source	Float32	Fixed16	Fixed8	BFP8 (Layer)	BFP8 (Channel)
unet	brain-segmentation-pytorch	90.89	90.88	80.98	90.95	90.85
unet-bilinear	brain-segmentation-pytorch	91.05	91.05	77.51	91.04	91.03

ucf101 (val-split1, top-1 acc)

Model	Source	Float32	Fixed16	Fixed8	BFP8 (Layer)	BFP8 (Channel)
x3d_s	mmaction2	93.68	93.57	1.13	90.21	93.57
x3d_m	mmaction2	96.40	96.40	0.81	95.24	96.29

Sparsity Results

Q - Fixed16 Quantization
AS - Activation Sparsity
WS - Weight Sparsity (applying global pruning threshold)
Post-training, without fine-tuning

imagenet

Model	Experiment	Accuracy	Sparsity
resnet18	Q+AS	69.74	50.75
resnet18	Q+AS+WS(0.005)	69.42	56.33
resnet18	Q+AS+WS(0.010)	67.36	61.47
resnet18	Q+AS+WS(0.015)	58.38	65.91
resnet18	Q+AS+WS(0.020)	27.91	69.63

Links to other repos

Optimizer: https://github.com/AlexMontgomerie/fpgaconvnet-optimiser; https://github.com/AlexMontgomerie/samo
Model: https://github.com/AlexMontgomerie/fpgaconvnet-model
HLS: https://github.com/AlexMontgomerie/fpgaconvnet-hls
Tutorial: https://github.com/AlexMontgomerie/fpgaconvnet-tutorial

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

fpgaconvnet-torch

Code Structure

Examples

Model Zoo

Quantization Results

imagenet (val, top-1 acc)

coco (val, mAP50-95)

camvid (val, mIOU)

cityscapes (val, mIOU)

llgmri (val, Dice coefficient)

ucf101 (val-split1, top-1 acc)

Sparsity Results

imagenet

Links to other repos

Files

README.md

Latest commit

History

README.md

File metadata and controls

fpgaconvnet-torch

Code Structure

Examples

Model Zoo

Quantization Results

imagenet (val, top-1 acc)

coco (val, mAP50-95)

camvid (val, mIOU)

cityscapes (val, mIOU)

llgmri (val, Dice coefficient)

ucf101 (val-split1, top-1 acc)

Sparsity Results

imagenet

Links to other repos