Skip to content

GhostNet: More Features from Cheap Operations - 75.7% top-1 (better than than MobileNetV3) #4418

Open
@AlexeyAB

Description

@AlexeyAB

GPU GeForce RTX 2070 - Darknet framework (GPU=1 CUDNN=1 CUDNN_HALF=1)
CPU Intel Core i7 6700k - Darknet framework (OPENMP=1 AVX=1)

  • darknet.cfg - GPU 360 FPS - CPU 63 FPS - 0.400 BFlops - 7.3M params - 61.1% Top1
  • darknet19.cfg - GPU 179 FPS - CPU 14 FPS - 2.793 BFlops - 20.8M params - 72.9% Top1
  • GhostNet-1.0 - GPU 61 FPS - CPU 12 FPS - 0.117 BFlops - 5.0M params - xx.x% Top1
  • MixNet-M-GPU - GPU 82 FPS - CPU 4.6 FPS - 0.533 BFlops - 11.9M params - 71.5% Top1
  • EfficientNetB0 - GPU 110 FPS - CPU 6.3 FPS - 0.450 BFlops - 4.9M params - 71.3% Top1
  • darknet53.cfg - GPU 85 FPS - CPU 4.8 FPS- 9.285 BFlops - 41.6M params - 77.2% Top1

  • GhostNet-1.0 - 5.0M params - 0.117 BFlops - xx.x% Top1 - xx.x% Top5 - MY URL
  • GhostNet-1.0 - 5.2M params - 0.141 BFlops - 73.9% Top1 - 91.4% Top5 - Official
  • MobileNetV3 - 5.4M params - 0.219 BFlops - 75.2% Top1 - --- Top5
  • GhostNet-1.3 - 7.3M params - 0.226 BFlops - 75.7% Top1 - 92.7% Top5 - Official
  • EfficientNetB0 - 4.9M params - 0.450 BFlops - 76.3% (71.3%) Top1 - 93.2% (90.4%) Top5 - MY URL
  • MixNet-M - 5.0M params - 0.360 BFlops - 77.0% (71.5%) Top1 - 93.3% ( 90.5%) Top5 - MixNet (Mix_Conv) - 0.360 (0.5) BFlops - 77.0% (71.5%) Top1 #4203

Comparison table: #4203 (comment)


maybe better than mobilenetv3, efficientnet, mixnet..., etc. huawei-noah/Efficient-AI-Backbones#1

We measure the actual inference speed on an ARM-based mobile phone using the TFLite tool, we use single-threaded mode with batch size 1:

image


image


image


image

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions