Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
annotated_papers		annotated_papers
convnets		convnets
images		images
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md

Repository files navigation

Modern Convolutional Neural Network Architectures

Revision of the designs and implementation of modern convolutional neural networks architectures

Convolutional Neural Networks (ConvNets or CNNs) are a class of neural networks that are used for visual recognition tasks.

ConvNets Architectures

AlexNet - Deep Convolutional Neural Networks: implementation, paper
VGG - Very Deep Convolutional Networks for Large Scale Image Recognition: implementation, paper
GoogLeNet(Inceptionv1) - Going Deeper with Convolutions: implementation, paper
ResNet - Deep Residual Learning for Image Recognition: implementation, annotated paper paper
ResNeXt - Aggregated Residual Transformations for Deep Neural Networks: implementation, annotated paper, paper
Xception - Deep Learning with Depthwise Separable Convolutions: implementation, annotated paper, paper
DenseNet - Densely Connected Convolutional Neural Networks: implementation, annotated paper, paper
MobileNetV1 - Efficient Convolutional Neural Networks for Mobile Vision Applications: implementation, annotated paper, paper
MobileNetV2 - Inverted Residuals and Linear Bottlenecks: implementation annotated paper, paper
EfficientNet - Rethinking Model Scaling for Convolutional Neural Networks: implementation, annotated paper, paper. See also EfficientNetV2
RegNet - Designing Network Design Spaces: implementation, annotated paper, paper. See also this
ConvMixer - Patches are All You Need?: implementation, annotated paper, paper.
ConvNeXt - A ConvNet for the 2020s: implementation, annotated paper, paper

On Choosing a ConvNets Architecture

Computer vision community is blessed with having many vision architectures that work great across many platforms or hardwares. But, having many options means it is not easy to choose an architecture that suits a given problem. How can you choose a CNNs architecture for your problem?

The first rule of thumb is that you should not try to design your own architecture from scratch. If you are working on generic problem, it never hurts to start with ResNet-50. If you are building a mobile-based visual application where there is limited computation resources, try MobileNets(or other mobile friendly architectures like ShuffleNetv2 or ESPNetv2).

For a better trade-off between accuracy and computation efficiency, I think EfficientNetV2 and or latest ConvNeXt can be a good fit!

That said, choosing architecture is a no free-lunch scenario. There is not a going to be a single architecture that works for all datasets and problems. It's all experimentation. It's all trying!

If you are a visionary or like to stay on the bleeding edge of the field, try vision transformers!

References Implementations

Important Notes

The implementations of ConvNets architectures contained in this repository are not optimized for training but rather to understand how those networks were designed, principal components that makes them and how they evolved overtime. LeNet-5(LeCunn, 1998) had 5 convolutional layers. AlexNet(Alex, 2012) had 9 convolutional layers. Few years later, Residual Networks(He, 2015) made the trends after showing that it's possible to train networks of over 100 layers. And in fact, residual networks are still one of the most widely used architecture across wide range of visual tasks and they impacted the design of language architectures. Computer vision research community is very vibrant. Understanding how architectures are designed is not a neccesity, but it's one of the good ways to stay on top of this fast-ever changing field!

If you want to use ConvNets for solving a visual recognition tasks such as image classification or object detection, you can get up running quickly by getting the models (and their pretrained weights) from tools like Keras, TensorFlow Hub, PyTorch Vision, Timm PyTorch Image Models, GluonCV, and OpenMML Lab.

Citation

If you find this repository helpful, I will appreciate if you cite it:

author: Jean de Dieu Nyandwi
title: ConvNets Architectures
year: 2022
publisher: GitHub
url: https://github.com/Nyandwi/convnets-architectures

For any suggestion, comment, or simply anything,you can reach out through email, Twitter or LinkedIn.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Modern Convolutional Neural Network Architectures

Revision of the designs and implementation of modern convolutional neural networks architectures

ConvNets Architectures

On Choosing a ConvNets Architecture

References Implementations

Important Notes

Citation

About

Releases

Packages

Languages

License

Nyandwi/ModernConvNets

Folders and files

Latest commit

History

Repository files navigation

Modern Convolutional Neural Network Architectures

Revision of the designs and implementation of modern convolutional neural networks architectures

ConvNets Architectures

On Choosing a ConvNets Architecture

References Implementations

Important Notes

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages