- A Convolutional Neural Network is a type of deep neural network designed for processing strcutred grid data like images.
- Primarily used in Image Recognition,Object Detection, Computer Vision Tasks etc...
-
Convolutional Layer:
- A Convolutional Layer is the core component of a CNN.
- It applies filters to the input images,sliding them across the image ans capture features from the images like edges and textures etc..
- Each filter produced a feature map that detects specific patterns in the data.
-
Pooling Layer:
- Reduces the spatial dimensions of the feature maps helpig to reduce time and calculation complexity
- Types of Pooling:
- Max Pooling: Takes maximum value in a smaller window.
- Average Pooling: Takes average of the values in the smaller window.
- Helps make the model invariant to small translations in the input data.
-
Activation Functions:
- Introduces non-linearity by converting negative values to zero.
- Ensures that the model learns complex patterns and relationships in the data.
-
Fully Connected Layer:
- The final layer(s) of a CNN that connects every node in one layer to every node in the next layer.
- Combines the features extracted by convolutional and pooling layers to produce the final classification or prediction.
-
Input Layer: The input to a CNN is typically an image represented as a matrix of pixel values.
-
Convolution Operation: Filters (small matrices) slide over the input image to detect features like edges, corners, or textures.
-
Feature Maps: The result of the convolution operation produces feature maps that highlight the presence of specific patterns or features in different regions of the input.
-
Pooling: The feature maps are downsampled to reduce the size, keeping the most important information (e.g., using max pooling).
-
Flattening and Fully Connected Layers: The output is flattened into a vector, which is fed into fully connected layers for classification or regression tasks.
-
Output Layer: Produces the final prediction, such as classifying an image into categories.
- Image Classification: Recognizing objects within images
- Object Detection: Locating and identifying multiple objects within an image.
- Facial Recognition: Detecting and recognizing human faces.
- Medical Imaging: Analyzing X-rays, MRIs, and other medical images for diagnostic purposes.