📝 Gujarati Handwriting Recognition

This project was developed as the final year diploma project in Computer Engineering. It involves building a machine learning pipeline to recognize handwritten Gujarati digits or characters using Convolutional Neural Networks (CNN) and a basic Neural Network (NN). The project covers the full pipeline: data preprocessing, augmentation, model training, and evaluation.

🔍 Objective

To build a robust image classification model that can recognize Gujarati handwritten characters or digits with high accuracy using custom image preprocessing and CNN/NN architectures.

⚙️ Technologies Used

Python - for model building and preprocessing
Pillow (PIL) - image loading and manipulation
NumPy - array operations
TensorFlow - for building and training models
Goggle Colab - model training & analysis

🧹 Preprocessing Pipeline

Balancing Classes (get_no_of_imgs.py)
- Counts the number of images in each class
- Calculates how many more images are needed per class to balance the dataset
Resizing & Grayscale Conversion (resize_imgs.py)
- Resizes all images to 50x70 pixels
- Converts to grayscale
- Organizes into class-wise directories
Thresholding & Binarization (absolute_0-1_filtering.py)
- Converts images to grayscale
- Applies a binary threshold (e.g., pixels > 130 → white, else black)
- Saves cleaned images for further use
Augmentation (augment.py)
- Applies slight rotations to existing samples
- Saves them as new images to increase data diversity

🔃 Preprocessing results

=>

🧠 Model Architectures

CNN Model
- Input shape: (50, 70, 1)
- Multiple convolutional + max pooling layers
- Dense layers for classification
- Dropout regularization
Simple Neural Network (NN)
- Input layer flattened from image
- Dense layers with ReLU
- Softmax output

📊 Results

Training Accuracy: ~95% (CNN), ~80% (NN)
Test Accuracy: ~90% (CNN), ~75% (NN)
CNN outperformed NN significantly due to its capability to extract spatial features.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets		assets
gujarati digits dataset		gujarati digits dataset
gujarati handwriting recognition notebooks		gujarati handwriting recognition notebooks
python files for preprocessing		python files for preprocessing
report		report
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

📝 Gujarati Handwriting Recognition

🔍 Objective

⚙️ Technologies Used

🧹 Preprocessing Pipeline

🔃 Preprocessing results

🧠 Model Architectures

📊 Results

About

Uh oh!

Releases

Packages

Languages

Uh oh!

Uh oh!

virtualharsh/gujarti-image-recognition

Folders and files

Latest commit

History

Repository files navigation

📝 Gujarati Handwriting Recognition

🔍 Objective

⚙️ Technologies Used

🧹 Preprocessing Pipeline

🔃 Preprocessing results

🧠 Model Architectures

📊 Results

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages