Skip to content

RVL-CDIP could be looked at as the equivalent of ImageNet for the document image community. It’s certainly the largest we’ve seen in the literature. There are 400,000 total document images in the dataset. The dataset contains much noise and variance in composition of each document class. Uncompressed, the dataset size is ~100GB, and comprises 16…

Notifications You must be signed in to change notification settings

sambalshikhar/Document-Image-Classification-with-Intra-Domain-Transfer-Learning-and-Stacked-Generalization-of-Deep

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Document-Image-Classification-with-Intra-Domain-Transfer-Learning-and-Stacked-Generalization-of-Deep

Blog post : https://medium.com/@shikharsambal3/how-i-built-a-document-classification-system-using-deep-convolutional-neural-networks-e1d9a83cbabd

RVL-CDIP could be looked at as the equivalent of ImageNet for the document image community. It’s certainly the largest we’ve seen in the literature. There are 400,000 total document images in the dataset. The dataset contains much noise and variance in composition of each document class. Uncompressed, the dataset size is ~100GB, and comprises 16 classes of document types, with 25,000 samples per classes. Example classes include email, resume, and invoice. Achieved an Accuracy of over 93% which beat the benchmark score of 92% based on https://paperswithcode.com/sota/document-image-classification-on-rvl-cdip

About

RVL-CDIP could be looked at as the equivalent of ImageNet for the document image community. It’s certainly the largest we’ve seen in the literature. There are 400,000 total document images in the dataset. The dataset contains much noise and variance in composition of each document class. Uncompressed, the dataset size is ~100GB, and comprises 16…

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published