Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
-
Updated
Jan 7, 2025 - Python
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
This repository contains demos I made with the Transformers library by HuggingFace.
A Repo For Document AI
The MERIT Dataset is a fully synthetic, labeled dataset created for training and benchmarking LLMs on Visually Rich Document Understanding tasks. It is also designed to help detect biases and improve interpretability in LLMs, where we are actively working. This repository is actively maintained, and new features are continuously being added.
LayoutLMv3 applied to a VQA problem with infographics.
Exploring LayoutLM for Smart OCR Capabilities
All in one package for Document (image, pdf) Classification. Unified Interface for google ocr and tesseract. Train, evaluate, and infer using fasttext, Small language models (NER), Small Vision Language Models (layoutlm), and LLM.
Prototypical Networks for Information Extraction in Visual Documents. Weights can be found at https://drive.google.com/file/d/1Zrp7QaZIf0H_FFRx_LhB0uZTqDUSis2H/view?usp=sharing.
Fine-tuning LayoutLMv3 on the SROIE dataset to build a receipt understanding model
Mini Projects that are developed using Python.
Add a description, image, and links to the layoutlm topic page so that developers can more easily learn about it.
To associate your repository with the layoutlm topic, visit your repo's landing page and select "manage topics."