AI-Powered PDF to Data Converter

Convert PDF files into structured data using AI-powered extraction and generate insights from the extracted data.

Overview

This project provides a solution for converting PDF files into structured data formats, such as JSON, using advanced AI techniques. It offers various functionalities including generalized data extraction, schema-based extraction, and generating insights from the extracted data.

Functionalities

Generalized Data Extraction: Extract all data from PDF files in JSON format.
Schema-Based Extraction: Extract data based on predefined schemas for specific document types.
Generate Insights: Upload the extracted data in CSV format and generate insights by plotting graphs based on user queries.

Dependencies

The project utilizes the following libraries and tools:

Langchain: For AI-powered PDF data extraction.
Gemini-pro: For converting extracted data into JSON format.
Pydantic: For schema-based data extraction.
Streamlit: For building interactive web applications.
PandasAi: For data Visualization.
GROQ with Mixtral: For getting query based insights from .csv data.

Screenshots

Dashboard
Extract Data
Extracted Data Output
Schema-Based Data Extraction
Schema-Based Data Output
Generate Insights
Generated Insights Output
Pie Chart

How to Run

Clone the repository:

git clone https://github.com/maheshsathe07/AI-powered-PDF-to-Data-Converter.git

Navigate to the project directory:
```
cd src
```
Install the required dependencies:
```
pip install -r requirements.txt
```
Run the Streamlit app:
```
streamlit run main.py
```
Access the app in your browser at http://localhost:8501.

Contributors

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI-Powered PDF to Data Converter

Overview

Functionalities

Dependencies

Screenshots

How to Run

Contributors

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

maheshsathe07/AI-powered-PDF-to-Data-Converter

Folders and files

Latest commit

History

Repository files navigation

AI-Powered PDF to Data Converter

Overview

Functionalities

Dependencies

Screenshots

How to Run

Contributors

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages