PDF Sumzz

PDF Sumzz is a powerful application that allows users to upload PDF documents and ask questions about their content. The AI analyzes the document and provides accurate, contextual answers based on the information in the PDF.

Created by

Malav Patel

Features

PDF Upload: Upload any PDF document for AI analysis
AI-Powered Q&A: Ask questions about the document and receive intelligent answers
Context-Aware Responses: The AI understands the context within the document
Interactive Chat Interface: User-friendly chat interface for seamless interaction
Document Chunking: Efficiently processes large documents by splitting them into manageable pieces
Vector Search: Uses embeddings and vector search to find the most relevant information

Technologies Used

LangChain: For orchestrating the AI workflow and document processing
PyMuPDF (Fitz): For extracting text from PDF documents
Mistral AI: For generating intelligent responses to questions
FAISS: For vector similarity search
Streamlit: For creating the web interface
Sentence Transformers: For generating vector embeddings

Installation

Clone the repository:

git clone https://github.com/Malav2364/PdfSumzz.git
cd PdfSumzz

Create a virtual environment:

python -m venv venv

# Activate the virtual environment
# On Windows:
venv\Scripts\activate
# On macOS/Linux:
source venv/bin/activate

Install dependencies:
```
pip install -r requirements.txt
```
Create a .env file in the project root directory and add your API key:
```
MISTRAL_API_KEY=your_mistral_api_key_here
```

Usage

Start the application:
```
streamlit run app.py
```
Open the application in your web browser (typically at http://localhost:8501)
Upload a PDF document using the file uploader
Once the document is processed, ask questions about its content
The AI will provide answers based on the document's content

Example Questions

What is the main topic of this document?
Can you summarize the content?
What are the key findings in this paper?
Explain the methodology used in this document.
What conclusions are drawn in this document?

Project Structure

PdfSumzz/ ├── .env # For API keys (not included in repository) ├── app.py # Main Streamlit application ├── pdf_processor.py # PDF processing logic ├── requirements.txt # Project dependencies └── README.md # Project documentation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF Sumzz

Created by

Features

Technologies Used

Installation

Usage

Example Questions

Project Structure

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
app.py		app.py
pdf_processor.py		pdf_processor.py

Malav2364/PdfSumzz

Folders and files

Latest commit

History

Repository files navigation

PDF Sumzz

Created by

Features

Technologies Used

Installation

Usage

Example Questions

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages