GPT-powered PDF Analyser. Upload Multiple PDFs and chat with them.
PDF-Analyser is a Python project developed to facilitate interactive and semantic search on text extracted from multiple PDFs. It utilizes Streamlit for the user interface, Langchain for text extraction and chunking, and OpenAI GPT-3.5 for generating relevant results based on user queries.

- PDF Upload: Multiple PDFs can be uploaded simultaneously.
- Chunking: Converts extracted text into manageable chunks.
- Vector Database: Stores the text chunks in a vector database (FAISS).
- Chat Interface: Provides a user-friendly chat screen for interacting with the PDF data.
- Semantic Search: Performs a semantic search on the vector store using user queries.
- GPT-3.5 Integration: Passes vector results to OpenAI GPT-3.5 for generating relevant
- Chat History: Preserves chat history to maintain context for generating more coherent responses.
- Clone the Repository
git clone https://github.com/jaisurya27/PDF-Analyser.git
- Navigate to the project directory
- Install the required dependencies
pip install -r requirements.txt
- Add OPENAPI key to .env file
OPENAI_API_KEY=your_secrit_api_key
- Run the streamlit app
streamlit run app.py
- Open your browser and navigate to http://localhost:8501 to access the PDF-Analyser interface
- Upload PDFs and start interacting with the chat interface to retrieve relevant information from the PDF data.
- The chat history will also be preserved for a more contextual conversation.
