Check it out: Streamlit App
Document Master is an advanced multi-file chatbot designed to facilitate interaction with PDF and MS Word documents. Leveraging OpenAI with LangChain and Hugging Face embedding models, this chatbot can handle questions from multiple documents simultaneously.
- Multi-File Support: Upload and process multiple PDF and DOCX files concurrently.
- Advanced Embeddings: Choose from several Hugging Face embedding models (
all-MiniLM-L6-v2
,bert-base-uncased
,bert-large-uncased
,roberta-base
) to vectorize document chunks. - Interactive Chat: Ask questions related to your uploaded documents and receive precise answers.
- Streamlit Integration: A user-friendly interface built with Streamlit for easy interaction.
- Python 3.8+
- An OpenAI API Key
-
Clone the repository:
git clone https://github.com/mrqadeer/multi-file-documentGPT.git cd multi-file-documentGPT
-
Create a virtual environment:
python -m venv venv source venv/bin/activate # On Windows, use `venv\Scripts\activate`
-
Install dependencies:
pip install -r requirements.txt
-
Start the Streamlit app:
streamlit run app.py
-
Interact with the Chatbot:
- Enter your name and OpenAI API key.
- Set the desired temperature for the language model.
- Choose your embedding model.
- Upload your PDF and DOCX files.
- Ask questions and get answers from your documents.
This project is licensed under the MIT License. See the LICENSE file for details.
Special thanks to:
- Sir Irfan Malik for his guidance and support.
- Hope To Skill for providing resources.
- Artificial Intelligence Course for the foundational knowledge.
For any questions or suggestions, please feel free to reach out to me via GitHub Issues.
Happy Document Chatting!