🧠 Multimodal RAG App for Research Paper Exploration

This Streamlit-powered application enables researchers, developers, and AI enthusiasts to explore cutting-edge LLM and attention mechanism research papers using Retrieval-Augmented Generation (RAG) enhanced with web search integration. The system loads academic papers (PDFs), extracts and embeds their content using OpenAI models, and allows users to query them via a dynamic conversational interface.

Features

✅ Multimodal RAG: Handles PDF documents with both text and tables
✅ Integrated Web Search: Extends RAG context with fresh knowledge from the internet
✅ OpenAI Embeddings + LLM: Supports OpenAI for both vector creation and generation
✅ ChromaDB as Vector Store: Fast and efficient local vector database
✅ Streamlit UI: Simple and interactive chat-like interface
✅ PDF Research Sources: GPT-4, Mistral 7B, Gemini, Attention Is All You Need, InstructGPT
✅ Retrieval Reference: Extend with document citations and source highlighting
✅ Local Deployment Ready

File Structure

├── openai_chromadb_rag_app.py       # Main Streamlit App
├── requirements.txt                 # Python dependencies
├── /data                           # Folder containing uploaded PDF research papers
│   ├── attention_paper.pdf
│   ├── gemini_paper.pdf
│   ├── gpt4.pdf
│   ├── instructgpt.pdf
│   └── mistral_paper.pdf
├── /vector_store                   # Persisted ChromaDB index (after first run)
│   ├── chroma.sqlite
│   └── index/

How It Works

Data Ingestion
PDFs are loaded, parsed (text and tables), and chunked for embedding.
Embedding + Indexing
Embeddings are generated using OpenAI models and stored in ChromaDB.
Augmented Retrieval
Chunks relevant to the user query are retrieved via vector search. Optionally, live web results are also fetched and fused into context.
Response Generation
Retrieved knowledge (PDF + web) is fed into OpenAI LLMs to generate grounded, accurate responses.
Streamlit UI
Offers a clean, scrollable chat interface for querying and exploring results.

📦 Setup & Installation

# 1. Clone the repository
git clone https://github.com/<your-username>/multimodal-rag-app.git
cd multimodal-rag-app

# 2. Create and activate a virtual environment
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

# 3. Install dependencies
pip install -r requirements.txt

# 4. Run the Streamlit app
streamlit run openai_chromadb_rag_app.py

🧠 Example Queries You Can Ask

“What architecture was proposed in the Mistral paper?”
“Summarize the GPT-4 training methodology.”
“Compare Gemini’s retrieval techniques with InstructGPT.”
“What is the attention mechanism described in the 2017 Transformer paper?”

Limitations

Only PDF research papers are supported in the current version
Web search is basic and may need rate-limiting or proxy setup
OpenAI API is required (ensure your keys are loaded in .env)

🤝 Acknowledgments

OpenAI – for embeddings & chat models
LangChain – used under the hood for RAG workflows
ChromaDB – local vector store
Research papers from arXiv.org, Google DeepMind & OpenAI

Future Outlook

Add support for images/diagrams (e.g., via BLIP or CLIP)
Integrate with LangSmith or WandB for observability
Enable user authentication (multi-user chat interface)
Deploy on HuggingFace Spaces or Streamlit Cloud

License

MIT License — see LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
CHROMA_DB_DIR		CHROMA_DB_DIR
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
openai_chromadb_rag_app.py		openai_chromadb_rag_app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 Multimodal RAG App for Research Paper Exploration

Features

File Structure

How It Works

📦 Setup & Installation

🧠 Example Queries You Can Ask

Limitations

🤝 Acknowledgments

Future Outlook

License

About

Uh oh!

Releases

Packages

Languages

License

chonzadaniel/Research-Papers-Multi-Modal-RAG

Folders and files

Latest commit

History

Repository files navigation

🧠 Multimodal RAG App for Research Paper Exploration

Features

File Structure

How It Works

📦 Setup & Installation

🧠 Example Queries You Can Ask

Limitations

🤝 Acknowledgments

Future Outlook

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages