An AI-powered assistant that helps you search for research papers, scrape academic content, summarize it using large language models (LLMs), extract and summarize PDF files, and even generate hypotheses — all in one easy-to-use Streamlit interface.
- 🔍 Literature Search from arXiv via DuckDuckGo
- 🌐 Web Scraping of abstracts and page content
- 📄 PDF Reader with text extraction (via PyMuPDF)
- 🤖 Summarization & Hypothesis Generation using
ibm-granite/granite-3.3-8b-instruct(via Replicate) - 🧪 Ideal for students, researchers, and academics
Install all dependencies using:
pip install -r requirements.txtOr individually:
pip install streamlit replicate requests beautifulsoup4 PyMuPDF- Go to Replicate API Tokens
- Generate a new API token
- Recommended: Set it as an environment variable
export REPLICATE_API_TOKEN=your_token_hereOr, you can hardcode it (not recommended for production):
REPLICATE_API_TOKEN = "your_token_here"
client = replicate.Client(api_token=REPLICATE_API_TOKEN)- Model:
ibm-granite/granite-3.3-8b-instruct - Version ID:
3ff9e6e20ff1f31263bf4f36c242bd9be1acb2025122daeefe2b06e883df0996
You can change the model in app.py if needed.
Start your Streamlit app locally:
streamlit run app.pyThen go to LocalHost.
Find three recent research papers on the ethical implications of using CRISPR technology in humans.
Summarize the uploaded paper and highlight the main results and methodology.
Suggest a hypothesis based on the uploaded PDF.
Draft an abstract for a paper on AI in climate modeling.📦 research-agent/
├── app.py # Main Streamlit app
├── requirements.txt # Required libraries
└── README.md # You're here- This tool uses third-party APIs and AI models. Use it responsibly for academic and research purposes only.
- Always verify LLM-generated content before citing or using it in official research.
- Export to PDF / BibTeX / CSV
- Follow-up Q&A on paper content
- Semantic search with vector embeddings
Samarth Pujari
AI Intern @ IBM