Paper Bold AI is an AI-powered web application designed for academic paper analysis. Built with a Retrieval Augmented Generation (RAG) architecture, it helps researchers and students efficiently extract key insights from complex academic papers in PDF format.
The application addresses the time-consuming challenge of understanding lengthy research papers by providing instant summaries, identifying technical approaches, and enabling conversational Q&A about the paper's content.
Paper Bold AI implements a complete RAG pipeline:
- Content Extraction: Converts PDF documents to text using PyPDF2
- Intelligent Chunking: Dynamically segments text based on document size and structure
- Vector Embedding: Transforms text chunks into semantic vectors using Google's embedding model
- Vector Storage: Organizes vectors in a Chroma database for quick retrieval
- Context Matching: ConversationalRetrievalChain query processing
- Response Generation: Gemini-powered answer synthesis

This application processes PDF files uploaded by users. No pre-prepared dataset is used. Each PDF is processed after upload and converted into a vector database.
- Bilingual Interface: Full support for both Turkish and English throughout the application
- Adaptive Document Processing: Automatically adjusts chunking parameters based on PDF size for optimal performance
- AI-Powered Summarization: Generates concise, focused summaries highlighting key findings and methodologies
- Technical Element Detection: Automatically identifies and extracts models, algorithms, and technical approaches mentioned in papers
- Contextual Question Answering: Allows natural language queries about any aspect of the paper with accurate, citation-based responses
- Responsive Design: Fully functional across desktop and mobile devices
Ideal For:
- Academic researchers conducting literature reviews
- Students navigating complex research papers
- Research teams sharing and discussing technical publications
- Anyone needing to quickly extract specific information from academic papers
- Backend: Python, Flask
- Frontend: HTML, CSS, JavaScript
- GenAI: Gemini 1.5 Flash, Google Embedding, LangChain
- Vector Database: Chroma
- PDF Processing: PyPDF2
-
Clone the repository:
git clone https://github.com/enesmanan/paper-bold.git cd paper-bold
-
Create and activate a virtual environment:
python -m venv venv source venv/bin/activate # Linux/Mac venv\Scripts\activate # Windows
-
Install required packages:
pip install -r requirements.txt
-
Create a
.env
file and add your Google API key:GOOGLE_API_KEY=your_api_key
-
Run the application:
python app.py
-
Go to
http://localhost:5000
in your browser
Please get in touch if you have any questions about the project.
- E-mail: enesmanan768@gmail.com
- GitHub: github.com/enesmanan
- LinkedIn: linkedin.com/in/enesfehmimanan
paper-bold/
├── app.py
├── static/
│ ├── style.css
│ └── images/
├── templates/
│ ├── index.html
│ └── viewer.html
├── uploads/
├── chroma_db/
├── requirements.txt
└── README.md