A web application for managing scientific paper PDFs and chemical structures extracted from them.
- Upload, view, and manage PDF documents with bibliographic information
- Capture chemical structures from PDFs
- Recognize chemical structures and convert them to SMILES and InChI formats
- Validate chemical structures
- Store and organize chemical compounds linked to their source papers
- Bun for running the JavaScript server
- Python 3.10 for the chemical service
- RDKit for chemical structure manipulation
- DECIMER for chemical structure recognition
# Install dependencies
bun install
# Start the server
bun run server.ts# you will need a working conda environment
conda create -n paperbase python=3.10
conda activate paperbase
pip install -r requirements.txt
# Start the chemical service
python chemical_service.pyDuring the first run, the chemical service will download the DECIMER model weights and cache them. This may take a few minutes.
- Access the application at http://localhost:8000
- Upload PDFs with bibliographic information
- View PDFs and capture regions containing chemical structures
- Use the "Recognize Structure" feature to extract SMILES from images
- Validate and save chemical structures with metadata
- Frontend: HTML/CSS/JavaScript
- Backend Server: Bun/TypeScript with SQLite database
- Chemical Service: Python Flask API with RDKit and DECIMER integration
This project is licensed under the GPLv3 license. See the LICENSE file for details.