ResearchNexus

How to run ?

Creating virtualenv

conda create --name venv python=3.10 -y
conda activate venv
pip install -r requirements.txt

To run Data Ingestion Pipeline

Update LLAMAINDEX_API_KEY in .env file
Run:
```
python3 prepare_data.py </path/to/pdf>
```
Cited research papers are fetched from the input pdf and stored in pdf_papers
PDF's are parfed into .txt format using llamaIndex efficiently and stored in txt_papers papers

To retreive most relevant papers for

Implemented BM25 and BERT based retreival methods in retreive_docs.ipynb file
We Qualitatively found ColBERT results to be better.
Update PAPER_NAME = *.txt in .env file

Run

sudo apt install jupyter-nbconvert
jupyter nbconvert --execute --inplace colbert.ipynb

Retrives the most relevant papers from txt_papers using ColBERT model.
Stores most relevant papers in final_input directory in .txt format
final_input dir is used for running on Microsoft Graph RAG.

To run Knowledge GraphRAG

cp final_input/* /path/to/input
/path/to/input directory refers to input directory of MS KG-RAG implementation
Run KG-RAG, CLI app with QnA session

Sample Responses

Sample Responses generated by the Knowledge Graphs Terminal and ResearchNexus Terminal are also attached.
answers_knowledgegraphs.txt contains answers to queries generated by Knowledge Graphs.
answers_researchnexus.txt contains answers to queries generated by ResearchNexus (our proposed pipeline).

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
final_input		final_input
pdf_papers		pdf_papers
txt_papers		txt_papers
.env		.env
.gitignore		.gitignore
1706.03762.pdf		1706.03762.pdf
ColBERT.ipynb		ColBERT.ipynb
Final_IRE_Report.pdf		Final_IRE_Report.pdf
README.md		README.md
answers_knowledgegraphs.txt		answers_knowledgegraphs.txt
answers_researchnexus.txt		answers_researchnexus.txt
prepare_data.py		prepare_data.py
requirements.txt		requirements.txt
retrieve_docs.ipynb		retrieve_docs.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ResearchNexus

How to run ?

Creating virtualenv

To run Data Ingestion Pipeline

To retreive most relevant papers for

To run Knowledge GraphRAG

Sample Responses

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Pranav-gu/Knowledge-Graphs-and-RAG

Folders and files

Latest commit

History

Repository files navigation

ResearchNexus

How to run ?

Creating virtualenv

To run Data Ingestion Pipeline

To retreive most relevant papers for

To run Knowledge GraphRAG

Sample Responses

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages