3GPP-RAG-chat README

Overview

The 3GPP-RAG-chat project is a weekend project for processing, analyzing and chatting with documents from the ETSI and 3GPP standards LOCALLY. I utilized libraries for Retrieval Augmented Generation (RAG)-powered Large Language Models (LLMs) such as Ollama and llama-index. This project needs a NVIDIA GPU with at least 12 GB VRAM.

Quick Demo

demo.mp4

Installation

Ensure you have Python 3.x installed on your system. Then, follow these steps to install the necessary dependencies:

Llama-Index

Install PyMuPDF, ChromaDB, llama-index and its related packages using pip:

pip install llama-index PyMuPDF chromadb
pip install llama-index-core llama-index-readers-file llama-index-llms-ollama llama-index-embeddings-huggingface

Ollama

Install Ollama using the provided script and pip:

curl -fsSL https://ollama.com/install.sh | sh
pip install ollama

LLM Pull with Ollama

We will be utilizing int8 quantized mistral 7b-instruct model. Use the ollama_pull.py script to pull the model:

python3 ollama_pull.py -m mistral:7b-instruct-v0.2-q8_0

Usage

All you need to start querying is to download the index data 3gpp_db from here, and go to step 4. Or you can start from scratch and begin the document scraping and processing workflow with the following steps:

Downloading Documents: Execute the download.py script to initiate the download of all PDF files from the ETSI website. Note that this process may take several hours to complete.
```
python download.py
```
Cleaning Documents: Run the clean.py script to remove any duplicate files and older versions of documents, ensuring that only the most relevant and up-to-date documents are kept for analysis. To save time, documents and folders are made available here.
```
python clean.py
```
Indexing Documents: Use the index.py script to index all documents. This step is crucial for efficiently searching and retrieving information from the documents later on. To save time, the index is available available here.
```
python index.py
```
Querying Documents: Finally, execute the query.py script to start querying the indexed documents. This allows you to search for specific information within the vast collection of ETSI and 3GPP documents.
```
python query.py
```

TODO

Wrap the code within a Dockerfile for containerized deployment.
Improve the retrieval process, possibly integrating late interaction mechanisms such as ColBERT for enhanced efficiency and accuracy.
Add UI, possibly with Gradio or Streamlit.

Contribution

Contributions are welcome! If you'd like to improve the project or suggest new features, please feel free to submit a pull request or open an issue.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
3gpp_db		3gpp_db
downloaded_pdfs		downloaded_pdfs
unique_pdfs		unique_pdfs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
clean.py		clean.py
download.py		download.py
index.py		index.py
ollama_pull.py		ollama_pull.py
query.py		query.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

3GPP-RAG-chat README

Overview

Quick Demo

Installation

Llama-Index

Ollama

LLM Pull with Ollama

Usage

TODO

Contribution

About

Uh oh!

Releases

Packages

Languages

License

vishalgoyal316/3GPP-RAG-chat

Folders and files

Latest commit

History

Repository files navigation

3GPP-RAG-chat README

Overview

Quick Demo

Installation

Llama-Index

Ollama

LLM Pull with Ollama

Usage

TODO

Contribution

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages