RAG Assistant 🤖🛠

RAG application with Contextual Retrieval and GrapRAG. Targeted to agriculture needs.

Example of the graph generated by the GraphRAG pipeline.

Context 🗺

This solution is built to help small business owners providing customer support for their solutions.

With this tool business owner will be partially relieved of the task of answer to common user questions that could be solved by just reading the manual.

As is, this solution has nothing to do necesary with agriculture specifically, and code will be as general as posible.

Yet this development is relevant as it will grow targeting the needs of the agriculture users that arise.

Features ✨

Access through FastAPI.
Support for:
- Addition of new files.
- Updates of preexisiting files.
- Deletion of deprecated files.
Support for this file types:
- PDFs and Markdown.
Two different strategies for the retrieval of the information:
- Contextual retrieval RAG. (Useful for answering specific questions).
- GraphRag retrieval. (Useful for answering general questions that require general knowledge from different documents).
Focus on leveraging Small Language Models (SML) as they are less hardware intensive, and could be used more widely. And as they are less smart being more secure to use.

Get started as dev 🚀

Clone the repository.

git clone https://github.com/maticas-org/rag-assistant.git

Install the dependencies.
```
cd rag-assistant
uv sync
```
This will install the dependencies required for doing the paragraph extraction from the documents, by using tesseract. If you are on a linux machine you will need to install the following dependencies:
```
sudo apt-get install libleptonica-dev tesseract-ocr tesseract-ocr-eng tesseract-ocr-script-latn
```
Get Ollama as well as the model you want to use with the embeddings model. For example, to get the llama3.2:3b model, and the nomic-embed-text model, you can run the following commands:
```
curl -fsSL https://ollama.com/install.sh | sh
ollama pull llama3.2:3b
ollama pull nomic-embed-text
```

Change configuration file if needed, the file looks like this config-local.yaml, there is also config-aws.yaml for the AWS configuration.

# config-local.yaml
backend:
    llm: 
        default:
            provider: "ollama"
            model_name: "llama3.2:3b"
            parameters:
                temperature: 0.1
        semantic_grouping:
            provider: "ollama"
            model_name: "llama3.2:1b"
            parameters:
                temperature: 0.1
    embeddings:
        provider: "ollama"
        model_name: "nomic-embed-text"
    vector_db:
        provider: "opensearch"
        host: "localhost"

The llm provider is the one that will be used to generate the summaries, entity types identification, entity extraction, and relation generation for the documents. Supported providers are:

aws: AWS through Bedrock.
ollama: Ollama is an open-source project that serves as a powerful and user-friendly platform for running LLMs on your local machine.

Currently we are under development, so you can run the following command to start the pipeline that will start processing the files in the data folder. Make sure to commenout certain lines in the main.py file to avoid reprocessing the files.
```
uv run main.py
```

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
docs		docs
pipeline		pipeline
utils		utils
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config-aws.yaml		config-aws.yaml
config-local.yaml		config-local.yaml
interactive_graph.html		interactive_graph.html
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock
viz.py		viz.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG Assistant 🤖🛠

Context 🗺

Features ✨

Get started as dev 🚀

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

maticas-tech-sas/rag-assistant

Folders and files

Latest commit

History

Repository files navigation

RAG Assistant 🤖🛠

Context 🗺

Features ✨

Get started as dev 🚀

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages