Skip to content

Commit

Permalink
Update README to align with branding guidelines (NVIDIA#182)
Browse files Browse the repository at this point in the history
  • Loading branch information
jayrodge authored Sep 2, 2024
1 parent 06f9905 commit 0b9cbd9
Show file tree
Hide file tree
Showing 3 changed files with 4 additions and 4 deletions.
2 changes: 1 addition & 1 deletion community/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -45,7 +45,7 @@ Community examples are sample code and deployments for RAG pipelines that are no

* [NVIDIA Multimodal RAG Assistant](./multimodal_assistant)

This example is able to ingest PDFs, PowerPoint slides, Word and other documents with complex data formats including text, images, slides and tables, orchestrated with Langchain. It allows users to ask questions through a text interface and optionally with an image query, and it can respond with text and reference images, slides and tables in its response, along with source links and downloads. Refer to this [example](./multimodal-rag) for the LlamaIndex version that uses [integration](https://docs.llamaindex.ai/en/stable/examples/llm/nvidia_nim/) with NVIDIA Inference Microservices (NIMs) of the Multimodal RAG Assistant.
This example is able to ingest PDFs, PowerPoint slides, Word and other documents with complex data formats including text, images, slides and tables. It allows users to ask questions through a text interface and optionally with an image query, and it can respond with text and reference images, slides and tables in its response, along with source links and downloads. Refer to this [example](./multimodal-rag) for the LlamaIndex version that uses [integration](https://docs.llamaindex.ai/en/stable/examples/llm/nvidia_nim/) with NIM microservices of the Multimodal RAG Assistant.

* [NVIDIA Developer RAG Chatbot](./rag-developer-chatbot)

Expand Down
4 changes: 2 additions & 2 deletions community/multimodal-rag/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,13 +4,13 @@

This Streamlit application implements a Multimodal Retrieval-Augmented Generation (RAG) system. It processes various types of documents including text files, PDFs, PowerPoint presentations, and images. The app leverages Large Language Models and Vision Language Models to extract and index information from these documents, allowing users to query the processed data through an interactive chat interface.

The system utilizes LlamaIndex for efficient indexing and retrieval of information, NVIDIA Inference Microservices (NIMs) for high-performance inference capabilities, and Milvus as a vector database for efficient storage and retrieval of embedding vectors. This combination of technologies enables the application to handle complex multimodal data, perform advanced queries, and deliver rapid, context-aware responses to user inquiries.
The system utilizes LlamaIndex for efficient indexing and retrieval of information, NIM microservices for high-performance inference capabilities, and Milvus as a vector database for efficient storage and retrieval of embedding vectors. This combination of technologies enables the application to handle complex multimodal data, perform advanced queries, and deliver rapid, context-aware responses to user inquiries.

## Features

- **Multi-format Document Processing**: Handles text files, PDFs, PowerPoint presentations, and images.
- **Advanced Text Extraction**: Extracts text from PDFs and PowerPoint slides, including tables and embedded images.
- **Image Analysis**: Uses a VLM (NeVA) to describe images and Google's DePlot for processing graphs/charts on NVIDIA Inference Microservices (NIMs).
- **Image Analysis**: Uses a VLM (NeVA) to describe images and Google's DePlot for processing graphs/charts on NIM microservices.
- **Vector Store Indexing**: Creates a searchable index of processed documents using Milvus vector store.
- **Interactive Chat Interface**: Allows users to query the processed information through a chat-like interface.

Expand Down
2 changes: 1 addition & 1 deletion community/multimodal-rag/app.py
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@

# Initialize settings
def initialize_settings():
Settings.embed_model = NVIDIAEmbedding(model="NV-Embed-QA", truncate="END")
Settings.embed_model = NVIDIAEmbedding(model="nvidia/nv-embedqa-e5-v5", truncate="END")
Settings.llm = NVIDIA(model="meta/llama-3.1-70b-instruct")
Settings.text_splitter = SentenceSplitter(chunk_size=600)

Expand Down

0 comments on commit 0b9cbd9

Please sign in to comment.