MSCetin37
diff --git a/‎FinanceAgent/README.md
Lines changed: 35 additions & 147 deletions b/‎FinanceAgent/README.md
Lines changed: 35 additions & 147 deletions
diff --git a/‎FinanceAgent/docker_compose/intel/hpu/gaudi/README.md
Lines changed: 184 additions & 0 deletions b/‎FinanceAgent/docker_compose/intel/hpu/gaudi/README.md
Lines changed: 184 additions & 0 deletions
@@ -1,6 +1,30 @@
-# Finance Agent
+# Finance Agent Example 
 
-## 1. Overview
+## Table of Contents
+
+- [Overview](#overview)
+- [Problem Motivation](#problem-motivation)
+- [Architecture](#architecture)
+  - [High-Level Diagram](#high-level-diagram)
+  - [OPEA Microservices Diagram](#opea-microservices-diagram)
+- [Deployment Options](#deployment-options)
+- [Contribution](#contribution)
+
+
+
+## Overview
+
+The Finance Agent example showcases a hierarchical multi-agent system designed to assist users with financial document processing and analysis. It provides three main functionalities: summarizing lengthy financial documents, answering queries related to financial documents, and conducting research to generate investment reports on public companies.
+
+Users interact with the system via a graphical user interface (UI), where requests are managed by a supervisor agent that delegates tasks to worker agents or the summarization microservice. The system supports document uploads through the UI for processing.
+
+
+## Problem Motivation
+Navigating and analyzing extensive financial documents can be challenging and time-consuming. Users often require concise summaries, answers to specific queries, or comprehensive investment reports. The Finance Agent addresses these needs by automating document summarization, query answering, and research tasks, thereby enhancing productivity and decision-making efficiency.
+
+## Architecture
+### High-Level Diagram
+The Finance Agent system is structured as a hierarchical multi-agent architecture. User interactions are managed by a supervisor agent, which coordinates tasks among worker agents and the summarization microservice. The system supports document uploads and processing through the UI.
 
 The architecture of this Finance Agent example is shown in the figure below. The agent is a hierarchical multi-agent system and has 3 main functions:
 
@@ -12,6 +36,7 @@ The user interacts with the supervisor agent through the graphical UI. The super
 
 ![Finance Agent Architecture](assets/finance_agent_arch.png)
 
+### OPEA Microservices Diagram
 The architectural diagram of the `dataprep` microservice is shown below. We use [docling](https://github.com/docling-project/docling) to extract text from PDFs and URLs into markdown format. Both the full document content and tables are extracted. We then use an LLM to extract metadata from the document, including the company name, year, quarter, document type, and document title. The full document markdown then gets chunked, and LLM is used to summarize each chunk, and the summaries are embedded and saved to a vector database. Each table is also summarized by LLM and the summaries are embedded and saved to the vector database. The chunks and tables are also saved into a KV store. The pipeline is designed as such to improve retrieval accuracy of the `search_knowledge_base` tool used by the Question Answering worker agent.
 
 ![dataprep architecture](assets/fin_agent_dataprep.png)
@@ -30,154 +55,17 @@ The Question Answering worker agent uses `search_knowledge_base` tool to get rel
 
 ![finqa search tool arch](assets/finqa_tool.png)
 
-## 2. Getting started
-
-### 2.1 Download repos
-
-```bash
-mkdir /path/to/your/workspace/
-export WORKDIR=/path/to/your/workspace/
-cd $WORKDIR
-git clone https://github.com/opea-project/GenAIExamples.git
-```
-
-### 2.2 Set up env vars
-
-```bash
-export ip_address="External_Public_IP"
-export no_proxy=${your_no_proxy},${ip_address}
-export HF_CACHE_DIR=/path/to/your/model/cache/
-export HF_TOKEN=<you-hf-token>
-export FINNHUB_API_KEY=<your-finnhub-api-key> # go to https://finnhub.io/ to get your free api key
-export FINANCIAL_DATASETS_API_KEY=<your-api-key> # go to https://docs.financialdatasets.ai/ to get your free api key
-```
-
-### 2.3 [Optional] Build docker images
-
-Only needed when docker pull failed.
-
-```bash
-cd $WORKDIR/GenAIExamples/FinanceAgent/docker_image_build
-# get GenAIComps repo
-git clone https://github.com/opea-project/GenAIComps.git
-# build the images
-docker compose -f build.yaml build --no-cache
-```
-
-If deploy on Gaudi, also need to build vllm image.
-
-```bash
-cd $WORKDIR
-git clone https://github.com/HabanaAI/vllm-fork.git
-# get the latest release tag of vllm gaudi
-cd vllm-fork
-VLLM_VER=$(git describe --tags "$(git rev-list --tags --max-count=1)")
-echo "Check out vLLM tag ${VLLM_VER}"
-git checkout ${VLLM_VER}
-docker build --no-cache -f Dockerfile.hpu -t opea/vllm-gaudi:latest --shm-size=128g . --build-arg https_proxy=$https_proxy --build-arg http_proxy=$http_proxy
-```
-
-## 3. Deploy with docker compose
-
-### 3.1 Launch vllm endpoint
-
-Below is the command to launch a vllm endpoint on Gaudi that serves `meta-llama/Llama-3.3-70B-Instruct` model on 4 Gaudi cards.
-
-```bash
-cd $WORKDIR/GenAIExamples/FinanceAgent/docker_compose/intel/hpu/gaudi
-bash launch_vllm.sh
-```
-
-### 3.2 Prepare knowledge base
-
-The commands below will upload some example files into the knowledge base. You can also upload files through UI.
-
-First, launch the redis databases and the dataprep microservice.
-
-```bash
-# inside $WORKDIR/GenAIExamples/FinanceAgent/docker_compose/intel/hpu/gaudi/
-bash launch_dataprep.sh
-```
-
-Validate datat ingest data and retrieval from database:
-
-```bash
-python $WORKDIR/GenAIExamples/FinanceAgent/tests/test_redis_finance.py --port 6007 --test_option ingest
-python $WORKDIR/GenAIExamples/FinanceAgent/tests/test_redis_finance.py --port 6007 --test_option get
-```
-
-### 3.3 Launch the multi-agent system
-
-The command below will launch 3 agent microservices, 1 docsum microservice, 1 UI microservice.
-
-```bash
-# inside $WORKDIR/GenAIExamples/FinanceAgent/docker_compose/intel/hpu/gaudi/
-bash launch_agents.sh
-```
-
-### 3.4 Validate agents
-
-FinQA Agent:
-
-```bash
-export agent_port="9095"
-prompt="What is Gap's revenue in 2024?"
-python3 $WORKDIR/GenAIExamples/FinanceAgent/tests/test.py --prompt "$prompt" --agent_role "worker" --ext_port $agent_port
-```
-
-Research Agent:
-
-```bash
-export agent_port="9096"
-prompt="generate NVDA financial research report"
-python3 $WORKDIR/GenAIExamples/FinanceAgent/tests/test.py --prompt "$prompt" --agent_role "worker" --ext_port $agent_port --tool_choice "get_current_date" --tool_choice "get_share_performance"
-```
-
-Supervisor Agent single turns:
-
-```bash
-export agent_port="9090"
-python3 $WORKDIR/GenAIExamples/FinanceAgent/tests/test.py --agent_role "supervisor" --ext_port $agent_port --stream
-```
-
-Supervisor Agent multi turn:
-
-```bash
-python3 $WORKDIR/GenAIExamples/FinanceAgent/tests/test.py --agent_role "supervisor" --ext_port $agent_port --multi-turn --stream
-
-```
-
-## How to interact with the agent system with UI
-
-The UI microservice is launched in the previous step with the other microservices.
-To see the UI, open a web browser to `http://${ip_address}:5175` to access the UI. Note the `ip_address` here is the host IP of the UI microservice.
-
-1. Create Admin Account with a random value
-
-2. Enter the endpoints in the `Connections` settings
-
-   First, click on the user icon in the upper right corner to open `Settings`. Click on `Admin Settings`. Click on `Connections`.
-
-   Then, enter the supervisor agent endpoint in the `OpenAI API` section: `http://${ip_address}:9090/v1`. Enter the API key as "empty". Add an arbitrary model id in `Model IDs`, for example, "opea_agent". The `ip_address` here should be the host ip of the agent microservice.
-
-   Then, enter the dataprep endpoint in the `Icloud File API` section. You first need to enable `Icloud File API` by clicking on the button on the right to turn it into green and then enter the endpoint url, for example, `http://${ip_address}:6007/v1`. The `ip_address` here should be the host ip of the dataprep microservice.
-
-   You should see screen like the screenshot below when the settings are done.
-
-![opea-agent-setting](assets/ui_connections_settings.png)
-
-3. Upload documents with UI
-
-   Click on the `Workplace` icon in the top left corner. Click `Knowledge`. Click on the "+" sign to the right of `Icloud Knowledge`. You can paste an url in the left hand side of the pop-up window, or upload a local file by click on the cloud icon on the right hand side of the pop-up window. Then click on the `Upload Confirm` button. Wait till the processing is done and the pop-up window will be closed on its own when the data ingestion is done. See the screenshot below.
 
-   Note: the data ingestion may take a few minutes depending on the length of the document. Please wait patiently and do not close the pop-up window.
+## Deployment Options
+This CodeGen example can be deployed manually on various hardware platforms using Docker Compose or Kubernetes. Select the appropriate guide based on your target environment:
 
-![upload-doc-ui](assets/upload_doc_ui.png)
+| Hardware        | Deployment Mode      | Guide Link                                                               |
+| :-------------- | :------------------- | :----------------------------------------------------------------------- |
+| Intel Gaudi HPU | Single Node (Docker) | [Gaudi Docker Compose Guide](./docker_compose/intel/hpu/gaudi/README.md) |
 
-4. Test agent with UI
+_Note: Building custom microservice images can be done using the resources in [GenAIComps](https://github.com/opea-project/GenAIComps)._
 
-   After the settings are done and documents are ingested, you can start to ask questions to the agent. Click on the `New Chat` icon in the top left corner, and type in your questions in the text box in the middle of the UI.
 
-   The UI will stream the agent's response tokens. You need to expand the `Thinking` tab to see the agent's reasoning process. After the agent made tool calls, you would also see the tool output after the tool returns output to the agent. Note: it may take a while to get the tool output back if the tool execution takes time.
+## Contribution
+We welcome contributions to the OPEA project. Please refer to the contribution guidelines for more information.
 
-![opea-agent-test](assets/opea-agent-test.png)
@@ -0,0 +1,184 @@
+# Deploy Finance Agent on Intel Gaudi HPU with Docker Compose
+This README provides instructions for deploying the Finance Agent application using Docker Compose on systems equipped with Intel Gaudi HPUs.
+
+## Table of Contents
+
+- [Overview](#overview)
+- [Prerequisites](#prerequisites)
+- [Start Deployment](#start-deployment)
+- [Validate Services](#validate-services)
+- [Accessing the User Interface (UI)](#accessing-the-user-interface-ui)
+
+## Overview
+
+This guide focuses on running the pre-configured Finance Agent service using Docker Compose on Intel Gaudi HPUs. It leverages containers optimized for Gaudi for the LLM serving component, along with CPU-based containers for other microservices like embedding, retrieval, data preparation and the UI.
+
+## Prerequisites
+- Docker and Docker Compose installed.
+- Intel Gaudi HPU(s) with the necessary drivers and software stack installed on the host system. (Refer to Intel Gaudi Documentation).
+- Git installed (for cloning repository).
+- Hugging Face Hub API Token (for downloading models).
+- Access to the internet (or a private model cache).
+
+Clone the GenAIExamples repository:
+
+```shell
+   mkdir /path/to/your/workspace/
+   export WORKDIR=/path/to/your/workspace/
+   cd $WORKDIR
+   git clone https://github.com/opea-project/GenAIExamples.git
+   cd GenAIExamples/FinanceAgent/docker_compose/intel/hpu/gaudi
+```
+
+## Start Deployment
+This uses the default vLLM-based deployment profile (vllm-gaudi-server).
+### Configure Environment
+Set required environment variables in your shell:
+
+```shell
+   # Replace with your Hugging Face Hub API token
+   export HF_TOKEN="your_huggingface_token"
+   # Path to your model cache
+   export HF_CACHE_DIR="./data"
+   # Go to https://finnhub.io/ to get your free api key
+   export FINNHUB_API_KEY=<your-finnhub-api-key> 
+   # Go to https://docs.financialdatasets.ai/ to get your free api key
+   export FINANCIAL_DATASETS_API_KEY=<your-api-key> 
+
+   # Optional: Configure HOST_IP if needed
+   # Replace with your host's external IP address (do not use localhost or 127.0.0.1). 
+   # export HOST_IP=$(hostname -I | awk '{print $1}')
+   # Optional: Configure proxy if needed
+   # export http_proxy="your_http_proxy"
+   # export https_proxy="your_https_proxy"
+   # export no_proxy="localhost,127.0.0.1,${HOST_IP}" # Add other hosts if necessary
+
+   source ../../set_env.sh
+```
+
+Note: The compose file might read additional variables from set_env.sh. Ensure all required variables like ports (LLM_SERVICE_PORT, TEI_EMBEDDER_PORT, etc.) are set if not using defaults from the compose file. For instance, edit the set_env.sh to change the LLM model:
+
+### Start Services
+#### Deploy with Docker Compose
+Below is the command to launch services
+   - vllm-gaudi-server
+   - tei-embedding-serving
+   - redis-vector-db
+   - redis-kv-store
+   - dataprep-redis-server-finance
+   - finqa-agent-endpoint
+   - research-agent-endpoint
+   - docsum-vllm-gaudi
+   - supervisor-agent-endpoint
+   - agent-ui
+
+
+```shell
+   docker compose up -d
+```
+
+#### [Optional] Build docker images
+
+Only needed when docker pull failed.
+
+```bash
+   cd $WORKDIR/GenAIExamples/FinanceAgent/docker_image_build
+   # get GenAIComps repo
+   git clone https://github.com/opea-project/GenAIComps.git
+   # build the images
+   docker compose -f build.yaml build --no-cache
+```
+
+If deploy on Gaudi, also need to build vllm image.
+
+```bash
+   cd $WORKDIR
+   git clone https://github.com/HabanaAI/vllm-fork.git
+   # get the latest release tag of vllm gaudi
+   cd vllm-fork
+   VLLM_VER=$(git describe --tags "$(git rev-list --tags --max-count=1)")
+   echo "Check out vLLM tag ${VLLM_VER}"
+   git checkout ${VLLM_VER}
+   docker build --no-cache -f Dockerfile.hpu -t opea/vllm-gaudi:latest --shm-size=128g . --build-arg https_proxy=$https_proxy --build-arg http_proxy=$http_proxy
+```
+
+
+## Validate Services
+Wait several minutes for models to download and services to initialize (Gaudi initialization can take time). Check container logs (docker compose logs -f <service_name>, especially vllm-gaudi-server).
+```bash
+   docker logs --tail 2000 -f vllm-gaudi-server
+``` 
+
+### Validate Data Services
+Ingest data and retrieval from database
+
+```bash
+   python $WORKDIR/GenAIExamples/FinanceAgent/tests/test_redis_finance.py --port 6007 --test_option ingest
+   python $WORKDIR/GenAIExamples/FinanceAgent/tests/test_redis_finance.py --port 6007 --test_option get
+```
+
+### Validate Agents
+
+FinQA Agent:
+
+```bash
+   export agent_port="9095"
+   prompt="What is Gap's revenue in 2024?"
+   python3 $WORKDIR/GenAIExamples/FinanceAgent/tests/test.py --prompt "$prompt" --agent_role "worker" --ext_port $agent_port
+```
+
+Research Agent:
+
+```bash
+   export agent_port="9096"
+   prompt="generate NVDA financial research report"
+   python3 $WORKDIR/GenAIExamples/FinanceAgent/tests/test.py --prompt "$prompt" --agent_role "worker" --ext_port $agent_port --tool_choice "get_current_date" --tool_choice "get_share_performance"
+```
+
+Supervisor Agent single turns:
+
+```bash
+   export agent_port="9090"
+   python3 $WORKDIR/GenAIExamples/FinanceAgent/tests/test.py --agent_role "supervisor" --ext_port $agent_port --stream
+```
+
+Supervisor Agent multi turn:
+
+```bash
+   python3 $WORKDIR/GenAIExamples/FinanceAgent/tests/test.py --agent_role "supervisor" --ext_port $agent_port --multi-turn --stream
+```
+
+## Accessing the User Interface (UI)
+
+The UI microservice is launched in the previous step with the other microservices.
+To see the UI, open a web browser to `http://${ip_address}:5175` to access the UI. Note the `ip_address` here is the host IP of the UI microservice.
+
+1. Create Admin Account with a random value
+
+2. Enter the endpoints in the `Connections` settings
+
+   First, click on the user icon in the upper right corner to open `Settings`. Click on `Admin Settings`. Click on `Connections`.
+
+   Then, enter the supervisor agent endpoint in the `OpenAI API` section: `http://${ip_address}:9090/v1`. Enter the API key as "empty". Add an arbitrary model id in `Model IDs`, for example, "opea_agent". The `ip_address` here should be the host ip of the agent microservice.
+
+   Then, enter the dataprep endpoint in the `Icloud File API` section. You first need to enable `Icloud File API` by clicking on the button on the right to turn it into green and then enter the endpoint url, for example, `http://${ip_address}:6007/v1`. The `ip_address` here should be the host ip of the dataprep microservice.
+
+   You should see screen like the screenshot below when the settings are done.
+
+![opea-agent-setting](../../../../assets/ui_connections_settings.png)
+
+3. Upload documents with UI
+
+   Click on the `Workplace` icon in the top left corner. Click `Knowledge`. Click on the "+" sign to the right of `Icloud Knowledge`. You can paste an url in the left hand side of the pop-up window, or upload a local file by click on the cloud icon on the right hand side of the pop-up window. Then click on the `Upload Confirm` button. Wait till the processing is done and the pop-up window will be closed on its own when the data ingestion is done. See the screenshot below.
+
+   Note: the data ingestion may take a few minutes depending on the length of the document. Please wait patiently and do not close the pop-up window.
+
+![upload-doc-ui](../../../../assets/upload_doc_ui.png)
+
+4. Test agent with UI
+
+   After the settings are done and documents are ingested, you can start to ask questions to the agent. Click on the `New Chat` icon in the top left corner, and type in your questions in the text box in the middle of the UI.
+
+   The UI will stream the agent's response tokens. You need to expand the `Thinking` tab to see the agent's reasoning process. After the agent made tool calls, you would also see the tool output after the tool returns output to the agent. Note: it may take a while to get the tool output back if the tool execution takes time.
+
+![opea-agent-test](../../../../assets/opea-agent-test.png)