You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CodeTrans/README_miscellaneous.md
+35Lines changed: 35 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -44,3 +44,38 @@ Some HuggingFace resources, such as some models, are only accessible if the deve
44
44
45
45
2. (Docker only) If all microservices work well, check the port ${host_ip}:7777, the port may be allocated by other users, you can modify the `compose.yaml`.
46
46
3. (Docker only) If you get errors like "The container name is in use", change container name in `compose.yaml`.
47
+
48
+
## Monitoring OPEA Services with Prometheus and Grafana Dashboard
49
+
50
+
OPEA microservice deployment can easily be monitored through Grafana dashboards using data collected via Prometheus. Follow the [README](https://github.com/opea-project/GenAIEval/blob/main/evals/benchmark/grafana/README.md) to setup Prometheus and Grafana servers and import dashboards to monitor the OPEA services.
> NOTE: This feature is disabled by default. Please use the compose.telemetry.yaml file to enable this feature.
58
+
59
+
OPEA microservice and [TGI](https://huggingface.co/docs/text-generation-inference/en/index)/[TEI](https://huggingface.co/docs/text-embeddings-inference/en/index) serving can easily be traced through [Jaeger](https://www.jaegertracing.io/) dashboards in conjunction with [OpenTelemetry](https://opentelemetry.io/) Tracing feature. Follow the [README](https://github.com/opea-project/GenAIComps/tree/main/comps/cores/telemetry#tracing) to trace additional functions if needed.
60
+
61
+
Tracing data is exported to http://{EXTERNAL_IP}:4318/v1/traces via Jaeger.
62
+
Users could also get the external IP via below command.
63
+
64
+
```bash
65
+
ip route get 8.8.8.8 | grep -oP 'src \K[^ ]+'
66
+
```
67
+
68
+
Access the Jaeger dashboard UI at http://{EXTERNAL_IP}:16686
69
+
70
+
For TGI serving on Gaudi, users could see different services like opea, TEI and TGI.
71
+

72
+
73
+
Here is a screenshot for one tracing of TGI serving request.
74
+

75
+
76
+
There are also OPEA related tracings. Users could understand the time breakdown of each service request by looking into each opea:schedule operation.
There could be asynchronous function such as `llm/MicroService_asyn_generate` and user needs to check the trace of the asynchronous function in another operation like
0 commit comments