Description
There is a lot of minor improvements that can be made to the JSON output of guidellm. For example:
- Deduplicate the prompt field for each request
- Flatten some field structures (e.g.
"decode_times": { "data": [] }
->"decode_times": []
). - Label percentiles (e.g.
"request_latency_percentiles": [ 1, 2,... ]
->"request_latency_percentiles": { "p01": 1, "p05": 5,. ... }
) - Give max and min with all percentiles
- Drop concurrences timestamps (possibly replace with percentiles/min/max/mean)