Skip to content
This repository has been archived by the owner on Oct 11, 2024. It is now read-only.

Benchmarks : Prune nightly benchmarks #150

Merged
merged 3 commits into from
Mar 26, 2024

Conversation

varun-sundar-rabindranath

Summary:
The 2024-03-25 nightly benchmarks failed due to performance regressions.
We find that this is either due to,

Updates in this PR:

  • Serving case : Remove the 3000 num prompts at 10 qps experiments.
  • Serving case : Mark the p90, p99 statistics as "Observation" metrics so they dont trigger failure.
  • Engine case (benchmark_throughput.py) : Remove the 16 and 32 prefill cases.

Test:
Some local testing

@varun-sundar-rabindranath varun-sundar-rabindranath merged commit 8894487 into main Mar 26, 2024
2 checks passed
@varun-sundar-rabindranath varun-sundar-rabindranath deleted the varun/prune-nightly-benchmarks branch March 26, 2024 23:45
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants