Skip to content

ci: Fix L0_batch related flaky tests #7999

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

yinggeh
Copy link
Contributor

@yinggeh yinggeh commented Feb 10, 2025

What does the PR do?

Set looser time requirement and execution counts to L0_batcher* tests. The actual response time and execution counts are highly dependent to the request arrival time.

Checklist

  • PR title reflects the change and is of format <commit_type>: <Title>
  • Changes are described in the pull request.
  • Related issues are referenced.
  • Populated github labels field
  • Added test plan and verified test passes.
  • Verified that the PR passes existing CI.
  • Verified copyright is correct on all changed files.
  • Added succinct git squash message before merging ref.
  • All template sections are filled out.
  • Optional: Additional screenshots for behavior/output changes with before/after.

Commit Type:

Check the conventional commit type
box here and add the label to the github PR.

  • ci

Related PRs:

Where should the reviewer start?

Test plan:

L0_batcher--base
L0_batcher_shm--base
L0_batcher_cudashm--base

  • CI Pipeline ID:
    23748413

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

  • closes GitHub issue: #xxx

@yinggeh yinggeh added the PR: ci Changes to our CI configuration files and scripts label Feb 10, 2025
@yinggeh yinggeh self-assigned this Feb 10, 2025
Comment on lines -1945 to +1948
# and the remaining requests will form the second batch.
# and the remaining requests will either form the second batch
# or more batches depending on their arrival time.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you explain why the originally intended behavior is not consistent and needs to be relaxed here?

@@ -1895,7 +1897,7 @@ def test_preferred_batch_only_use_no_preferred_size(self):
def test_max_queue_delay_only_non_default(self):
# Send 12 requests with batch size 1. The max_queue_delay is set
# to non-zero. Depending upon the timing of the requests arrival
# there can be either 1 or 2 model executions.
# there can be either 1 or multiple model executions.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you explain why the originally intended behavior is not consistent and needs to be relaxed here? (different for zero and non-zero queue delay scenarios?)

@@ -619,13 +619,15 @@ def test_multi_batch_not_preferred_different_shape(self):
},
)
)
# Add some delay to ensure the first two requests arrive before the third
time.sleep(2)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should this 2 seconds be some function of the _max_queue_delay_ms instead?

@yinggeh yinggeh changed the title ci: Fix flaky L0_batch related tests ci: Fix L0_batch related flaky tests Feb 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
PR: ci Changes to our CI configuration files and scripts
Development

Successfully merging this pull request may close these issues.

2 participants