Skip to content

Conversation

@bjhargrave
Copy link
Contributor

Title

Fix handling of non-terminal Replicate prediction states

Relevant issues

Fixes #16630

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

  • I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
  • I have added a screenshot of my new test passing locally
  • My PR passes all unit tests on make test-unit
  • My PR's scope is as isolated as possible, it only solves 1 specific problem

Type

🐛 Bug Fix

Changes

The test for "processing" is replaced by a test for not one of the terminal states since there can be other non-terminal states like "starting".

Also, the retry limit is removed since, depending on the specific model and the state (warm/cold) it can take several attempts for the prediction to report a terminal state.

@vercel
Copy link

vercel bot commented Nov 19, 2025

@bjhargrave is attempting to deploy a commit to the CLERKIEAI Team on Vercel.

A member of the Team first needs to authorize it.

@CLAassistant
Copy link

CLAassistant commented Nov 19, 2025

CLA assistant check
All committers have signed the CLA.

Fixes BerriAI#16630

Signed-off-by: BJ Hargrave <hargrave@us.ibm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug]: Replicate chat handler fails for non-terminal states (e.g., 'starting')

2 participants