Support for async requests or webhooks #348

brian316 · 2024-10-29T15:55:56Z

🚀 Feature

Is there currently a way to run inference and get the result by polling or a webhook callback?

Motivation

If an inference takes a long time to run lets say 30 min you dont want the connection to drop because you will have to rerun the inference especially on a shaky connection.

Pitch

A webhook callback or polling method can let us run inference as a background task based off a uuid or something the user specifies so that they can come back at a later time for the result. can also help integrations so the inference is not a blocking request.

Alternatives

Cog from replicate implements a webhook

Additional context

aniketmaurya · 2024-10-30T12:17:29Z

@brian316 this is a great feature! You can already do this with LitServe 😄

Here are the steps:

Send the webhook endpoint in request body
Use callback and define hooks at on_after_decode_request , on_after_predict and on_after_encode_response.
Send the current status of request/response in each hooks/stage.

brian316 · 2024-10-30T12:32:51Z

I've looked at the documentation and saw those handy Hooks, but wouldn't the the initial api request wait until the server returns something? I was thinking of returning something immediately like an id the user can poll for the polling method or for the web hook method return a success message. while still running the background task. I guess something that could potentially work is returning immediately in the predict function and run some thread in the background but that seems hacky to get it to work. having async task is important for my type of tasks

brian316 added the enhancement New feature or request label Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for async requests or webhooks #348

Support for async requests or webhooks #348

brian316 commented Oct 29, 2024 •

edited

Loading

aniketmaurya commented Oct 30, 2024

brian316 commented Oct 30, 2024

Support for async requests or webhooks #348

Support for async requests or webhooks #348

Comments

brian316 commented Oct 29, 2024 • edited Loading

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

aniketmaurya commented Oct 30, 2024

brian316 commented Oct 30, 2024

brian316 commented Oct 29, 2024 •

edited

Loading