how does `default_response_timeout` work?

### 📚 The doc issue

I set the value of `default_response_timeout` to 4 i.e. 4 seconds. At the start of the model load, this happens after 4 (ish) seconds:
```
org.pytorch.serve.wlm.WorkerInitializationException: Backend worker did not respond in given time
```

My guess is because the model takes a while to load (more than 4 seconds), the worker gets killed.
Is there a way to set a larger initial delay i.e. differentiate these two scenarios:
* account for the initial model load with a number different from `default_response_timeout`
* if model doesn't response in `default_response_timeout` after the initial load, then kill the worker

### Suggest a potential alternative/fix

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

how does `default_response_timeout` work? #2452

📚 The doc issue

Suggest a potential alternative/fix

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

how does default_response_timeout work? #2452

Description

📚 The doc issue

Suggest a potential alternative/fix

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

how does `default_response_timeout` work? #2452