Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add inference request cancellation APIs (#249)
* Fix state transitions for re-running requests (#251) * Add backend/server APIs * Implement the cancellation APIs * Only store the state in response factory * Add unit testing for request cancellation * Add test * Add cancellation status * Add testing for cancelling a request after release * Handle request re-use * Enable request reuse test * Add staged changes * Add temporary fix for the request state bug --------- Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
- Loading branch information