forked from triton-inference-server/server
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add mem_type parameter in SetFixedSizeOutputBuffer. Modify contexts t…
…o use SetFixedSizeOutputBuffer (triton-inference-server#559) * Add 'src_memory_type' parameter in SetFixedSizeOutputBuffer * Apply the change on all backend contexts * Fix provider's AllocateOutputBuffer to use mem_type param * temp * refactor backend context * fix provider AllocateOutputBuffer logic to be the same as alloc_fn * backend_context temp * Fix provider to return error on class output and 0 byte size * Update SetFixedSizeOutputBuffer * Create cuda stream for all backend context if GPU is supported * Address comment on synchronizing CUDA stream once for all outputs * Fix no-GPU build failure * Fix GPU supported build on CPU-only machine * temp input * Fix rebase artifact. Work on TODOs related to this PR * Fix AllocatedSystemMemory size to be 0 if allocated buffer is nullptr * Fix ensemble ResponseAlloc function * Remove plan backend copy check as ... cuda_copy can be false if output is not required
- Loading branch information
Showing
13 changed files
with
400 additions
and
300 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.