-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Babak/upgrade triton to v2.44.0 #5
Babak/upgrade triton to v2.44.0 #5
Commits on Jun 29, 2023
-
Add test for Python BLS model loading API (#5980)
* Add test for Python BLS model loading API * Fix up
Configuration menu - View commit details
-
Copy full SHA for 438ee53 - Browse repository at this point
Copy the full SHA 438ee53View commit details
Commits on Jun 30, 2023
-
Configuration menu - View commit details
-
Copy full SHA for fff1595 - Browse repository at this point
Copy the full SHA fff1595View commit details -
Configuration menu - View commit details
-
Copy full SHA for e47fbca - Browse repository at this point
Copy the full SHA e47fbcaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 00c0fd1 - Browse repository at this point
Copy the full SHA 00c0fd1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3e6ef8d - Browse repository at this point
Copy the full SHA 3e6ef8dView commit details -
Configuration menu - View commit details
-
Copy full SHA for b95366e - Browse repository at this point
Copy the full SHA b95366eView commit details
Commits on Jul 5, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 15dbea1 - Browse repository at this point
Copy the full SHA 15dbea1View commit details
Commits on Jul 6, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 20d6bb2 - Browse repository at this point
Copy the full SHA 20d6bb2View commit details
Commits on Jul 7, 2023
-
Enabling nested spans for trace mode OpenTelemetry (#5928)
* Adding nested spans to OTel tracing + support of ensemble models
Configuration menu - View commit details
-
Copy full SHA for 78d9d82 - Browse repository at this point
Copy the full SHA 78d9d82View commit details -
Move multi-GPU dlpack test to a separate L0 test (#6001)
* Move multi-GPU dlpack test to a separate L0 test * Fix copyright * Fix up
Configuration menu - View commit details
-
Copy full SHA for 4d864a1 - Browse repository at this point
Copy the full SHA 4d864a1View commit details -
* Upgrade OV to 2023.0.0 * Upgrade OV model gen script to 2023.0.0
Configuration menu - View commit details
-
Copy full SHA for fd96f23 - Browse repository at this point
Copy the full SHA fd96f23View commit details -
Add test to check the output memory type for onnx models (#6033)
* Add test to check the output memory type for onnx models * Remove unused import * Address comment
Configuration menu - View commit details
-
Copy full SHA for 0049763 - Browse repository at this point
Copy the full SHA 0049763View commit details -
Add testing for implicit state for PyTorch backend (#6016)
* Add testing for implicit state for PyTorch backend * Add testing for libtorch string implicit models * Fix CodeQL * Mention that libtorch backend supports implicit state * Fix CodeQL * Review edits * Fix output tests for PyTorch backend
Configuration menu - View commit details
-
Copy full SHA for 0edd8a7 - Browse repository at this point
Copy the full SHA 0edd8a7View commit details -
Allow uncompressed conda execution enviroments (#6005)
Add test for uncompressed conda execution enviroments
Configuration menu - View commit details
-
Copy full SHA for bfe467c - Browse repository at this point
Copy the full SHA bfe467cView commit details
Commits on Jul 10, 2023
-
Configuration menu - View commit details
-
Copy full SHA for f431477 - Browse repository at this point
Copy the full SHA f431477View commit details
Commits on Jul 11, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 288c1df - Browse repository at this point
Copy the full SHA 288c1dfView commit details
Commits on Jul 12, 2023
-
Configuration menu - View commit details
-
Copy full SHA for f17f348 - Browse repository at this point
Copy the full SHA f17f348View commit details -
Configuration menu - View commit details
-
Copy full SHA for ee8d048 - Browse repository at this point
Copy the full SHA ee8d048View commit details -
Configuration menu - View commit details
-
Copy full SHA for a8f122d - Browse repository at this point
Copy the full SHA a8f122dView commit details
Commits on Jul 17, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 1e805ae - Browse repository at this point
Copy the full SHA 1e805aeView commit details
Commits on Jul 18, 2023
-
Fix L0_backend_python expected instance name (#6073)
* Fix expected instance name * Copyright year
Configuration menu - View commit details
-
Copy full SHA for 899fd2d - Browse repository at this point
Copy the full SHA 899fd2dView commit details
Commits on Jul 19, 2023
-
Fix L0_sdk: update the search name for the client wheel (#6074)
* Fix name of client wheel to be looked for * Fix up
Configuration menu - View commit details
-
Copy full SHA for 00fee98 - Browse repository at this point
Copy the full SHA 00fee98View commit details
Commits on Jul 20, 2023
-
Add GitHub action to format and lint code (#6022)
* Add pre-commit * Fix typos, exec/shebang, formatting * Remove clang-format * Update contributing md to include pre-commit * Update spacing in CONTRIBUTING * Fix contributing pre-commit link * Link to pre-commit install directions * Wording * Restore clang-format * Fix yaml spacing * Exclude templates folder for check-yaml * Remove unused vars * Normalize spacing * Remove unused variable * Normalize config indentation * Update .clang-format to enforce max line length of 80 * Update copyrights * Update copyrights * Run workflows on every PR * Fix copyright year * Fix grammar * Entrypoint.d files are not executable * Run pre-commit hooks * Mark not executable * Run pre-commit hooks * Remove unused variable * Run pre-commit hooks after rebase * Update copyrights * Fix README.md typo (decoupled) Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> * Run pre-commit hooks * Grammar fix Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> * Redundant word Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> * Revert docker file changes * Executable shebang revert * Make model.py files non-executable * Passin is proper flag * Run pre-commit hooks on init_args/model.py * Fix typo in init_args/model.py * Make copyrights one line --------- Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 2847211 - Browse repository at this point
Copy the full SHA 2847211View commit details
Commits on Jul 21, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 9bc9ad6 - Browse repository at this point
Copy the full SHA 9bc9ad6View commit details
Commits on Jul 24, 2023
-
Add test for sequence model instance update (#5831)
* Add test for sequence model instance update * Add gap for file timestamp update * Update test for non-blocking sequence update * Update documentation * Remove mentioning increase instance count case * Add more documentaion for scheduler update test * Update test for non-blocking batcher removal * Add polling due to async scheduler destruction * Use _ as private * Fix typo * Add docs on instance count decrease * Fix typo * Separate direct and oldest to different test cases * Separate nested tests in a loop into multiple test cases * Refactor scheduler update test * Improve doc on handling future test failures * Address pre-commit * Add best effort to reset model state after a single test case failure * Remove reset model method to make harder for chaining multiple test cases as one * Remove description on model state clean up
Configuration menu - View commit details
-
Copy full SHA for 0f84995 - Browse repository at this point
Copy the full SHA 0f84995View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3db04cc - Browse repository at this point
Copy the full SHA 3db04ccView commit details
Commits on Jul 25, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 2b121fd - Browse repository at this point
Copy the full SHA 2b121fdView commit details
Commits on Jul 28, 2023
-
Update post-23.07 release (#6103)
* Update README and versions for 2.36.0 / 23.07 * Update Dockerfile.win10.min * Fix formating issue * fix formating issue * Fix whitespaces * Fix whitespaces * Fix whitespaces
Configuration menu - View commit details
-
Copy full SHA for 682cc22 - Browse repository at this point
Copy the full SHA 682cc22View commit details -
Configuration menu - View commit details
-
Copy full SHA for 14437dc - Browse repository at this point
Copy the full SHA 14437dcView commit details
Commits on Aug 1, 2023
-
Reduce instance count to 1 for python bls model loading test (#6130)
* Reduce instance count to 1 for python bls model loading test * Add comment when calling unload
Configuration menu - View commit details
-
Copy full SHA for da22fd7 - Browse repository at this point
Copy the full SHA da22fd7View commit details
Commits on Aug 4, 2023
-
Fix queue test to expect exact number of failures (#6133)
* Fix queue test to expect exact number of failures * Increase the execution time to more accurately capture requests
Configuration menu - View commit details
-
Copy full SHA for cfe5f24 - Browse repository at this point
Copy the full SHA cfe5f24View commit details -
Configuration menu - View commit details
-
Copy full SHA for 36d80fe - Browse repository at this point
Copy the full SHA 36d80feView commit details
Commits on Aug 7, 2023
-
Adding the support tracing of child models invoked from a BLS model (#…
…6063) * Adding tests for bls * Added fixme, cleaned previous commit * Removed unused imports * Fixing commit tree: Refactor code, so that OTel tracer provider is initialized only once Added resource cmd option, testig Added docs * Clean up * Update docs/user_guide/trace.md Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> * Revision * Update doc * Clean up * Added ostream exporter to OpenTelemetry for testing purposes; refactored trace tests * Added opentelemetry trace collector set up to tests; refactored otel exporter tests to use OTel collector instead of netcat * Revising according to comments * Added comment regarding 'parent_span_id' * Added permalink * Adjusted test --------- Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for c3a2f6b - Browse repository at this point
Copy the full SHA c3a2f6bView commit details
Commits on Aug 8, 2023
-
Test python environments 3.8-3.11 (#6109)
Add tests for python 3.8-3.11 for L0_python_backends
Configuration menu - View commit details
-
Copy full SHA for d964cd2 - Browse repository at this point
Copy the full SHA d964cd2View commit details -
Improve L0_backend_python debugging (#6157)
* Improve L0_backend_python debugging * Use utils function for artifacts collection
Configuration menu - View commit details
-
Copy full SHA for 09409cf - Browse repository at this point
Copy the full SHA 09409cfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8a8b691 - Browse repository at this point
Copy the full SHA 8a8b691View commit details -
Configuration menu - View commit details
-
Copy full SHA for 82d4110 - Browse repository at this point
Copy the full SHA 82d4110View commit details
Commits on Aug 9, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 0529476 - Browse repository at this point
Copy the full SHA 0529476View commit details -
Update docs with NVAIE messaging (#6162)
Update docs with NVAIE messaging
Configuration menu - View commit details
-
Copy full SHA for 263dec2 - Browse repository at this point
Copy the full SHA 263dec2View commit details
Commits on Aug 10, 2023
-
Configuration menu - View commit details
-
Copy full SHA for b5fb819 - Browse repository at this point
Copy the full SHA b5fb819View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2265bb6 - Browse repository at this point
Copy the full SHA 2265bb6View commit details -
Remove a test case that sanity checks input value of --shape CLI flag (…
…#6140) * Remove test checking for --shape option * Remove the entire test
Configuration menu - View commit details
-
Copy full SHA for 4a8b716 - Browse repository at this point
Copy the full SHA 4a8b716View commit details
Commits on Aug 14, 2023
-
Add test when unload/load requests for same model is received at the …
…same time (#6150) * Add test when unload/load requests for same model received the same time * Add test_same_model_overlapping_load_unload * Use a load/unload stress test instead * Pre-merge test name update * Address pre-commit error * Revert "Address pre-commit error" This reverts commit 781cab1. * Record number of occurrence of each exception
Configuration menu - View commit details
-
Copy full SHA for ff1398c - Browse repository at this point
Copy the full SHA ff1398cView commit details -
Configuration menu - View commit details
-
Copy full SHA for f953f08 - Browse repository at this point
Copy the full SHA f953f08View commit details
Commits on Aug 15, 2023
-
Add end-to-end CI test for decoupled model support (#6131) (#6184)
* Add end-to-end CI test for decoupled model support * Address feedback
Configuration menu - View commit details
-
Copy full SHA for d0599da - Browse repository at this point
Copy the full SHA d0599daView commit details -
Configuration menu - View commit details
-
Copy full SHA for 20d8c1c - Browse repository at this point
Copy the full SHA 20d8c1cView commit details -
* added debugging guide * Run pre-commit --------- Co-authored-by: David Yastremsky <dyastremsky@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 59ccf5b - Browse repository at this point
Copy the full SHA 59ccf5bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0f478f3 - Browse repository at this point
Copy the full SHA 0f478f3View commit details
Commits on Aug 16, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 81794dd - Browse repository at this point
Copy the full SHA 81794ddView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8a823b2 - Browse repository at this point
Copy the full SHA 8a823b2View commit details
Commits on Aug 17, 2023
-
Add utility functions for array manipulation (#6203)
* Add utility functions for outlier removal * Fix functions * Add newline to end of file
Configuration menu - View commit details
-
Copy full SHA for b02e223 - Browse repository at this point
Copy the full SHA b02e223View commit details -
Add gc collect to make sure gpu tensor is deallocated (#6205)
* Testing: add gc collect to make sure gpu tensor is deallocated * Address comment
Configuration menu - View commit details
-
Copy full SHA for 328614e - Browse repository at this point
Copy the full SHA 328614eView commit details
Commits on Aug 18, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 7fd8516 - Browse repository at this point
Copy the full SHA 7fd8516View commit details
Commits on Aug 22, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 47db080 - Browse repository at this point
Copy the full SHA 47db080View commit details
Commits on Aug 23, 2023
-
Trace Model Name Validation (#6199)
* Initial commit * Cleanup using new standard formatting * QA test restructuring * Add newline to the end of test.sh * HTTP/GRCP protocol changed to pivot on ready status & error status. Log file name changed in qa test. * Fixing unhandled error memory leak * Handle index function memory leak fix
Configuration menu - View commit details
-
Copy full SHA for 1296c40 - Browse repository at this point
Copy the full SHA 1296c40View commit details -
Configuration menu - View commit details
-
Copy full SHA for fbc1f0e - Browse repository at this point
Copy the full SHA fbc1f0eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6497518 - Browse repository at this point
Copy the full SHA 6497518View commit details
Commits on Aug 28, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 639f758 - Browse repository at this point
Copy the full SHA 639f758View commit details
Commits on Aug 30, 2023
-
Update post-23.08 release (#6234)
* CUDA 12.1 > 12.2 * DLIS-5208: onnxruntime+windows - stop treat warnings on compile as errors * Revert "DLIS-5208: onnxruntime+windows - stop treat warnings on compile as errors" This reverts commit 0cecbb7. * Update Dockerfile.win10.min * Update Dockerfile.win10.min * Update README and versions for 23.08 branch * Update Dockerfile.win10 * Fix the versions in docs * Add the note about stabilization of the branch * Update docs with NVAIE messaging (#6162) (#6167) Update docs with NVAIE messaging Co-authored-by: David Zier <42390249+dzier@users.noreply.github.com> * Resolve merge conflict --------- Co-authored-by: tanmayv25 <tanmay2592@gmail.com> Co-authored-by: David Zier <42390249+dzier@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 7fdff54 - Browse repository at this point
Copy the full SHA 7fdff54View commit details -
Configuration menu - View commit details
-
Copy full SHA for 13db22e - Browse repository at this point
Copy the full SHA 13db22eView commit details -
Adding safe string to number conversions (#6173)
* Added catch for out of range error for trace setting update * Added wrapper to safe parse options * Added option names to errors * Adjustments * Quick fix * Fixing option name for Windows * Removed repetitive code * Adjust getopt_long for Windows to use longindex * Moved try catch into ParseOption * Removed unused input * Improved names * Refactoring and clean up * Fixed Windows * Refactored getopt_long for Windows * Refactored trace test, pinned otel's collector version to avoid problems with go requirements
Configuration menu - View commit details
-
Copy full SHA for b2117ab - Browse repository at this point
Copy the full SHA b2117abView commit details
Commits on Sep 2, 2023
-
Test Python execute() to return Triton error code (#6228)
* Add test for Python execute error code * Add all supported error codes into test * Move ErrorCode into TritonError * Expose ErrorCode internal in TritonError
Configuration menu - View commit details
-
Copy full SHA for 7a4ce10 - Browse repository at this point
Copy the full SHA 7a4ce10View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7532b76 - Browse repository at this point
Copy the full SHA 7532b76View commit details
Commits on Sep 5, 2023
-
Add test for TensorRT version-compatible model support (#6255)
* Add tensorrt version-compatibility test * Generate one version-compatible model * Fix copyright year * Remove unnecessary variable * Remove unnecessary line * Generate TRT version-compatible model * Add sample inference to TRT version-compatible test * Clean up utils and model gen for new plan model * Fix startswith capitalization * Remove unused imports * Remove unused imports * Add log check
Configuration menu - View commit details
-
Copy full SHA for 39ebf82 - Browse repository at this point
Copy the full SHA 39ebf82View commit details
Commits on Sep 6, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 6a66d60 - Browse repository at this point
Copy the full SHA 6a66d60View commit details -
Add testing for retrieving shape and datatype in backend API (#6231)
Add testing for retrieving output shape and datatype info from backend API
Configuration menu - View commit details
-
Copy full SHA for 6930d00 - Browse repository at this point
Copy the full SHA 6930d00View commit details
Commits on Sep 8, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 9fe213a - Browse repository at this point
Copy the full SHA 9fe213aView commit details
Commits on Sep 11, 2023
-
Configuration menu - View commit details
-
Copy full SHA for e0dc4aa - Browse repository at this point
Copy the full SHA e0dc4aaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 41c689e - Browse repository at this point
Copy the full SHA 41c689eView commit details -
Exclude extra TRT version-compatible models from tests (#6294)
* Exclude compatible models from tests. * Force model removal, in case it does not exist Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> --------- Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 199de7b - Browse repository at this point
Copy the full SHA 199de7bView commit details
Commits on Sep 12, 2023
-
Adding installation of docker and docker-buildx (#6299)
* Adding installation of docker and docker-buildx * remove whitespace
Configuration menu - View commit details
-
Copy full SHA for 68e116a - Browse repository at this point
Copy the full SHA 68e116aView commit details
Commits on Sep 13, 2023
-
Use targetmodel from header as model name in SageMaker (#6147)
* Use targetmodel from header as model name in SageMaker * Update naming for model hash
Configuration menu - View commit details
-
Copy full SHA for 7624490 - Browse repository at this point
Copy the full SHA 7624490View commit details
Commits on Sep 14, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 6e0a9fb - Browse repository at this point
Copy the full SHA 6e0a9fbView commit details
Commits on Sep 18, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 3de9808 - Browse repository at this point
Copy the full SHA 3de9808View commit details -
Update the request re-use example (#6283)
* Update the request re-use example * Review edit * Review comment
Configuration menu - View commit details
-
Copy full SHA for 4a20c28 - Browse repository at this point
Copy the full SHA 4a20c28View commit details
Commits on Sep 19, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 6527c73 - Browse repository at this point
Copy the full SHA 6527c73View commit details -
Add Python binding build. Add L0_python_api to test Python binding (#…
…6319) * Add L0_python_api to test Python binding * Install Python API in CI image * Fix QA build
Configuration menu - View commit details
-
Copy full SHA for 40c2c45 - Browse repository at this point
Copy the full SHA 40c2c45View commit details -
Configuration menu - View commit details
-
Copy full SHA for bbbcad7 - Browse repository at this point
Copy the full SHA bbbcad7View commit details
Commits on Sep 20, 2023
-
Tests and docs for ability to specify subdirectory to download for Lo…
…calizePath (#6308) * Added custom localization tests for s3 and azure, added docs
Configuration menu - View commit details
-
Copy full SHA for 5dd9398 - Browse repository at this point
Copy the full SHA 5dd9398View commit details
Commits on Sep 21, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 2ccc3ce - Browse repository at this point
Copy the full SHA 2ccc3ceView commit details
Commits on Sep 25, 2023
-
Refactor model generation scripts (#6336)
* Refactor model generation scripts * Fix codeql * Fix relative path import * Fix package structure * Copy the gen_common file * Add missing uint8 * Remove duplicate import
Configuration menu - View commit details
-
Copy full SHA for e0f70aa - Browse repository at this point
Copy the full SHA e0f70aaView commit details
Commits on Sep 26, 2023
-
Add testing for scalar I/O in ORT backend (#6343)
* Add testing for scalar I/O in ORT backend * Review edit * ci
Configuration menu - View commit details
-
Copy full SHA for 1f4af6d - Browse repository at this point
Copy the full SHA 1f4af6dView commit details
Commits on Sep 29, 2023
-
Update post-23.09 release (#6367)
* Update README and versions for 23.09 branch (#6280) * Update `Dockerfile` and `build.py` (#6281) * Update configuration for Windows Dockerfile (#6256) * Adding installation of docker and docker-buildx * Enable '--expt-relaxed-constexpr' flag for custom ops models * Upate Dockerfile version * Disable unit tests for Jetson * Update condition (#6285) * removing Whitespaces (#6293) * removing Whitespaces * removing whitespaces
Configuration menu - View commit details
-
Copy full SHA for f4d4278 - Browse repository at this point
Copy the full SHA f4d4278View commit details
Commits on Oct 3, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 4b4f569 - Browse repository at this point
Copy the full SHA 4b4f569View commit details
Commits on Oct 4, 2023
-
Adding client-side request cancellation support and testing (#6383)
* Add L0_request_cancellation (#6252) * Add L0_request_cancellation * Remove unittest test * Add cancellation to gRPC server error handling * Fix up * Use identity model * Add tests for gRPC client-side cancellation (#6278) * Add tests for gRPC client-side cancellation * Fix CodeQL issues * Formatting * Update qa/L0_client_cancellation/client_cancellation_test.py Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> * Move to L0_request_cancellation * Address review comments * Removing request cancellation support from asyncio version * Format * Update copyright * Remove tests * Handle cancellation notification in gRPC server (#6298) * Handle cancellation notification in gRPC server * Fix the request ptr initialization * Update src/grpc/infer_handler.h Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> * Address review comment * Fix logs * Fix request complete callback by removing reference to state * Improve documentation --------- Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> --------- Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> * Fixes on the gRPC frontend to handle AsyncNotifyWhenDone() API (#6345) * Fix segmentation fault in gRPC frontend * Finalize all states upon completion * Fixes all state cleanups * Handle completed states when cancellation notification is received * Add more documentation steps * Retrieve dormant states to minimize the memory footprint for long streams * Update src/grpc/grpc_utils.h Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> * Use a boolean state instead of raw pointer --------- Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> * Add L0_grpc_state_cleanup test (#6353) * Add L0_grpc_state_cleanup test * Add model file in QA container * Fix spelling * Add remaining subtests * Add failing subtests * Format fixes * Fix model repo * Fix QA docker file * Remove checks for the error message when shutting down server * Fix spelling * Address review comments * Add schedulers request cancellation tests (#6309) * Add schedulers request cancellation tests * Merge gRPC client test * Reduce testing time and covers cancelling other requests as a consequence of request cancellation * Add streaming request cancellation test --------- Co-authored-by: Iman Tabrizian <iman.tabrizian@gmail.com> Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> Co-authored-by: Jacky <18255193+kthui@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for e97ad74 - Browse repository at this point
Copy the full SHA e97ad74View commit details -
Configuration menu - View commit details
-
Copy full SHA for b95736c - Browse repository at this point
Copy the full SHA b95736cView commit details
Commits on Oct 6, 2023
-
Add basic generate endpoints for LLM tasks (#6366)
* PoC of parsing request prompt and converting to Triton infer request * Remove extra trace * Add generate endpoint * Enable streaming version * Fix bug * Fix up * Add basic testing. Cherry pick from #6369 * format * Address comment. Fix build * Minor cleanup * cleanup syntax * Wrap error in SSE format * Fix up * Restrict number of response on non-streaming generate * Address comment on implementation. * Re-enable trace on generate endpoint * Add more comprehensive llm endpoint tests (#6377) * Add security policy (#6376) * Start adding some more comprehensive tests * Fix test case * Add response error testing * Complete test placeholder * Address comment * Address comments * Fix code check --------- Co-authored-by: dyastremsky <58150256+dyastremsky@users.noreply.github.com> Co-authored-by: GuanLuo <gluo@nvidia.com> * Address comment * Address comment * Address comment * Fix typo --------- Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> Co-authored-by: dyastremsky <58150256+dyastremsky@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for bdf227c - Browse repository at this point
Copy the full SHA bdf227cView commit details -
Add Python backend request cancellation test (#6364)
* Add cancelled response status test * Add Python backend request cancellation test * Add Python backend decoupled request cancellation test * Simplified response if cancelled * Test response_sender.send() after closed * Rollback test response_sender.send() after closed * Rollback non-decoupled any response on cancel
Configuration menu - View commit details
-
Copy full SHA for 2bf543b - Browse repository at this point
Copy the full SHA 2bf543bView commit details
Commits on Oct 7, 2023
-
Add TRT-LLM backend build to Triton (#6365) (#6392)
* Add TRT-LLM backend build to Triton (#6365) * Add trtllm backend to build * Temporarily adding version map for 23.07 * Fix build issue * Update comment * Comment out python binding changes * Add post build * Update trtllm backend naming * Update TRTLLM base image * Fix cmake arch * Revert temp changes for python binding PR * Address comment * Move import to the top (#6395) * Move import to the top * pre commit format
Configuration menu - View commit details
-
Copy full SHA for a33f257 - Browse repository at this point
Copy the full SHA a33f257View commit details
Commits on Oct 8, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 2ca380e - Browse repository at this point
Copy the full SHA 2ca380eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 60f14ac - Browse repository at this point
Copy the full SHA 60f14acView commit details
Commits on Oct 9, 2023
-
Configuration menu - View commit details
-
Copy full SHA for b796451 - Browse repository at this point
Copy the full SHA b796451View commit details -
Configuration menu - View commit details
-
Copy full SHA for 135341b - Browse repository at this point
Copy the full SHA 135341bView commit details
Commits on Oct 11, 2023
-
Configuration menu - View commit details
-
Copy full SHA for b1c876d - Browse repository at this point
Copy the full SHA b1c876dView commit details -
Add documentation on request cancellation (#6403)
* Add documentation on request cancellation * Include python backend * Update docs/user_guide/request_cancellation.md Co-authored-by: Iman Tabrizian <iman.tabrizian@gmail.com> * Update docs/user_guide/request_cancellation.md Co-authored-by: Neelay Shah <neelays@nvidia.com> * Update docs/README.md Co-authored-by: Neelay Shah <neelays@nvidia.com> * Update docs/user_guide/request_cancellation.md Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> * Remove inflight term from the main documentation * Address review comments * Fix * Update docs/user_guide/request_cancellation.md Co-authored-by: Jacky <18255193+kthui@users.noreply.github.com> * Fix --------- Co-authored-by: Iman Tabrizian <iman.tabrizian@gmail.com> Co-authored-by: Neelay Shah <neelays@nvidia.com> Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> Co-authored-by: Jacky <18255193+kthui@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 9c707e3 - Browse repository at this point
Copy the full SHA 9c707e3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 85487a1 - Browse repository at this point
Copy the full SHA 85487a1View commit details
Commits on Oct 12, 2023
-
Document generate HTTP endpoint (#6412)
* Document generate HTTP endpoint * Address comment * Fix up * format * Address comment
Configuration menu - View commit details
-
Copy full SHA for 786f48f - Browse repository at this point
Copy the full SHA 786f48fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4929c33 - Browse repository at this point
Copy the full SHA 4929c33View commit details -
Fix missing library in L0_data_compression (#6424)
* Fix missing library in L0_data_compression * Fix up
Configuration menu - View commit details
-
Copy full SHA for 098de8f - Browse repository at this point
Copy the full SHA 098de8fView commit details
Commits on Oct 13, 2023
-
Add Javacpp-presets repo location as env variable in Java tests(#6385)
Simplify testing when upstream (javacpp-presets) build changes. Related to triton-inference-server/client#409
Configuration menu - View commit details
-
Copy full SHA for 6ae3c34 - Browse repository at this point
Copy the full SHA 6ae3c34View commit details -
TRT-LLM backend build changes (#6406)
* Update url * Debugging * Debugging * Update url * Fix build for TRT-LLM backend * Remove TRTLLM TRT and CUDA versions * Fix up unused var * Fix up dir name * FIx cmake patch * Remove previous TRT version * Install required packages for example models * Remove packages that are only needed for testing
Configuration menu - View commit details
-
Copy full SHA for e817431 - Browse repository at this point
Copy the full SHA e817431View commit details -
Add gRPC AsyncIO request cancellation tests (#6408)
* Fix gRPC test failure and refactor * Add gRPC AsyncIO cancellation tests * Better check if a request is cancelled * Use f-string
Configuration menu - View commit details
-
Copy full SHA for 95d2ee7 - Browse repository at this point
Copy the full SHA 95d2ee7View commit details
Commits on Oct 16, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 62db791 - Browse repository at this point
Copy the full SHA 62db791View commit details -
Configuration menu - View commit details
-
Copy full SHA for e11ae14 - Browse repository at this point
Copy the full SHA e11ae14View commit details
Commits on Oct 17, 2023
-
Switch Jetson model TensorRT models generation to container (#6378)
* Switch Jetson model TensorRT models generation to container * Adding missed file * Fix typo * Fix typos * Remove extra spaces * Fix typo
Configuration menu - View commit details
-
Copy full SHA for c8e4dd0 - Browse repository at this point
Copy the full SHA c8e4dd0View commit details
Commits on Oct 18, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 960afe7 - Browse repository at this point
Copy the full SHA 960afe7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 328d519 - Browse repository at this point
Copy the full SHA 328d519View commit details
Commits on Oct 19, 2023
-
Configuration menu - View commit details
-
Copy full SHA for ccdb26b - Browse repository at this point
Copy the full SHA ccdb26bView commit details -
Fix notify state destruction and inflight states tracking (#6451)
* Ensure notify_state_ gets properly destructed * Fix inflight state tracking to properly erase states * Prevent removing the notify_state from being erased * Wrap notify_state_ object within unique_ptr
Configuration menu - View commit details
-
Copy full SHA for e19cfe7 - Browse repository at this point
Copy the full SHA e19cfe7View commit details
Commits on Oct 20, 2023
-
Update TRT-LLM backend url (#6455)
* TRTLLM backend post release * TRTLLM backend post release * Update submodule url for permission issue * Update submodule url * Fix up * Not using postbuild function to workaround submodule url permission issue
Configuration menu - View commit details
-
Copy full SHA for c162205 - Browse repository at this point
Copy the full SHA c162205View commit details -
Added docs on python based backends (#6429)
Co-authored-by: Neelay Shah <neelays@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for b0bff04 - Browse repository at this point
Copy the full SHA b0bff04View commit details
Commits on Oct 24, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 35c54fd - Browse repository at this point
Copy the full SHA 35c54fdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2b1d267 - Browse repository at this point
Copy the full SHA 2b1d267View commit details
Commits on Oct 25, 2023
-
Test Python BLS with different sizes of CUDA memory pool (#6276)
* Test with different sizes of CUDA memory pool * Check the server log for error message * Improve debugging * Fix syntax
Configuration menu - View commit details
-
Copy full SHA for b5c2e38 - Browse repository at this point
Copy the full SHA b5c2e38View commit details
Commits on Oct 26, 2023
-
Add documentation for K8s-onprem StartupProbe (#5257)
Co-authored-by: dyastremsky <58150256+dyastremsky@users.noreply.github.com> Co-authored-by: Ryan McCormick <mccormick.codes@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 3dfa18f - Browse repository at this point
Copy the full SHA 3dfa18fView commit details
Commits on Oct 27, 2023
-
Update
main
post-23.10 release (#6484)* Update README and versions for 23.10 branch (#6399) * Cherry-picking vLLM backend changes (#6404) * Update build.py to build vLLM backend (#6394) * Add Python backend when vLLM backend built (#6397) --------- Co-authored-by: dyastremsky <58150256+dyastremsky@users.noreply.github.com> * Add documentation on request cancellation (#6403) (#6407) * Add documentation on request cancellation * Include python backend * Update docs/user_guide/request_cancellation.md * Update docs/user_guide/request_cancellation.md * Update docs/README.md * Update docs/user_guide/request_cancellation.md * Remove inflight term from the main documentation * Address review comments * Fix * Update docs/user_guide/request_cancellation.md * Fix --------- Co-authored-by: Iman Tabrizian <iman.tabrizian@gmail.com> Co-authored-by: Neelay Shah <neelays@nvidia.com> Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> Co-authored-by: Jacky <18255193+kthui@users.noreply.github.com> * Fixes in request cancellation doc (#6409) (#6410) * TRT-LLM backend build changes (#6406) (#6430) * Update url * Debugging * Debugging * Update url * Fix build for TRT-LLM backend * Remove TRTLLM TRT and CUDA versions * Fix up unused var * Fix up dir name * FIx cmake patch * Remove previous TRT version * Install required packages for example models * Remove packages that are only needed for testing * Fixing vllm build (#6433) (#6437) * Fixing torch version for vllm Co-authored-by: Olga Andreeva <124622579+oandreeva-nv@users.noreply.github.com> * Update TRT-LLM backend url (#6455) (#6460) * TRTLLM backend post release * TRTLLM backend post release * Update submodule url for permission issue * Update submodule url * Fix up * Not using postbuild function to workaround submodule url permission issue * remove redundant lines * Revert "remove redundant lines" This reverts commit 86be7ad. * restore missed lines * Update build.py Co-authored-by: Olga Andreeva <124622579+oandreeva-nv@users.noreply.github.com> * Update build.py Co-authored-by: Olga Andreeva <124622579+oandreeva-nv@users.noreply.github.com> --------- Co-authored-by: Tanmay Verma <tanmay2592@gmail.com> Co-authored-by: dyastremsky <58150256+dyastremsky@users.noreply.github.com> Co-authored-by: Iman Tabrizian <iman.tabrizian@gmail.com> Co-authored-by: Neelay Shah <neelays@nvidia.com> Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> Co-authored-by: Jacky <18255193+kthui@users.noreply.github.com> Co-authored-by: Kris Hung <krish@nvidia.com> Co-authored-by: Katherine Yang <80359429+jbkyang-nvi@users.noreply.github.com> Co-authored-by: Olga Andreeva <124622579+oandreeva-nv@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 2ba0762 - Browse repository at this point
Copy the full SHA 2ba0762View commit details
Commits on Oct 28, 2023
-
Configuration menu - View commit details
-
Copy full SHA for ed7b0fd - Browse repository at this point
Copy the full SHA ed7b0fdView commit details
Commits on Oct 30, 2023
-
Improve L0_backend_python test stability (ensemble / gpu_tensor_lifec…
…ycle) (#6490) * Test torch allocator gpu memory usage directly rather than global gpu memory for more consistency
Configuration menu - View commit details
-
Copy full SHA for 638fcf9 - Browse repository at this point
Copy the full SHA 638fcf9View commit details
Commits on Nov 1, 2023
-
Add L0_generative_sequence test (#6475)
* Add testing backend and test * Add test to build / CI. Minor fix on L0_http * Format. Update backend documentation * Fix up * Address comment * Add negative testing * Fix up
Configuration menu - View commit details
-
Copy full SHA for f49493f - Browse repository at this point
Copy the full SHA f49493fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 048d5ac - Browse repository at this point
Copy the full SHA 048d5acView commit details
Commits on Nov 2, 2023
-
Collecting sub dir artifacts in GitLab yaml. Removing collect functio…
…n from test script. (#6499)
Configuration menu - View commit details
-
Copy full SHA for d33eaa5 - Browse repository at this point
Copy the full SHA d33eaa5View commit details -
Use post build function for TRT-LLM backend (#6476)
* Use postbuild function * Remove updating submodule url
Configuration menu - View commit details
-
Copy full SHA for b0d4612 - Browse repository at this point
Copy the full SHA b0d4612View commit details -
Enhanced python_backend autocomplete (#6504)
* Added testing for python_backend autocomplete: optional input and model_transaction_policy
Configuration menu - View commit details
-
Copy full SHA for 4985ab9 - Browse repository at this point
Copy the full SHA 4985ab9View commit details -
Parse reuse-grpc-port and reuse-http-port as booleans (#6511)
Co-authored-by: Francesco Petrini <francescogpetrini@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 8336873 - Browse repository at this point
Copy the full SHA 8336873View commit details
Commits on Nov 3, 2023
-
Configuration menu - View commit details
-
Copy full SHA for f15ceec - Browse repository at this point
Copy the full SHA f15ceecView commit details -
Add Python-based backends CI (#6466)
* Bumped vllm version * Add python-bsed backends testing * Add python-based backends CI * Fix errors * Add vllm backend * Fix pre-commit * Modify test.sh * Remove vllm_opt qa model * Remove vLLM ackend tests * Resolve review comments * Fix pre-commit errors * Update qa/L0_backend_python/python_based_backends/python_based_backends_test.py Co-authored-by: Tanmay Verma <tanmay2592@gmail.com> * Remove collect_artifacts_from_subdir function call --------- Co-authored-by: oandreeva-nv <oandreeva@nvidia.com> Co-authored-by: Tanmay Verma <tanmay2592@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for c1b334f - Browse repository at this point
Copy the full SHA c1b334fView commit details -
Enabling option to restrict access to HTTP APIs based on header value…
… pairs (similar to gRPC)
Configuration menu - View commit details
-
Copy full SHA for 4b481a6 - Browse repository at this point
Copy the full SHA 4b481a6View commit details
Commits on Nov 6, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 9da5135 - Browse repository at this point
Copy the full SHA 9da5135View commit details
Commits on Nov 7, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 53b2fac - Browse repository at this point
Copy the full SHA 53b2facView commit details -
Test file override outside of model directory (#6516)
* Add boost-filesystem
Configuration menu - View commit details
-
Copy full SHA for d893ca1 - Browse repository at this point
Copy the full SHA d893ca1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0f1f924 - Browse repository at this point
Copy the full SHA 0f1f924View commit details
Commits on Nov 8, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 0664c5a - Browse repository at this point
Copy the full SHA 0664c5aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5fca835 - Browse repository at this point
Copy the full SHA 5fca835View commit details -
Enhance testing for pending request count (#6532)
* Enhance testing for pending request count * Improve the documentation * Add more documentation
Configuration menu - View commit details
-
Copy full SHA for 05dae41 - Browse repository at this point
Copy the full SHA 05dae41View commit details
Commits on Nov 9, 2023
-
Add testing for Python backend request rescheduling (#6509)
* Add testing * Fix up * Enhance testing * Fix up * Revert test changes * Add grpc endpoint test * Remove unused import * Remove unused import * Update qa/L0_backend_python/request_rescheduling/grpc_endpoint_test.py Co-authored-by: Iman Tabrizian <iman.tabrizian@gmail.com> * Update qa/python_models/bls_request_rescheduling/model.py Co-authored-by: Iman Tabrizian <iman.tabrizian@gmail.com> --------- Co-authored-by: Iman Tabrizian <iman.tabrizian@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 380d739 - Browse repository at this point
Copy the full SHA 380d739View commit details -
Configuration menu - View commit details
-
Copy full SHA for c295fbf - Browse repository at this point
Copy the full SHA c295fbfView commit details
Commits on Nov 10, 2023
-
secure deployment considerations guide (#6533)
* draft document * updates * updates * updated * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * update * updates * updates * Update docs/customization_guide/deploy.md Co-authored-by: Kyle McGill <101670481+nv-kmcgill53@users.noreply.github.com> * Update docs/customization_guide/deploy.md Co-authored-by: Kyle McGill <101670481+nv-kmcgill53@users.noreply.github.com> * fixing typos * updated with clearer warnings * updates to readme and toc --------- Co-authored-by: Kyle McGill <101670481+nv-kmcgill53@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 53685d3 - Browse repository at this point
Copy the full SHA 53685d3View commit details -
Fix typo and change the command line order (#6557)
* Fix typo and change the command line order * Improve visual experience. Add 'clang' package
Configuration menu - View commit details
-
Copy full SHA for a59f3cf - Browse repository at this point
Copy the full SHA a59f3cfView commit details
Commits on Nov 13, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 7982a53 - Browse repository at this point
Copy the full SHA 7982a53View commit details
Commits on Nov 15, 2023
-
Configuration menu - View commit details
-
Copy full SHA for a5452b4 - Browse repository at this point
Copy the full SHA a5452b4View commit details -
Add testing for implicit state enhancements (#6524)
* Add testing for single buffer * Add testing for implicit state with buffer growth * Improve testing * Fix up * Add CUDA virtual address size flag * Add missing test files * Parameter rename * Test fixes * Only build implicit state backend for GPU=ON
Configuration menu - View commit details
-
Copy full SHA for e5cbe4f - Browse repository at this point
Copy the full SHA e5cbe4fView commit details -
Configuration menu - View commit details
-
Copy full SHA for ee68c57 - Browse repository at this point
Copy the full SHA ee68c57View commit details
Commits on Nov 16, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 441d930 - Browse repository at this point
Copy the full SHA 441d930View commit details -
Configuration menu - View commit details
-
Copy full SHA for 859ab91 - Browse repository at this point
Copy the full SHA 859ab91View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4102905 - Browse repository at this point
Copy the full SHA 4102905View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1c20826 - Browse repository at this point
Copy the full SHA 1c20826View commit details -
Configuration menu - View commit details
-
Copy full SHA for fb5bc9f - Browse repository at this point
Copy the full SHA fb5bc9fView commit details
Commits on Nov 19, 2023
-
Change name generative -> iterative (#6601)
* name changes * updated names
Configuration menu - View commit details
-
Copy full SHA for e7bee37 - Browse repository at this point
Copy the full SHA e7bee37View commit details
Commits on Nov 20, 2023
-
Add documentation on generative sequence (#6595)
* Add documentation on generative sequence * Address comment * Reflect the "iterative" change
Configuration menu - View commit details
-
Copy full SHA for daceccf - Browse repository at this point
Copy the full SHA daceccfView commit details -
Configuration menu - View commit details
-
Copy full SHA for a5b7001 - Browse repository at this point
Copy the full SHA a5b7001View commit details -
Restricted HTTP API documentation
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 2f24070 - Browse repository at this point
Copy the full SHA 2f24070View commit details
Commits on Nov 21, 2023
-
Configuration menu - View commit details
-
Copy full SHA for ba44923 - Browse repository at this point
Copy the full SHA ba44923View commit details -
Support for http request cancellation. Includes fix for seg fault in …
…generate_stream endpoint.
Configuration menu - View commit details
-
Copy full SHA for b876a90 - Browse repository at this point
Copy the full SHA b876a90View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9647526 - Browse repository at this point
Copy the full SHA 9647526View commit details -
Configuration menu - View commit details
-
Copy full SHA for 18ee5ac - Browse repository at this point
Copy the full SHA 18ee5acView commit details
Commits on Nov 22, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 92214f7 - Browse repository at this point
Copy the full SHA 92214f7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 738996f - Browse repository at this point
Copy the full SHA 738996fView commit details -
Extend request objects lifetime and fixes possible segmentation fault (…
…#6620) * Extend request objects lifetime * Remove explicit TRITONSERVER_InferenceRequestDelete * Format fix * Include the inference_request_ initialization to cover RequestNew --------- Co-authored-by: Neelay Shah <neelays@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 4b34a48 - Browse repository at this point
Copy the full SHA 4b34a48View commit details
Commits on Nov 29, 2023
-
Update protobuf after python update for testing (#6638)
This fixes the issue where python client has `AttributeError: 'NoneType' object has no attribute 'enum_types_by_name' errors after python version is updated.
Configuration menu - View commit details
-
Copy full SHA for b96ae5f - Browse repository at this point
Copy the full SHA b96ae5fView commit details
Commits on Dec 4, 2023
-
Update post-23.11 release (#6653)
* Update README and versions for 2.40.0 / 23.11 (#6544) * Removing path construction to use SymLink alternatives * Update version for PyTorch * Update windows Dockerfile configuration * Update triton version to 23.11 * Update README and versions for 2.40.0 / 23.11 * Fix typo * Ading 'ldconfig' to configure dynamic linking in container (#6602) * Point to tekit_backend (#6616) * Point to tekit_backend * Update version * Revert tekit changes (#6640) --------- Co-authored-by: Kris Hung <krish@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for b9f69b5 - Browse repository at this point
Copy the full SHA b9f69b5View commit details -
* New testing to confirm large request timeout values can be passed and retrieved within Python BLS models.
Configuration menu - View commit details
-
Copy full SHA for b44ee7c - Browse repository at this point
Copy the full SHA b44ee7cView commit details
Commits on Dec 5, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 4ac7f37 - Browse repository at this point
Copy the full SHA 4ac7f37View commit details
Commits on Dec 6, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 817428a - Browse repository at this point
Copy the full SHA 817428aView commit details
Commits on Dec 8, 2023
-
Add test for optional internal tensor within an ensemble (#6663)
* Add test for optional internal tensor within an ensemble * Fix up
Configuration menu - View commit details
-
Copy full SHA for 8afdad2 - Browse repository at this point
Copy the full SHA 8afdad2View commit details
Commits on Dec 10, 2023
-
Set CMake version to 3.27.7 (#6675)
* Set CMake version to 3.27.7 * Set CMake version to 3.27.7 * Fix double slash typo
Configuration menu - View commit details
-
Copy full SHA for a34770b - Browse repository at this point
Copy the full SHA a34770bView commit details
Commits on Dec 11, 2023
-
Configuration menu - View commit details
-
Copy full SHA for cbe58e7 - Browse repository at this point
Copy the full SHA cbe58e7View commit details
Commits on Dec 12, 2023
-
Configuration menu - View commit details
-
Copy full SHA for f5717c6 - Browse repository at this point
Copy the full SHA f5717c6View commit details -
Configuration menu - View commit details
-
Copy full SHA for e6c300d - Browse repository at this point
Copy the full SHA e6c300dView commit details
Commits on Dec 13, 2023
-
Configuration menu - View commit details
-
Copy full SHA for f2cd999 - Browse repository at this point
Copy the full SHA f2cd999View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8165ca7 - Browse repository at this point
Copy the full SHA 8165ca7View commit details -
Fix iGPU CMakeFile tags (#6695)
* Unify iGPU test build with x86 ARM * adding TRITON_IGPU_BUILD to core build definition; adding logic to skip caffe2plan test if TRITON_IGPU_BUILD=1 * re-organizing some copies in Dockerfile.QA to fix igpu devel build * Pre-commit fix --------- Co-authored-by: kyle <kmcgill@kmcgill-ubuntu.nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 0d2e236 - Browse repository at this point
Copy the full SHA 0d2e236View commit details
Commits on Dec 14, 2023
-
adding default value for TRITON_IGPU_BUILD=OFF (#6705)
* adding default value for TRITON_IGPU_BUILD=OFF * fix newline --------- Co-authored-by: kyle <kmcgill@kmcgill-ubuntu.nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 9c56e19 - Browse repository at this point
Copy the full SHA 9c56e19View commit details
Commits on Dec 15, 2023
-
Add test case for decoupled model raising exception (#6686)
* Add test case for decoupled model raising exception * Remove unused import * Address comment
Configuration menu - View commit details
-
Copy full SHA for d6bd668 - Browse repository at this point
Copy the full SHA d6bd668View commit details -
Configuration menu - View commit details
-
Copy full SHA for 13dd37e - Browse repository at this point
Copy the full SHA 13dd37eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2df7b25 - Browse repository at this point
Copy the full SHA 2df7b25View commit details -
Allow configuring GRPC max connection age and max connection age grace (
#6639) * Add ability to configure GRPC max connection age and max connection age grace * Allow pass GRPC connection age args when they are set from command ---------- Co-authored-by: Katherine Yang <80359429+jbkyang-nvi@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 23ba7a0 - Browse repository at this point
Copy the full SHA 23ba7a0View commit details
Commits on Dec 19, 2023
-
Modify HTTP frontend to return error code reflecting Triton error. Ad…
…d test (#6713) * Modify HTTP frontend to return error code reflecting Triton error * Add test for dedicated HTTP error. Releax existing test on HTTP code * Address comment. Fix copy right
Configuration menu - View commit details
-
Copy full SHA for 7b7ef51 - Browse repository at this point
Copy the full SHA 7b7ef51View commit details
Commits on Dec 26, 2023
-
Update
main
branch post-23.12 release (#6726)* Update README and versions for 23.12 branch * Bring back the README (#6671) * Bring back the README * main -> r23.12 * Remove L0_libtorch_nvfuser (#6674) * iGPU build refactor (#6684) * Fix iGPU CMakeFile tags (#6695) (#6698) * Unify iGPU test build with x86 ARM * adding TRITON_IGPU_BUILD to core build definition; adding logic to skip caffe2plan test if TRITON_IGPU_BUILD=1 * re-organizing some copies in Dockerfile.QA to fix igpu devel build * Pre-commit fix --------- Co-authored-by: kyle <kmcgill@kmcgill-ubuntu.nvidia.com> * Update windows Dockerfile versions (#6672) Changing version to the latest one Co-authored-by: Misha Chornyi <mchornyi@nvidia.com> * Remove README banner (#6719) * Update README --------- Co-authored-by: tanmayv25 <tanmay2592@gmail.com> Co-authored-by: Jacky <18255193+kthui@users.noreply.github.com> Co-authored-by: kyle <kmcgill@kmcgill-ubuntu.nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 121194b - Browse repository at this point
Copy the full SHA 121194bView commit details
Commits on Dec 28, 2023
-
* testing apprroach with pre-built image * Build TensorRT-LLM * Disable Triton Build * Remove file * Update config * Changet PATH variables * Update path * Update configuration for CMake * Getting back TRITON_BUILD flag * REvert missing files creation * Update configuration for the PyTorch installation * Update configuration for docker * Change the location * Update configuration * update config * Set CMake version to 3.27.7 * Fix double slash typo * remove unused strings * restore typo (#6680) * remove old line * fix line indentation * Update LD_LIBRARY_PATH for TensorRT-LLM * Addign TRT llm changes * remove TRT-LLM container from bhte argument list * Update indentation
Configuration menu - View commit details
-
Copy full SHA for 334d3df - Browse repository at this point
Copy the full SHA 334d3dfView commit details
Commits on Jan 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 639106e - Browse repository at this point
Copy the full SHA 639106eView commit details
Commits on Jan 3, 2024
-
Update RE2 package location (#6750)
* Update RE2 package location * Use only 1 parallel thread for build * Revert "Use only 1 parallel thread for build" This reverts commit 93eab3a.
Configuration menu - View commit details
-
Copy full SHA for 854bc42 - Browse repository at this point
Copy the full SHA 854bc42View commit details -
Add testing for zero tensors in PyTorch backend (#6760)
* Add testing for zero tensors in PyTorch backend * Fix up * Review edit
Configuration menu - View commit details
-
Copy full SHA for b42a11a - Browse repository at this point
Copy the full SHA b42a11aView commit details
Commits on Jan 4, 2024
-
Fix L0_lifecycle on insufficient hardware concurrency (#6762)
* Do not fail test on insufficient hardware concurrency * Track instead of fail test if cannot replicate load while async unload * Add some TODOs for the sub-test
Configuration menu - View commit details
-
Copy full SHA for 9e3bde3 - Browse repository at this point
Copy the full SHA 9e3bde3View commit details
Commits on Jan 5, 2024
-
Simplify cmake install command (#6725)
* Simplify cmake install command * Fix up * Review comment
Configuration menu - View commit details
-
Copy full SHA for b922b84 - Browse repository at this point
Copy the full SHA b922b84View commit details -
Add cmdline option to set model load retry. Add test (#6764)
* Add cmdline option to set model load retry. Add test * Fix copyright * Minor change on testing model * Remove unused import
Configuration menu - View commit details
-
Copy full SHA for 690fe1a - Browse repository at this point
Copy the full SHA 690fe1aView commit details
Commits on Jan 8, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 634e5ea - Browse repository at this point
Copy the full SHA 634e5eaView commit details -
Configuration menu - View commit details
-
Copy full SHA for bfb6ff0 - Browse repository at this point
Copy the full SHA bfb6ff0View commit details
Commits on Jan 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 0299e63 - Browse repository at this point
Copy the full SHA 0299e63View commit details -
Bring back Python backend based PyTorch backend (#6518)
* Patch L0_model_config with runtime * Add L0_pytorch_python_runtime * Update expected runtime field * Add test for escaping runtime * Add comments on unit test imports * Add invalid runtime test * User to build PyTorch env * Update copyright
Configuration menu - View commit details
-
Copy full SHA for 4ffec9f - Browse repository at this point
Copy the full SHA 4ffec9fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0e278c1 - Browse repository at this point
Copy the full SHA 0e278c1View commit details
Commits on Jan 12, 2024
-
Add tests/docs for Pinned Memory metrics (#6754)
* Test case * Update metrics.md * Fix alert * Add copyright * Update test * Improve pinned_memory_metrics_test.py * Update qa/L0_metrics/pinned_memory_metrics_test.py Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> * Update pinned_memory_metrics_test.py --------- Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for e58fa4c - Browse repository at this point
Copy the full SHA e58fa4cView commit details -
Configuration menu - View commit details
-
Copy full SHA for b6a6dce - Browse repository at this point
Copy the full SHA b6a6dceView commit details
Commits on Jan 17, 2024
-
Support for Context Propagation for OTel trace mode (#6785)
Added support for OTel context propagation --------- Co-authored-by: Markus Hennerbichler <markush@speechmatics.com> Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for af2c54e - Browse repository at this point
Copy the full SHA af2c54eView commit details -
Use current time when overwriting model configuration. (#6727)
This validates the change made to ../core wrt how model configuration mtime is handled.
Configuration menu - View commit details
-
Copy full SHA for 7d9f6cd - Browse repository at this point
Copy the full SHA 7d9f6cdView commit details
Commits on Jan 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 6779fc5 - Browse repository at this point
Copy the full SHA 6779fc5View commit details -
Configuration menu - View commit details
-
Copy full SHA for e74a353 - Browse repository at this point
Copy the full SHA e74a353View commit details
Commits on Jan 19, 2024
-
Configuration menu - View commit details
-
Copy full SHA for a031177 - Browse repository at this point
Copy the full SHA a031177View commit details -
Improve L0_backend_python on shm reliability (#6803)
* Run all cases wihh shm probe * Warmup test and then run multiple iterations * Log free shared memory on enter/exit of probe * Add shm probe to all tests * Add debug_str to shm_util * Refactor ensemble_io test, modify probe to check for growth rather than inequality * Improve stability of bls_tensor_lifecycle gpu memory tests * Add more visibility into failing model/case in python_unittest helper * [FIXME] Skip probe on certain subtests for now * [FIXME] Remove shm probe from test_restart on unhealthy stub * Start clean server run for each bls test case * Don't exit early on failure so logs can be properly collected * Restore bls test logic * Fix shm size compare * Print region name that leaked * Remove special handling on unittest * Remove debug str * Add enter and exit delay to shm leak probe --------- Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 37cae0c - Browse repository at this point
Copy the full SHA 37cae0cView commit details
Commits on Jan 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d1c8507 - Browse repository at this point
Copy the full SHA d1c8507View commit details
Commits on Jan 23, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2b24225 - Browse repository at this point
Copy the full SHA 2b24225View commit details
Commits on Jan 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 8fc7b10 - Browse repository at this point
Copy the full SHA 8fc7b10View commit details -
Update trace summary script (#6758)
* Update trace_summery script * Remove GRPC_WAITREAD and Overhead
Configuration menu - View commit details
-
Copy full SHA for 30d64af - Browse repository at this point
Copy the full SHA 30d64afView commit details -
Add gsutil upload retry helper function (#6817)
* Add gsutil cp retry helper function * Add max retry to GCS upload * Use simple sequential upload
Configuration menu - View commit details
-
Copy full SHA for 4bc15c9 - Browse repository at this point
Copy the full SHA 4bc15c9View commit details
Commits on Jan 27, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b0e7e50 - Browse repository at this point
Copy the full SHA b0e7e50View commit details
Commits on Jan 29, 2024
-
Handle 0 dimension output for generate endpoint (#6833)
* Handle empty output * Add test case for 0 dimension output * Fix up number of tests
Configuration menu - View commit details
-
Copy full SHA for f0d788b - Browse repository at this point
Copy the full SHA f0d788bView commit details -
tensorrt-llm benchmarking test (#6771)
* tensorrt-llm benchmarking test
Configuration menu - View commit details
-
Copy full SHA for 62c3a76 - Browse repository at this point
Copy the full SHA 62c3a76View commit details
Commits on Jan 30, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 82c5a77 - Browse repository at this point
Copy the full SHA 82c5a77View commit details -
Use libmamba solver for L0_backend_python env test. Fix pytest not fo…
…und (#6834) * Update miniconda version * Install pytest for different py version * Install pytest
Configuration menu - View commit details
-
Copy full SHA for dfe9dde - Browse repository at this point
Copy the full SHA dfe9ddeView commit details
Commits on Jan 31, 2024
-
Add test for shutdown while loading model (#6837)
* Add test for shutdown while loading * Fix intermittent failure on test_model_config_overwrite
Configuration menu - View commit details
-
Copy full SHA for f345bbb - Browse repository at this point
Copy the full SHA f345bbbView commit details
Commits on Feb 1, 2024
-
Adding OpenTelemetry Batch Span Processor (#6842)
Adding OpenTelemetry Batch Span Processor --------- Co-authored-by: Theo Clark <theoclark101@gmail.com> Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 8f98789 - Browse repository at this point
Copy the full SHA 8f98789View commit details -
Support Double-Type Inference Request/Response Parameters (#6755)
* Support Double-Type Infer/Response Parameters
Configuration menu - View commit details
-
Copy full SHA for 9860f73 - Browse repository at this point
Copy the full SHA 9860f73View commit details
Commits on Feb 7, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2623c7f - Browse repository at this point
Copy the full SHA 2623c7fView commit details
Commits on Feb 8, 2024
-
Python Backend Windows Support (#6830)
* Base Python Backend Support for Windows
Configuration menu - View commit details
-
Copy full SHA for bfbb24c - Browse repository at this point
Copy the full SHA bfbb24cView commit details
Commits on Feb 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 3dfb7b1 - Browse repository at this point
Copy the full SHA 3dfb7b1View commit details
Commits on Feb 13, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 4a398b6 - Browse repository at this point
Copy the full SHA 4a398b6View commit details
Commits on Feb 14, 2024
-
Add unit test reports to L0_dlpack_multi_gpu and L0_warmup (#6873)
* Add unit test reports to L0_dlpack_multi_gpu * Add unit test reports to L0_warmup
Configuration menu - View commit details
-
Copy full SHA for c048a9f - Browse repository at this point
Copy the full SHA c048a9fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 44cfb62 - Browse repository at this point
Copy the full SHA 44cfb62View commit details -
Configuration menu - View commit details
-
Copy full SHA for fbe03b3 - Browse repository at this point
Copy the full SHA fbe03b3View commit details
Commits on Feb 16, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 15b7f25 - Browse repository at this point
Copy the full SHA 15b7f25View commit details
Commits on Feb 17, 2024
-
Add response statistics (#6869)
* Add response statistics * Add L0_response_statistics * Enable http vs grpc statistics comparison * Add docs for response statistics protocol * Add more comments for response statistics test * Remove model name from config * Improve docs wordings * [Continue] Improve docs wordings * [Continue] Add more comments for response statistics test * [Continue 2] Improve docs wordings * Fix typo * Remove mentioning decoupled from docs * [Continue 3] Improve docs wordings * [Continue 4] Improve docs wordings Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> --------- Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for d6ab1ef - Browse repository at this point
Copy the full SHA d6ab1efView commit details
Commits on Feb 22, 2024
-
Fix busyop test for L0_memory_growth (#6900)
* Switch to Python model for busyop test * Clean up * Address comment * Remove unused import
Configuration menu - View commit details
-
Copy full SHA for 5732163 - Browse repository at this point
Copy the full SHA 5732163View commit details
Commits on Feb 23, 2024
-
Add cancellation into response statistics (#6904)
* Add cancellation into response statistics * Add test for response statistics cancel * Remove debugging print * Use is None comparison * Fix docs * Use default args None * Refactor RegisterModelStatistics()
Configuration menu - View commit details
-
Copy full SHA for 48cf6b7 - Browse repository at this point
Copy the full SHA 48cf6b7View commit details
Commits on Feb 24, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d0dd553 - Browse repository at this point
Copy the full SHA d0dd553View commit details
Commits on Feb 27, 2024
-
Match forward headers case insensitively. (#6889)
* Modify "header_forward_pattern" to match headers case-insensitively. Add unit tests. * fix indentation * fix pre-comiit errors * Update doc * Update copyright * Add test case for "(?-i)", which disables regex case-insensitive mode. * fix pre-commit * Name each test. Remove support of disabling --http-header-forward-pattern case-insensitive mode on http python client. * Update .md file. * fix typo * Reformat args. * Fix pre-commit * Fix test name issue. * Fix pre-commit. * Update md file and copyright.
Configuration menu - View commit details
-
Copy full SHA for 5630efe - Browse repository at this point
Copy the full SHA 5630efeView commit details
Commits on Mar 1, 2024
-
Configuration menu - View commit details
-
Copy full SHA for ac5ba42 - Browse repository at this point
Copy the full SHA ac5ba42View commit details
Commits on Mar 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for ad25365 - Browse repository at this point
Copy the full SHA ad25365View commit details -
Update README and versions for 2.43.0 / 24.02 (#6886)
* Update README and versions for 2.43.0 / 24.02 * Update Dockefile to reduce image size. * Update path in patch file for model generation Update README.md post-24.02
Configuration menu - View commit details
-
Copy full SHA for 3bc6863 - Browse repository at this point
Copy the full SHA 3bc6863View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5ba53d8 - Browse repository at this point
Copy the full SHA 5ba53d8View commit details
Commits on Mar 4, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 46f87ff - Browse repository at this point
Copy the full SHA 46f87ffView commit details
Commits on Mar 6, 2024
-
Parameterize git repository (#6934)
* patching git repository parameterization from production branch 1 * Fix go package directory name * pre-commit fixes * pre-commit fixes --------- Co-authored-by: kyle <kmcgill@kmcgill-ubuntu.nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 2255663 - Browse repository at this point
Copy the full SHA 2255663View commit details
Commits on Mar 8, 2024
-
Enhance bound check for shm offset (#6914)
* Enhance bound check for shm offset * Add test for enhance bound check for shm offset * Fix off by 1 on max offset * Improve comments * Improve comment and offset * Separate logic between computation and validation
Configuration menu - View commit details
-
Copy full SHA for 60071e1 - Browse repository at this point
Copy the full SHA 60071e1View commit details -
Allow non-decoupled model to send response and FINAL flag separately (#…
…6017) * Allow non-decoupled model to send response and FINAL flag separately * Update copyright * Defer sending error until FINAL flag is seen to avoid invalid reference * Move timestamp capture location * Delay time-point of response complete timestamp in GPRC and SageMaker endpoint * Move location of RESPONSE_COMPLETE timestamp capture to better align with the meaning.
Configuration menu - View commit details
-
Copy full SHA for 0a8dbaf - Browse repository at this point
Copy the full SHA 0a8dbafView commit details -
Configuration menu - View commit details
-
Copy full SHA for b603024 - Browse repository at this point
Copy the full SHA b603024View commit details
Commits on Mar 9, 2024
-
Test improved input validation errors (#6933)
Added a test case to check for optional/required input params in a request and appropriate response from server. Includes addition of 3 simple models with a combination of required/optional input params
Configuration menu - View commit details
-
Copy full SHA for d89e700 - Browse repository at this point
Copy the full SHA d89e700View commit details
Commits on Mar 11, 2024
-
Update Dockerfile.sdk with OpenAI support (#6941)
Add flag to enable compile of OpenAI support in PA
Configuration menu - View commit details
-
Copy full SHA for a6ce4f6 - Browse repository at this point
Copy the full SHA a6ce4f6View commit details -
Test Correlation Id string support for BLS (#6963)
* Test Correlation Id string support for BLS
Configuration menu - View commit details
-
Copy full SHA for 79a09f2 - Browse repository at this point
Copy the full SHA 79a09f2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4aba07d - Browse repository at this point
Copy the full SHA 4aba07dView commit details
Commits on Mar 13, 2024
-
Add AsyncIO HTTP compression test (#6975)
* Add AsyncIO HTTP compression test * Improve command line option handling
Configuration menu - View commit details
-
Copy full SHA for 55b13f6 - Browse repository at this point
Copy the full SHA 55b13f6View commit details -
Install
genai-pa
into SDK container (#6942)* Update Docerkfile to install genai * Change the installation script * install both build and hatch * Update name --------- Co-authored-by: Elias Bermudez <dbermudez@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 30d086d - Browse repository at this point
Copy the full SHA 30d086dView commit details
Commits on Mar 15, 2024
-
Configuration menu - View commit details
-
Copy full SHA for de473d2 - Browse repository at this point
Copy the full SHA de473d2View commit details -
Exposing trace context to python backend (#6985)
* Added TRITONSERVER_InferenceTraceSetContext logic
Configuration menu - View commit details
-
Copy full SHA for 06b73f3 - Browse repository at this point
Copy the full SHA 06b73f3View commit details
Commits on Mar 19, 2024
-
Add documentation for mapping between Triton Errors and HTTP status c…
…odes (#6992) * Add documentation for mapping between Triton Errors and HTTP status codes * formatting * Update README.md
Configuration menu - View commit details
-
Copy full SHA for 6972963 - Browse repository at this point
Copy the full SHA 6972963View commit details
Commits on Mar 21, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 8fadf21 - Browse repository at this point
Copy the full SHA 8fadf21View commit details -
Configuration menu - View commit details
-
Copy full SHA for a168d51 - Browse repository at this point
Copy the full SHA a168d51View commit details
Commits on Mar 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 4949fa2 - Browse repository at this point
Copy the full SHA 4949fa2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8c3156e - Browse repository at this point
Copy the full SHA 8c3156eView commit details
Commits on Mar 27, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 99240f9 - Browse repository at this point
Copy the full SHA 99240f9View commit details
Commits on Apr 1, 2024
-
Update 'main' post-24.03 (#7051)
* Update README and versions for 2.44.0 / 24.03 (#6971) * Update README and versions for 2.44.0 / 24.03 * Mchornyi 24.03 (#6972) * Current location is dropped in 12.4 * Update Dockerfile.win10.min * Change to triton_sample_folder (#6973) --------- Co-authored-by: kyle <kmcgill@kmcgill-ubuntu.nvidia.com> Co-authored-by: Misha Chornyi <99709299+mc-nv@users.noreply.github.com> * Specify path for PyTorch model extension library (#7025) * Update README.md 2.44.0 / 24.03 (#7032) * Update README.md post-24.03 --------- Co-authored-by: Kyle McGill <101670481+nv-kmcgill53@users.noreply.github.com> Co-authored-by: kyle <kmcgill@kmcgill-ubuntu.nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 6cab4bb - Browse repository at this point
Copy the full SHA 6cab4bbView commit details
Commits on Apr 4, 2024
-
Fix incorrect version updates (#7073)
* Fix Otel version * Fix version in CPU metrics * Update metrics.md * Update trace.md
Configuration menu - View commit details
-
Copy full SHA for 3f83727 - Browse repository at this point
Copy the full SHA 3f83727View commit details -
Configuration menu - View commit details
-
Copy full SHA for f57de7f - Browse repository at this point
Copy the full SHA f57de7fView commit details