From c8297d6827b599208cb0c7b94c766856de7abd79 Mon Sep 17 00:00:00 2001 From: Stanley Tsang Date: Thu, 16 Nov 2023 10:31:39 -0700 Subject: [PATCH] 6.0 cherry pick for changelog and version update (#483) * Fix changelog for 6.0 * Fix version --- CHANGELOG.md | 10 +++++++--- CMakeLists.txt | 2 +- 2 files changed, 8 insertions(+), 4 deletions(-) diff --git a/CHANGELOG.md b/CHANGELOG.md index ab972ba62..6e75d46de 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -2,19 +2,23 @@ Full documentation for rocPRIM is available at [https://rocprim.readthedocs.io/en/latest/](https://rocprim.readthedocs.io/en/latest/) -## [Unreleased rocPRIM-2.13.1 for ROCm 5.7.0] +## [rocPRIM-3.0.0 for ROCm 6.0.0] ### Added - `block_sort::sort()` overload for keys and values with a dynamic size, for all block sort algorithms. Additionally, all `block_sort::sort()` overloads with a dynamic size are now supported for `block_sort_algorithm::merge_sort` and `block_sort_algorithm::bitonic_sort`. - New two-way partition primitive `partition_two_way` which can write to two separate iterators. +### Optimizations +- Improved the performance of `partition`. +### Fixed +- Fixed `rocprim::MatchAny` for devices with 64-bit warp size. The function `rocprim::MatchAny` is deprecated and `rocprim::match_any` is preferred instead. + +## [rocPRIM-2.13.1 for ROCm 5.7.0] ### Changed - Deprecated configuration `radix_sort_config` for device-level radix sort as it no longer matches the algorithm's parameters. New configuration `radix_sort_config_v2` is preferred instead. - Removed erroneous implementation of device-level `inclusive_scan` and `exclusive_scan`. The prior default implementation using lookback-scan now is the only available implementation. - The benchmark metric indicating the bytes processed for `exclusive_scan_by_key` and `inclusive_scan_by_key` has been changed to incorporate the key type. Furthermore, the benchmark log has been changed such that these algorithms are reported as `scan` and `scan_by_key` instead of `scan_exclusive` and `scan_inclusive`. - Deprecated configurations `scan_config` and `scan_by_key_config` for device-level scans, as they no longer match the algorithm's parameters. New configurations `scan_config_v2` and `scan_by_key_config_v2` are preferred instead. -- Improved the performance of `partition`. ### Fixed - Fixed build issue caused by missing header in `thread/thread_search.hpp`. -- Fixed `rocprim::MatchAny` for devices with 64-bit warp size. The function `rocprim::MatchAny` is deprecated and `rocprim::match_any` is preferred instead. ## [rocPRIM-2.13.0 for ROCm 5.5.0] ### Added diff --git a/CMakeLists.txt b/CMakeLists.txt index 40c8d8cf9..900e2021c 100644 --- a/CMakeLists.txt +++ b/CMakeLists.txt @@ -114,7 +114,7 @@ if(USE_HIP_CPU) endif() # Setup VERSION -set(VERSION_STRING "2.13.1") +set(VERSION_STRING "3.0.0") rocm_setup_version(VERSION ${VERSION_STRING}) # Print configuration summary