Skip to content

Changes in v2025.03.0 compared to develop #1803

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 6 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .gitlab/custom-jobs-and-variables.yml
Original file line number Diff line number Diff line change
Expand Up @@ -56,7 +56,7 @@ variables:
# Lassen and Butte use a different job scheduler (spectrum lsf) that does not
# allow pre-allocation the same way slurm does.
# Arguments for job level allocation
LASSEN_JOB_ALLOC: "1 -W 40 -q pci"
LASSEN_JOB_ALLOC: "1 -W 45 -q pci"
# Project specific variants for lassen
PROJECT_LASSEN_VARIANTS: "~shared +openmp +vectorization +tests cuda_arch=70"
# Project specific deps for lassen
Expand Down
4 changes: 2 additions & 2 deletions CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -14,8 +14,8 @@ endif()
include(CMakeDependentOption)

# Set version number
set(RAJA_VERSION_MAJOR 2024)
set(RAJA_VERSION_MINOR 07)
set(RAJA_VERSION_MAJOR 2025)
set(RAJA_VERSION_MINOR 03)
set(RAJA_VERSION_PATCHLEVEL 0)

if (RAJA_LOADED AND (NOT RAJA_LOADED STREQUAL "${RAJA_VERSION_MAJOR}.${RAJA_VERSION_MINOR}.${RAJA_VERSION_PATCHLEVEL}"))
Expand Down
2 changes: 1 addition & 1 deletion RELEASE
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
*******************************************************************************

RAJA: ................................, version 2023.06.1
RAJA: ................................, version 2025.03.0

Copyright (c) 2016-25, Lawrence Livermore National Security, LLC.
Produced at the Lawrence Livermore National Laboratory.
Expand Down
50 changes: 50 additions & 0 deletions RELEASE_NOTES.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,56 @@ Notable changes include:
* Bug fixes/improvements:


Version 2025.03.0 -- Release date 2025-03-17
============================================

This release contains new features, bug fixes, and updates to submodule
dependencies.

Notable changes include:

* New features / API changes:
* Added improved support for perfectly nested loops in RAJA::launch.
* Added helper methods to simplify the creation of RAJA View objects
with permutations of stride ordering. Examples and user docs have also
been added.
* Added GPU policies for CUDA and HIP that do not check loop bounds when
they do not need to be checked in a kernel. This can help improve
performance by up to 5%. The new policies are documented in the RAJA
user guide and include `direct_unchecked` in their names.
* Refactored the new (experimental) RAJA reduction interface to have
consistent min/max/loc operator semantics and added type safety to
reduce erroneous usage. Changes are described in the RAJA User Guide.
* Added support for new RAJA reduction interface to RAJA::dynamic_forall
and pulled dynamic_forall out of RAJA `expt` namespace.
* Added `RAJA_HIP_WAVESIZE` CMake option to set the wave size for HIP
builds. It defaults to 64 but can be set to 32, for example, to
build RAJA to run on Radeon gaming cards.

* Build changes/improvements:
* Update BLT to v0.7.0 release.
* Update camp submodule to v2025.03.0 release.
* Update desul submodule to 6114dd25b54782678c555c0c1d2197f13cc8d2a0
commit.
* Added clang-format CI check (clang 14) that must pass before a PR can
be merged -- noted here so external contributors are aware.

* Bug fixes/improvements:
* Resolved undefined behavior related to constructing
uniform_int_distribution with min > max. This was causing some Windows
tests to fail.
* Corrected call to wrong global function when using a fixed CUDA policy
and reductions in RAJA::launch kernel -- potential performance issue.
* Fixed memory leak in RAJA::launch OpenMP back-end.
* Added missing host-device decorations to some math utility functions.
* Fixed MSVC compilation failures with 64-bit intrinsics in x86 Windows
builds.
* Fixed issue so that a kernel will no longer be launched when there is no
work for it to do; i.e., no active iteration space entries.
* Removed invalid C++ usage in implementation of RAJA::kernel `initLocalMem`
statement, which was causing large warning messages during compilation.


Version 2024.07.0 -- Release date 2024-07-24
============================================

Expand Down
4 changes: 2 additions & 2 deletions docs/conf.py
Original file line number Diff line number Diff line change
Expand Up @@ -87,9 +87,9 @@
# built documents.
#
# The short X.Y version.
version = u'2024.07'
version = u'2025.03'
# The full version, including alpha/beta/rc tags.
release = u'2024.07.0'
release = u'2025.03.0'

# The language for content autogenerated by Sphinx. Refer to documentation
# for a list of supported languages.
Expand Down