Add MemoryResourceConfig to cudf-polars config #20042

TomAugspurger · 2025-09-19T20:10:30Z

Description

This adds a new configuration option to cudf-polars' config, allowing users to specify the memory resource to create by default.

Currently, users either get the default behavior (typically a managed memory resource) or can pass in a concrete memory resource (see the new docs added in this PR).

In some cases, you might want to pass in a description of the memory resource to use:

In our unit tests, we might want to specify a CudaAsyncMemoryResource with a relatively small initial pool size
In a distributed environment, you can't pass around concrete memory resource objects (which can't be serialized)

This is strictly more flexible than the current option of setting POLARS_GPU_ENABLE_CUDA_MANAGED_MEMORY. By setting POLARS_GPU_ENABLE_CUDA_MANAGED_MEMORY=0 you'll get a CudaAsyncMemoryResource with its default initial_pool_size and release_threshold. With this system, you can set

CUDF_POLARS__MEMORY_RESOURCE_CONFIG__QUALNAME="rmm.mr.CudaAsyncMemoryResource"

to get the same thing, or

CUDF_POLARS__MEMORY_RESOURCE_CONFIG__QUALNAME="rmm.mr.CudaAsyncMemoryResource"
CUDF_POLARS__MEMORY_RESOURCE_CONFIG__OPTIONS='{"initial_pool_size": 256, "release_threshold": 256}"'

to configure the pool.

I'd recommend deprecating POLARS_GPU_ENABLE_CUDA_MANAGED_MEMORY to reduce the number of ways this can be configured, though we should take our time with that.

This adds a new configuration option to cudf-polars' config, allowing users to specify the memory resource to create by default. Currently, users either get the default behavior (typically a managed memory resource) or can pass in a concrete memory resource. In some cases, you might want to pass in a description of the memory resource to use: 1. In our unit tests, we might want to specify a CudaAsyncMemoryResource with a relatively small initial pool size 2. In a distributed environment, you can't pass around concrete memory resource objects (which can't be serialized)

copy-pr-bot · 2025-09-19T20:10:33Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

…resource-config

TomAugspurger · 2025-09-22T13:48:44Z

python/cudf_polars/cudf_polars/utils/config.py

+    Examples
+    --------
+    >>> MemoryResourceConfig(
+    ...     qualname="rmm.mr.CudaAsyncMemoryResource",


Are people OK with "qualname" here? I want to avoid locking us to MRs that happen to be defined in rmm.mr.

python/cudf_polars/cudf_polars/utils/config.py

…resource-config

TomAugspurger · 2025-09-24T12:05:44Z

The only other feature I might want to add here is expanding the MemoryResourceConfig to handle "nested" memory resource configurations. Right now, you can't create an RMM memory resource that needs to configure the inner memory resource. I'm not sure whether that's worth doing here or not.

For example, to express our default memory resource:

            mr = rmm.mr.PrefetchResourceAdaptor(
                rmm.mr.PoolMemoryResource(
                    rmm.mr.ManagedMemoryResource(),
                    initial_pool_size=free_memory,
                )
            )

Aside from free_memory itself being dynamic, this could be expressed as something like:

{
    "qualname": "rmm.mr.PrefetchResourceAdaptor",
    "options": {
        "upstream_mr": {
            "qualname": "rmm.mr.PoolMemoryResource",
            "options": {
                "upstream_mr": {
                    "qualname": "rmm.mr.ManagedMemoryResource",
                    "options": {
                        "initial_pool_size": 256
                    }
                }
            }
        }
    }
}

That relies on pattern matching a dict with {"qualname": ..., "options": ...} to mean "this is a memory resource config", which is probably sufficient.

docs/cudf/source/cudf_polars/engine_options.md

…resource-config

(cherry picked from commit 29b3b41)

Now that cudf-polars uses managed memory by default, the prior comment here should no longer be applicable and we should be able to run these tests with more than 1 process for a hopeful improvement in runtime. Probably depends on #20042 so each xdist process doesn't set the `initial_pool_size` of the memory resource to 80% of the available device memory. Authors: - Matthew Roeschke (https://github.com/mroeschke) - Tom Augspurger (https://github.com/TomAugspurger) Approvers: - Kyle Edwards (https://github.com/KyleFromNVIDIA) - Bradley Dice (https://github.com/bdice) - Tom Augspurger (https://github.com/TomAugspurger) URL: #19980

We really only need to see why tests fail when they fail. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Matthew Murray (https://github.com/Matt711) - Bradley Dice (https://github.com/bdice) URL: rapidsai#20107

…i#20102) Contributes to rapidsai#15170 Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Matthew Roeschke (https://github.com/mroeschke) URL: rapidsai#20102

…idsai#20101) Contributes to rapidsai#15170 Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Matthew Roeschke (https://github.com/mroeschke) URL: rapidsai#20101

…dsai#20099) Contributes to rapidsai#15170 Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Matthew Roeschke (https://github.com/mroeschke) URL: rapidsai#20099

This pull-request adds a JIT filter for the `read_parquet` filtering. JIT Filter for `read_parquet` can be turned on using the `use_jit_filter` reader option or the `LIBCUDF_USE_JIT` environment variable. It also adds a benchmark for `read_parquet` to compare the AST and JIT filters. Benchmark results will be posted below. Follows-up rapidsai#18023 Authors: - Basit Ayantunde (https://github.com/lamarrr) - Muhammad Haseeb (https://github.com/mhaseeb123) Approvers: - Bradley Dice (https://github.com/bdice) - Muhammad Haseeb (https://github.com/mhaseeb123) URL: rapidsai#19831

This PR cleans up the custom device atomic logic by using `atomic_ref`. Authors: - Yunsong Wang (https://github.com/PointKernel) Approvers: - Bradley Dice (https://github.com/bdice) - David Wendt (https://github.com/davidwendt) URL: rapidsai#19924

) This PR uses 8 processes instead of 6, hoping to cut the runtime of the pandas test suite. I have data below, this speeds things up by about 21.6%. Authors: - Bradley Dice (https://github.com/bdice) Approvers: - Vyas Ramasubramani (https://github.com/vyasr) URL: rapidsai#20109

This PR is needed to support the nvbench upgrade in rapidsai/rapids-cmake#895. This should be merged immediately after. Authors: - Bradley Dice (https://github.com/bdice) Approvers: - Kyle Edwards (https://github.com/KyleFromNVIDIA) - Vyas Ramasubramani (https://github.com/vyasr) URL: rapidsai#19619

Contributes to rapidsai#15170 Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Matthew Roeschke (https://github.com/mroeschke) URL: rapidsai#20119

…i#20038) Fixes several groupby benchmarks that only measured the aggregate/scan/shift call where a sort-based groupby is used. This means only the 1st call performed a keys sort while the remaining invocations reused the already sorted keys. The change involves simply instantiating the groupby object within the `state.exec{}` functor along with the `aggregate` call. This change also adds decimal64 to the sum benchmark to show improvement on a follow on PR. Authors: - David Wendt (https://github.com/davidwendt) Approvers: - Shruti Shivakumar (https://github.com/shrshi) - Nghia Truong (https://github.com/ttnghia) URL: rapidsai#20038

Minor cleanup of aggregation code including fixing a misspelling. No functional changes. Found while working on rapidsai#20040 Authors: - David Wendt (https://github.com/davidwendt) Approvers: - Bradley Dice (https://github.com/bdice) - Yunsong Wang (https://github.com/PointKernel) URL: rapidsai#20053

…9980) Now that cudf-polars uses managed memory by default, the prior comment here should no longer be applicable and we should be able to run these tests with more than 1 process for a hopeful improvement in runtime. Probably depends on rapidsai#20042 so each xdist process doesn't set the `initial_pool_size` of the memory resource to 80% of the available device memory. Authors: - Matthew Roeschke (https://github.com/mroeschke) - Tom Augspurger (https://github.com/TomAugspurger) Approvers: - Kyle Edwards (https://github.com/KyleFromNVIDIA) - Bradley Dice (https://github.com/bdice) - Tom Augspurger (https://github.com/TomAugspurger) URL: rapidsai#19980

github-actions bot assigned TomAugspurger Sep 19, 2025

github-actions bot added Python Affects Python cuDF API. cudf-polars Issues specific to cudf-polars labels Sep 19, 2025

github-project-automation bot added this to cuDF Python Sep 19, 2025

GPUtester moved this to In Progress in cuDF Python Sep 19, 2025

TomAugspurger added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Sep 22, 2025

TomAugspurger added 5 commits September 22, 2025 05:31

Merge remote-tracking branch 'upstream/branch-25.10' into tom/memory-…

1e9a532

…resource-config

Fix GPUEngine refs

54661fd

wording fixes

538d2ef

more wording

7647524

fixups

498ced7

TomAugspurger commented Sep 22, 2025

View reviewed changes

mroeschke mentioned this pull request Sep 22, 2025

Run cudf-polars conda unit tests with more than 1 process #19980

Merged

3 tasks

TomAugspurger added 4 commits September 23, 2025 10:41

Merge remote-tracking branch 'upstream/branch-25.10' into tom/memory-…

9f3be30

…resource-config

Fix failing call to default_memory_resource

aa174cd

Merge remote-tracking branch 'upstream/branch-25.10' into tom/memory-…

c271cd3

…resource-config

wording

d6ad468

TomAugspurger marked this pull request as ready for review September 24, 2025 13:23

TomAugspurger requested a review from a team as a code owner September 24, 2025 13:23

TomAugspurger requested review from vyasr and bdice September 24, 2025 13:23

TomAugspurger commented Sep 24, 2025

View reviewed changes

docs/cudf/source/cudf_polars/engine_options.md Outdated Show resolved Hide resolved

docs/cudf/source/cudf_polars/engine_options.md Outdated Show resolved Hide resolved

docs/cudf/source/cudf_polars/engine_options.md Outdated Show resolved Hide resolved

doc fixes

be1ee5d

TomAugspurger changed the base branch from branch-25.10 to branch-25.12 September 24, 2025 16:14

TomAugspurger added 2 commits September 24, 2025 10:40

coverage

93cc7cc

Merge remote-tracking branch 'upstream/branch-25.12' into tom/memory-…

d30c427

…resource-config

TomAugspurger added 2 commits September 25, 2025 05:26

Add fixture to cleare MR cache

77ac97b

(cherry picked from commit 29b3b41)

Merge branch 'branch-25.12' into tom/memory-resource-config

a209f6d

vyasr and others added 13 commits September 26, 2025 12:23

Add strings to/from encoded integer APIs (rapidsai#19789)

40754e2

Add memory resources to groupby, datetime, and lists modules (rapidsa…

b14f835

…i#20102) Contributes to rapidsai#15170 Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Matthew Roeschke (https://github.com/mroeschke) URL: rapidsai#20102

Add memory resources to search, reshape, and partitioning module (rap…

5950359

…idsai#20101) Contributes to rapidsai#15170 Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Matthew Roeschke (https://github.com/mroeschke) URL: rapidsai#20101

Add memory resources to rolling, sorting, and quantiles modules (rapi…

2ab2a56

…dsai#20099) Contributes to rapidsai#15170 Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Matthew Roeschke (https://github.com/mroeschke) URL: rapidsai#20099

Add memory resources to all nvtext APIs (rapidsai#20119)

e67b914

Contributes to rapidsai#15170 Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Matthew Roeschke (https://github.com/mroeschke) URL: rapidsai#20119

TomAugspurger requested review from a team as code owners September 26, 2025 19:24

github-actions bot added libcudf Affects libcudf (C++/CUDA) code. CMake CMake build issue pylibcudf Issues specific to the pylibcudf package labels Sep 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add MemoryResourceConfig to cudf-polars config #20042

Add MemoryResourceConfig to cudf-polars config #20042

Uh oh!

TomAugspurger commented Sep 19, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Sep 19, 2025

Uh oh!

TomAugspurger Sep 22, 2025

Uh oh!

Uh oh!

Uh oh!

TomAugspurger commented Sep 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add MemoryResourceConfig to cudf-polars config #20042

Are you sure you want to change the base?

Add MemoryResourceConfig to cudf-polars config #20042

Uh oh!

Conversation

TomAugspurger commented Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

copy-pr-bot bot commented Sep 19, 2025

Uh oh!

TomAugspurger Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

TomAugspurger commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

TomAugspurger commented Sep 19, 2025 •

edited

Loading

TomAugspurger commented Sep 24, 2025 •

edited

Loading