Skip to content

[FEA] Add an internal utility API to return an offsets column of a sliced column starting with zero #9256

Open
@ttnghia

Description

For nested sliced columns that have an offsets column child, their offsets columns may contain values that do not start from zero. For example:

offsets = [5, 7, 20, ...]

Many operations on these sliced columns need to generate an output offsets column that starts with zero. For example, with the input column having offsets given above, the output offsets column should be:

offsets = [0, 2, 13, ...]

Such output offsets column is generated simply by subtracting all the values with the first value. Yes, very simple.

I would like to have an internal API implementing this feature. Currently, there are several other APIs using it by implementing private code in their .cu files. For example:

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

Labels

feature requestNew feature or requestimprovementImprovement / enhancement to an existing functionlibcudfAffects libcudf (C++/CUDA) code.non-breakingNon-breaking change

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions