Skip to content

[FR] Transform Chaining, Lazy Mapping #6012

Open
@NightMachinery

Description

Feature request

Currently using a map call processes and duplicates the whole dataset, which takes both time and disk space.

The solution is to allow lazy mapping, which is essentially a saved chain of transforms that are applied on the fly whenever a slice of the dataset is requested.

The API should look like map, as set_transform changes the current dataset while map returns another dataset.

Motivation

Lazy processing allows lower disk usage and faster experimentation.

Your contribution

_

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions