Skip to content

Provide a syntax to perform row aggregations fast #2439

Closed
@ppalmes

Description

@ppalmes
  • All() operates on collection of columns and typically, you need to operate in many columns so it is easier to process vector or tuple of rows instead of using positional arguments.
  • All() -> ByRow(sum or mean) won't work including other statistical metrics because All returns positional arguments and most functions process either vectors or tuples.
  • All()=>ByRow((x...) -> sum(x)) or AsTable(All()) => (x->sum(x)) is needed but it is messy because why do you have to transform All() into a table because from the user perspective, dataframe is a table already. It is not clear why you need AsTable with All(). This happens because All() returns a positional value instead of a collection like vector or tuple
  • most functions return scalar, or tuple, or vector. it is confusing to find All() returning positional argument for typical user. it is convenient to deal with two or three columns but many use-cases require operating each row across several columns

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions