Skip to content

Consolidate datafusion benchmarks into dfbench #7052

@alamb

Description

@alamb

Is your feature request related to a problem or challenge?

DataFusion currently has four different benchmark runners for comparing different versions of datafusion with itself, which have substantial overlap in their functionality and code

cargo run --release --bin nyctaxi
cargo run --release --bin h2o
cargo run --release --bin parquet
cargo run --release --bin tpch

I would like to add a 5th one (ClickBench) #6994 but would like to avoid copy/pasting yet again mostly the same thing

Describe the solution you'd like

Thus I propose a new consoidated binary dfbench run like this:

cargo run --release --bin benchmark -- nyctaxi
cargo run --release --bin benchmark -- h2o
cargo run --release --bin benchmark -- parquet
cargo run --release --bin benchmark -- tpch
cargo run --release --bin benchmark -- clickbench (I want to add this)

Describe alternatives you've considered

Follow the same pattern

Additional context

Task List

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions