Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: scan row groups in one sst file parallelly #474

Conversation

Rachelint
Copy link
Contributor

Which issue does this PR close?

Closes #

Rationale for this change

Now, the concurrent level while pulling sst data is on file level, we can enhance it to row group level.

What changes are included in this PR?

Are there any user-facing changes?

How does this change test

Test by ut.

@Rachelint Rachelint changed the title feat: make scanning sst file in parallel feat: make scanning sst file in row group parallel Dec 12, 2022
@Rachelint Rachelint force-pushed the origin/make-pulling-single-sst-file-in-row-group-parallel branch from fc26b39 to a4f2a39 Compare December 12, 2022 03:22
@Rachelint Rachelint force-pushed the origin/make-pulling-single-sst-file-in-row-group-parallel branch 5 times, most recently from 27fafff to 33bb9e6 Compare December 13, 2022 14:39
analytic_engine/src/sst/parquet/async_reader.rs Outdated Show resolved Hide resolved
analytic_engine/src/sst/parquet/async_reader.rs Outdated Show resolved Hide resolved
analytic_engine/src/sst/parquet/async_reader.rs Outdated Show resolved Hide resolved
@Rachelint Rachelint force-pushed the origin/make-pulling-single-sst-file-in-row-group-parallel branch 2 times, most recently from 5c75795 to 75f6e5e Compare December 15, 2022 12:52
@Rachelint Rachelint force-pushed the origin/make-pulling-single-sst-file-in-row-group-parallel branch 2 times, most recently from e40ba49 to 67823b4 Compare December 15, 2022 13:32
@Rachelint Rachelint force-pushed the origin/make-pulling-single-sst-file-in-row-group-parallel branch 3 times, most recently from 3f51d14 to a925730 Compare December 15, 2022 13:44
@Rachelint Rachelint force-pushed the origin/make-pulling-single-sst-file-in-row-group-parallel branch from a925730 to 7a81423 Compare December 16, 2022 03:03
@Rachelint Rachelint force-pushed the origin/make-pulling-single-sst-file-in-row-group-parallel branch from 7a81423 to ea76811 Compare December 16, 2022 03:19
Copy link
Contributor

@jiacai2050 jiacai2050 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@jiacai2050 jiacai2050 changed the title feat: make scanning sst file in row group parallel feat: scan row groups in one sst file parallelly Dec 16, 2022
@jiacai2050 jiacai2050 merged commit 5e02068 into apache:main Dec 16, 2022
chunshao90 pushed a commit to chunshao90/ceresdb that referenced this pull request May 15, 2023
* make scanning sst file in parallel.

* rename `read_parallelism` to `background_read_parallelism` in `SstReaderOptions`.

* address CR.

* fix parallelly reading but and add tests.

* add more checks for the `read_parallism`.

* address CR.
@Rachelint Rachelint deleted the origin/make-pulling-single-sst-file-in-row-group-parallel branch May 27, 2023 12:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants