Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement grace hash join when build input is bigger than the available memory #14675

Open
SunRunAway opened this issue Feb 7, 2020 · 1 comment
Assignees
Labels
epic/memory-management sig/execution SIG execution type/enhancement The issue or PR belongs to an enhancement.

Comments

@SunRunAway
Copy link
Contributor

SunRunAway commented Feb 7, 2020

Feature Request

Is your feature request related to a problem? Please describe:

What we already done before is using temporary disk for build side of Hash Join.

  1. But when build input is bigger enough, the memory usage of hash table may still be larger than the available memory.
  2. It spills all build input into disk and abandons the usage of memory for build input. aka, when probing, the reading of each build row is a random I/O.

Describe the feature you'd like:

By implementing GRACE hash join or hybrid hash join, we can spill both build input and probe input when memory limit is exceeded, and for each partition we can join in memory.

Describe alternatives you've considered:

Teachability, Documentation, Adoption, Migration Strategy:

@b41sh
Copy link
Member

b41sh commented Mar 6, 2020

Let me do this issue.

@jianzhiyao
Copy link
Contributor

marked

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
epic/memory-management sig/execution SIG execution type/enhancement The issue or PR belongs to an enhancement.
Projects
None yet
Development

No branches or pull requests

5 participants