Skip to content

Add support of Apache Uniffle for remote shuffle service#796

Merged
richox merged 3 commits intoapache:masterfrom
zuston:uniffle2
Feb 5, 2025
Merged

Add support of Apache Uniffle for remote shuffle service#796
richox merged 3 commits intoapache:masterfrom
zuston:uniffle2

Conversation

@zuston
Copy link
Member

@zuston zuston commented Jan 26, 2025

Which issue does this PR close?

Closes #.

Rationale for this change

Uniffle is a high performance, general purpose remote shuffle service for distributed computing engines. It provides the ability to push shuffle data into centralized storage service, changing the shuffle style from "local file pull-like style" to "remote block push-like style". It brings in several advantages like supporting disaggregated storage deployment, super large shuffle jobs, and high elasticity. Currently it supports Apache Spark, Apache Hadoop MapReduce and Apache Tez.

Based on the above advantages, uniffle has been used by several commercial companies. After intergrating with blaze, users' spark jobs will benefit greatly from storage-computation separation and vectorized execution.

What changes are included in this PR?

Following the blaze's rss shuffle manager design to implement the writer + reader

Are there any user-facing changes?

Yes.

@merrily01
Copy link
Member

Really looking forward to this feature. Thank you for your contribution

@richox richox merged commit c1d70b1 into apache:master Feb 5, 2025
618 checks passed
@richox richox mentioned this pull request Apr 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants