Skip to content

Conversation

@dongjoon-hyun
Copy link
Member

@dongjoon-hyun dongjoon-hyun commented Jun 11, 2025

What changes were proposed in this pull request?

This PR aims to lower the minimum limit of spark.eventLog.rolling.maxFileSize from 10MiB to 2MiB at Apache Spark 4.1.0 while keeping the default (128MiB).

Why are the changes needed?

spark.eventLog.rolling.maxFileSize has 10MiB as the lower bound limit since Apache Spark 3.0.0.

By reducing the lower bound to 2MiB, we can allow Spark jobs to write small log files more frequently and faster without waiting for 10MiB. This is helpful some slow(large micro-batch period) or low-traffic streaming jobs. The users will set a proper value for their jobs.

Does this PR introduce any user-facing change?

There is no behavior change for the existing jobs. This only extends the range of configuration values for a user who wants to have lower values.

How was this patch tested?

Pass the CIs.

Was this patch authored or co-authored using generative AI tooling?

No.

@github-actions github-actions bot added the CORE label Jun 11, 2025
@dongjoon-hyun dongjoon-hyun changed the title [SPARK-52456][CORE] Lower the maximum size limit of event log files [SPARK-52456][CORE] Lower the minimum limit of spark.eventLog.rolling.maxFileSize Jun 11, 2025
@dongjoon-hyun
Copy link
Member Author

Thank you, @yaooqinn . Merged to master for Apache Spark 4.1.0.

@dongjoon-hyun dongjoon-hyun deleted the SPARK-52456 branch June 12, 2025 03:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants