Improved support for MongoOffsets using hadoop file systems #130

rozza · 2025-01-15T13:58:27Z

Spark expects hadoop configurations to be prefixed with spark.hadoop. However, documentation on the web omits this prefix when setting filesystem configuration - See the azure storage docs.

The issue the connector has is the MongoOffset support just uses the SparkContext.hadoopConfiguration() helper method, which omits any non-prefixed configuration. So this improvement adds any filesystem configuration prefixed with fs. to the hadoop configuration. This ensures that the MongoOffsets use of the Hadoop filesystem includes the configuraion.

SPARK-438

Spark expects hadoop configurations to be prefixed with `spark.hadoop`. However, documentation on the web omits this prefix when setting filesystem configuration - See the azure storage docs. The issue the connector has is the `MongoOffset` support just uses the `SparkContext.hadoopConfiguration()` helper method, which omits any non-prefixed configuration. So this improvement adds any filesystem configuration prefixed with `fs.` to the hadoop configuration. This ensures that the `MongoOffset`s use of the Hadoop filesystem includes the configuraion. SPARK-438

katcharov

LGTM!

rozza requested a review from katcharov January 16, 2025 09:38

katcharov approved these changes Jan 20, 2025

View reviewed changes

rozza merged commit 560d495 into mongodb:main Jan 22, 2025
21 of 24 checks passed

rozza deleted the SPARK-438 branch January 22, 2025 10:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improved support for MongoOffsets using hadoop file systems #130

Improved support for MongoOffsets using hadoop file systems #130

Uh oh!

rozza commented Jan 15, 2025

Uh oh!

katcharov left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Improved support for MongoOffsets using hadoop file systems #130

Improved support for MongoOffsets using hadoop file systems #130

Uh oh!

Conversation

rozza commented Jan 15, 2025

Uh oh!

katcharov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants