Skip to content

Conversation

@parthchandra
Copy link
Contributor

Replaces hacked CometScan with CometNativeScan so we can switch between original and experimental without rebuilding.
I tried removing a lot of the functions in CometNativeScan but too many test in CometExecSuite fail and I ended up including almost everything back.
Three tests in CometExecScan are still failing.

Comment on lines 80 to 87
val COMET_FULL_NATIVE_SCAN_ENABLED: ConfigEntry[Boolean] = conf(
"spark.comet.native.scan.enabled")
.doc(
"Whether to enable the fully native scan. When this is turned on, Spark will use Comet to " +
"read supported data sources (currently only Parquet is supported natively)." +
" By default, this config is true.")
.booleanConf
.createWithDefault(true)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I added a similar config locally. 🙂

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If it was was something related, you can merge it here. This is an experimental branch. If it is for something different, then we can change our name.

Copy link
Member

@viirya viirya Nov 12, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No, I meant I tried to do similar native scan change into this branch. I was trying to enable DataFusion native Parquet scan yesterday. I found this branch this morning so tried to apply initial change onto this branch.

I will see if any difference can be merged.

_,
_,
_)
if CometNativeScanExec.isSchemaSupported(requiredSchema)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we could also fall back for bucketed scans and other unsupported cases (can be a separate PR)

Copy link
Member

@andygrove andygrove left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks @parthchandra

@andygrove andygrove merged commit 38e32f7 into apache:comet-parquet-exec Nov 12, 2024
17 of 74 checks passed
andygrove added a commit that referenced this pull request Nov 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants