Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove hard-coded special treatment for $path and $bucket #11221

Closed
wants to merge 1 commit into from

Conversation

Yuhta
Copy link
Contributor

@Yuhta Yuhta commented Oct 10, 2024

Summary: These special columns are engine-specific and should be handled during engine split generation by setting the corresponding values in split infoColumns. For example in Prestissimo this is done in https://github.com/prestodb/presto/blob/48f0a0c1d380b1155dfd7c99b134a350627c7260/presto-native-execution/presto_cpp/main/types/PrestoToVeloxConnector.cpp#L1112-L1120.

Differential Revision: D64180231

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 10, 2024
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D64180231

Copy link

netlify bot commented Oct 10, 2024

Deploy Preview for meta-velox canceled.

Name Link
🔨 Latest commit 48fe9a9
🔍 Latest deploy log https://app.netlify.com/sites/meta-velox/deploys/6708174b6489390008cf19d0

Copy link
Contributor

@xiaoxmeng xiaoxmeng left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Yuhta LGTM. I assume the query system backend already handle these two special columns? Thanks!

@@ -226,7 +226,9 @@ class HiveConnectorTestBase : public OperatorTestBase {
class HiveConnectorSplitBuilder {
public:
HiveConnectorSplitBuilder(std::string filePath)
: filePath_{std::move(filePath)} {}
: filePath_{filePath.find("/") == 0 ? "file:" + filePath : filePath} {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why we need this prefix to ensure it goes to local fs? IIRC, the default goes to local fs.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't know either, maybe some legacy workaround, let me remove it to see if it still works

@Yuhta
Copy link
Contributor Author

Yuhta commented Oct 10, 2024

@xiaoxmeng Yes Prestissimo handles it in split conversion code I mentioned in the PR description. Other usages also have the special columns they need set up properly in their code.

Yuhta added a commit to Yuhta/velox that referenced this pull request Oct 10, 2024
…cubator#11221)

Summary:

These special columns are engine-specific and should be handled during engine split generation by setting the corresponding values in split `infoColumns`.  For example in Prestissimo this is done in https://github.com/prestodb/presto/blob/48f0a0c1d380b1155dfd7c99b134a350627c7260/presto-native-execution/presto_cpp/main/types/PrestoToVeloxConnector.cpp#L1112-L1120.

Reviewed By: xiaoxmeng

Differential Revision: D64180231
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D64180231

Yuhta added a commit to Yuhta/velox that referenced this pull request Oct 10, 2024
…cubator#11221)

Summary:

These special columns are engine-specific and should be handled during engine split generation by setting the corresponding values in split `infoColumns`.  For example in Prestissimo this is done in https://github.com/prestodb/presto/blob/48f0a0c1d380b1155dfd7c99b134a350627c7260/presto-native-execution/presto_cpp/main/types/PrestoToVeloxConnector.cpp#L1112-L1120.

Reviewed By: xiaoxmeng

Differential Revision: D64180231
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D64180231

…cubator#11221)

Summary:

These special columns are engine-specific and should be handled during engine split generation by setting the corresponding values in split `infoColumns`.  For example in Prestissimo this is done in https://github.com/prestodb/presto/blob/48f0a0c1d380b1155dfd7c99b134a350627c7260/presto-native-execution/presto_cpp/main/types/PrestoToVeloxConnector.cpp#L1112-L1120.

Reviewed By: xiaoxmeng

Differential Revision: D64180231
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D64180231

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 0758d04.

Copy link

Conbench analyzed the 1 benchmark run on commit 0758d04d.

There were no benchmark performance regressions. 🎉

The full Conbench report has more details.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants