Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update to apache/arrow-datafusion#2578 #48

Merged
merged 3 commits into from
May 31, 2022
Merged

Conversation

tustvold
Copy link
Contributor

@tustvold tustvold commented May 29, 2022

Which issue does this PR close?

Rationale for this change

apache/datafusion#2578 contains breaking changes (introduces ListingTableUri)

What changes are included in this PR?

Updates Ballista with breaking changes

Are there any user-facing changes?

There are breaking changes to DataFusion

@tustvold tustvold changed the title Update to https://github.com/apache/arrow-datafusion/pull/2578 Update to apache/arrow-datafusion#2578 May 29, 2022
let df = ctx.read_csv(table_name, options).await?;

let uri = format!("file:///{}.csv", table_name);
ctx.register_csv(table_name, &uri, options).await?;
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changing to using this method is necessary as otherwise it infers the ?table? as the table name, which seems to result in an additional projection appearing, not entirely sure why though...

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@tustvold tustvold marked this pull request as ready for review May 30, 2022 15:55
@tustvold tustvold requested a review from andygrove May 30, 2022 15:56
Copy link
Contributor

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The changes make sense to me

@@ -39,8 +39,8 @@ arrow-flight = { version = "14.0.0" }
async-trait = "0.1.41"
chrono = { version = "0.4", default-features = false }
clap = { version = "3", features = ["derive", "cargo"] }
datafusion = { git = "https://github.com/apache/arrow-datafusion", rev = "b6fb0dd52c2abd0f8e134aa46cc1571cc6a0971b" }
datafusion-proto = { git = "https://github.com/apache/arrow-datafusion", rev = "b6fb0dd52c2abd0f8e134aa46cc1571cc6a0971b" }
datafusion = { git = "https://github.com/apache/arrow-datafusion", rev = "fdb8fecf0ab475ba07dc0d15f7b53e25ccf30ee7" }
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@andygrove / @thinkharderdev -- what would you think about writing some script that automatically updates this pin rather than requiring datafusion developers manually make PRs?

We basically upgrade the DataFusion pin manually for IOx and while it takes times it seems to work for us -- for example https://github.com/influxdata/influxdb_iox/pull/4619

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Makes sense to me

let df = ctx.read_csv(table_name, options).await?;

let uri = format!("file:///{}.csv", table_name);
ctx.register_csv(table_name, &uri, options).await?;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@alamb
Copy link
Contributor

alamb commented May 31, 2022

I am going to merge this in because:

  1. I don't think it contains anything potentially contentious
  2. It is blocking the datafusion upgrade to arrow: Upgrade to arrow 15 datafusion#2631

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants