-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Write csv not save all lines of dataframe #3783
Comments
@Miyake-Diogo So I tried running this code but it has dependencies that are not here:
Do you have this code in GitHub somewhere? I am happy to help debug if you have a public repro case. |
Here is a smaller repro case: use datafusion::prelude::*;
#[tokio::main]
async fn main() -> datafusion::error::Result<()> {
let ctx: SessionContext = SessionContext::new();
let raw_fato_path: &str = "/mnt/bigdata/census/Data8277.csv";
let stage_fato_path: &str = "/tmp/stage";
let fato_census_df = ctx.read_csv(raw_fato_path, CsvReadOptions::new()).await?;
fato_census_df.write_csv(stage_fato_path).await?;
Ok(())
}
I tested with DataFusion 11, 12, and 13, and all have the same issue |
@Miyake-Diogo The issue is that this error is happening:
I recommend specifying the schema for the file since it contains mixed types for this column. You did not see the error due to a bug with the error being ignored and the fix for that issue is in #3801 |
Hi @andygrove , all codes are in this repo: https://gitlab.com/miyake-diogo/rust-big-data-playground |
@Miyake-Diogo Apologies for the late reply, but schema can be set in The root issue of not writing all results was fixed in #3801 |
Don't worry @andygrove thanks for answering me. |
Describe the bug
When I try to save dataframe as csv, only around 400K of lines are saved.. data has more than 1M of lines.
To Reproduce
My code:
Dataset:
Age and sex by ethnic group (grouped total responses), for census usually resident population counts, 2006, 2013, and 2018 Censuses (RC, TA, SA2, DHB)
Expected behavior
See all lines saved:
But only this quantity are saved.
The text was updated successfully, but these errors were encountered: