-
Notifications
You must be signed in to change notification settings - Fork 28.7k
[SPARK-10310] [SQL] Fixes script transformation field/line delimiters #8860
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-10310] [SQL] Fixes script transformation field/line delimiters #8860
Conversation
Test build #42803 has finished for PR 8860 at commit
|
@liancheng I guess there's still some issue like: |
f86b5bc
to
8d36775
Compare
8d36775
to
387ac72
Compare
@zhichao-li I further special cased |
test this please |
Test build #42846 has finished for PR 8860 at commit
|
Test build #42848 has finished for PR 8860 at commit
|
Test build #42855 has finished for PR 8860 at commit
|
@zhichao-li Can you try this PR? |
LGTM |
Thanks! Merging to master and branch 1.5. |
**Please attribute this PR to `Zhichao Li <zhichao.liintel.com>`.** This PR is based on PR #8476 authored by zhichao-li. It fixes SPARK-10310 by adding field delimiter SerDe property to the default `LazySimpleSerDe`, and enabling default record reader/writer classes. Currently, we only support `LazySimpleSerDe`, used together with `TextRecordReader` and `TextRecordWriter`, and don't support customizing record reader/writer using `RECORDREADER`/`RECORDWRITER` clauses. This should be addressed in separate PR(s). Author: Cheng Lian <lian@databricks.com> Closes #8860 from liancheng/spark-10310/fix-script-trans-delimiters. (cherry picked from commit 84f81e0) Signed-off-by: Yin Huai <yhuai@databricks.com>
**Please attribute this PR to `Zhichao Li <zhichao.liintel.com>`.** This PR is based on PR apache#8476 authored by zhichao-li. It fixes SPARK-10310 by adding field delimiter SerDe property to the default `LazySimpleSerDe`, and enabling default record reader/writer classes. Currently, we only support `LazySimpleSerDe`, used together with `TextRecordReader` and `TextRecordWriter`, and don't support customizing record reader/writer using `RECORDREADER`/`RECORDWRITER` clauses. This should be addressed in separate PR(s). Author: Cheng Lian <lian@databricks.com> Closes apache#8860 from liancheng/spark-10310/fix-script-trans-delimiters. (cherry picked from commit 84f81e0) Signed-off-by: Yin Huai <yhuai@databricks.com> (cherry picked from commit 73d0621)
Please attribute this PR to
Zhichao Li <zhichao.li@intel.com>
.This PR is based on PR #8476 authored by @zhichao-li. It fixes SPARK-10310 by adding field delimiter SerDe property to the default
LazySimpleSerDe
, and enabling default record reader/writer classes.Currently, we only support
LazySimpleSerDe
, used together withTextRecordReader
andTextRecordWriter
, and don't support customizing record reader/writer usingRECORDREADER
/RECORDWRITER
clauses. This should be addressed in separate PR(s).