[KYUUBI #5377] Spark engine query result save to file #5591

lsm1 · 2023-10-31T15:12:18Z

Why are the changes needed?

close #5377

How was this patch tested?

Add some test cases that check the changes thoroughly including negative and positive cases if possible
Add screenshots for manual tests if appropriate
Run test locally before make a pull request

Was this patch authored or co-authored using generative AI tooling?

NO

codecov-commenter · 2023-10-31T17:21:22Z

Codecov Report

Attention: 99 lines in your changes are missing coverage. Please review.

Comparison is base (4463cc8) 61.51% compared to head (9d1a18c) 61.34%.
Report is 3 commits behind head on master.

Files	Patch %	Lines
...ubi/engine/spark/operation/FetchOrcStatement.scala	0.00%	57 Missing ⚠️
...uubi/engine/spark/operation/ExecuteStatement.scala	24.00%	18 Missing and 1 partial ⚠️
...rg/apache/kyuubi/engine/spark/SparkSQLEngine.scala	18.18%	8 Missing and 1 partial ⚠️
...g/apache/spark/sql/kyuubi/SparkDatasetHelper.scala	0.00%	8 Missing ⚠️
.../engine/spark/session/SparkSQLSessionManager.scala	0.00%	4 Missing and 1 partial ⚠️
...in/scala/org/apache/kyuubi/config/KyuubiConf.scala	93.75%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##             master    #5591      +/-   ##
============================================
- Coverage     61.51%   61.34%   -0.18%     
  Complexity       23       23              
============================================
  Files           608      609       +1     
  Lines         36091    36252     +161     
  Branches       4952     4993      +41     
============================================
+ Hits          22201    22237      +36     
- Misses        11506    11616     +110     
- Partials       2384     2399      +15

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

...s/kyuubi-spark-sql-engine/src/main/scala/org/apache/kyuubi/engine/spark/SparkSQLEngine.scala

wForget · 2023-11-09T00:29:41Z

...rk-sql-engine/src/main/scala/org/apache/kyuubi/engine/spark/operation/ExecuteStatement.scala

+        val fileName = s"$savePath/$engineId/$sessionId/$statementId"
+        val colName = range(0, result.schema.size).map(x => "col" + x)
+        if (resultMaxRows > 0) {
+          result.toDF(colName: _*).limit(resultMaxRows).write


Use resultDF instead of result. Also, is toDF(colName: _*) necessary?

If the result has duplicate columns, we can not write it to file, so we rename all col name to avoid this case

spark.sql("select 1 as a,2 as a").write("/filepath") org.apache.spark.sql.AnalysisException: Found duplicate column(s) when inserting into hdfs

@lsm1 let's add such information to the comments

there is another known issue as I mentioned in the issue comment

directly call df.write will introduce an extra shuffle for the outermost limit, and hurt performance

I think we should also add this known issue to the comment and create a new ticket to track this issue.

...k-sql-engine/src/main/scala/org/apache/kyuubi/engine/spark/operation/FetchOrcStatement.scala

wForget · 2023-11-09T00:38:52Z

...k-sql-engine/src/main/scala/org/apache/kyuubi/engine/spark/operation/FetchOrcStatement.scala

+  }
+
+  override def next(): OrcStruct = {
+    if (iters(idx).hasNext) {


iters(idx).hasNext) has been called in the hasNext method

wForget · 2023-11-09T00:45:52Z

...rk-sql-engine/src/main/scala/org/apache/kyuubi/engine/spark/operation/ExecuteStatement.scala

+        .getOrElse(session.sessionManager.getConf.get(OPERATION_RESULT_SAVE_TO_FILE))
+      lazy val threshold =
+        session.sessionManager.getConf.get(OPERATION_RESULT_SAVE_TO_FILE_THRESHOLD)
+      if (hasResultSet && sparkSave && shouldSaveResultToHdfs(resultMaxRows, threshold, result)) {


shouldSaveResultToHdfs is inferred based on the execution plan and may not be accurate. Should we change it to sparkSave || shouldSaveResultToHdfs

I prefer to keep it as-is, we need a configuration to disable this feature globally

wForget · 2023-11-09T01:03:35Z

It might be simpler for us to make changes in the executeStatement method, like:

change

result = spark.sql(statement)

to

if (saveResultToPath) {
  spark.sql(statement).write.format(format).save(resultPath)
  result = spark.read.load(resultPath)
} else {
  result = spark.sql(statement)
}

WDYT? cc @pan3793 @cxzl25

cxzl25 · 2023-11-14T12:47:26Z

It might be simpler for us to make changes in the executeStatement method, like:

result = spark.read.load(resultPath)

This may lose the ordering of the query data, e.g. order by xx limit 100

wForget · 2023-11-15T02:36:06Z

This may lose the ordering of the query data, e.g. order by xx limit 100

I did a simple test and the results were as expected. (Test Env: Kyuubi 1.8.0 + Spark 3.5.0)

create table wangzhen_test_20231115_t1(id bigint, name string) stored as parquet;
insert into wangzhen_test_20231115_t1 values (1, 'a');
insert into wangzhen_test_20231115_t1 values (2, 'b');
insert into wangzhen_test_20231115_t1 values (3, 'c');

set kyuubi.operation.language=scala;

val df = spark.sql("select * from wangzhen_test_20231115_t1 order by id limit 2");
df.write.format("parquet").save("hfds://XXX/result.parquet");

spark.sql("set kyuubi.operation.language=sql");
select * from `parquet`.`hfds://XXX/result.parquet`;

== Physical Plan ==
Execute InsertIntoHadoopFsRelationCommand (5)
+- WriteFiles (4)
   +- TakeOrderedAndProject (3)
      +- * ColumnarToRow (2)
         +- Scan parquet spark_catalog.XXX.wangzhen_test_20231115_t1 (1)

lsm1 · 2023-11-15T03:06:32Z

if (saveResultToPath) {
  spark.sql(statement).write.format(format).save(resultPath)
  result = spark.read.load(resultPath)
} else {
  result = spark.sql(statement)
}

we still call result.collect() later, if the result is too large,we can not avoid driver OOM

wForget · 2023-11-15T03:17:45Z

we still call result.collect() later, if the result is too large,we can not avoid driver OOM

Can we combine incremental collection mode?

cxzl25 · 2023-11-15T04:12:33Z

I did a simple test and the results were as expected

Maybe you can test the scenario of generating multiple files

wForget · 2023-11-15T04:43:18Z

Maybe you can test the scenario of generating multiple files

Do you mean a large data set or multiple tasks?

.../kyuubi-spark-sql-engine/src/main/scala/org/apache/spark/sql/kyuubi/SparkDatasetHelper.scala

wForget · 2023-11-15T05:32:19Z

Maybe you can test the scenario of generating multiple files

The output seems to be in order even when outputting multiple files.

set spark.sql.files.maxRecordsPerFile=10000;

set kyuubi.operation.language=scala;

spark.range(0, 1000000, 1, numPartitions = 10)
  .selectExpr("id", "cast(id as string) as name")
  .createOrReplaceTempView("wangzhen_test_20231115_tmp1")

val df = spark.sql("select * from wangzhen_test_20231115_tmp1 order by id limit 100000");
df.write.format("parquet").save("hdfs://XXX/result");

spark.sql("set kyuubi.operation.language=sql");
select * from `parquet`.`hdfs://XXX/result`;

lsm1 · 2023-11-23T11:40:34Z

we still call result.collect() later, if the result is too large,we can not avoid driver OOM

Can we combine incremental collection mode?

When we use incremental collection mode, it may significantly impact performance.

.../kyuubi-spark-sql-engine/src/main/scala/org/apache/spark/sql/kyuubi/SparkDatasetHelper.scala

...k-sql-engine/src/main/scala/org/apache/kyuubi/engine/spark/operation/FetchOrcStatement.scala

cxzl25 · 2023-12-04T07:40:05Z

The output seems to be in order even when outputting multiple files.

When Spark reads datasource, it will be sorted by file length, so there is no guarantee.

org.apache.spark.sql.execution.datasources.v2.FileScan#partitions

      partition.files.flatMap { file =>
        PartitionedFileUtil.splitFiles(
          file = file,
          isSplitable = isSplitable(file.getPath),
          maxSplitBytes = maxSplitBytes,
          partitionValues = partitionValues
        )
      }.toArray.sortBy(_.length)(implicitly[Ordering[Long]].reverse)

https://github.com/apache/spark/blob/4398bb5d37328e2f3594302d98f98803a379a2e9/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/FileScan.scala#L146-L160

kyuubi-common/src/main/scala/org/apache/kyuubi/config/KyuubiConf.scala

docs/configuration/settings.md

...k-sql-engine/src/main/scala/org/apache/kyuubi/engine/spark/operation/FetchOrcStatement.scala

cfmcgrady · 2023-12-08T10:19:26Z

...ql-engine/src/main/scala/org/apache/kyuubi/engine/spark/session/SparkSQLSessionManager.scala

@@ -184,6 +186,12 @@ class SparkSQLSessionManager private (name: String, spark: SparkSession)
      info("Session stopped due to shared level is Connection.")
      stopSession()
    }
+    if (conf.get(OPERATION_RESULT_SAVE_TO_FILE)) {


nit: only cleanup for the operation ExecuteStatement?

There is no simple way to determine whether the session has executed ExecuteStatement

turboFei · 2023-12-13T00:46:19Z

Some question:

I wonder that, If the result is order needed, if we save the result into files and then read from when client fetching result, the result returned to users is not ordered as expected.

pan3793 · 2023-12-13T03:19:33Z

@turboFei from the context, I think the implementation already reserves the global order, @cxzl25 could you please clarify it?

lsm1 · 2023-12-13T07:33:17Z

Some question:

I wonder that, If the result is order needed, if we save the result into files and then read from when client fetching result, the result returned to users is not ordered as expected.

spark save ordered result to multiple part-X files in the filesystem, in order of the keys
https://github.com/lsm1/spark/blob/master/core/src/main/scala/org/apache/spark/rdd/OrderedRDDFunctions.scala#L47

org.apache.spark.rdd.OrderedRDDFunctions#sortByKey
 /**
	* Sort the RDD by key, so that each partition contains a sorted range of the elements. Calling
	* `collect` or `save` on the resulting RDD will return or output an ordered list of records
	* (in the `save` case, they will be written to multiple `part-X` files in the filesystem, in
	* order of the keys).  
*/

When fetchOrcStatement read file, it will be sorted by file name, so it will return ordered result

cfmcgrady · 2023-12-13T07:59:29Z

thanks all. merging to master(v1.9.0)

# 🔍 Description ## Issue References 🔗 #5591 (comment) ## Describe Your Solution 🔧 Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change. ## Types of changes 🔖 - [ ] Bugfix (non-breaking change which fixes an issue) - [ ] New feature (non-breaking change which adds functionality) - [ ] Breaking change (fix or feature that would cause existing functionality to change) ## Test Plan 🧪 #### Behavior Without This Pull Request ⚰️ #### Behavior With This Pull Request 🎉 #### Related Unit Tests --- # Checklists ## 📝 Author Self Checklist - [x] My code follows the [style guidelines](https://kyuubi.readthedocs.io/en/master/contributing/code/style.html) of this project - [x] I have performed a self-review - [x] I have commented my code, particularly in hard-to-understand areas - [ ] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] New and existing unit tests pass locally with my changes - [ ] This patch was not authored or co-authored using [Generative Tooling](https://www.apache.org/legal/generative-tooling.html) ## 📝 Committer Pre-Merge Checklist - [ ] Pull request title is okay. - [ ] No license issues. - [ ] Milestone correctly set? - [ ] Test coverage is ok - [ ] Assignees are selected. - [ ] Minimum number of approvals - [ ] No changes are requested **Be nice. Be informative.** Closes #5895 from lsm1/branch-kyuubi-5377-followup. Closes #5377 4219d28 [Fei Wang] nit 31d4fc1 [senmiaoliu] use zlib when SPARK version less than 3.2 Lead-authored-by: senmiaoliu <senmiaoliu@trip.com> Co-authored-by: Fei Wang <fwang12@ebay.com> Signed-off-by: Fei Wang <fwang12@ebay.com>

…sult max rows # 🔍 Description Followup #5591 Support to get existing limit from more plan and regard the result max rows. ## Issue References 🔗 This pull request fixes # ## Describe Your Solution 🔧 Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change. ## Types of changes 🔖 - [ ] Bugfix (non-breaking change which fixes an issue) - [ ] New feature (non-breaking change which adds functionality) - [ ] Breaking change (fix or feature that would cause existing functionality to change) ## Test Plan 🧪 #### Behavior Without This Pull Request ⚰️ #### Behavior With This Pull Request 🎉 #### Related Unit Tests --- # Checklist 📝 - [ ] This patch was not authored or co-authored using [Generative Tooling](https://www.apache.org/legal/generative-tooling.html) **Be nice. Be informative.** Closes #5963 from turboFei/incremental_save. Closes #5377 223d510 [Fei Wang] use optimized plan ecefc2a [Fei Wang] use spark plan 57091e5 [Fei Wang] minor 2096144 [Fei Wang] for logical plan 0f734ee [Fei Wang] ut fdc1155 [Fei Wang] save f8e405a [Fei Wang] math.min Authored-by: Fei Wang <fwang12@ebay.com> Signed-off-by: Fei Wang <fwang12@ebay.com>

…ard result max rows # 🔍 Description Followup apache#5591 Support to get existing limit from more plan and regard the result max rows. ## Issue References 🔗 This pull request fixes # ## Describe Your Solution 🔧 Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change. ## Types of changes 🔖 - [ ] Bugfix (non-breaking change which fixes an issue) - [ ] New feature (non-breaking change which adds functionality) - [ ] Breaking change (fix or feature that would cause existing functionality to change) ## Test Plan 🧪 #### Behavior Without This Pull Request ⚰️ #### Behavior With This Pull Request 🎉 #### Related Unit Tests --- # Checklist 📝 - [ ] This patch was not authored or co-authored using [Generative Tooling](https://www.apache.org/legal/generative-tooling.html) **Be nice. Be informative.** Closes apache#5963 from turboFei/incremental_save. Closes apache#5377 223d510 [Fei Wang] use optimized plan ecefc2a [Fei Wang] use spark plan 57091e5 [Fei Wang] minor 2096144 [Fei Wang] for logical plan 0f734ee [Fei Wang] ut fdc1155 [Fei Wang] save f8e405a [Fei Wang] math.min Authored-by: Fei Wang <fwang12@ebay.com> Signed-off-by: Fei Wang <fwang12@ebay.com>

### _Why are the changes needed?_ close apache#5377 ### _How was this patch tested?_ - [ ] Add some test cases that check the changes thoroughly including negative and positive cases if possible - [ ] Add screenshots for manual tests if appropriate - [ ] [Run test](https://kyuubi.readthedocs.io/en/master/contributing/code/testing.html#running-tests) locally before make a pull request ### _Was this patch authored or co-authored using generative AI tooling?_ NO Closes apache#5591 from lsm1/branch-kyuubi-5377. Closes apache#5377 9d1a18c [senmiaoliu] ignore empty file 3c70a1e [LSM] fix doc 73d3c3a [senmiaoliu] fix style and add some comment 80e1f0d [senmiaoliu] Close orc fetchOrcStatement and remove result save file when ExecuteStatement close 42634a1 [senmiaoliu] fix style 979125d [senmiaoliu] fix style 1dc07a5 [senmiaoliu] spark engine save into hdfs file Lead-authored-by: senmiaoliu <senmiaoliu@trip.com> Co-authored-by: LSM <senmiaoliu@trip.com> Signed-off-by: Fu Chen <cfmcgrady@gmail.com>

# 🔍 Description ## Issue References 🔗 apache#5591 (comment) ## Describe Your Solution 🔧 Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change. ## Types of changes 🔖 - [ ] Bugfix (non-breaking change which fixes an issue) - [ ] New feature (non-breaking change which adds functionality) - [ ] Breaking change (fix or feature that would cause existing functionality to change) ## Test Plan 🧪 #### Behavior Without This Pull Request ⚰️ #### Behavior With This Pull Request 🎉 #### Related Unit Tests --- # Checklists ## 📝 Author Self Checklist - [x] My code follows the [style guidelines](https://kyuubi.readthedocs.io/en/master/contributing/code/style.html) of this project - [x] I have performed a self-review - [x] I have commented my code, particularly in hard-to-understand areas - [ ] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] New and existing unit tests pass locally with my changes - [ ] This patch was not authored or co-authored using [Generative Tooling](https://www.apache.org/legal/generative-tooling.html) ## 📝 Committer Pre-Merge Checklist - [ ] Pull request title is okay. - [ ] No license issues. - [ ] Milestone correctly set? - [ ] Test coverage is ok - [ ] Assignees are selected. - [ ] Minimum number of approvals - [ ] No changes are requested **Be nice. Be informative.** Closes apache#5895 from lsm1/branch-kyuubi-5377-followup. Closes apache#5377 4219d28 [Fei Wang] nit 31d4fc1 [senmiaoliu] use zlib when SPARK version less than 3.2 Lead-authored-by: senmiaoliu <senmiaoliu@trip.com> Co-authored-by: Fei Wang <fwang12@ebay.com> Signed-off-by: Fei Wang <fwang12@ebay.com>

…ard result max rows # 🔍 Description Followup apache#5591 Support to get existing limit from more plan and regard the result max rows. ## Issue References 🔗 This pull request fixes # ## Describe Your Solution 🔧 Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change. ## Types of changes 🔖 - [ ] Bugfix (non-breaking change which fixes an issue) - [ ] New feature (non-breaking change which adds functionality) - [ ] Breaking change (fix or feature that would cause existing functionality to change) ## Test Plan 🧪 #### Behavior Without This Pull Request ⚰️ #### Behavior With This Pull Request 🎉 #### Related Unit Tests --- # Checklist 📝 - [ ] This patch was not authored or co-authored using [Generative Tooling](https://www.apache.org/legal/generative-tooling.html) **Be nice. Be informative.** Closes apache#5963 from turboFei/incremental_save. Closes apache#5377 223d510 [Fei Wang] use optimized plan ecefc2a [Fei Wang] use spark plan 57091e5 [Fei Wang] minor 2096144 [Fei Wang] for logical plan 0f734ee [Fei Wang] ut fdc1155 [Fei Wang] save f8e405a [Fei Wang] math.min Authored-by: Fei Wang <fwang12@ebay.com> Signed-off-by: Fei Wang <fwang12@ebay.com>

# 🔍 Description ## Issue References 🔗 This pull request fixes #6437 ## Describe Your Solution 🔧 Use `org.apache.hadoop.fs.Path` instead of `java.nio.file.Paths` to avoid `OPERATION_RESULT_SAVE_TO_FILE_DIR` scheme unexpected change. ## Types of changes 🔖 - [x] Bugfix (non-breaking change which fixes an issue) - [ ] New feature (non-breaking change which adds functionality) - [ ] Breaking change (fix or feature that would cause existing functionality to change) ## Test Plan 🧪 #### Behavior Without This Pull Request ⚰️ Spark Job failed to start with error: `java.io.IOException: JuiceFS initialized failed for jfs:///` with conf `kyuubi.operation.result.saveToFile.dir=jfs://datalake/tmp`. `hdfs://xxx:port/tmp` may encounter similar errors #### Behavior With This Pull Request 🎉 User Can use hdfs dir as `kyuubi.operation.result.saveToFile.dir` without error. #### Related Unit Tests Seems no test suites added in #5591 and #5986, I'll try to build a dist and test with our internal cluster. --- # Checklist 📝 - [x] This patch was not authored or co-authored using [Generative Tooling](https://www.apache.org/legal/generative-tooling.html) **Be nice. Be informative.** Closes #6444 from camper42/save-to-hdfs. Closes #6437 990f0a7 [camper42] [Kyuubi #6437] Fix Spark engine query result save to HDFS Authored-by: camper42 <camper.xlii@gmail.com> Signed-off-by: Cheng Pan <chengpan@apache.org>

This pull request fixes #6437 Use `org.apache.hadoop.fs.Path` instead of `java.nio.file.Paths` to avoid `OPERATION_RESULT_SAVE_TO_FILE_DIR` scheme unexpected change. - [x] Bugfix (non-breaking change which fixes an issue) - [ ] New feature (non-breaking change which adds functionality) - [ ] Breaking change (fix or feature that would cause existing functionality to change) Spark Job failed to start with error: `java.io.IOException: JuiceFS initialized failed for jfs:///` with conf `kyuubi.operation.result.saveToFile.dir=jfs://datalake/tmp`. `hdfs://xxx:port/tmp` may encounter similar errors User Can use hdfs dir as `kyuubi.operation.result.saveToFile.dir` without error. Seems no test suites added in #5591 and #5986, I'll try to build a dist and test with our internal cluster. --- - [x] This patch was not authored or co-authored using [Generative Tooling](https://www.apache.org/legal/generative-tooling.html) **Be nice. Be informative.** Closes #6444 from camper42/save-to-hdfs. Closes #6437 990f0a7 [camper42] [Kyuubi #6437] Fix Spark engine query result save to HDFS Authored-by: camper42 <camper.xlii@gmail.com> Signed-off-by: Cheng Pan <chengpan@apache.org> (cherry picked from commit 71649da) Signed-off-by: Cheng Pan <chengpan@apache.org>

github-actions bot added kind:documentation Documentation is a feature! module:spark module:common labels Oct 31, 2023

wForget reviewed Nov 9, 2023

View reviewed changes

...s/kyuubi-spark-sql-engine/src/main/scala/org/apache/kyuubi/engine/spark/SparkSQLEngine.scala Show resolved Hide resolved

wForget reviewed Nov 9, 2023

View reviewed changes

...k-sql-engine/src/main/scala/org/apache/kyuubi/engine/spark/operation/FetchOrcStatement.scala Outdated Show resolved Hide resolved

wForget reviewed Nov 9, 2023

View reviewed changes

pan3793 reviewed Nov 15, 2023

View reviewed changes

.../kyuubi-spark-sql-engine/src/main/scala/org/apache/spark/sql/kyuubi/SparkDatasetHelper.scala Outdated Show resolved Hide resolved

lsm1 force-pushed the branch-kyuubi-5377 branch from f8574f9 to 09d3b98 Compare November 23, 2023 11:40

cxzl25 reviewed Nov 23, 2023

View reviewed changes

.../kyuubi-spark-sql-engine/src/main/scala/org/apache/spark/sql/kyuubi/SparkDatasetHelper.scala Outdated Show resolved Hide resolved

cxzl25 reviewed Nov 30, 2023

View reviewed changes

...k-sql-engine/src/main/scala/org/apache/kyuubi/engine/spark/operation/FetchOrcStatement.scala Outdated Show resolved Hide resolved

cxzl25 reviewed Nov 30, 2023

View reviewed changes

...k-sql-engine/src/main/scala/org/apache/kyuubi/engine/spark/operation/FetchOrcStatement.scala Show resolved Hide resolved

lsm1 force-pushed the branch-kyuubi-5377 branch 2 times, most recently from 59add37 to 9b7ce8d Compare December 1, 2023 08:23

cxzl25 requested review from turboFei and cfmcgrady December 4, 2023 07:44

pan3793 reviewed Dec 7, 2023

View reviewed changes

kyuubi-common/src/main/scala/org/apache/kyuubi/config/KyuubiConf.scala Outdated Show resolved Hide resolved

pan3793 reviewed Dec 7, 2023

View reviewed changes

kyuubi-common/src/main/scala/org/apache/kyuubi/config/KyuubiConf.scala Outdated Show resolved Hide resolved

lsm1 force-pushed the branch-kyuubi-5377 branch from 8af15f8 to 73d3c3a Compare December 8, 2023 08:45

cfmcgrady reviewed Dec 8, 2023

View reviewed changes

docs/configuration/settings.md Outdated Show resolved Hide resolved

fix doc

3c70a1e

cfmcgrady reviewed Dec 8, 2023

View reviewed changes

...k-sql-engine/src/main/scala/org/apache/kyuubi/engine/spark/operation/FetchOrcStatement.scala Outdated Show resolved Hide resolved

cfmcgrady reviewed Dec 8, 2023

View reviewed changes

ignore empty file

9d1a18c

lsm1 force-pushed the branch-kyuubi-5377 branch from 806e724 to 9d1a18c Compare December 8, 2023 14:58

pan3793 approved these changes Dec 13, 2023

View reviewed changes

cfmcgrady approved these changes Dec 13, 2023

View reviewed changes

turboFei approved these changes Dec 13, 2023

View reviewed changes

cfmcgrady assigned lsm1 Dec 13, 2023

cfmcgrady added this to the v1.9.0 milestone Dec 13, 2023

cxzl25 approved these changes Dec 13, 2023

View reviewed changes

cfmcgrady closed this in 4c029f9 Dec 13, 2023

lsm1 mentioned this pull request Dec 21, 2023

[KYUUBI #5377][FOLLOWUP] Spark engine query result save to file #5895

Closed

18 tasks

turboFei mentioned this pull request Jan 11, 2024

[KYUUBI #5377][FOLLOWUP] Get limit from more spark plan and regard result max rows #5963

Closed

4 tasks

camper42 mentioned this pull request Jun 3, 2024

[KYUUBI #6437] Fix Spark engine query result save to HDFS #6444

Closed

4 tasks

[KYUUBI #5377] Spark engine query result save to file #5591

[KYUUBI #5377] Spark engine query result save to file #5591

Uh oh!

Conversation

lsm1 commented Oct 31, 2023

Why are the changes needed?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

codecov-commenter commented Oct 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wForget commented Nov 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cxzl25 commented Nov 14, 2023

Uh oh!

wForget commented Nov 15, 2023

Uh oh!

lsm1 commented Nov 15, 2023

Uh oh!

wForget commented Nov 15, 2023

Uh oh!

cxzl25 commented Nov 15, 2023

Uh oh!

wForget commented Nov 15, 2023

Uh oh!

Uh oh!

wForget commented Nov 15, 2023

Uh oh!

lsm1 commented Nov 23, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cxzl25 commented Dec 4, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

turboFei commented Dec 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pan3793 commented Dec 13, 2023

Uh oh!

lsm1 commented Dec 13, 2023

Uh oh!

cfmcgrady commented Dec 13, 2023

Uh oh!

Uh oh!

codecov-commenter commented Oct 31, 2023 •

edited

Loading

wForget commented Nov 9, 2023 •

edited

Loading

turboFei commented Dec 13, 2023 •

edited

Loading