Skip to content

SPARK-3025 [SQL]: Allow JDBC clients to set a fair scheduler pool #1937

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from

Conversation

pwendell
Copy link
Contributor

This definitely needs review as I am not familiar with this part of Spark.
I tested this locally and it did seem to work.

@SparkQA
Copy link

SparkQA commented Aug 14, 2014

QA tests have started for PR 1937. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18517/consoleFull

@SparkQA
Copy link

SparkQA commented Aug 14, 2014

QA results for PR 1937:
- This patch FAILED unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18517/consoleFull

@liancheng
Copy link
Contributor

retest this please

@SparkQA
Copy link

SparkQA commented Aug 14, 2014

QA tests have started for PR 1937. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18541/consoleFull

@SparkQA
Copy link

SparkQA commented Aug 14, 2014

QA results for PR 1937:
- This patch PASSES unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18541/consoleFull

@pwendell pwendell changed the title SPARK-3025: Allow JDBC clients to set a fair scheduler pool SPARK-3025 [SQL]: Allow JDBC clients to set a fair scheduler pool Aug 14, 2014
@tianyi
Copy link
Contributor

tianyi commented Aug 15, 2014

hi @pwendell , I have two questions:
1 Why don't handle this set command in "SetCommand" ?
2 Do we have to save this pool name in LocalProperty?

@pwendell
Copy link
Contributor Author

@tianyi it might make more sense to handle this in the physical execution of SetCommand, I'll see if I can do that. The local property is necessary I think - do you know of an alternative mechanism? IIRC this is the main way we expect people to allocate jobs to specific pools.

@pwendell
Copy link
Contributor Author

@tianyi after looking some more, I believe this needs to be handled in the thriftserver explicitly and cannot be handled in the SetCommand physical operator. The reason is that we need to track association with the specific Hive session. To make this more clear, I'm going to rename this setting to include the word "thriftserver" so it's clear that this is only relevant to the thriftserver. In the case where a user is referencing a SQLContext directly, they can simply set the local property themeselves.

@pwendell
Copy link
Contributor Author

Jenkins, test this please.

@SparkQA
Copy link

SparkQA commented Aug 17, 2014

QA tests have started for PR 1937 at commit 2c647e0.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Aug 17, 2014

QA tests have finished for PR 1937 at commit 2c647e0.

  • This patch fails unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

This definitely needs review as I am not familiar with this part of Spark.
I tested this locally and it did seem to work.
@SparkQA
Copy link

SparkQA commented Aug 17, 2014

QA tests have started for PR 1937 at commit b858e33.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Aug 18, 2014

QA tests have finished for PR 1937 at commit b858e33.

  • This patch fails unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@tianyi
Copy link
Contributor

tianyi commented Aug 18, 2014

@pwendell thanks for the explanation. I think there should be some test case about this feature.

@tianyi
Copy link
Contributor

tianyi commented Aug 18, 2014

@pwendell I think we could save the current session's pool setting in the hiveConf of parentSession instead of a map in SparkSQLOperationManager. So it could resolve the unlimit map size problem.

@marmbrus
Copy link
Contributor

As far as I know we don't actually do much with the HiveConf that is stored in parentSession, though that does seem like a reasonable place to store such information. There is already a JIRA open for cleaning up multi-user semantics in the JDBC server. Given that its be nice to have something here for 1.1 I'm going to go ahead and merge this, and leave a note in that JIRA to clean this up when we have a better solution for session state.

@asfgit asfgit closed this in 6bca889 Aug 18, 2014
asfgit pushed a commit that referenced this pull request Aug 18, 2014
This definitely needs review as I am not familiar with this part of Spark.
I tested this locally and it did seem to work.

Author: Patrick Wendell <pwendell@gmail.com>

Closes #1937 from pwendell/scheduler and squashes the following commits:

b858e33 [Patrick Wendell] SPARK-3025: Allow JDBC clients to set a fair scheduler pool

(cherry picked from commit 6bca889)
Signed-off-by: Michael Armbrust <michael@databricks.com>
xiliu82 pushed a commit to xiliu82/spark that referenced this pull request Sep 4, 2014
This definitely needs review as I am not familiar with this part of Spark.
I tested this locally and it did seem to work.

Author: Patrick Wendell <pwendell@gmail.com>

Closes apache#1937 from pwendell/scheduler and squashes the following commits:

b858e33 [Patrick Wendell] SPARK-3025: Allow JDBC clients to set a fair scheduler pool
szehon-ho pushed a commit to szehon-ho/spark that referenced this pull request Aug 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants