[SPARK-2087] [SQL] Multiple thriftserver sessions with single HiveContext instance #4885

chenghao-intel · 2015-03-04T05:59:59Z

Still, we keep only a single HiveContext within ThriftServer, and we also create a object called SQLSession for isolating the different user states.

Developers can obtain/release a new user session via openSession and closeSession, and SQLContext and HiveContext will also provide a default session if no openSession called, for backward-compatibility.

SparkQA · 2015-03-04T06:03:07Z

Test build #28253 has started for PR 4885 at commit 5fea724.

This patch merges cleanly.

chenghao-intel · 2015-03-04T06:04:48Z

cc @liancheng @tianyi @guowei2
We have 2 implementations for supporting the multiple sessions in thriftserver, can you review the code for me?

SparkQA · 2015-03-04T06:07:16Z

Test build #28253 has finished for PR 4885 at commit 5fea724.

This patch fails to build.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-03-04T06:07:17Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28253/
Test FAILed.

SparkQA · 2015-03-04T06:58:05Z

Test build #28256 has started for PR 4885 at commit 0ca4bbd.

This patch merges cleanly.

guowei2 · 2015-03-04T07:30:02Z

sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveContext.scala

I think there's no need to overwrite SQLSession and createSession here, for SessionState self is ThreadLocal. we just need to set SessionState when openSession in SparkSQLSessionManager.

@guowei2 I think either way is OK for now. Putting all session-specific stuff into a central place (SQLSession) seems cleaner to me. Making SQLSession a thread-local does look a little ugly, however, right now it's not used anywhere other than the Thrift server. When we do decide to move Hive into a separate data source and make our own data source neutral Spark SQL server, we can handle the session problem in a cleaner way (e.g., using an actor for each session and keep all session-specific stuff in the actor instance).

SparkQA · 2015-03-04T08:17:34Z

Test build #28256 has finished for PR 4885 at commit 0ca4bbd.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-03-04T08:17:37Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28256/
Test PASSed.

liancheng · 2015-03-13T07:44:59Z

...ftserver/src/test/scala/org/apache/spark/sql/hive/thriftserver/HiveThriftServer2Suites.scala

�Indentations are off in this test case.

liancheng · 2015-03-13T08:01:08Z

Hey @chenghao-intel, terribly sorry for the delay. In general this LGTM. Left some comments, mostly on styling issues. Thanks!

SparkQA · 2015-03-16T03:18:10Z

Test build #28636 has started for PR 4885 at commit 815b27a.

This patch merges cleanly.

chenghao-intel · 2015-03-16T03:20:29Z

Thank you @liancheng @guowei2 for the review, I've updated the code as suggested.

Still, I am thinking how to handle the temporal function and table which isolated by SQLSession, maybe life would be easier if we have the design along this PR(we can do those in a separated PR). Any suggestions @liancheng @marmbrus @guowei2 ?

SparkQA · 2015-03-16T03:21:35Z

Test build #28636 has finished for PR 4885 at commit 815b27a.

This patch fails to build.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-03-16T03:21:37Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28636/
Test FAILed.

SparkQA · 2015-03-16T04:28:01Z

Test build #28638 has started for PR 4885 at commit 1c47b2a.

This patch merges cleanly.

SparkQA · 2015-03-16T05:45:05Z

Test build #28638 has finished for PR 4885 at commit 1c47b2a.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-03-16T05:45:08Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28638/
Test PASSed.

liancheng · 2015-03-16T17:02:57Z

...ftserver/src/test/scala/org/apache/spark/sql/hive/thriftserver/HiveThriftServer2Suites.scala

Would be nice to add comment to indicate that the expected value should be "<undefined>". I was quite confused at first as 200 should be the default value of "spark.sql.shuffle.partitions" :)

liancheng · 2015-03-16T17:07:01Z

Hey @chenghao-intel, left another 3 minor comments. But I'm gonna merge this. Please fix them in another PR. Also verified locally that both session isolation and cache sharing work as expected. Thanks for the efforts!!

chenghao-intel · 2015-03-17T02:04:36Z

Thank you very much @liancheng, I will create another PR for the requirements that we discussed above, and also the minor issues.

guowei2 reviewed Mar 4, 2015
View reviewed changes

liancheng reviewed Mar 13, 2015
View reviewed changes

chenghao-intel added 3 commits March 15, 2015 18:37

thriftservice with single context

4665b0d

openSession is not compatible between Hive0.12 & 0.13.1

57e3fa0

code style issue

815b27a

chenghao-intel force-pushed the multisessions_singlecontext branch from 0ca4bbd to 815b27a Compare March 16, 2015 03:15

rename the tss => tlSession

1c47b2a

chenghao-intel changed the title ~~[SPARK-2087] [SQL] [WIP] Multiple thriftserver sessions with single HiveContext instance~~ [SPARK-2087] [SQL] Multiple thriftserver sessions with single HiveContext instance Mar 16, 2015

liancheng reviewed Mar 16, 2015
View reviewed changes

asfgit closed this in 12a345a Mar 16, 2015

chenghao-intel mentioned this pull request Mar 17, 2015

[SPARK-2087] [SQL] Multiple thriftserver sessions with different HiveContext instances #4382

Closed

chenghao-intel deleted the multisessions_singlecontext branch July 2, 2015 08:39

[SPARK-2087] [SQL] Multiple thriftserver sessions with single HiveContext instance #4885

[SPARK-2087] [SQL] Multiple thriftserver sessions with single HiveContext instance #4885

Uh oh!

Conversation

chenghao-intel commented Mar 4, 2015

Uh oh!

SparkQA commented Mar 4, 2015

Uh oh!

chenghao-intel commented Mar 4, 2015

Uh oh!

SparkQA commented Mar 4, 2015

Uh oh!

AmplabJenkins commented Mar 4, 2015

Uh oh!

SparkQA commented Mar 4, 2015

Uh oh!

guowei2 Mar 4, 2015

Choose a reason for hiding this comment

Uh oh!

liancheng Mar 13, 2015

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Mar 4, 2015

Uh oh!

AmplabJenkins commented Mar 4, 2015

Uh oh!

liancheng Mar 13, 2015

Choose a reason for hiding this comment

Uh oh!

liancheng commented Mar 13, 2015

Uh oh!

SparkQA commented Mar 16, 2015

Uh oh!

chenghao-intel commented Mar 16, 2015

Uh oh!

SparkQA commented Mar 16, 2015

Uh oh!

AmplabJenkins commented Mar 16, 2015

Uh oh!

SparkQA commented Mar 16, 2015

Uh oh!

SparkQA commented Mar 16, 2015

Uh oh!

AmplabJenkins commented Mar 16, 2015

Uh oh!

liancheng Mar 16, 2015

Choose a reason for hiding this comment

Uh oh!

liancheng commented Mar 16, 2015

Uh oh!

chenghao-intel commented Mar 17, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants