[SPARK-24166][SQL] InMemoryTableScanExec should not access SQLConf at executor side #21223

cloud-fan · 2018-05-03T05:07:50Z

What changes were proposed in this pull request?

This PR is extracted from #21190 , to make it easier to backport.

InMemoryTableScanExec#createAndDecompressColumn is executed inside rdd.map, we can't access conf.offHeapColumnVectorEnabled there.

How was this patch tested?

it's tested in #21190

cloud-fan · 2018-05-03T05:08:03Z

cc @kiszk @viirya

kiszk · 2018-05-03T05:20:19Z

LGTM

dongjoon-hyun

+1, LGTM.

SparkQA · 2018-05-03T07:04:38Z

Test build #90095 has finished for PR 21223 at commit d900b4c.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

cloud-fan · 2018-05-03T07:07:56Z

retest this please

viirya

LGTM

SparkQA · 2018-05-03T11:11:52Z

Test build #90107 has finished for PR 21223 at commit d900b4c.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

adrian-wang

+1 LGTM

… executor side ## What changes were proposed in this pull request? This PR is extracted from #21190 , to make it easier to backport. `InMemoryTableScanExec#createAndDecompressColumn` is executed inside `rdd.map`, we can't access `conf.offHeapColumnVectorEnabled` there. ## How was this patch tested? it's tested in #21190 Author: Wenchen Fan <[email protected]> Closes #21223 from cloud-fan/minor1. (cherry picked from commit 991b526) Signed-off-by: Wenchen Fan <[email protected]>

cloud-fan · 2018-05-03T12:11:03Z

thanks, merging to master/2.3!

… on the driver ## What changes were proposed in this pull request? This is a followup of apache#20136 . apache#20136 didn't really work because in the test, we are using local backend, which shares the driver side `SparkEnv`, so `SparkEnv.get.executorId == SparkContext.DRIVER_IDENTIFIER` doesn't work. This PR changes the check to `TaskContext.get != null`, and move the check to `SQLConf.get`, and fix all the places that violate this check: * `InMemoryTableScanExec#createAndDecompressColumn` is executed inside `rdd.map`, we can't access `conf.offHeapColumnVectorEnabled` there. apache#21223 merged * `DataType#sameType` may be executed in executor side, for things like json schema inference, so we can't call `conf.caseSensitiveAnalysis` there. This contributes to most of the code changes, as we need to add `caseSensitive` parameter to a lot of methods. * `ParquetFilters` is used in the file scan function, which is executed in executor side, so we can't can't call `conf.parquetFilterPushDownDate` there. apache#21224 merged * `WindowExec#createBoundOrdering` is called on executor side, so we can't use `conf.sessionLocalTimezone` there. apache#21225 merged * `JsonToStructs` can be serialized to executors and evaluate, we should not call `SQLConf.get.getConf(SQLConf.FROM_JSON_FORCE_NULLABLE_SCHEMA)` in the body. apache#21226 merged ## How was this patch tested? existing test Author: Wenchen Fan <[email protected]> Closes apache#21190 from cloud-fan/minor.

InMemoryTableScanExec should not access SQLConf at executor side

d900b4c

cloud-fan mentioned this pull request May 3, 2018

[SPARK-22938][SQL][followup] Assert that SQLConf.get is accessed only on the driver #21190

Closed

dongjoon-hyun approved these changes May 3, 2018

View reviewed changes

viirya approved these changes May 3, 2018

View reviewed changes

adrian-wang approved these changes May 3, 2018

View reviewed changes

asfgit closed this in 991b526 May 3, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-24166][SQL] InMemoryTableScanExec should not access SQLConf at executor side #21223

[SPARK-24166][SQL] InMemoryTableScanExec should not access SQLConf at executor side #21223

Uh oh!

cloud-fan commented May 3, 2018

Uh oh!

cloud-fan commented May 3, 2018

Uh oh!

kiszk commented May 3, 2018

Uh oh!

dongjoon-hyun left a comment

Uh oh!

SparkQA commented May 3, 2018

Uh oh!

cloud-fan commented May 3, 2018

Uh oh!

viirya left a comment

Uh oh!

SparkQA commented May 3, 2018

Uh oh!

adrian-wang left a comment

Uh oh!

cloud-fan commented May 3, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[SPARK-24166][SQL] InMemoryTableScanExec should not access SQLConf at executor side #21223

[SPARK-24166][SQL] InMemoryTableScanExec should not access SQLConf at executor side #21223

Uh oh!

Conversation

cloud-fan commented May 3, 2018

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

cloud-fan commented May 3, 2018

Uh oh!

kiszk commented May 3, 2018

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

SparkQA commented May 3, 2018

Uh oh!

cloud-fan commented May 3, 2018

Uh oh!

viirya left a comment

Choose a reason for hiding this comment

Uh oh!

SparkQA commented May 3, 2018

Uh oh!

adrian-wang left a comment

Choose a reason for hiding this comment

Uh oh!

cloud-fan commented May 3, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants