[SPARK-15954][SQL] Disable loading test tables in Python tests #14005

rxin · 2016-06-30T22:58:44Z

What changes were proposed in this pull request?

This patch introduces a flag to disable loading test tables in TestHiveSparkSession and disables that in Python. This fixes an issue in which python/run-tests would fail due to failure to load test tables.

Note that these test tables are not used outside of HiveCompatibilitySuite. In the long run we should probably decouple the loading of test tables from the test Hive setup.

How was this patch tested?

This is a test only change.

rxin · 2016-06-30T22:59:14Z

The diff is a lot smaller when ignoring whitespaces: https://github.com/apache/spark/pull/14005/files?w=1

Most of the changes are just some indentation change.

holdenk · 2016-06-30T23:24:19Z

Great approach, I mentioned I had a similar approach available in the other PR #13737 (comment) to fix this (adding a flag to disable loading the tables) (although mine used a lazy val and only had the if around register) but looks functional equivalent.

Note: I ran the scala hive tests locally and they failed, I think that the default should be to load the tables. You can fix it here or I can push my version. But other than that LGTM pending tests passing.

rxin · 2016-06-30T23:51:49Z

ah yes default should definitely be true. let me fix that.

SparkQA · 2016-06-30T23:54:53Z

Test build #61579 has finished for PR 14005 at commit e090304.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

holdenk · 2016-06-30T23:57:36Z

LGTM pending tests. I'll go ahead and close my original PR. cc @MLnick and @sameeragarwal .

SparkQA · 2016-07-01T01:53:54Z

Test build #61582 has finished for PR 14005 at commit 5982972.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2016-07-01T02:02:16Z

Merging in master/2.0.

## What changes were proposed in this pull request? This patch introduces a flag to disable loading test tables in TestHiveSparkSession and disables that in Python. This fixes an issue in which python/run-tests would fail due to failure to load test tables. Note that these test tables are not used outside of HiveCompatibilitySuite. In the long run we should probably decouple the loading of test tables from the test Hive setup. ## How was this patch tested? This is a test only change. Author: Reynold Xin <[email protected]> Closes #14005 from rxin/SPARK-15954. (cherry picked from commit 38f4d6f) Signed-off-by: Reynold Xin <[email protected]>

…skipped always ## What changes were proposed in this pull request? Currently, `HiveContext` in SparkR is not being tested and always skipped. This is because the initiation of `TestHiveContext` is being failed due to trying to load non-existing data paths (test tables). This is introduced from #14005 This enables the tests with SparkR. ## How was this patch tested? Manually, **Before** (on Mac OS) ``` ... Skipped ------------------------------------------------------------------------ 1. create DataFrame from RDD (test_sparkSQL.R#200) - Hive is not build with SparkSQL, skipped 2. test HiveContext (test_sparkSQL.R#1041) - Hive is not build with SparkSQL, skipped 3. read/write ORC files (test_sparkSQL.R#1748) - Hive is not build with SparkSQL, skipped 4. enableHiveSupport on SparkSession (test_sparkSQL.R#2480) - Hive is not build with SparkSQL, skipped 5. sparkJars tag in SparkContext (test_Windows.R#21) - This test is only for Windows, skipped ... ``` **After** (on Mac OS) ``` ... Skipped ------------------------------------------------------------------------ 1. sparkJars tag in SparkContext (test_Windows.R#21) - This test is only for Windows, skipped ... ``` Please refer the tests below (on Windows) - Before: https://ci.appveyor.com/project/HyukjinKwon/spark/build/45-test123 - After: https://ci.appveyor.com/project/HyukjinKwon/spark/build/46-test123 Author: hyukjinkwon <[email protected]> Closes #14889 from HyukjinKwon/SPARK-17326. (cherry picked from commit 50bb142) Signed-off-by: Shivaram Venkataraman <[email protected]>

…skipped always ## What changes were proposed in this pull request? Currently, `HiveContext` in SparkR is not being tested and always skipped. This is because the initiation of `TestHiveContext` is being failed due to trying to load non-existing data paths (test tables). This is introduced from #14005 This enables the tests with SparkR. ## How was this patch tested? Manually, **Before** (on Mac OS) ``` ... Skipped ------------------------------------------------------------------------ 1. create DataFrame from RDD (test_sparkSQL.R#200) - Hive is not build with SparkSQL, skipped 2. test HiveContext (test_sparkSQL.R#1041) - Hive is not build with SparkSQL, skipped 3. read/write ORC files (test_sparkSQL.R#1748) - Hive is not build with SparkSQL, skipped 4. enableHiveSupport on SparkSession (test_sparkSQL.R#2480) - Hive is not build with SparkSQL, skipped 5. sparkJars tag in SparkContext (test_Windows.R#21) - This test is only for Windows, skipped ... ``` **After** (on Mac OS) ``` ... Skipped ------------------------------------------------------------------------ 1. sparkJars tag in SparkContext (test_Windows.R#21) - This test is only for Windows, skipped ... ``` Please refer the tests below (on Windows) - Before: https://ci.appveyor.com/project/HyukjinKwon/spark/build/45-test123 - After: https://ci.appveyor.com/project/HyukjinKwon/spark/build/46-test123 Author: hyukjinkwon <[email protected]> Closes #14889 from HyukjinKwon/SPARK-17326.

[SPARK-15954][SQL] TestHive has issues being used in PySpark

e090304

rxin mentioned this pull request Jun 30, 2016

[SPARK-15954][SQL][PySpark][TEST] Fix TestHiveContext interaction with PySpark issue #13737

Closed

Change default

5982972

asfgit closed this in 38f4d6f Jul 1, 2016

liancheng mentioned this pull request Jul 15, 2016

Migrate to Spark 2.0 databricks/spark-redshift#221

Closed

HyukjinKwon mentioned this pull request Aug 31, 2016

[SPARK-17326][SPARKR] Fix tests with HiveContext in SparkR not to be skipped always #14889

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-15954][SQL] Disable loading test tables in Python tests #14005

[SPARK-15954][SQL] Disable loading test tables in Python tests #14005

Uh oh!

rxin commented Jun 30, 2016

Uh oh!

rxin commented Jun 30, 2016

Uh oh!

holdenk commented Jun 30, 2016

Uh oh!

rxin commented Jun 30, 2016

Uh oh!

SparkQA commented Jun 30, 2016

Uh oh!

holdenk commented Jun 30, 2016

Uh oh!

SparkQA commented Jul 1, 2016

Uh oh!

rxin commented Jul 1, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-15954][SQL] Disable loading test tables in Python tests #14005

[SPARK-15954][SQL] Disable loading test tables in Python tests #14005

Uh oh!

Conversation

rxin commented Jun 30, 2016

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

rxin commented Jun 30, 2016

Uh oh!

holdenk commented Jun 30, 2016

Uh oh!

rxin commented Jun 30, 2016

Uh oh!

SparkQA commented Jun 30, 2016

Uh oh!

holdenk commented Jun 30, 2016

Uh oh!

SparkQA commented Jul 1, 2016

Uh oh!

rxin commented Jul 1, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants