Skip to content

Conversation

@HyukjinKwon
Copy link
Member

@HyukjinKwon HyukjinKwon commented Nov 29, 2019

What changes were proposed in this pull request?

Currently, Apache Spark PR Builder is using hive-1.2 for hadoop-2.7 and hive-2.3 for hadoop-3.2. This PR aims to support

  • [test-hive1.2] in PR builder
  • [test-hive2.3] in PR builder to be consistent and independent of the default profile
  • After this PR, all PR builders will use Hive 2.3 by default (because Spark uses Hive 2.3 by default as of c98e5eb)
  • Use default profile in AppVeyor build.

Note that this was reverted due to unexpected test failure at ThriftServerPageSuite, which was investigated in #26706 . This PR fixed it by letting it use their own forked JVM. There is no explicit evidence for this fix and it was just my speculation, and thankfully it fixed at least.

Why are the changes needed?

This new tag allows us more flexibility.

Does this PR introduce any user-facing change?

No. (This is a dev-only change.)

How was this patch tested?

Check the Jenkins triggers in this PR.

Default:

========================================================================
Building Spark
========================================================================
[info] Building Spark using SBT with these arguments:  -Phadoop-2.7 -Phive-2.3 -Phive-thriftserver -Pmesos -Pspark-ganglia-lgpl -Phadoop-cloud -Phive -Pkubernetes -Pkinesis-asl -Pyarn test:package streaming-kinesis-asl-assembly/assembly

[test-hive1.2][test-hadoop3.2]:

========================================================================
Building Spark
========================================================================
[info] Building Spark using SBT with these arguments:  -Phadoop-3.2 -Phive-1.2 -Phadoop-cloud -Pyarn -Pspark-ganglia-lgpl -Phive -Phive-thriftserver -Pmesos -Pkubernetes -Pkinesis-asl test:package streaming-kinesis-asl-assembly/assembly

[test-maven][test-hive2.3]:

========================================================================
Building Spark
========================================================================
[info] Building Spark using Maven with these arguments:  -Phadoop-2.7 -Phive-2.3 -Pspark-ganglia-lgpl -Pyarn -Phive -Phadoop-cloud -Pkinesis-asl -Pmesos -Pkubernetes -Phive-thriftserver clean package -DskipTests

@HyukjinKwon HyukjinKwon changed the title [SPARK-29991][INFRA] Support test-hive1.2 and test-hive2.3 (default) in PR builder [SPARK-29991][INFRA][test-hive2.3] Support test-hive1.2 and test-hive2.3 (default) in PR builder Nov 29, 2019
@HyukjinKwon
Copy link
Member Author

cc @dongjoon-hyun, @wangyum, and @srowen

@HyukjinKwon HyukjinKwon changed the title [SPARK-29991][INFRA][test-hive2.3] Support test-hive1.2 and test-hive2.3 (default) in PR builder [SPARK-29991][INFRA][test-hive1.2] Support test-hive1.2 and test-hive2.3 (default) in PR builder Nov 29, 2019
"org.apache.spark.sql.hive.thriftserver.ThriftServerQueryTestSuite",
"org.apache.spark.sql.hive.thriftserver.SparkSQLEnvSuite"
"org.apache.spark.sql.hive.thriftserver.SparkSQLEnvSuite",
"org.apache.spark.sql.hive.thriftserver.ui.ThriftServerPageSuite"
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This fixes the test failure against Hive 2.3 (tested in #26706); however, I have no explicit evidence. Just given my speculation and it worked.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm still seeing failures like in https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/4953/testReport/ - should we make all the thriftserver tests single-threaded?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah yeah. Maybe that's a better idea. I'll monitor a bit more and make a PR soon

@HyukjinKwon HyukjinKwon changed the title [SPARK-29991][INFRA][test-hive1.2] Support test-hive1.2 and test-hive2.3 (default) in PR builder [SPARK-29991][INFRA][test-hive1.2][test-hadoop3.2] Support test-hive1.2 and test-hive2.3 (default) in PR builder Nov 29, 2019
@HyukjinKwon HyukjinKwon changed the title [SPARK-29991][INFRA][test-hive1.2][test-hadoop3.2] Support test-hive1.2 and test-hive2.3 (default) in PR builder [WIP][SPARK-29991][INFRA][test-hive1.2][test-hadoop3.2] Support test-hive1.2 and test-hive2.3 (default) in PR builder Nov 29, 2019
@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@HyukjinKwon HyukjinKwon changed the title [WIP][SPARK-29991][INFRA][test-hive1.2][test-hadoop3.2] Support test-hive1.2 and test-hive2.3 (default) in PR builder [WIP][SPARK-29991][INFRA][test-hive1.2][test-hadoop3.2] Support Hive 1.3 and Hive 2.3 (default) in PR builder Nov 29, 2019
@SparkQA

This comment has been minimized.

@HyukjinKwon HyukjinKwon changed the title [WIP][SPARK-29991][INFRA][test-hive1.2][test-hadoop3.2] Support Hive 1.3 and Hive 2.3 (default) in PR builder [SPARK-29991][INFRA][test-hive1.2][test-hadoop3.2] Support Hive 1.3 and Hive 2.3 (default) in PR builder Nov 29, 2019
@HyukjinKwon HyukjinKwon changed the title [SPARK-29991][INFRA][test-hive1.2][test-hadoop3.2] Support Hive 1.3 and Hive 2.3 (default) in PR builder [SPARK-29991][INFRA] Support Hive 1.3 and Hive 2.3 (default) in PR builder Nov 29, 2019
if "test-hive1.2" in ghprb_pull_title:
os.environ["AMPLAB_JENKINS_BUILD_HIVE_PROFILE"] = "hive1.2"
if "test-hive2.3" in ghprb_pull_title:
os.environ["AMPLAB_JENKINS_BUILD_HIVE_PROFILE"] = "hive2.3"
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cc @shaneknapp, maybe we can use it this environment variable (?) later when we consider setting the Jenkins jobs for Hive 1.2/2.3 + Hadoop 2.7/3.2 + JDK 8/11 combinations.

The env name looks a bit odd given AMPLAB_JENKINS_BUILD_PROFILE. We might have to rename it later.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sorry, i missed this over the november holiday. yes, i've had a desire for years to rename the AMPLAB_* variables as the amplab project ended in december 2016. :)

i'll do this after the 3.0 cut. no time like the present!

@HyukjinKwon HyukjinKwon changed the title [SPARK-29991][INFRA] Support Hive 1.3 and Hive 2.3 (default) in PR builder [SPARK-29991][INFRA][test-maven][hive-2.3] Support Hive 1.3 and Hive 2.3 (default) in PR builder Nov 29, 2019
@HyukjinKwon HyukjinKwon changed the title [SPARK-29991][INFRA][test-maven][hive-2.3] Support Hive 1.3 and Hive 2.3 (default) in PR builder [SPARK-29991][INFRA] Support Hive 1.3 and Hive 2.3 (default) in PR builder Nov 29, 2019
@SparkQA
Copy link

SparkQA commented Nov 29, 2019

Test build #114614 has finished for PR 26710 at commit 2135955.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Nov 29, 2019

Test build #114618 has finished for PR 26710 at commit 2135955.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

Copy link
Member

@srowen srowen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks reasonable, esp. the last part that may fix the test problem

@SparkQA
Copy link

SparkQA commented Nov 29, 2019

Test build #114621 has finished for PR 26710 at commit 2135955.

  • This patch fails from timeout after a configured wait of 400m.
  • This patch merges cleanly.
  • This patch adds no public classes.

@HyukjinKwon
Copy link
Member Author

retest this please

@HyukjinKwon
Copy link
Member Author

Yeah, seems it fixed. Let me merge this tomorrow if this isn't merged

@dongjoon-hyun dongjoon-hyun changed the title [SPARK-29991][INFRA] Support Hive 1.3 and Hive 2.3 (default) in PR builder [SPARK-29991][INFRA] Support Hive 1.2 and Hive 2.3 (default) in PR builder Nov 29, 2019
Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for the fix!

@SparkQA
Copy link

SparkQA commented Nov 29, 2019

Test build #114635 has finished for PR 26710 at commit 2135955.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@wangyum
Copy link
Member

wangyum commented Nov 30, 2019

Do we need to support testing [test-hive1.2][test-hadoop3.2]?

@HyukjinKwon
Copy link
Member Author

No because it doesn't work but let's leave it since our build itself already allows such profile combinations.

@HyukjinKwon
Copy link
Member Author

If you feel strongly, I will make a followup to disallow that specific matrix. Let me know @wangyum.

Merged to master.

@wangyum
Copy link
Member

wangyum commented Nov 30, 2019

OK. Thank you @HyukjinKwon LGTM

@shaneknapp
Copy link
Contributor

shaneknapp commented Nov 30, 2019 via email

attilapiros pushed a commit to attilapiros/spark that referenced this pull request Dec 6, 2019
…ilder

### What changes were proposed in this pull request?

Currently, Apache Spark PR Builder using `hive-1.2` for `hadoop-2.7` and `hive-2.3` for `hadoop-3.2`. This PR aims to support

- `[test-hive1.2]`  in PR builder
- `[test-hive2.3]` in PR builder to be consistent and independent of the default profile
- After this PR, all PR builders will use Hive 2.3 by default (because Spark uses Hive 2.3 by default as of apache@c98e5eb)
- Use default profile in AppVeyor build.

Note that this was reverted due to unexpected test failure at `ThriftServerPageSuite`, which was investigated in apache#26706 . This PR fixed it by letting it use their own forked JVM. There is no explicit evidence for this fix and it was just my speculation, and thankfully it fixed at least.

### Why are the changes needed?
This new tag allows us more flexibility.

### Does this PR introduce any user-facing change?
No. (This is a dev-only change.)

### How was this patch tested?
Check the Jenkins triggers in this PR.

Default:

```
========================================================================
Building Spark
========================================================================
[info] Building Spark using SBT with these arguments:  -Phadoop-2.7 -Phive-2.3 -Phive-thriftserver -Pmesos -Pspark-ganglia-lgpl -Phadoop-cloud -Phive -Pkubernetes -Pkinesis-asl -Pyarn test:package streaming-kinesis-asl-assembly/assembly
```

`[test-hive1.2][test-hadoop3.2]`:

```
========================================================================
Building Spark
========================================================================
[info] Building Spark using SBT with these arguments:  -Phadoop-3.2 -Phive-1.2 -Phadoop-cloud -Pyarn -Pspark-ganglia-lgpl -Phive -Phive-thriftserver -Pmesos -Pkubernetes -Pkinesis-asl test:package streaming-kinesis-asl-assembly/assembly
```

`[test-maven][test-hive-2.3]`:

```
========================================================================
Building Spark
========================================================================
[info] Building Spark using Maven with these arguments:  -Phadoop-2.7 -Phive-2.3 -Pspark-ganglia-lgpl -Pyarn -Phive -Phadoop-cloud -Pkinesis-asl -Pmesos -Pkubernetes -Phive-thriftserver clean package -DskipTests
```

Closes apache#26710 from HyukjinKwon/SPARK-29991.

Authored-by: HyukjinKwon <[email protected]>
Signed-off-by: HyukjinKwon <[email protected]>
Copy link
Member

@gatorsmile gatorsmile left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@wangyum @HyukjinKwon could you help update the website https://spark.apache.org/developer-tools.html for including these new tags?

@dongjoon-hyun
Copy link
Member

+1 for @gatorsmile 's advice.

@HyukjinKwon
Copy link
Member Author

Sure, will do.

HyukjinKwon added a commit that referenced this pull request Feb 11, 2020
…ranch-3.0-test-sbt-hadoop-2.7-hive-2.3

### What changes were proposed in this pull request?

This PR tries #26710 (comment) way to fix the test.

### Why are the changes needed?

To make the tests pass.

### Does this PR introduce any user-facing change?

No.

### How was this patch tested?

Jenkins will test first, and then `on spark-branch-3.0-test-sbt-hadoop-2.7-hive-2.3` will test it out.

Closes #27513 from HyukjinKwon/test-SPARK-30756.

Authored-by: HyukjinKwon <[email protected]>
Signed-off-by: HyukjinKwon <[email protected]>
HyukjinKwon added a commit that referenced this pull request Feb 11, 2020
…ranch-3.0-test-sbt-hadoop-2.7-hive-2.3

### What changes were proposed in this pull request?

This PR tries #26710 (comment) way to fix the test.

### Why are the changes needed?

To make the tests pass.

### Does this PR introduce any user-facing change?

No.

### How was this patch tested?

Jenkins will test first, and then `on spark-branch-3.0-test-sbt-hadoop-2.7-hive-2.3` will test it out.

Closes #27513 from HyukjinKwon/test-SPARK-30756.

Authored-by: HyukjinKwon <[email protected]>
Signed-off-by: HyukjinKwon <[email protected]>
(cherry picked from commit 8efe367)
Signed-off-by: HyukjinKwon <[email protected]>
@HyukjinKwon HyukjinKwon deleted the SPARK-29991 branch March 3, 2020 01:17
sjincho pushed a commit to sjincho/spark that referenced this pull request Apr 15, 2020
…ranch-3.0-test-sbt-hadoop-2.7-hive-2.3

### What changes were proposed in this pull request?

This PR tries apache#26710 (comment) way to fix the test.

### Why are the changes needed?

To make the tests pass.

### Does this PR introduce any user-facing change?

No.

### How was this patch tested?

Jenkins will test first, and then `on spark-branch-3.0-test-sbt-hadoop-2.7-hive-2.3` will test it out.

Closes apache#27513 from HyukjinKwon/test-SPARK-30756.

Authored-by: HyukjinKwon <[email protected]>
Signed-off-by: HyukjinKwon <[email protected]>
(cherry picked from commit 8efe367)
Signed-off-by: HyukjinKwon <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants