-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-30015][BUILD] Move hive-storage-api dependency from hive-2.3 to sql/core
#26658
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
hive-2.3 to sql/corehive-2.3 to sql/core
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
hive-2.3 to sql/corehive-2.3 to sql/core
|
Retest this please. |
This comment has been minimized.
This comment has been minimized.
|
Test build #114395 has finished for PR 26658 at commit
|
tgravescs
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm still seeing build failures even with this pr:
[ERROR] [Error] /home/tgraves/workspace/tgravescs-spark/sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveExternalCatalog.scala:32: object serde is not a member of package org.apache.hadoop.hive
[ERROR] [Error] /home/tgraves/workspace/tgravescs-spark/sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveExternalCatalog.scala:1343: not found: value SERIALIZATION_FORMAT
build/mvn -Phadoop-2.7 -Phive -Pyarn -Pkinesis-asl -Pkubernetes -Pmesos -Phadoop-cloud -Pspark-ganglia-lgpl package -DskipTests
Maybe I still need the -Phive-2.3?
|
Hi, @srowen and @tgravescs . Thank you for review.
BTW, although it's orthogonal to this PR, we had better remove |
|
so what are the rest of the jiras to fix the build? Its not very user friendly for us to have broke the default build and it to not be clear at what options actually work. I tried building 4 different ways before finding one that worked. |
|
@tgravescs . After merging this, I'd like to change the following. BTW, what you want when you use
I'm wondering your decision on |
|
Thank you for approval. Sorry for the inconvenience. |
|
I'm not sure I saw an answer on the mailing thread, if we are considering hive 1.2.1 deprecated perhaps we should have hive 2.3 the default. Also for hadoop 3.2 profile the only one that works is hive 2.3 right? |
|
Thanks for confirming. Yes. Right. |
|
Test build #114416 has finished for PR 26658 at commit
|
|
Test build #114417 has finished for PR 26658 at commit
|
What changes were proposed in this pull request?
This PR aims to relocate the following internal dependencies to compile
sql/corewithout-Phive-2.3profile.hive-storage-apitosql/corewhich is usinghive-storage-apireally.BEFORE (sql/core compilation)
AFTER (sql/core compilation)
commons-lang:commons-langtest dependency tospark-coremodule to manage the dependency explicitly. Without this,coremodule fails to build the test classes.BEFORE (commons-lang:commons-lang)
The following is the previous
coremodule'scommons-lang:commons-langdependency.AFTER (commons-lang:commons-lang)
Since we wanted to verify that this PR doesn't change
hive-1.2profile, we mergedSPARK-30005 Update
test-dependencies.shto checkhive-1.2/2.3profile before this PR.Why are the changes needed?
Apache Spark 2.4's
sql/coreis usingApache ORC (nohive)jars including shadedhive-storage-apito access ORC data sources.Apache Spark 3.0's
sql/coreis usingApache Hivejars directly. Previously,-Phadoop-3.2hid thishive-storage-apidependency. Now, we are using-Phive-2.3instead. As I mentioned previously, this PR is required to compilesql/coremodule without-Phive-2.3.For
sql/hiveandsql/hive-thriftserver, it's natural that we need-Phive-1.2or-Phive-2.3.Does this PR introduce any user-facing change?
No.
How was this patch tested?
This will pass the Jenkins (with the dependency check and unit tests).
We need to check manually with
./build/mvn -DskipTests --pl sql/core --am compile.This closes #26657 .