[SPARK-36669][BUILD] Revert to non-shaded Hadoop client library #33913

viirya · 2021-09-04T22:09:05Z

What changes were proposed in this pull request?

This patch proposes to use non-shaded Hadoop client libraries.

Why are the changes needed?

Currently we use Hadop 3.3.1's shaded client libraries. Lz4 is a provided dependency in Hadoop Common 3.3.1 for Lz4Codec. But it isn't excluded from relocation in these libraries. So to use lz4 as Parquet codec, we will hit the exception even we include lz4 as dependency.

[info]   Cause: java.lang.NoClassDefFoundError: org/apache/hadoop/shaded/net/jpountz/lz4/LZ4Factory                                                                                            
[info]   at org.apache.hadoop.io.compress.lz4.Lz4Compressor.<init>(Lz4Compressor.java:66)
[info]   at org.apache.hadoop.io.compress.Lz4Codec.createCompressor(Lz4Codec.java:119)                                                                                                         
[info]   at org.apache.hadoop.io.compress.CodecPool.getCompressor(CodecPool.java:152)                                                                                                          
[info]   at org.apache.hadoop.io.compress.CodecPool.getCompressor(CodecPool.java:168)

I already submitted a PR (HADOOP-17891) to Hadoop to fix it. Before it is released, at Spark side, we either downgrade to 3.3.0 or revert back to non-shaded hadoop client library.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Manually test.

viirya · 2021-09-04T22:11:19Z

cc @dongjoon-hyun @sunchao @dbtsai @gengliangwang

viirya · 2021-09-04T22:13:38Z

pom.xml

      when the Hadoop profile is hadoop-2.7, because these are only available in 3.x. Note that,
      as result we have to include the same hadoop-client dependency multiple times in hadoop-2.7.
    -->
-    <hadoop-client-api.artifact>hadoop-client-api</hadoop-client-api.artifact>


This is open to hear the voices from the reviewers. So don't add some comment here yet.

SparkQA · 2021-09-04T22:27:06Z

Test build #142993 has finished for PR 33913 at commit 117a07c.

This patch fails to build.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2021-09-04T23:02:01Z

Can we have a test coverage for your example, @viirya ?

SparkQA · 2021-09-04T23:03:09Z

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47495/

viirya · 2021-09-04T23:05:18Z

Yea, found this issue during adding codec tests in #33912.

SparkQA · 2021-09-04T23:11:41Z

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47495/

sunchao · 2021-09-05T04:55:44Z

Well this is a bummer. We can't go back to 3.3.0 since it neither support shaded client nor non-shaded client. The only option is 3.2.2.

I think in theory we can use non-shaded client for 3.3.1 but I haven't tried it. You may need to revert more PRs, for instance #33053.

viirya · 2021-09-05T06:14:06Z

Hmm, yea, it looks more trouble than I thought...I only did this change to test the codec tests (sql). For entire Spark, seems it needs to revert more stuff.

gengliangwang · 2021-09-05T13:30:19Z

+1 for a new test case for the issue

viirya · 2021-09-05T16:54:58Z

@gengliangwang if we run codec test using lz4 in #33912 with current master branch, it will throw the exception as shown in the description.

HyukjinKwon · 2021-09-06T04:39:48Z

cc @bozhang2820 who's also interested in this FYI

pan3793 · 2021-09-06T05:26:18Z

Move to Hadoop Shaded Client is a big improvement for Spark 3.2, how about implementing an org.apache.hadoop.shaded.net.jpountz.lz4.LZ4Factory delegate to net.jpountz.lz4.LZ4Factory as a workaround?

viirya · 2021-09-06T07:37:11Z

Thanks @pan3793.

I tried to add lz4 wrapper classes in #33912. Fortunately only a few lz4 APIs were used at Hadoop Lz4 codec internally. So the wrapper classes are simple.

It can pass the tests locally. Let me know what you think about this idea. @cloud-fan @sunchao @dongjoon-hyun

Basically it sounds good as we don't need to revert shaded hadoop client related stuffs.

dbtsai · 2021-09-06T07:47:14Z

+1 with the workaround.

@viirya does it mean for snappy, we will have two copies of the snappy-java? One from Spark, and another one shaded hadoop lib?

dbtsai · 2021-09-06T07:49:59Z

Thanks @pan3793.

I tried to add lz4 wrapper classes in #33912. Fortunately only a few lz4 APIs were used at Hadoop Lz4 codec internally. So the wrapper classes are simple.

It works for parquet, but does it work for compressed lz4 hadoop seq file which will need full Lz4Codec support?

Thanks,

viirya · 2021-09-06T08:16:28Z

+1 with the workaround.

@viirya does it mean for snappy, we will have two copies of the snappy-java? One from Spark, and another one shaded hadoop lib?

At Hadoop side, snappy-java is not provided but a compile dependency. So it is relocated and included in the shaded client libraries. Spark includes its snappy-java, yes. But they don't conflict as Hadoop relocates its, I think.

viirya · 2021-09-06T08:22:08Z

It works for parquet, but does it work for compressed lz4 hadoop seq file which will need full Lz4Codec support?

Thanks,

lz4-java APIs are only used internally in Hadoop Lz4Compressor and Lz4Decompressor, not by Lz4Codec. The added wrapper classes already implement all used lz4-java APIs there, Hadoop usage should be fine for both parquet, sequence file. I will also run a test to verify it too.

Actually maybe we also need to add some e2e tests for seq file in Spark too.

sunchao · 2021-09-06T15:25:03Z

+1 on adding the wrapper as a workaround

pan3793 · 2021-09-08T07:08:36Z

Does this help for snappy-java?

Fixed the pure-java Snappy fallback logic when no native library for your platform is found.

https://github.com/xerial/snappy-java/releases/tag/1.1.8.2

viirya · 2021-09-08T07:28:17Z

Does this help for snappy-java?

Fixed the pure-java Snappy fallback logic when no native library for your platform is found.

https://github.com/xerial/snappy-java/releases/tag/1.1.8.2

I think it doesn't. Actually the native library is relocated, snappy library can find it and load it. But when JNI resolves native method, it cannot resolve the defined native methods because relocation doesn't work on native methods.

BTW, Hadoop 3.3.1 already uses snappy-java 1.1.8.2.

pan3793 · 2021-09-08T07:35:16Z

What if force set org.apache.hadoop.shaded.org.xerial.snappy.purejava = true?

try {
    if (Boolean.parseBoolean(System.getProperty("org.apache.hadoop.shaded.org.xerial.snappy.purejava", "false"))) {
        setSnappyApi(new PureJavaSnappy());
    } else {
        loadNativeLibrary();
        setSnappyApi(new SnappyNative());
    }
} catch (Throwable var1) {
    setSnappyApi(new PureJavaSnappy());
}

Revert to non-shaded Hadoop client library.

117a07c

github-actions bot added the BUILD label Sep 4, 2021

viirya changed the title ~~[SPARK-36669][SQL] Revert to non-shaded Hadoop client library~~ [SPARK-36669][SQL][BUILD] Revert to non-shaded Hadoop client library Sep 4, 2021

viirya changed the title ~~[SPARK-36669][SQL][BUILD] Revert to non-shaded Hadoop client library~~ [SPARK-36669][BUILD] Revert to non-shaded Hadoop client library Sep 4, 2021

viirya mentioned this pull request Sep 4, 2021

[SPARK-36670][SQL][TEST] Add FileSourceCodecSuite #33912

Closed

viirya commented Sep 4, 2021

View reviewed changes

viirya closed this Sep 10, 2021

viirya deleted the SPARK-36669 branch December 27, 2023 18:25

Uh oh!

[SPARK-36669][BUILD] Revert to non-shaded Hadoop client library #33913

[SPARK-36669][BUILD] Revert to non-shaded Hadoop client library #33913

Uh oh!

Conversation

viirya commented Sep 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

viirya commented Sep 4, 2021

Uh oh!

viirya Sep 4, 2021

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Sep 4, 2021

Uh oh!

dongjoon-hyun commented Sep 4, 2021

Uh oh!

SparkQA commented Sep 4, 2021

Uh oh!

viirya commented Sep 4, 2021

Uh oh!

SparkQA commented Sep 4, 2021

Uh oh!

sunchao commented Sep 5, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

viirya commented Sep 5, 2021

Uh oh!

gengliangwang commented Sep 5, 2021

Uh oh!

viirya commented Sep 5, 2021

Uh oh!

HyukjinKwon commented Sep 6, 2021

Uh oh!

pan3793 commented Sep 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

viirya commented Sep 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dbtsai commented Sep 6, 2021

Uh oh!

dbtsai commented Sep 6, 2021

Uh oh!

viirya commented Sep 6, 2021

Uh oh!

viirya commented Sep 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sunchao commented Sep 6, 2021

Uh oh!

pan3793 commented Sep 8, 2021

Uh oh!

viirya commented Sep 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pan3793 commented Sep 8, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

viirya commented Sep 4, 2021 •

edited

Loading

sunchao commented Sep 5, 2021 •

edited

Loading

pan3793 commented Sep 6, 2021 •

edited

Loading

viirya commented Sep 6, 2021 •

edited

Loading

viirya commented Sep 6, 2021 •

edited

Loading

viirya commented Sep 8, 2021 •

edited

Loading