[SPARK-22233] [CORE] [FOLLOW-UP] Allow user to filter out empty split in HadoopRDD #19504

jiangxb1987 · 2017-10-16T07:06:15Z

What changes were proposed in this pull request?

Update the config spark.files.ignoreEmptySplits, rename it and make it internal.

This is followup of #19464

How was this patch tested?

Exsiting tests.

jiangxb1987 · 2017-10-16T07:19:46Z

cc @liutang123 @HyukjinKwon @gatorsmile @cloud-fan

HyukjinKwon

LGTM too

liutang123 · 2017-10-16T08:06:02Z

It looks better.

SparkQA · 2017-10-16T10:14:45Z

Test build #82787 has finished for PR 19504 at commit bcb3dbd.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

liutang123 · 2017-10-16T11:41:54Z

core/src/test/scala/org/apache/spark/FileSuite.scala

+      .set(HADOOP_RDD_IGNORE_EMPTY_SPLITS, true)
    sc = new SparkContext(conf)

    def testIgnoreEmptySplits(


testIgnoreEmptySplits( data = Array.empty[Tuple2[String, String]], actualPartitionNum = 1, expectedPartitionNum = 0)

=>

testIgnoreEmptySplits( data = Array.empty[(String, String)], actualPartitionNum = 1, expectedPartitionNum = 0)

cloud-fan · 2017-10-16T14:15:58Z

LGTM, merging to master!

update config spark.hadoopRDD.ignoreEmptySplits

bcb3dbd

srowen approved these changes Oct 16, 2017

View reviewed changes

HyukjinKwon approved these changes Oct 16, 2017

View reviewed changes

liutang123 reviewed Oct 16, 2017

View reviewed changes

asfgit closed this in 0fa1066 Oct 16, 2017

HyukjinKwon mentioned this pull request Mar 21, 2021

[SPARK-34809][CORE] Enable spark.hadoopRDD.ignoreEmptySplits by default #31909

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-22233] [CORE] [FOLLOW-UP] Allow user to filter out empty split in HadoopRDD #19504

[SPARK-22233] [CORE] [FOLLOW-UP] Allow user to filter out empty split in HadoopRDD #19504

Uh oh!

jiangxb1987 commented Oct 16, 2017 •

edited

Loading

Uh oh!

jiangxb1987 commented Oct 16, 2017

Uh oh!

HyukjinKwon left a comment

Uh oh!

liutang123 commented Oct 16, 2017

Uh oh!

SparkQA commented Oct 16, 2017

Uh oh!

liutang123 Oct 16, 2017

Uh oh!

cloud-fan commented Oct 16, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[SPARK-22233] [CORE] [FOLLOW-UP] Allow user to filter out empty split in HadoopRDD #19504

[SPARK-22233] [CORE] [FOLLOW-UP] Allow user to filter out empty split in HadoopRDD #19504

Uh oh!

Conversation

jiangxb1987 commented Oct 16, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

jiangxb1987 commented Oct 16, 2017

Uh oh!

HyukjinKwon left a comment

Choose a reason for hiding this comment

Uh oh!

liutang123 commented Oct 16, 2017

Uh oh!

SparkQA commented Oct 16, 2017

Uh oh!

liutang123 Oct 16, 2017

Choose a reason for hiding this comment

Uh oh!

cloud-fan commented Oct 16, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

jiangxb1987 commented Oct 16, 2017 •

edited

Loading