Skip to content

Conversation

@zsxwing
Copy link
Member

@zsxwing zsxwing commented Sep 28, 2018

What changes were proposed in this pull request?

Since we don't fail a job when AccumulatorV2.merge fails, we should try to update the remaining accumulators so that they can still report correct values.

How was this patch tested?

The new unit test.

@SparkQA
Copy link

SparkQA commented Sep 29, 2018

Test build #96777 has finished for PR 22586 at commit e33788e.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Sep 29, 2018

Test build #96776 has finished for PR 22586 at commit badb471.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Sep 29, 2018

Test build #96778 has finished for PR 22586 at commit a3d4a0d.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Sep 29, 2018

Test build #96779 has finished for PR 22586 at commit f582c34.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

assert(sc.parallelize(1 to 10, 2).count() === 10)
}

test("misbehaved accumulator should not impact other accumulators") {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also verify the log message?

That's not in the core project.

@gatorsmile
Copy link
Member

LGTM

cc @cloud-fan

@gatorsmile
Copy link
Member

This was introduced by AccumulatorV2. It might be a blocker issue for Spark 2.4, since this could return an incorrect result.

Copy link
Contributor

@HeartSaVioR HeartSaVioR left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM for this patch, and follow-up issue (SPARK-25569) totally makes sense to me.

asfgit pushed a commit that referenced this pull request Sep 30, 2018
…n failing to update one accumulator

## What changes were proposed in this pull request?

Since we don't fail a job when `AccumulatorV2.merge` fails, we should try to update the remaining accumulators so that they can still report correct values.

## How was this patch tested?

The new unit test.

Closes #22586 from zsxwing/SPARK-25568.

Authored-by: Shixiong Zhu <[email protected]>
Signed-off-by: gatorsmile <[email protected]>
(cherry picked from commit b6b8a66)
Signed-off-by: gatorsmile <[email protected]>
asfgit pushed a commit that referenced this pull request Sep 30, 2018
…n failing to update one accumulator

## What changes were proposed in this pull request?

Since we don't fail a job when `AccumulatorV2.merge` fails, we should try to update the remaining accumulators so that they can still report correct values.

## How was this patch tested?

The new unit test.

Closes #22586 from zsxwing/SPARK-25568.

Authored-by: Shixiong Zhu <[email protected]>
Signed-off-by: gatorsmile <[email protected]>
(cherry picked from commit b6b8a66)
Signed-off-by: gatorsmile <[email protected]>
asfgit pushed a commit that referenced this pull request Sep 30, 2018
…n failing to update one accumulator

## What changes were proposed in this pull request?

Since we don't fail a job when `AccumulatorV2.merge` fails, we should try to update the remaining accumulators so that they can still report correct values.

## How was this patch tested?

The new unit test.

Closes #22586 from zsxwing/SPARK-25568.

Authored-by: Shixiong Zhu <[email protected]>
Signed-off-by: gatorsmile <[email protected]>
(cherry picked from commit b6b8a66)
Signed-off-by: gatorsmile <[email protected]>
@asfgit asfgit closed this in b6b8a66 Sep 30, 2018
@gatorsmile
Copy link
Member

Thanks! Merged to master/2.4/2.3/2.2

@zsxwing zsxwing deleted the SPARK-25568 branch October 1, 2018 18:01
jackylee-ch pushed a commit to jackylee-ch/spark that referenced this pull request Feb 18, 2019
…n failing to update one accumulator

## What changes were proposed in this pull request?

Since we don't fail a job when `AccumulatorV2.merge` fails, we should try to update the remaining accumulators so that they can still report correct values.

## How was this patch tested?

The new unit test.

Closes apache#22586 from zsxwing/SPARK-25568.

Authored-by: Shixiong Zhu <[email protected]>
Signed-off-by: gatorsmile <[email protected]>
Willymontaz pushed a commit to criteo-forks/spark that referenced this pull request Sep 26, 2019
…n failing to update one accumulator

## What changes were proposed in this pull request?

Since we don't fail a job when `AccumulatorV2.merge` fails, we should try to update the remaining accumulators so that they can still report correct values.

## How was this patch tested?

The new unit test.

Closes apache#22586 from zsxwing/SPARK-25568.

Authored-by: Shixiong Zhu <[email protected]>
Signed-off-by: gatorsmile <[email protected]>
(cherry picked from commit b6b8a66)
Signed-off-by: gatorsmile <[email protected]>
Willymontaz pushed a commit to criteo-forks/spark that referenced this pull request Sep 27, 2019
…n failing to update one accumulator

## What changes were proposed in this pull request?

Since we don't fail a job when `AccumulatorV2.merge` fails, we should try to update the remaining accumulators so that they can still report correct values.

## How was this patch tested?

The new unit test.

Closes apache#22586 from zsxwing/SPARK-25568.

Authored-by: Shixiong Zhu <[email protected]>
Signed-off-by: gatorsmile <[email protected]>
(cherry picked from commit b6b8a66)
Signed-off-by: gatorsmile <[email protected]>
otterc pushed a commit to linkedin/spark that referenced this pull request Mar 22, 2023
…n failing to update one accumulator

Ref: LIHADOOP-42001

Since we don't fail a job when `AccumulatorV2.merge` fails, we should try to update the remaining accumulators so that they can still report correct values.

The new unit test.

Closes apache#22586 from zsxwing/SPARK-25568.

Authored-by: Shixiong Zhu <[email protected]>
Signed-off-by: gatorsmile <[email protected]>
(cherry picked from commit b6b8a66)

RB=1489248
BUG=LIHADOOP-42001
G=superfriends-reviewers
R=fli,mshen,yezhou,edlu
A=yezhou
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants