[SPARK-2412] CoalescedRDD throws exception with certain pref locs #1337

aarondav · 2014-07-09T00:44:35Z

If the first pass of CoalescedRDD does not find the target number of locations AND the second pass finds new locations, an exception is thrown, as "groupHash.get(nxt_replica).get" is not valid.

The fix is just to add an ArrayBuffer to groupHash for that replica if it didn't already exist.

If the first pass of CoalescedRDD does not find the target number of locations AND the second pass finds new locations, an exception is thrown, as "groupHash.get(nxt_replica).get" is not valid. The fix is just to add an ArrayBuffer to groupHash for that replica if it didn't already exist.

aarondav · 2014-07-09T00:44:43Z

@alig

AmplabJenkins · 2014-07-09T00:46:08Z

Merged build triggered.

AmplabJenkins · 2014-07-09T00:46:16Z

Merged build started.

AmplabJenkins · 2014-07-09T01:33:19Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-07-09T01:33:19Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16434/

andrewor14 · 2014-07-10T20:22:53Z

core/src/main/scala/org/apache/spark/rdd/CoalescedRDD.scala

getOrElseUpdate?

pwendell · 2014-07-10T21:25:01Z

Jenkins, test this please (testing something).

SparkQA · 2014-07-10T21:27:48Z

QA tests have started for PR 1337. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16523/consoleFull

aarondav · 2014-07-10T22:36:09Z

By the way, I'd like to point out that this is the 1337 PR. Naturally.

SparkQA · 2014-07-10T22:36:47Z

QA tests have started for PR 1337. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16529/consoleFull

SparkQA · 2014-07-10T23:12:44Z

QA results for PR 1337:
- This patch PASSES unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16523/consoleFull

AmplabJenkins · 2014-07-10T23:12:47Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16523/

SparkQA · 2014-07-11T00:13:27Z

QA results for PR 1337:
- This patch PASSES unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16529/consoleFull

pwendell · 2014-07-15T06:57:45Z

core/src/main/scala/org/apache/spark/rdd/CoalescedRDD.scala

Is this just a stylistic change or does this operator somehow have different semantics?

strictly stylistic -- it made more sense when I was using put below, now there's no reason for it

pwendell · 2014-07-15T06:58:21Z

LGTM pending one small question

pwendell · 2014-07-15T20:53:51Z

okay LGTM

pwendell · 2014-07-17T08:01:33Z

Okay I merged this.

If the first pass of CoalescedRDD does not find the target number of locations AND the second pass finds new locations, an exception is thrown, as "groupHash.get(nxt_replica).get" is not valid. The fix is just to add an ArrayBuffer to groupHash for that replica if it didn't already exist. Author: Aaron Davidson <[email protected]> Closes #1337 from aarondav/2412 and squashes the following commits: f587b5d [Aaron Davidson] getOrElseUpdate 3ad8a3c [Aaron Davidson] [SPARK-2412] CoalescedRDD throws exception with certain pref locs (cherry picked from commit 7c23c0d) Signed-off-by: Patrick Wendell <[email protected]>

If the first pass of CoalescedRDD does not find the target number of locations AND the second pass finds new locations, an exception is thrown, as "groupHash.get(nxt_replica).get" is not valid. The fix is just to add an ArrayBuffer to groupHash for that replica if it didn't already exist. Author: Aaron Davidson <[email protected]> Closes apache#1337 from aarondav/2412 and squashes the following commits: f587b5d [Aaron Davidson] getOrElseUpdate 3ad8a3c [Aaron Davidson] [SPARK-2412] CoalescedRDD throws exception with certain pref locs (cherry picked from commit 7c23c0d) Signed-off-by: Patrick Wendell <[email protected]>

If the first pass of CoalescedRDD does not find the target number of locations AND the second pass finds new locations, an exception is thrown, as "groupHash.get(nxt_replica).get" is not valid. The fix is just to add an ArrayBuffer to groupHash for that replica if it didn't already exist. Author: Aaron Davidson <[email protected]> Closes apache#1337 from aarondav/2412 and squashes the following commits: f587b5d [Aaron Davidson] getOrElseUpdate 3ad8a3c [Aaron Davidson] [SPARK-2412] CoalescedRDD throws exception with certain pref locs

…ache#1337) ### What changes were proposed in this pull request? This is for rdar://88338827 (Backport SPARK-38047 Add `OUTLIER_NO_FALLBACK` executor roll policy). This PR aims to add a new executor roll policy which allows users to skip rolling in cases where there are no outlier executors. ### Why are the changes needed? As currently implemented an executor is always rolled every `spark.kubernetes.executor.rollInterval` interval. In environments where starting of executors can introduce latencies it may be desirable for users to have the option to determine if rolling should only happen when outliers are found. ### Does this PR introduce _any_ user-facing change? No, this is an additional option being added to a new feature in Apache Spark 3.3. ### How was this patch tested? Pass the CIs with the newly added test cases.

andrewor14 reviewed Jul 10, 2014
View reviewed changes

core/src/main/scala/org/apache/spark/rdd/CoalescedRDD.scala Outdated

Copy link

Contributor

andrewor14 Jul 10, 2014

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

getOrElseUpdate?

getOrElseUpdate

f587b5d

pwendell reviewed Jul 15, 2014
View reviewed changes

asfgit closed this in 7c23c0d Jul 17, 2014

[SPARK-2412] CoalescedRDD throws exception with certain pref locs #1337

[SPARK-2412] CoalescedRDD throws exception with certain pref locs #1337

Uh oh!

Conversation

aarondav commented Jul 9, 2014

Uh oh!

aarondav commented Jul 9, 2014

Uh oh!

AmplabJenkins commented Jul 9, 2014

Uh oh!

AmplabJenkins commented Jul 9, 2014

Uh oh!

AmplabJenkins commented Jul 9, 2014

Uh oh!

AmplabJenkins commented Jul 9, 2014

Uh oh!

andrewor14 Jul 10, 2014

Choose a reason for hiding this comment

Uh oh!

pwendell commented Jul 10, 2014

Uh oh!

SparkQA commented Jul 10, 2014

Uh oh!

aarondav commented Jul 10, 2014

Uh oh!

SparkQA commented Jul 10, 2014

Uh oh!

SparkQA commented Jul 10, 2014

Uh oh!

AmplabJenkins commented Jul 10, 2014

Uh oh!

SparkQA commented Jul 11, 2014

Uh oh!

pwendell Jul 15, 2014

Choose a reason for hiding this comment

Uh oh!

aarondav Jul 15, 2014

Choose a reason for hiding this comment

Uh oh!

pwendell commented Jul 15, 2014

Uh oh!

pwendell commented Jul 15, 2014

Uh oh!

pwendell commented Jul 17, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants