Skip to content

Conversation

@sarutak
Copy link
Member

@sarutak sarutak commented May 1, 2014

I think, when an Executor which has block(s) to be fetched is lost, fetch from the Executor will be fail and re-fetch from another Executor will occur.
When an Executor process is dead, re-fetch will be done successfully, but an Executor process is alive but can't read its own block(s), re-fetch wouldn't occur.
I think the behavior is not expected and tried to modify.

I pull-request my first modification and how does this look?

Modified DiskBlockManager to create sub directories when they are absent
@sarutak sarutak changed the title Modified BlockFetchIterator to handle fetch failure re-fetch fail occasionally May 1, 2014
@sarutak sarutak changed the title re-fetch fail occasionally re-fetch fails occasionally May 1, 2014
@sarutak sarutak changed the title re-fetch fails occasionally SPARK-1667 re-fetch fails occasionally May 1, 2014
@AmplabJenkins
Copy link

Can one of the admins verify this patch?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please dont catch Throwable.

@mridulm
Copy link
Contributor

mridulm commented May 3, 2014

Can you add a testcase to verify this ?
Where None is returned and validated.

I suspect we have also observed in the past too - but I never got around to nailing it down : thanks for the PR !

@sarutak
Copy link
Member Author

sarutak commented May 6, 2014

Thank you for your comment, mridulm!
I will revise the patch.

I met the bug when I remove the block files in /tmp/spark-local-*/ on the local filesystem of an Executor.

@sarutak sarutak closed this Jul 8, 2014
@sarutak sarutak deleted the SPARK-1667 branch July 8, 2014 15:25
@aarondav
Copy link
Contributor

aarondav commented Jul 8, 2014

Did you mean to close this PR?

bzhaoopenstack pushed a commit to bzhaoopenstack/spark that referenced this pull request Sep 11, 2019
Use old bazel for kind
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants