Skip to content

Conversation

@amanomer
Copy link
Contributor

@amanomer amanomer commented Nov 9, 2019

What changes were proposed in this pull request?

Use persist on RDD which is used for more than one action.

Why are the changes needed?

RDD dataset.toDF.rdd will generate two rdds: training and validation. Some actions will be operated on these two rdds, but dataset.toDF.rdd is not persisted, which will cause re-computation.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Tested manually

@AmplabJenkins
Copy link

Can one of the admins verify this patch?

@amanomer amanomer changed the title [SPARK-29815] Missing persist in ml.tuning.CrossValidator.fit() [SPARK-29815][ML] Missing persist in ml.tuning.CrossValidator.fit() Nov 9, 2019
@amanomer amanomer closed this Nov 9, 2019
@srowen
Copy link
Member

srowen commented Nov 10, 2019

#26454

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants