-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-8151] [MLLIB] pipeline components should correctly implement copy #6622
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
When writing copy for CrossValidator, I recall we decided not to copy the extra params to the estimator param.
This case seems analogous, so we probably should not copy the extra params to classifier here.
(I could imagine arguments for either behavior, but we should be consistent.)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
After discussion: We will override copy() for CrossValidator too!
|
Test build #34115 has finished for PR 6622 at commit
|
|
Changed the title to |
|
Implemented copy for every pipeline component. Let Jenkins run for mima checks. |
|
Test build #34402 has finished for PR 6622 at commit
|
|
Test build #34467 has finished for PR 6622 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why does IDFModel need this? It does not use minDocFreq.
|
a few new comments + the old, but it looks good |
|
Test build #35190 has finished for PR 6622 at commit
|
|
LGTM except for the style tests |
|
Test build #35234 has finished for PR 6622 at commit
|
|
Merged into master and branch-1.4. |
Otherwise, extra params get ignored in `PipelineModel.transform`. jkbradley Author: Xiangrui Meng <[email protected]> Closes #6622 from mengxr/SPARK-8087 and squashes the following commits: 0e4c8c4 [Xiangrui Meng] fix merge issues 26fc1f0 [Xiangrui Meng] address comments e607a04 [Xiangrui Meng] merge master b85b57e [Xiangrui Meng] fix examples/compile d6f7891 [Xiangrui Meng] rename defaultCopyWithParams to defaultCopy 84ec278 [Xiangrui Meng] remove setter checks due to generics 2cf2ed0 [Xiangrui Meng] snapshot 291814f [Xiangrui Meng] OneVsRest.copy 1dfe3bd [Xiangrui Meng] PipelineModel.copy should copy stages (cherry picked from commit 43c7ec6) Signed-off-by: Xiangrui Meng <[email protected]>
Otherwise, extra params get ignored in `PipelineModel.transform`. jkbradley Author: Xiangrui Meng <[email protected]> Closes apache#6622 from mengxr/SPARK-8087 and squashes the following commits: 0e4c8c4 [Xiangrui Meng] fix merge issues 26fc1f0 [Xiangrui Meng] address comments e607a04 [Xiangrui Meng] merge master b85b57e [Xiangrui Meng] fix examples/compile d6f7891 [Xiangrui Meng] rename defaultCopyWithParams to defaultCopy 84ec278 [Xiangrui Meng] remove setter checks due to generics 2cf2ed0 [Xiangrui Meng] snapshot 291814f [Xiangrui Meng] OneVsRest.copy 1dfe3bd [Xiangrui Meng] PipelineModel.copy should copy stages (cherry picked from commit 43c7ec6) Signed-off-by: Xiangrui Meng <[email protected]>
Otherwise, extra params get ignored in
PipelineModel.transform. @jkbradley