-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-12566] [SPARK-14324] [ML] [WIP] GLM model family, link function support in SparkR:::glm #12294
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #55513 has finished for PR 12294 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We can use rFormulaModel to avoid fitting again.
|
LGTM except one place where we can avoid re-fitting. @yanboliang Could you create follow-up JIRAs so we can add the missing feature back? Thanks! |
|
@mengxr We already have SPARK-13925 to add summary statistics for SparkR glm. I will send a PR after this get in. |
|
Test build #55557 has finished for PR 12294 at commit
|
|
Test build #55605 has finished for PR 12294 at commit
|
|
LGTM. Merged into master. Thanks! |
…lumn ## What changes were proposed in this pull request? SparkR does not support type of vector which is the default type of feature column in ML. R predict also does not output intermediate feature column. So SparkR ```predict``` should not output feature column. In this PR, I only fix this issue for ```naiveBayes``` and ```survreg```. ```kmeans``` has the right code route already and ```glm``` will be fixed at SparkRWrapper refactor(#12294). ## How was this patch tested? No new tests. cc mengxr shivaram Author: Yanbo Liang <[email protected]> Closes #11958 from yanboliang/spark-14147.
What changes were proposed in this pull request?
formula, family, data, epsilon and maxit.How was this patch tested?
Unit tests.
cc @mengxr @jkbradley @hhbyyh