[ML][PYTHON][SPARK-13008] Put one alg per line in pyspark.ml all lists #10927

jkbradley · 2016-01-26T20:17:16Z

This is to fix a long-time annoyance: Whenever we add a new algorithm to pyspark.ml, we have to add it to the __all__ list at the top. Since we keep it alphabetized, it often creates a lot more changes than needed. It is also easy to add the Estimator and forget the Model. I'm going to switch it to have one algorithm per line.

This also alphabetizes a few out-of-place classes in pyspark.ml.feature. No changes have been made to the moved classes.

CC: @thunterdb

SparkQA · 2016-01-26T20:37:20Z

Test build #50121 has finished for PR 10927 at commit bb0f1ef.

This patch fails Python style tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- class ChiSqSelector(JavaEstimator, HasFeaturesCol, HasOutputCol, HasLabelCol):
- class ChiSqSelectorModel(JavaModel):
- class PCA(JavaEstimator, HasInputCol, HasOutputCol):
- class PCAModel(JavaModel):
- class RFormula(JavaEstimator, HasFeaturesCol, HasLabelCol):
- class RFormulaModel(JavaModel):

SparkQA · 2016-01-26T21:01:11Z

Test build #50123 has finished for PR 10927 at commit 639a562.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

mengxr · 2016-02-02T18:56:00Z

LGTM but it has merge conflicts with master now.

SparkQA · 2016-03-02T00:53:29Z

Test build #52270 has finished for PR 10927 at commit 12b15fb.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

mengxr · 2016-03-02T05:27:08Z

Merged into master. Thanks!

This is to fix a long-time annoyance: Whenever we add a new algorithm to pyspark.ml, we have to add it to the ```__all__``` list at the top. Since we keep it alphabetized, it often creates a lot more changes than needed. It is also easy to add the Estimator and forget the Model. I'm going to switch it to have one algorithm per line. This also alphabetizes a few out-of-place classes in pyspark.ml.feature. No changes have been made to the moved classes. CC: thunterdb Author: Joseph K. Bradley <[email protected]> Closes apache#10927 from jkbradley/ml-python-all-list.

reordered __all__ items to put one alg per line

12b15fb

jkbradley force-pushed the ml-python-all-list branch from 639a562 to 12b15fb Compare March 2, 2016 00:32

asfgit closed this in 9495c40 Mar 2, 2016

jkbradley deleted the ml-python-all-list branch March 8, 2016 18:52

yanboliang mentioned this pull request Mar 17, 2016

[SPARK-11940][PYSPARK][ML] Python API for ml.clustering.LDA #10242

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML][PYTHON][SPARK-13008] Put one alg per line in pyspark.ml all lists #10927

[ML][PYTHON][SPARK-13008] Put one alg per line in pyspark.ml all lists #10927

Uh oh!

jkbradley commented Jan 26, 2016

Uh oh!

SparkQA commented Jan 26, 2016

Uh oh!

SparkQA commented Jan 26, 2016

Uh oh!

mengxr commented Feb 2, 2016

Uh oh!

SparkQA commented Mar 2, 2016

Uh oh!

mengxr commented Mar 2, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[ML][PYTHON][SPARK-13008] Put one alg per line in pyspark.ml all lists #10927

[ML][PYTHON][SPARK-13008] Put one alg per line in pyspark.ml all lists #10927

Uh oh!

Conversation

jkbradley commented Jan 26, 2016

Uh oh!

SparkQA commented Jan 26, 2016

Uh oh!

SparkQA commented Jan 26, 2016

Uh oh!

mengxr commented Feb 2, 2016

Uh oh!

SparkQA commented Mar 2, 2016

Uh oh!

mengxr commented Mar 2, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants