Skip to content

Commit 6468d6f

Browse files
huaxingaozhengruifeng
authored andcommitted
[SPARK-30776][ML] Support FValueSelector for continuous features and continuous labels
### What changes were proposed in this pull request? Add FValueRegressionSelector for continuous features and continuous labels. ### Why are the changes needed? Currently Spark only supports selection of categorical features, while there are many requirements for the selection of continuous distribution features. This PR adds FValueSelector for continuous features and continuous labels. ANOVASelector for continuous features and categorical labels will be added later using a separate PR. ### Does this PR introduce any user-facing change? Yes. Add a new Selector ### How was this patch tested? Add new tests Closes apache#27679 from huaxingao/spark_30776. Authored-by: Huaxin Gao <[email protected]> Signed-off-by: zhengruifeng <[email protected]>
1 parent 1426ad8 commit 6468d6f

File tree

5 files changed

+758
-13
lines changed

5 files changed

+758
-13
lines changed

mllib/src/main/scala/org/apache/spark/ml/feature/ChiSqSelector.scala

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ import org.apache.hadoop.fs.Path
2121

2222
import org.apache.spark.annotation.Since
2323
import org.apache.spark.ml._
24-
import org.apache.spark.ml.attribute.{AttributeGroup, _}
24+
import org.apache.spark.ml.attribute._
2525
import org.apache.spark.ml.linalg._
2626
import org.apache.spark.ml.param._
2727
import org.apache.spark.ml.param.shared._

0 commit comments

Comments
 (0)