-
Notifications
You must be signed in to change notification settings - Fork 42
Open
Milestone
Description
In order to find the best value for K (the number of clusters), it would be nice to get the variance of the distance of clustered points to their cluster's centroid.
Inspired by https://www.youtube.com/watch?v=4b5d3muPQmA
Also see https://en.wikipedia.org/wiki/Elbow_method_(clustering)
I also believe the current v3 implementation of RandomInitialization is wrong 🤷♂️
Proposed change
$result = (new Kmeans\Algorithm($init))->clusterize($points, $K);
echo $result->getTotalVariance();Metadata
Metadata
Assignees
Labels
No labels