documentation and numRuns warning change

FlytxtRnD · FlytxtRnD · commit c446c58da68b · 2015-07-08T17:37:32.000+05:30
diff --git a/docs/mllib-clustering.md b/docs/mllib-clustering.md
@@ -33,6 +33,7 @@ guaranteed to find a globally optimal solution, and when run multiple times on
 a given dataset, the algorithm returns the best clustering result).
 * *initializationSteps* determines the number of steps in the k-means\|\| algorithm.
 * *epsilon* determines the distance threshold within which we consider k-means to have converged.
+* *initialModel* is an optional set of cluster centers used for initialization. If this parameter is supplied, only one run is performed.
 
 **Examples**
 
diff --git a/mllib/src/main/scala/org/apache/spark/mllib/clustering/KMeans.scala b/mllib/src/main/scala/org/apache/spark/mllib/clustering/KMeans.scala
@@ -209,12 +209,15 @@ class KMeans private (
     val initStartTime = System.nanoTime()
 
     // Only one run is allowed when initialModel is given
-    val numRuns = if (initialModel.nonEmpty) 1 else runs
-    logWarning("Ignoring runs; one run is allowed when initialModel is given.")
+    val numRuns = if (initialModel.nonEmpty){
+      if (runs >1 ) logWarning("Ignoring runs; one run is allowed when initialModel is given.")
+      1
+    } else runs
+
 
     val centers = initialModel match {
       case Some(kMeansCenters) => {
-        Array(kMeansCenters.clusterCenters.map(s => new VectorWithNorm(s, Vectors.norm(s, 2.0))))
+        Array(kMeansCenters.clusterCenters.map(s => new VectorWithNorm(s)))
       }
       case None => {
         if (initializationMode == KMeans.RANDOM) {