@@ -1170,22 +1170,25 @@ class PowerIterationClustering(HasMaxIter, HasPredictionCol, JavaTransformer, Ja
11701170 is a symmetric matrix whose entries are non-negative similarities between items.
11711171 PIC takes this matrix (or graph) as an adjacency matrix. Specifically, each input row
11721172 includes:
1173- - :py:class:`idCol`: vertex ID
1174- - :py:class:`neighborsCol`: neighbors of vertex in :py:class:`idCol`
1175- - :py:class:`similaritiesCol`: non-negative weights (similarities) of edges between the
1176- vertex in :py:class:`idCol` and each neighbor in :py:class:`neighborsCol`
1177- PIC returns a cluster assignment for each input vertex. It appends a new column
1178- :py:class:`predictionCol` containing the cluster assignment in :py:class:`[0,k)` for
1179- each row (vertex).
1180-
1181- Notes:
1182- - [[PowerIterationClustering]] is a transformer with an expensive [[transform]] operation.
1173+
1174+ - :py:class:`idCol`: vertex ID
1175+ - :py:class:`neighborsCol`: neighbors of vertex in :py:class:`idCol`
1176+ - :py:class:`similaritiesCol`: non-negative weights (similarities) of edges between the
1177+ vertex in :py:class:`idCol` and each neighbor in :py:class:`neighborsCol`
1178+
1179+ PIC returns a cluster assignment for each input vertex. It appends a new column
1180+ :py:class:`predictionCol` containing the cluster assignment in :py:class:`[0,k)` for
1181+ each row (vertex).
1182+
1183+ Notes:
1184+
1185+ - [[PowerIterationClustering]] is a transformer with an expensive [[transform]] operation.
11831186 Transform runs the iterative PIC algorithm to cluster the whole input dataset.
1184- - Input validation: This validates that similarities are non-negative but does NOT validate
1187+ - Input validation: This validates that similarities are non-negative but does NOT validate
11851188 that the input matrix is symmetric.
11861189
1187- @see <a href=http://en.wikipedia.org/wiki/Spectral_clustering>
1188- Spectral clustering (Wikipedia)</a>
1190+ @see <a href=http://en.wikipedia.org/wiki/Spectral_clustering>
1191+ Spectral clustering (Wikipedia)</a>
11891192
11901193 >>> from pyspark.sql.types import ArrayType, DoubleType, LongType, StructField, StructType
11911194 >>> similarities = [((long)(1), [0], [0.5]), ((long)(2), [0, 1], [0.7,0.5]), \
0 commit comments