@@ -11,7 +11,7 @@ displayTitle: <a href="mllib-guide.html">MLlib</a> - Dimensionality Reduction
1111of reducing the number of variables under consideration.
1212It can be used to extract latent features from raw and noisy features
1313or compress data while maintaining the structure.
14- MLlib provides support for dimensionality reduction on tall-and-skinny matrices .
14+ MLlib provides support for dimensionality reduction on the RowMatrix class .
1515
1616## Singular value decomposition (SVD)
1717
@@ -39,8 +39,7 @@ If we keep the top $k$ singular values, then the dimensions of the resulting low
3939* ` $\Sigma$ ` : ` $k \times k$ ` ,
4040* ` $V$ ` : ` $n \times k$ ` .
4141
42- MLlib provides SVD functionality to row-oriented matrices that have only a few columns,
43- say, less than $1000$, but many rows, i.e., * tall-and-skinny* matrices.
42+ MLlib provides SVD functionality to row-oriented matrices, provided in the RowMatrix class.
4443
4544<div class =" codetabs " >
4645<div data-lang =" scala " markdown =" 1 " >
@@ -119,14 +118,13 @@ statistical method to find a rotation such that the first coordinate has the lar
119118possible, and each succeeding coordinate in turn has the largest variance possible. The columns of
120119the rotation matrix are called principal components. PCA is used widely in dimensionality reduction.
121120
122- MLlib supports PCA for tall-and-skinny matrices stored in row-oriented format.
121+ MLlib supports PCA for matrices stored in row-oriented format.
123122
124123<div class =" codetabs " >
125124<div data-lang =" scala " markdown =" 1 " >
126125
127- The following code demonstrates how to compute principal components on a tall-and-skinny ` RowMatrix `
126+ The following code demonstrates how to compute principal components on a ` RowMatrix `
128127and use them to project the vectors into a low-dimensional space.
129- The number of columns should be small, e.g, less than 1000.
130128
131129{% highlight scala %}
132130import org.apache.spark.mllib.linalg.Matrix
@@ -144,7 +142,7 @@ val projected: RowMatrix = mat.multiply(pc)
144142
145143<div data-lang =" java " markdown =" 1 " >
146144
147- The following code demonstrates how to compute principal components on a tall-and-skinny ` RowMatrix `
145+ The following code demonstrates how to compute principal components on a ` RowMatrix `
148146and use them to project the vectors into a low-dimensional space.
149147The number of columns should be small, e.g, less than 1000.
150148
0 commit comments