[MLlib] [SPARK-6713] Iterators in columnSimilarities for mapPartitionsWithIndex #5364

rezazadeh · 2015-04-05T09:02:59Z

Use Iterators in columnSimilarities to allow mapPartitionsWithIndex to spill to disk. This could happen in a dense and large column - this way Spark can spill the pairs onto disk instead of building all the pairs before handing them to Spark.

Another PR coming to update documentation.

srowen · 2015-04-05T09:24:05Z

Yeah that looks like a great change to avoid allocating so much memory at once.

AmplabJenkins · 2015-04-05T10:38:20Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/29721/
Test PASSed.

mengxr · 2015-04-06T20:15:43Z

Merged into master. Thanks!

This could happen during mapPartitionsWithIndex in a dense and large column - this way Spark can spill the pairs onto disk instead of building all the pairs before handing them to Spark. (See SPARK-6713, apache/spark#5364, from which this code is lifted.)

Iterators in columnSimilarities for flatMap

47c90ba

rezazadeh changed the title ~~[MLlib] [SPARK-6713] Iterators in columnSimilarities for flatMap~~ [MLlib] [SPARK-6713] Iterators in columnSimilarities for mapPartitionsWithIndex Apr 5, 2015

asfgit closed this in 30363ed Apr 6, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MLlib] [SPARK-6713] Iterators in columnSimilarities for mapPartitionsWithIndex #5364

[MLlib] [SPARK-6713] Iterators in columnSimilarities for mapPartitionsWithIndex #5364

Uh oh!

rezazadeh commented Apr 5, 2015

Uh oh!

srowen commented Apr 5, 2015

Uh oh!

AmplabJenkins commented Apr 5, 2015

Uh oh!

mengxr commented Apr 6, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[MLlib] [SPARK-6713] Iterators in columnSimilarities for mapPartitionsWithIndex #5364

[MLlib] [SPARK-6713] Iterators in columnSimilarities for mapPartitionsWithIndex #5364

Uh oh!

Conversation

rezazadeh commented Apr 5, 2015

Uh oh!

srowen commented Apr 5, 2015

Uh oh!

AmplabJenkins commented Apr 5, 2015

Uh oh!

mengxr commented Apr 6, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants