Skip to content

Conversation

@dongjoon-hyun
Copy link
Member

What changes were proposed in this pull request?

This PR fixes DataFrame.describe() by forcing materialization to make the Seq serializable. Currently, describe() of DataFrame throws Task not serializable Spark exceptions when joining in Scala 2.10.

How was this patch tested?

Manual. (After building with Scala 2.10, test on bin/spark-shell and bin/pyspark.)

@dongjoon-hyun
Copy link
Member Author

cc @davies .
This is a PR for branch 1.6.

@SparkQA
Copy link

SparkQA commented Jun 25, 2016

Test build #61220 has finished for PR 13902 at commit f114f08.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@davies
Copy link
Contributor

davies commented Jun 25, 2016

Merged into 1.6

asfgit pushed a commit that referenced this pull request Jun 25, 2016
## What changes were proposed in this pull request?

This PR fixes `DataFrame.describe()` by forcing materialization to make the `Seq` serializable. Currently, `describe()` of `DataFrame` throws `Task not serializable` Spark exceptions when joining in Scala 2.10.

## How was this patch tested?

Manual. (After building with Scala 2.10, test on bin/spark-shell and bin/pyspark.)

Author: Dongjoon Hyun <[email protected]>

Closes #13902 from dongjoon-hyun/SPARK-16173-branch-1.6.
@dongjoon-hyun
Copy link
Member Author

Thank you, @davies !

asfgit pushed a commit that referenced this pull request Jun 25, 2016
## What changes were proposed in this pull request?

This PR fixes `DataFrame.describe()` by forcing materialization to make the `Seq` serializable. Currently, `describe()` of `DataFrame` throws `Task not serializable` Spark exceptions when joining in Scala 2.10.

## How was this patch tested?

Manual. (After building with Scala 2.10, test on bin/spark-shell and bin/pyspark.)

Author: Dongjoon Hyun <[email protected]>

Closes #13902 from dongjoon-hyun/SPARK-16173-branch-1.6.
@rxin
Copy link
Contributor

rxin commented Jun 25, 2016

@dongjoon-hyun can you close the pr?

@dongjoon-hyun
Copy link
Member Author

dongjoon-hyun commented Jun 25, 2016

Oh, sure. I forgot that it is not closed automatically.
Thank you for pining me, @rxin .

zzcclp pushed a commit to zzcclp/spark that referenced this pull request Jun 27, 2016
## What changes were proposed in this pull request?

This PR fixes `DataFrame.describe()` by forcing materialization to make the `Seq` serializable. Currently, `describe()` of `DataFrame` throws `Task not serializable` Spark exceptions when joining in Scala 2.10.

## How was this patch tested?

Manual. (After building with Scala 2.10, test on bin/spark-shell and bin/pyspark.)

Author: Dongjoon Hyun <[email protected]>

Closes apache#13902 from dongjoon-hyun/SPARK-16173-branch-1.6.

(cherry picked from commit b7acc1b)
@dongjoon-hyun dongjoon-hyun deleted the SPARK-16173-branch-1.6 branch July 20, 2016 07:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants