Skip to content

Conversation

@ueshin
Copy link
Member

@ueshin ueshin commented Oct 17, 2017

What changes were proposed in this pull request?

This is a follow-up of #18732.
This pr modifies GroupedData.apply() method to convert pandas udf to grouped udf implicitly.

How was this patch tested?

Exisiting tests.

@SparkQA
Copy link

SparkQA commented Oct 17, 2017

Test build #82841 has finished for PR 19517 at commit 7b386c4.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Oct 17, 2017

Test build #82842 has finished for PR 19517 at commit 7e43bb4.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@ueshin ueshin changed the title [SPARK-20396][SQL][PySpark][FOLLOW-UP] groupby().apply() with pandas udf [WIP][SPARK-20396][SQL][PySpark][FOLLOW-UP] groupby().apply() with pandas udf Oct 18, 2017
@gatorsmile
Copy link
Member

retest this please



class PythonUdfType(object):
# row-based UDFs
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nit: Please update all row-based UDFs to row-at-a-time UDFs

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure, I'll update it.

import org.apache.spark.sql.types.DataType

private[spark] object PythonUdfType {
// row-based UDFs
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The same here.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure, I'll update it, too.

@gatorsmile
Copy link
Member

LGTM.

@ueshin Could you remove [WIP] from the title of this PR?

@ueshin ueshin changed the title [WIP][SPARK-20396][SQL][PySpark][FOLLOW-UP] groupby().apply() with pandas udf [SPARK-20396][SQL][PySpark][FOLLOW-UP] groupby().apply() with pandas udf Oct 20, 2017
@SparkQA
Copy link

SparkQA commented Oct 20, 2017

Test build #82922 has finished for PR 19517 at commit 59d61a4.

  • This patch fails due to an unknown error code, -9.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Oct 20, 2017

Test build #82921 has finished for PR 19517 at commit 7e43bb4.

  • This patch fails due to an unknown error code, -9.
  • This patch merges cleanly.
  • This patch adds no public classes.

@gatorsmile
Copy link
Member

retest this please

@SparkQA
Copy link

SparkQA commented Oct 20, 2017

Test build #82926 has finished for PR 19517 at commit 59d61a4.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@gatorsmile
Copy link
Member

retest this please

@SparkQA
Copy link

SparkQA commented Oct 20, 2017

Test build #82936 has finished for PR 19517 at commit 59d61a4.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@asfgit asfgit closed this in b8624b0 Oct 20, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants