Skip to content

Conversation

@cloud-fan
Copy link
Contributor

No description provided.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the generic get and array method is useful in interpreted mode, but it's a lot painful to handle Any between scala and java, so I put this interface in scala.

@cloud-fan
Copy link
Contributor Author

cc @rxin

@SparkQA
Copy link

SparkQA commented Jul 28, 2015

Test build #38685 has finished for PR 7724 at commit 53cf9fc.

  • This patch fails RAT tests.
  • This patch does not merge cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters
    • class GenericArrayData(val array: Array[Any]) extends ArrayData

@cloud-fan cloud-fan changed the title [SPARK-9390][SQL][WIP] create a wrapper for array type [SPARK-9390][SQL] create a wrapper for array type Jul 28, 2015
@cloud-fan
Copy link
Contributor Author

cc @rxin, I underestimated this work, It took me a lot of time to cover all corner cases...

@SparkQA
Copy link

SparkQA commented Jul 28, 2015

Test build #38707 has finished for PR 7724 at commit 21d71c6.

  • This patch fails RAT tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters with Serializable
    • class GenericArrayData(array: Array[Any]) extends ArrayData

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should avoid using InternalRow in test directly, which means we are manually converting external type to internal type. Using Row and let converter do this work seems better.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That makes sense.

@SparkQA
Copy link

SparkQA commented Jul 28, 2015

Test build #38708 has finished for PR 7724 at commit 7efc4c6.

  • This patch fails RAT tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters with Serializable
    • class GenericArrayData(array: Array[Any]) extends ArrayData

@SparkQA
Copy link

SparkQA commented Jul 28, 2015

Test build #38710 has finished for PR 7724 at commit 7394d0a.

  • This patch fails RAT tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters with Serializable
    • class GenericArrayData(array: Array[Any]) extends ArrayData

@SparkQA
Copy link

SparkQA commented Jul 28, 2015

Test build #38718 has finished for PR 7724 at commit 1694fe6.

  • This patch fails Scala style tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters with Serializable
    • class GenericArrayData(array: Array[Any]) extends ArrayData

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can you move this into its own file

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we need specialized version of toArray, and get rid of calling generic getter inside it. We can do it after adding special getter for map and removing generic getter.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

@SparkQA
Copy link

SparkQA commented Jul 29, 2015

Test build #38790 has finished for PR 7724 at commit 485e1d9.

  • This patch fails to build.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Jul 29, 2015

Test build #38797 has finished for PR 7724 at commit 08b6cfd.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters with Serializable
    • class GenericArrayData(array: Array[Any]) extends ArrayData

@SparkQA
Copy link

SparkQA commented Jul 29, 2015

Test build #38804 has finished for PR 7724 at commit b66ed9b.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters with Serializable
    • class GenericArrayData(array: Array[Any]) extends ArrayData

@SparkQA
Copy link

SparkQA commented Jul 29, 2015

Test build #38818 has finished for PR 7724 at commit 9952110.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters with Serializable
    • class GenericArrayData(array: Array[Any]) extends ArrayData

@SparkQA
Copy link

SparkQA commented Jul 29, 2015

Test build #38824 has finished for PR 7724 at commit ab98286.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters with Serializable
    • class GenericArrayData(array: Array[Any]) extends ArrayData

@cloud-fan
Copy link
Contributor Author

retest this please.

@SparkQA
Copy link

SparkQA commented Jul 29, 2015

Test build #145 has finished for PR 7724 at commit ab98286.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Jul 29, 2015

Test build #38831 has finished for PR 7724 at commit ab98286.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters with Serializable
    • class GenericArrayData(array: Array[Any]) extends ArrayData

@SparkQA
Copy link

SparkQA commented Jul 29, 2015

Test build #38846 has finished for PR 7724 at commit e9f2fc6.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters with Serializable
    • class GenericArrayData(array: Array[Any]) extends ArrayData

@SparkQA
Copy link

SparkQA commented Jul 29, 2015

Test build #38844 has finished for PR 7724 at commit 0772ca0.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Jul 30, 2015

Test build #38931 has finished for PR 7724 at commit 3a0147d.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters with Serializable
    • class GenericArrayData(array: Array[Any]) extends ArrayData

@cloud-fan
Copy link
Contributor Author

cc @rxin I think it's close, there is still a pyspark doc string test failue:

File "/home/jenkins/workspace/SparkPullRequestBuilder/python/pyspark/ml/recommendation.py", line 73, in __main__.ALS
Failed example:
    model.userFactors.orderBy("id").collect()
Expected:
    [Row(id=0, features=[...]), Row(id=1, ...), Row(id=2, ...)]
Got:
    [Row(id=0, features=(None, None, None, None, None, None, None, None, None, None)), Row(id=1, features=(None, None, None, None, None, None, None, None, None, None)), Row(id=2, features=(None, None, None, None, None, None, None, None, None, None))]

Does any body have an idea on it?

One possible reason I guess is: before ArrayData, we use ArraySeq as external array value, but now we use WrappedArray, will this make difference?

@rxin
Copy link
Contributor

rxin commented Jul 30, 2015

cc @davies can you take a look?

@rxin
Copy link
Contributor

rxin commented Jul 30, 2015

@SparkQA
Copy link

SparkQA commented Jul 30, 2015

Test build #38981 has finished for PR 7724 at commit a24c0a5.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters with Serializable
    • class GenericArrayData(array: Array[Any]) extends ArrayData

@cloud-fan
Copy link
Contributor Author

retest this please.

@SparkQA
Copy link

SparkQA commented Jul 30, 2015

Test build #153 has finished for PR 7724 at commit a24c0a5.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Jul 30, 2015

Test build #38990 has finished for PR 7724 at commit a24c0a5.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters with Serializable
    • class GenericArrayData(array: Array[Any]) extends ArrayData

@cloud-fan
Copy link
Contributor Author

retest this please.

@SparkQA
Copy link

SparkQA commented Jul 30, 2015

Test build #157 has finished for PR 7724 at commit a24c0a5.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Jul 30, 2015

Test build #39007 has finished for PR 7724 at commit a24c0a5.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • abstract class ArrayData extends SpecializedGetters with Serializable
    • class GenericArrayData(array: Array[Any]) extends ArrayData

@SparkQA
Copy link

SparkQA commented Jul 30, 2015

Test build #39025 has finished for PR 7724 at commit d0408a1.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@cloud-fan
Copy link
Contributor Author

cc @rxin, I'll specialize toArray when working on MapData.

@rxin
Copy link
Contributor

rxin commented Jul 30, 2015

Thanks - I've merged this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants