Skip to content

Conversation

@jkbradley
Copy link
Member

What changes were proposed in this pull request?

Adds structured streaming tests using testTransformer for these suites:

  • IDF
  • Imputer
  • Interaction
  • MaxAbsScaler
  • MinHashLSH
  • MinMaxScaler
  • NGram

How was this patch tested?

It is a bunch of tests!

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why the "value" column use java.lang.Double type ?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

since it's nullable

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

vector1 === vector2
and the error message looks strange. ==> s"scaled value $vector2 do not equal expected value "$vector1" ?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why not have "expected" column" here to compare with ?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think that's necessary for testing that this works with structured streaming. (I can't see how streaming would mess up the correctness of the algorithm.)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

vector1 === vector2

Copy link
Member Author

@jkbradley jkbradley Apr 3, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

True, === is more standard (though both are fine)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just curious, why change Seq[String] to Seq[_], is it for fixing some potential issue ?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

String is not actually checked because of erasure, so IntelliJ complained with a style warning before this change.

@SparkQA
Copy link

SparkQA commented Apr 3, 2018

Test build #88851 has finished for PR 20964 at commit 6dff94e.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@WeichenXu123
Copy link
Contributor

LGTM. 👍

@jkbradley
Copy link
Member Author

Thanks! I'll rerun tests since they are stale and merge after they pass.

@SparkQA
Copy link

SparkQA commented Apr 11, 2018

Test build #4151 has finished for PR 20964 at commit 6dff94e.

  • This patch passes all tests.
  • This patch does not merge cleanly.
  • This patch adds no public classes.

@jkbradley
Copy link
Member Author

I rebased off of master because of the merge warning in the last tests.
I did not have to resolve any conflicts.
I'll merge this once tests pass.

@SparkQA
Copy link

SparkQA commented Apr 11, 2018

Test build #89174 has finished for PR 20964 at commit 592670c.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@jkbradley
Copy link
Member Author

Merging with master

jkbradley added a commit to jkbradley/spark that referenced this pull request Apr 11, 2018
## What changes were proposed in this pull request?

Adds structured streaming tests using testTransformer for these suites:
* IDF
* Imputer
* Interaction
* MaxAbsScaler
* MinHashLSH
* MinMaxScaler
* NGram

## How was this patch tested?

It is a bunch of tests!

Author: Joseph K. Bradley <[email protected]>

Closes apache#20964 from jkbradley/SPARK-22883-part2.
@asfgit asfgit closed this in 75a1830 Apr 11, 2018
@jkbradley jkbradley deleted the SPARK-22883-part2 branch April 11, 2018 17:06
asfgit pushed a commit that referenced this pull request Apr 11, 2018
This backports #20964 to branch-2.3.

## What changes were proposed in this pull request?

Adds structured streaming tests using testTransformer for these suites:
* IDF
* Imputer
* Interaction
* MaxAbsScaler
* MinHashLSH
* MinMaxScaler
* NGram

## How was this patch tested?

It is a bunch of tests!

Author: Joseph K. Bradley <josephdatabricks.com>

Author: Joseph K. Bradley <[email protected]>

Closes #21042 from jkbradley/SPARK-22883-part2-2.3backport.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants