[SPARK-35981][PYTHON][TEST] Use check_exact=False to loosen the check precision #33179

ueshin · 2021-07-01T22:30:16Z

What changes were proposed in this pull request?

We should use check_exact=False because the value check in StatsTest.test_cov_corr_meta is too strict.

Why are the changes needed?

In some environment, the precision could be different in pandas' DataFrame.corr function and the test StatsTest.test_cov_corr_meta fails.

AssertionError: DataFrame.iloc[:, 0] (column name="a") are different
DataFrame.iloc[:, 0] (column name="a") values are different (14.28571 %)
[index]: [a, b, c, d, e, f, g]
[left]:  [1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0]
[right]: [1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 4.807406715958909e-17]

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Modified tests should still pass.

ueshin · 2021-07-01T22:32:44Z

cc @HyukjinKwon @itholic @xinrong-databricks

SparkQA · 2021-07-01T23:15:47Z

Test build #140540 has finished for PR 33179 at commit 7e46edd.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2021-07-01T23:39:21Z

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45053/

SparkQA · 2021-07-02T00:15:33Z

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45053/

HyukjinKwon · 2021-07-02T08:58:06Z

Merged to master.

… precision ### What changes were proposed in this pull request? We should use `check_exact=False` because the value check in `StatsTest.test_cov_corr_meta` is too strict. ### Why are the changes needed? In some environment, the precision could be different in pandas' `DataFrame.corr` function and the test `StatsTest.test_cov_corr_meta` fails. ``` AssertionError: DataFrame.iloc[:, 0] (column name="a") are different DataFrame.iloc[:, 0] (column name="a") values are different (14.28571 %) [index]: [a, b, c, d, e, f, g] [left]: [1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0] [right]: [1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 4.807406715958909e-17] ``` ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Modified tests should still pass. Closes apache#33179 from ueshin/issuse/SPARK-35981/corr. Authored-by: Takuya UESHIN <[email protected]> Signed-off-by: Hyukjin Kwon <[email protected]>

…check precision ### What changes were proposed in this pull request? This is a cherry-pick of #33179. We should use `check_exact=False` because the value check in `StatsTest.test_cov_corr_meta` is too strict. ### Why are the changes needed? In some environment, the precision could be different in pandas' `DataFrame.corr` function and the test `StatsTest.test_cov_corr_meta` fails. ``` AssertionError: DataFrame.iloc[:, 0] (column name="a") are different DataFrame.iloc[:, 0] (column name="a") values are different (14.28571 %) [index]: [a, b, c, d, e, f, g] [left]: [1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0] [right]: [1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 4.807406715958909e-17] ``` ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Modified tests should still pass. Closes #33193 from ueshin/issuse/SPARK-35981/3.2/corr. Authored-by: Takuya UESHIN <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]>

Use check_exact=False to loosen the check precision.

7e46edd

github-actions bot added CORE PYTHON labels Jul 1, 2021

HyukjinKwon approved these changes Jul 2, 2021

View reviewed changes

HyukjinKwon closed this in 7769644 Jul 2, 2021

ueshin mentioned this pull request Jul 2, 2021

[SPARK-35981][PYTHON][TEST][3.2] Use check_exact=False to loosen the check precision #33193

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-35981][PYTHON][TEST] Use check_exact=False to loosen the check precision #33179

[SPARK-35981][PYTHON][TEST] Use check_exact=False to loosen the check precision #33179

Uh oh!

ueshin commented Jul 1, 2021 •

edited

Loading

Uh oh!

ueshin commented Jul 1, 2021

Uh oh!

SparkQA commented Jul 1, 2021

Uh oh!

SparkQA commented Jul 1, 2021

Uh oh!

SparkQA commented Jul 2, 2021

Uh oh!

HyukjinKwon commented Jul 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-35981][PYTHON][TEST] Use check_exact=False to loosen the check precision #33179

[SPARK-35981][PYTHON][TEST] Use check_exact=False to loosen the check precision #33179

Uh oh!

Conversation

ueshin commented Jul 1, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

ueshin commented Jul 1, 2021

Uh oh!

SparkQA commented Jul 1, 2021

Uh oh!

SparkQA commented Jul 1, 2021

Uh oh!

SparkQA commented Jul 2, 2021

Uh oh!

HyukjinKwon commented Jul 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ueshin commented Jul 1, 2021 •

edited

Loading