[SPARK-32907][ML][PYTHON] Adaptively blockify instances - AFT,LiR,LoR #30355

zhengruifeng · 2020-11-12T12:26:34Z

What changes were proposed in this pull request?

use maxBlockSizeInMB instead of blockSize (#rows) to control the stacking of vectors;

Why are the changes needed?

the performance gain is mainly related to the nnz of block.

Does this PR introduce any user-facing change?

yes, param blockSize -> blockSizeInMB in master

How was this patch tested?

updated testsuites

rename param add comment

SparkQA · 2020-11-12T13:45:13Z

Test build #131000 has finished for PR 30355 at commit f8782a4.

This patch fails PySpark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2020-11-12T14:13:10Z

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/35606/

SparkQA · 2020-11-12T14:35:27Z

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/35606/

SparkQA · 2020-11-13T02:30:16Z

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/35634/

SparkQA · 2020-11-13T02:53:40Z

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/35634/

SparkQA · 2020-11-13T03:06:46Z

Test build #131028 has finished for PR 30355 at commit 5f9e70f.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

WeichenXu123

Let's document the default 0.0 behavior (set optimal block size, current 1MB) in the HasMaxBlockSizeInMB param doc.
otherwise LGTM.

SparkQA · 2020-11-18T06:27:57Z

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/35855/

SparkQA · 2020-11-18T06:50:28Z

Test build #131252 has finished for PR 30355 at commit 48b2814.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2020-11-18T06:55:57Z

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/35855/

SparkQA · 2020-11-18T09:25:01Z

Test build #131268 has finished for PR 30355 at commit 317bde9.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2020-11-18T10:20:18Z

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/35871/

SparkQA · 2020-11-18T10:50:01Z

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/35871/

WeichenXu123 · 2020-11-18T15:01:39Z

LGTM.

WeichenXu123 · 2020-11-18T15:03:35Z

Merge to master.

zhengruifeng · 2020-11-19T01:19:35Z

@WeichenXu123 Thank you so much!

aft_lir_lor

f8782a4

rename param add comment

zhengruifeng added ML PYSPARK PYTHON labels Nov 12, 2020

github-actions bot added CORE MLLIB labels Nov 12, 2020

zhengruifeng removed the CORE label Nov 13, 2020

fix py lir

5f9e70f

github-actions bot added the CORE label Nov 13, 2020

HyukjinKwon changed the title ~~[SPARK-32907][ML][PYTHON] adaptively blockify instances - AFT,LiR,LoR~~ [SPARK-32907][ML][PYTHON] Adaptively blockify instances - AFT,LiR,LoR Nov 13, 2020

WeichenXu123 requested changes Nov 17, 2020

View reviewed changes

update doc

48b2814

use local variable

317bde9

WeichenXu123 approved these changes Nov 18, 2020

View reviewed changes

WeichenXu123 closed this in 689c294 Nov 18, 2020

zhengruifeng deleted the adaptively_blockify_aft_lir_lor branch November 19, 2020 01:19

zhengruifeng mentioned this pull request Dec 18, 2020

[SPARK-31454][ML] An optimized K-Means based on DenseMatrix and GEMM #28229

Closed

[SPARK-32907][ML][PYTHON] Adaptively blockify instances - AFT,LiR,LoR #30355

[SPARK-32907][ML][PYTHON] Adaptively blockify instances - AFT,LiR,LoR #30355

Uh oh!

Conversation

zhengruifeng commented Nov 12, 2020

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

SparkQA commented Nov 12, 2020

Uh oh!

SparkQA commented Nov 12, 2020

Uh oh!

SparkQA commented Nov 12, 2020

Uh oh!

SparkQA commented Nov 13, 2020

Uh oh!

SparkQA commented Nov 13, 2020

Uh oh!

SparkQA commented Nov 13, 2020

Uh oh!

WeichenXu123 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Nov 18, 2020

Uh oh!

SparkQA commented Nov 18, 2020

Uh oh!

SparkQA commented Nov 18, 2020

Uh oh!

SparkQA commented Nov 18, 2020

Uh oh!

SparkQA commented Nov 18, 2020

Uh oh!

SparkQA commented Nov 18, 2020

Uh oh!

WeichenXu123 commented Nov 18, 2020

Uh oh!

WeichenXu123 commented Nov 18, 2020

Uh oh!

zhengruifeng commented Nov 19, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

WeichenXu123 left a comment •

edited

Loading