[SPARK-26003][SQL][2.4] Improve SQLAppStatusListener.aggregateMetrics performance #25860

gatorsmile · 2019-09-19T21:59:24Z

This PR is to cherry-pick #23002 to Spark 2.4

What changes were proposed in this pull request?

In SQLAppStatusListener.aggregateMetrics, we use the metricIds only to filter the relevant metrics. And this is a Seq which is also sorted. When there are many metrics involved, this can be pretty inefficient. The PR proposes to use a Set for it.

How was this patch tested?

NA

Closes #23002 from mgaido91/SPARK-26003.

## What changes were proposed in this pull request? In `SQLAppStatusListener.aggregateMetrics`, we use the `metricIds` only to filter the relevant metrics. And this is a Seq which is also sorted. When there are many metrics involved, this can be pretty inefficient. The PR proposes to use a Set for it. ## How was this patch tested? NA Closes apache#23002 from mgaido91/SPARK-26003. Authored-by: Marco Gaido <[email protected]> Signed-off-by: Wenchen Fan <[email protected]>

gatorsmile · 2019-09-19T22:00:36Z

cc @mgaido91 @zsxwing @cloud-fan

zsxwing · 2019-09-19T22:01:41Z

LGTM

dongjoon-hyun

+1, LGTM.

SparkQA · 2019-09-20T01:48:21Z

Test build #111027 has finished for PR 25860 at commit 39c1bcd.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2019-09-20T02:21:19Z

Thank you all! Merged to branch-2.4.

… performance This PR is to cherry-pick #23002 to Spark 2.4 --- ## What changes were proposed in this pull request? In `SQLAppStatusListener.aggregateMetrics`, we use the `metricIds` only to filter the relevant metrics. And this is a Seq which is also sorted. When there are many metrics involved, this can be pretty inefficient. The PR proposes to use a Set for it. ## How was this patch tested? NA Closes #23002 from mgaido91/SPARK-26003. Closes #25860 from gatorsmile/cherrypickSPARK-26003. Authored-by: Marco Gaido <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]>

mgaido91 · 2019-09-20T07:21:18Z

a late LGTM, thanks @gatorsmile @dongjoon-hyun

dongjoon-hyun changed the title ~~[SPARK-26003] [Backport-2.4] Improve SQLAppStatusListener.aggregateMetrics performance~~ [SPARK-26003][SQL][2.4] Improve SQLAppStatusListener.aggregateMetrics performance Sep 19, 2019

dongjoon-hyun added the SQL label Sep 19, 2019

dongjoon-hyun approved these changes Sep 19, 2019

View reviewed changes

viirya approved these changes Sep 19, 2019

View reviewed changes

maropu approved these changes Sep 20, 2019

View reviewed changes

dongjoon-hyun closed this Sep 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-26003][SQL][2.4] Improve SQLAppStatusListener.aggregateMetrics performance #25860

[SPARK-26003][SQL][2.4] Improve SQLAppStatusListener.aggregateMetrics performance #25860

Uh oh!

gatorsmile commented Sep 19, 2019

Uh oh!

gatorsmile commented Sep 19, 2019

Uh oh!

zsxwing commented Sep 19, 2019

Uh oh!

dongjoon-hyun left a comment

Uh oh!

SparkQA commented Sep 20, 2019

Uh oh!

dongjoon-hyun commented Sep 20, 2019

Uh oh!

mgaido91 commented Sep 20, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

[SPARK-26003][SQL][2.4] Improve SQLAppStatusListener.aggregateMetrics performance #25860

[SPARK-26003][SQL][2.4] Improve SQLAppStatusListener.aggregateMetrics performance #25860

Uh oh!

Conversation

gatorsmile commented Sep 19, 2019

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

gatorsmile commented Sep 19, 2019

Uh oh!

zsxwing commented Sep 19, 2019

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Sep 20, 2019

Uh oh!

dongjoon-hyun commented Sep 20, 2019

Uh oh!

mgaido91 commented Sep 20, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants