[SPARK-23413][UI] Fix sorting tasks by Host / Executor ID at the Stage page #20601

attilapiros · 2018-02-13T18:23:20Z

What changes were proposed in this pull request?

Fixing exception got at sorting tasks by Host / Executor ID:

        java.lang.IllegalArgumentException: Invalid sort column: Host
	at org.apache.spark.ui.jobs.ApiHelper$.indexName(StagePage.scala:1017)
	at org.apache.spark.ui.jobs.TaskDataSource.sliceData(StagePage.scala:694)
	at org.apache.spark.ui.PagedDataSource.pageData(PagedTable.scala:61)
	at org.apache.spark.ui.PagedTable$class.table(PagedTable.scala:96)
	at org.apache.spark.ui.jobs.TaskPagedTable.table(StagePage.scala:708)
	at org.apache.spark.ui.jobs.StagePage.liftedTree1$1(StagePage.scala:293)
	at org.apache.spark.ui.jobs.StagePage.render(StagePage.scala:282)
	at org.apache.spark.ui.WebUI$$anonfun$2.apply(WebUI.scala:82)
	at org.apache.spark.ui.WebUI$$anonfun$2.apply(WebUI.scala:82)
	at org.apache.spark.ui.JettyUtils$$anon$3.doGet(JettyUtils.scala:90)
	at javax.servlet.http.HttpServlet.service(HttpServlet.java:687)
	at javax.servlet.http.HttpServlet.service(HttpServlet.java:790)
	at org.spark_project.jetty.servlet.ServletHolder.handle(ServletHolder.java:848)
	at org.spark_project.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:584)

Moreover some refactoring to avoid similar problems by introducing constants for each header name and reusing them at the identification of the corresponding sorting index.

How was this patch tested?

Manually:

squito · 2018-02-13T20:32:35Z

core/src/main/scala/org/apache/spark/ui/jobs/StagePage.scala

+    HEADER_STATUS -> TaskIndexNames.STATUS,
+    HEADER_LOCALITY -> TaskIndexNames.LOCALITY,
+    HEADER_EXECUTOR -> TaskIndexNames.EXECUTOR,
+    HEADER_HOST -> TaskIndexNames.EXECUTOR,


sorting by host and executor is not the same ... you might have executors 1 & 5 on host A, and execs 2,3,4 on host B.

The 2.2 UI had both executor and host in the same column: https://github.com/apache/spark/blob/branch-2.2/core/src/main/scala/org/apache/spark/ui/jobs/StagePage.scala#L1203

I think we either need to go back to having one column for both, or add an index on host.

thoughts @vanzin ?

Seems we'd better have a new TaskIndexNames for host column.

Hmmm... I agree that the correct thing would be to sort by host and have an index on that. The problem is that we'd be changing the data on disk, breaking compatibility with previous versions of the disk store. So unless that change goes into 2.3.0, that means revving the disk version number, which would require re-parsing all logs. And that kinda sucks. (I hope by the next major version I - or someone - get time to better investigate versioning of the disk data.)

Given this affects 2.3 we could potentially consider it a blocker. @sameeragarwal probably won't be very happy though.

2.2 actually sorts by executor id, and doesn't have a separate host column (added in SPARK-21675). That's one of these small changes I missed while merging all the SHS stuff.

another alternative is to disable sorting by host, and just fix sorting by executor. That could go into 2.3.1 without breaking compatibility.

I think that is good idea. I can extend taskHeadersAndCssClasses to store Tuple3 objects where the additional Boolean property flags whether the column is sortable. And for a not sortable column we are skipping the headerLink.

or even go back to the 2.2 behavior, with executor & host in the same column.

I do think having a separate column for host, and having it be sortable, is actually better ... but just trying to think of simple solutions.

Looks like we'll have a new RC, so I'll jump in the bandwagon and mark this one a blocker too. We can then add the new index in 2.3.0.

SparkQA · 2018-02-13T21:53:41Z

Test build #87411 has finished for PR 20601 at commit d51602f.

This patch fails PySpark unit tests.
This patch merges cleanly.
This patch adds no public classes.

jiangxb1987 · 2018-02-14T08:23:37Z

core/src/main/scala/org/apache/spark/ui/jobs/StagePage.scala

+  val HEADER_INPUT_SIZE = "Input Size / Records"
+  val HEADER_OUTPUT_SIZE = "Output Size / Records"
+  val HEADER_SHUFFLE_READ_TIME = "Shuffle Read Blocked Time"
+  val HEADER_SHUFFLE_TOTAL_READS = "Shuffle Read Size / Records"


nit: HEADER_SHUFFLE_TOTAL_READS -> HEADER_SHUFFLE_READ_SIZE ?

In the header constants naming I have followed the existing task index names:

HEADER_SHUFFLE_TOTAL_READS -> TaskIndexNames.SHUFFLE_TOTAL_READS,

vanzin · 2018-02-14T10:08:35Z

I'm trying to think of a way that we can avoid these issues going forward - not that I expect this code to change much. Maybe have a unit test that makes sure all declared header constants are mapped to some index, or something like that, and fails if you add a new header constant without a mapping.

vanzin · 2018-02-15T13:21:28Z

@attilapiros could you take a look at the test case Ryan added in #20615 and add something like that to your patch? It'd be nice to catch these things in unit tests.

attilapiros · 2018-02-15T14:13:46Z

Yes, of course. The test of @zsxwing is perfect to avoid similar problems in the future.

vanzin · 2018-02-15T15:28:54Z

LGTM pending tests.

SparkQA · 2018-02-15T16:00:21Z

Test build #87477 has finished for PR 20601 at commit c8ef968.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

attilapiros · 2018-02-15T16:26:27Z

jenkins retest this please

SparkQA · 2018-02-15T17:50:24Z

Test build #87478 has finished for PR 20601 at commit 22179e8.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

vanzin · 2018-02-15T17:53:57Z

ah, flaky tests. retest this please

zsxwing · 2018-02-15T19:06:39Z

LGTM

SparkQA · 2018-02-15T19:29:17Z

Test build #87481 has finished for PR 20601 at commit 22179e8.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

squito · 2018-02-15T19:50:36Z

Everything that might have changed from this has passed, the failures are known flaky tests:

https://issues.apache.org/jira/browse/SPARK-23369

https://issues.apache.org/jira/browse/SPARK-23390

merging to master / 2.3

…e page ## What changes were proposed in this pull request? Fixing exception got at sorting tasks by Host / Executor ID: ``` java.lang.IllegalArgumentException: Invalid sort column: Host at org.apache.spark.ui.jobs.ApiHelper$.indexName(StagePage.scala:1017) at org.apache.spark.ui.jobs.TaskDataSource.sliceData(StagePage.scala:694) at org.apache.spark.ui.PagedDataSource.pageData(PagedTable.scala:61) at org.apache.spark.ui.PagedTable$class.table(PagedTable.scala:96) at org.apache.spark.ui.jobs.TaskPagedTable.table(StagePage.scala:708) at org.apache.spark.ui.jobs.StagePage.liftedTree1$1(StagePage.scala:293) at org.apache.spark.ui.jobs.StagePage.render(StagePage.scala:282) at org.apache.spark.ui.WebUI$$anonfun$2.apply(WebUI.scala:82) at org.apache.spark.ui.WebUI$$anonfun$2.apply(WebUI.scala:82) at org.apache.spark.ui.JettyUtils$$anon$3.doGet(JettyUtils.scala:90) at javax.servlet.http.HttpServlet.service(HttpServlet.java:687) at javax.servlet.http.HttpServlet.service(HttpServlet.java:790) at org.spark_project.jetty.servlet.ServletHolder.handle(ServletHolder.java:848) at org.spark_project.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:584) ``` Moreover some refactoring to avoid similar problems by introducing constants for each header name and reusing them at the identification of the corresponding sorting index. ## How was this patch tested? Manually: ![screen shot 2018-02-13 at 18 57 10](https://user-images.githubusercontent.com/2017933/36166532-1cfdf3b8-10f3-11e8-8d32-5fcaad2af214.png) Author: “attilapiros” <[email protected]> Closes apache#20601 from attilapiros/SPARK-23413. (cherry picked from commit 1dc2c1d)

squito · 2018-02-15T19:59:31Z

ack I merged to master but screwed up on 2.3 -- fixing that here: #20623

SparkQA · 2018-02-15T20:57:08Z

Test build #87487 has finished for PR 20601 at commit 22179e8.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

initial version

d51602f

squito reviewed Feb 13, 2018

View reviewed changes

jiangxb1987 reviewed Feb 14, 2018

View reviewed changes

vanzin mentioned this pull request Feb 15, 2018

[SPARK-23430][WebUI]ApiHelper.COLUMN_TO_INDEX should match headers of the task table #20615

Closed

adding host as an index

c8ef968

Adding @zsxwing test code

22179e8

asfgit closed this in 1dc2c1d Feb 15, 2018

squito mentioned this pull request Feb 15, 2018

[SPARK-23413][UI] Fix sorting tasks by Host / Executor ID at the Stag… #20623

Closed

attilapiros deleted the SPARK-23413 branch April 26, 2018 20:07

[SPARK-23413][UI] Fix sorting tasks by Host / Executor ID at the Stage page #20601

[SPARK-23413][UI] Fix sorting tasks by Host / Executor ID at the Stage page #20601

Uh oh!

Conversation

attilapiros commented Feb 13, 2018

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Feb 13, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vanzin commented Feb 14, 2018

Uh oh!

vanzin commented Feb 15, 2018

Uh oh!

attilapiros commented Feb 15, 2018

Uh oh!

vanzin commented Feb 15, 2018

Uh oh!

SparkQA commented Feb 15, 2018

Uh oh!

attilapiros commented Feb 15, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SparkQA commented Feb 15, 2018

Uh oh!

vanzin commented Feb 15, 2018

Uh oh!

zsxwing commented Feb 15, 2018

Uh oh!

SparkQA commented Feb 15, 2018

Uh oh!

squito commented Feb 15, 2018

Uh oh!

squito commented Feb 15, 2018

Uh oh!

SparkQA commented Feb 15, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

attilapiros commented Feb 15, 2018 •

edited

Loading