[SPARK-21809] : Change Stage Page to use datatables to support sorting columns and searching #19270

pgandhi999 · 2017-09-18T20:59:28Z

Support column sort and search for Stage Page using jQuery DataTable and REST API. Before this commit, the Stage page generated a hard-coded HTML table that could not support search. Supporting search and sort (over all applications rather than the 20 entries in the current page) in any case will greatly improve the user experience.
Created the stagespage-template.html for displaying application information in datables. Added REST api endpoint and javascript code to fetch data from the endpoint and display it on the data table.
Because of the above change, certain functionalities in the page had to be modified to support the addition of datatables. For example, the toggle checkbox 'Select All' previously would add the checked fields as columns in the Task table and as rows in the Summary Metrics table, but after the change, only columns are added in the Task Table as it got tricky to add rows dynamically in the datatables.

How was this patch tested?

I have attached the screenshots of the Stage Page UI before and after the fix.
Before:

Accumulators Table:

After:

Accumulators Table:

…g columns and searching Converted static html tables to datatables on stage page [SPARK-21809] : Change Stage Page to use datatables to support sorting columns and searching

[SPARK-21809] : Fixing code to pass ScalaStyle Check

pgandhi999 · 2017-09-18T21:05:32Z

@ajbozarth Thank you for your comment on the previous PR. I have closed that one. Apologies for the confusion caused in the previous PR!

ajbozarth · 2017-09-18T21:12:31Z

Thanks, I'll try to review this by EOD tomorrow

ajbozarth · 2017-09-18T22:24:59Z

I'll look at the html/js code tomorrow, but it looks like there still unrelated code that adds new fields, is that code supposed to be there or is it for another task?

pgandhi999 · 2017-09-19T13:51:38Z

All of the code is part of the same task. Can you please be more specific about the code that you have doubts about, and I can elaborate further on it.

tgravescs · 2017-09-19T15:43:05Z

ok to test

SparkQA · 2017-09-19T15:58:27Z

Test build #81936 has finished for PR 19270 at commit c588953.

This patch fails MiMa tests.
This patch merges cleanly.
This patch adds no public classes.

pgandhi999 · 2017-09-19T16:01:27Z

The error logs for test build #81683 state that method this(Long,Int,Int,Long,Long,Long,Long,Long,Long)Unit in class org.apache.spark.status.api.v1.ExecutorStageSummary does not have a correspondent in current version. All I have done is add new fields in the api ExecutorStageSummary and have not modified any existing ones. It should be fine but please let me know if it is not.

ajbozarth · 2017-09-19T23:07:08Z

On a second look I think I figured out my misunderstanding, and I've realized a through review will take quite a bit of time, I'll do my best to finish by the end of the week but no promises. As for the MiMa failure, any change to a public api (even additions) must be added to the MiMa excludes.

pgandhi999 · 2017-09-20T01:04:29Z

No problem. Thank you for your valuable comments.

Adding Problem filter for ExecutorStageSummary to MiMa Excludes

SparkQA · 2017-09-20T16:23:58Z

Test build #82003 has finished for PR 19270 at commit dd12be7.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

ajbozarth · 2017-09-20T19:36:19Z

I'm still going through the code but I also checked out, built and ran you changes and found that the page doesn't work in the web UI only in the SHS. Did you test this on both the Web UI and SHS? I'll continue my read through and testing of your code while you fix this.

pgandhi999 · 2017-09-21T19:26:08Z

I believe you mean the opposite of what you wrote. My changes are visible in the web ui(while the app is running) and not in the SHS(once the job is done). Yep I see that and am working on the fix. Thank you.

ajbozarth · 2017-09-21T20:52:44Z

For me it's definitely the UI that doesn't work and the SHS that does, I'' see if I can recreate and screenshot the js error I'm getting for you

ajbozarth · 2017-09-21T20:55:46Z

tgravescs · 2017-09-22T18:10:07Z

I just tried this out and it appears to be working for me for a running application, haven't tried the history UI yet. @ajbozarth What browser are you using and what are you running (a wordcount, or similar)? I tried both in chrome and firefox.

I just checked out his pull request and built that.

one thing I noticed was if you select everything under the "Show additional metrics", then refresh the page, is saves the settings but the check boxes aren't checked anymore. "Select All" should be called what it used to be "(De)select All"

It also seems to be missing some of the options there when you do a shuffle there should be Shuffle read/write metrics: "Shuffle Read Blocked Time" and "Shuffle Remote Reads"

ajbozarth · 2017-09-26T22:43:04Z

Ok so I'm still doing more testing but I've narrowed the above problem. The above error is occurring when using either local or standalone, the error doesn't appear when using yarn. I'll continue my testing and review.

pgandhi999 · 2017-09-26T22:49:17Z

Ok, I will look into it. I am currently fixing ui bugs and unit tests, so will commit those changes first, then will look into the above issue. Thank you.

…g columns and searching [SPARK-21809] : Fixed a couple of ui issues; the changes are visible both in webui and SHS when running on yarn. Fixed unit tests. Removed two obsolete unit tests.

SparkQA · 2017-09-27T22:01:48Z

Test build #82252 has finished for PR 19270 at commit 098a93d.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

pgandhi999 · 2017-09-28T13:53:54Z

@ajbozarth There were two unit tests in StagePageSuite.scala that were failing as they are no longer valid for the modified ui that generate datatables dynamically from Javascript. I have removed them for the time being, but if you have any other suggestions, let me know.

…ui and history on local and standalone [SPARK-21809] : Fixing issue of not being able to see changes in web ui and history on local and standalone mode as well as on yarn

SparkQA · 2017-09-29T18:53:32Z

Test build #82321 has finished for PR 19270 at commit c1f85ae.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

pgandhi999 · 2017-09-29T19:11:44Z

@ajbozarth I have fixed the issue of my changes not working in the web ui for local, standalone and yarn. Let me know if you are still facing issues with the testing.

ajbozarth · 2017-09-29T22:54:04Z

Thanks, I'll pull the latest changes and keep testing. And thanks for your quick responses, I understand large changes like this take forever to review and can get frustrating for the submitter.

pgandhi999 · 2017-09-30T18:46:20Z

No problem, @ajbozarth , and thank you for your valuable feedback. I really appreciate it.

ajbozarth

I haven't gotten through tasks pages.js completely yet but wanted to post my comments so far. (also I think taskspages.js should either be taskspage.js or stagepage.js)

Two bits of functionality I found missing:

Show additional metrics don't toggle the metrics in the summary table, they're just always on
# Complete Tasks doesn't link to the Tasks table anymore

ajbozarth · 2017-10-03T22:24:14Z

core/src/main/scala/org/apache/spark/status/api/v1/AllStagesResource.scala

  }

-  def convertTaskData(uiData: TaskUIData, lastUpdateTime: Option[Long]): TaskData = {
+  private def getGettingResultTime(info: TaskInfo, currentTime: Long): Long = {


currentTime is never used

ajbozarth · 2017-10-03T22:28:31Z

core/src/main/scala/org/apache/spark/ui/exec/ExecutorsTab.scala

  def activeStorageStatusList: Seq[StorageStatus] = storageStatusListener.storageStatusList
-
  def deadStorageStatusList: Seq[StorageStatus] = storageStatusListener.deadStorageStatusList
-


Why remove these lines? They don't seem to be an issue

ajbozarth · 2017-10-03T22:42:18Z

core/src/main/scala/org/apache/spark/ui/exec/ExecutorsTab.scala

    execTaskSummary.isBlacklisted = isBlacklisted
  }

+  def getExecutorHost(eid: String): String = {


Is this how getExecutorHost works in other classes? How did you decide to implement it this way?

As the list active StorageStatusList stores the info of all the active executors at any point, I thought it could be used in that manner. I tested it and it seems to work fine. If you think there could be a potential issue with this approach, do let me know and we can discuss further.

ajbozarth · 2017-10-03T22:46:26Z

core/src/test/scala/org/apache/spark/ui/UISeleniumSuite.scala

      for {
        stageId <- 0 to 1
-        attemptId <- 0 to 1
+        attemptId <- 1 to 0


Why is this changed?

So this test was initially failing due to following reason. For stage id 1 and attempt id 0, the stage is designed to fail. So ideally, for this case, when the test tries to connect to the backend to get the json file in line 352, it would exit. But as there is no code writen to handle the exception, the test would quit and fail as the last case never ran. So, by changing the order, I ensured that all the stage success cases would run and the last one would fail and test will pass. I went as far as I could to debug and this was what I could find. If you think there is something more to this, let me know and we can discuss further.

ajbozarth · 2017-10-03T22:52:04Z