You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/monitoring.md
+143Lines changed: 143 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -609,7 +609,150 @@ A list of the available metrics, with a short description:
609
609
</tr>
610
610
</table>
611
611
612
+
### Executor Metrics
612
613
614
+
Executor-level metrics are sent from each executor to the driver as part of the Heartbeat to describe the performance metrics of Executor itself like JVM heap memory, GC infomation. Metrics `peakExecutorMetrics.*` are only enabled if `spark.eventLog.logStageExecutorMetrics.enabled` is true.
615
+
A list of the available metrics, with a short description:
616
+
617
+
<tableclass="table">
618
+
<tr><th>Executor Level Metric name</th>
619
+
<th>Short description</th>
620
+
</tr>
621
+
<tr>
622
+
<td>totalGCTime</td>
623
+
<td>Elapsed time the JVM spent in garbage collection summed in this Executor.
624
+
The value is expressed in milliseconds.</td>
625
+
</tr>
626
+
<tr>
627
+
<td>totalInputBytes</td>
628
+
<td>Total input bytes summed in this Executor.</td>
629
+
</tr>
630
+
<tr>
631
+
<td>totalShuffleRead</td>
632
+
<td>Total shuffer read bytes summed in this Executor.</td>
633
+
</tr>
634
+
<tr>
635
+
<td>totalShuffleWrite</td>
636
+
<td>Total shuffer write bytes summed in this Executor.</td>
637
+
</tr>
638
+
<tr>
639
+
<td>maxMemory</td>
640
+
<td>Total amount of memory available for storage, in bytes.</td>
<td>Total available off heap memory for storage, in bytes. This amount can vary over time, depending on the MemoryManager implementation.</td>
661
+
</tr>
662
+
<tr>
663
+
<td>peakMemoryMetrics.*</td>
664
+
<td>Peak value of memory (and GC) metrics:</td>
665
+
</tr>
666
+
<tr>
667
+
<td> .JVMHeapMemory</td>
668
+
<td>Peak memory usage of the heap that is used for object allocation.
669
+
The heap consists of one or more memory pools. The used and committed size of the returned memory usage is the sum of those values of all heap memory pools whereas the init and max size of the returned memory usage represents the setting of the heap memory which may not be the sum of those of all heap memory pools.
670
+
The amount of used memory in the returned memory usage is the amount of memory occupied by both live objects and garbage objects that have not been collected, if any.</td>
<td>Peak memory usage of non-heap memory that is used by the Java virtual machine. The non-heap memory consists of one or more memory pools. The used and committed size of the returned memory usage is the sum of those values of all non-heap memory pools whereas the init and max size of the returned memory usage represents the setting of the non-heap memory which may not be the sum of those of all non-heap memory pools.</td>
<td>Resident Set Size for other kind of process. Enabled if spark.eventLog.logStageExecutorProcessTreeMetrics.enabled is true.</td>
735
+
</tr>
736
+
<tr>
737
+
<td> .MinorGCCount</td>
738
+
<td>Total minor GC count. For example, the garbage collector is one of Copy, PS Scavenge, ParNew, G1 Young Generation and so on.</td>
739
+
</tr>
740
+
<tr>
741
+
<td> .MinorGCTime</td>
742
+
<td>Elapsed total minor GC time.
743
+
The value is expressed in milliseconds.</td>
744
+
</tr>
745
+
<tr>
746
+
<td> .MajorGCCount</td>
747
+
<td>Total major GC count. For example, the garbage collector is one of MarkSweepCompact, PS MarkSweep, ConcurrentMarkSweep, G1 Old Generation and so on.</td>
748
+
</tr>
749
+
<tr>
750
+
<td> .MajorGCTime</td>
751
+
<td>Elapsed total major GC time.
752
+
The value is expressed in milliseconds.</td>
753
+
</tr>
754
+
</table>
755
+
The computation of RSS and Vmem are based on [proc(5)](http://man7.org/linux/man-pages/man5/proc.5.html)
0 commit comments