Skip to content

Commit 31d435e

Browse files
KaiXinXiaoLeirxin
authored andcommitted
Add a config option to print DAG.
Add a config option "spark.rddDebug.enable" to check whether to print DAG info. When "spark.rddDebug.enable" is true, it will print information about DAG in the log. Author: KaiXinXiaoLei <[email protected]> Closes apache#4257 from KaiXinXiaoLei/DAGprint and squashes the following commits: d9fe42e [KaiXinXiaoLei] change log info c27ee76 [KaiXinXiaoLei] change log info 83c2b32 [KaiXinXiaoLei] change config option adcb14f [KaiXinXiaoLei] change the file. f4e7b9e [KaiXinXiaoLei] add a option to print DAG
1 parent 08488c1 commit 31d435e

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

core/src/main/scala/org/apache/spark/SparkContext.scala

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1420,6 +1420,9 @@ class SparkContext(config: SparkConf) extends Logging with ExecutorAllocationCli
14201420
val callSite = getCallSite
14211421
val cleanedFunc = clean(func)
14221422
logInfo("Starting job: " + callSite.shortForm)
1423+
if (conf.getBoolean("spark.logLineage", false)) {
1424+
logInfo("RDD's recursive dependencies:\n" + rdd.toDebugString)
1425+
}
14231426
dagScheduler.runJob(rdd, cleanedFunc, partitions, callSite, allowLocal,
14241427
resultHandler, localProperties.get)
14251428
progressBar.foreach(_.finishAll())

0 commit comments

Comments
 (0)