| Property Name | Default | Meaning | ||||
|---|---|---|---|---|---|---|
spark.akka.frameSize |
+ spark.rpc.message.maxSize |
128 | Maximum message size (in MB) to allow in "control plane" communication; generally only applies to map output size information sent between executors and the driver. Increase this if you are running - jobs with many thousands of map and reduce tasks and see messages about the frame size. - | -|||
spark.akka.heartbeat.interval |
- 1000s | -
- This is set to a larger value to disable the transport failure detector that comes built in to
- Akka. It can be enabled again, if you plan to use this feature (Not recommended). A larger
- interval value reduces network overhead and a smaller value ( ~ 1 s) might be more
- informative for Akka's failure detector. Tune this in combination of spark.akka.heartbeat.pauses
- if you need to. A likely positive use case for using failure detector would be: a sensistive
- failure detector can help evict rogue executors quickly. However this is usually not the case
- as GC pauses and network lags are expected in a real Spark cluster. Apart from that enabling
- this leads to a lot of exchanges of heart beats between nodes leading to flooding the network
- with those.
- |
-||||
spark.akka.heartbeat.pauses |
- 6000s | -
- This is set to a larger value to disable the transport failure detector that comes built in to Akka.
- It can be enabled again, if you plan to use this feature (Not recommended). Acceptable heart
- beat pause for Akka. This can be used to control sensitivity to GC pauses. Tune
- this along with spark.akka.heartbeat.interval if you need to.
- |
-||||
spark.akka.threads |
- 4 | -- Number of actor threads to use for communication. Can be useful to increase on large clusters - when the driver has a lot of CPU cores. - | -||||
spark.akka.timeout |
- 100s | -- Communication timeout between Spark nodes. + jobs with many thousands of map and reduce tasks and see messages about the RPC message size. | ||||
spark.executor.port |
- (random) | -- Port for the executor to listen on. This is used for communicating with the driver. - This is only relevant when using the Akka RPC backend. - | -||||
spark.fileserver.port |
- (random) | -- Port for the driver's HTTP file server to listen on. - This is only relevant when using the Akka RPC backend. - | -||||
spark.network.timeout |
120s |
Default timeout for all network interactions. This config will be used in place of
- spark.core.connection.ack.wait.timeout, spark.akka.timeout,
+ spark.core.connection.ack.wait.timeout,
spark.storage.blockManagerSlaveTimeoutMs,
spark.shuffle.io.connectionTimeout, spark.rpc.askTimeout or
spark.rpc.lookupTimeout if they are not configured.
@@ -1418,8 +1362,7 @@ Apart from these, the following properties are also available, and may be useful
Use fs for file server.
|
||||
| Config Namespace | Component | |||||
spark.ssl.akka |
- Akka communication channels | -|||||
spark.ssl.fs |
HTTP file server and broadcast server | @@ -137,7 +132,7 @@ configure those ports.7077 | Submit job to cluster / Join cluster |
SPARK_MASTER_PORT |
- Akka-based. Set to "0" to choose a port randomly. Standalone mode only. | +Set to "0" to choose a port randomly. Standalone mode only. |
| Standalone Master | @@ -145,7 +140,7 @@ configure those ports.(random) | Schedule executors | SPARK_WORKER_PORT |
- Akka-based. Set to "0" to choose a port randomly. Standalone mode only. | +Set to "0" to choose a port randomly. Standalone mode only. |
spark.driver.portspark.executor.portspark.fileserver.port