Skip to content

Commit ec3e76f

Browse files
parentesrowen
authored andcommitted
[SPARK-6343] Doc driver-worker network reqs
Attempt at making the driver-worker networking requirement more explicit and up-front in the documentation (see https://issues.apache.org/jira/browse/SPARK-6343). Update cluster overview diagram to show connections from workers to driver. Add a bullet below about how driver listens / accepts connections from workers. Author: Peter Parente <[email protected]> Closes #5382 from parente/SPARK-6343 and squashes the following commits: 0b2fb9d [Peter Parente] [SPARK-6343] Doc driver-worker network reqs (cherry picked from commit b9c51c0) Signed-off-by: Sean Owen <[email protected]>
1 parent 4453c59 commit ec3e76f

File tree

3 files changed

+5
-1
lines changed

3 files changed

+5
-1
lines changed

docs/cluster-overview.md

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -33,7 +33,11 @@ There are several useful things to note about this architecture:
3333
2. Spark is agnostic to the underlying cluster manager. As long as it can acquire executor
3434
processes, and these communicate with each other, it is relatively easy to run it even on a
3535
cluster manager that also supports other applications (e.g. Mesos/YARN).
36-
3. Because the driver schedules tasks on the cluster, it should be run close to the worker
36+
3. The driver program must listen for and accept incoming connections from its executors throughout
37+
its lifetime (e.g., see [spark.driver.port and spark.fileserver.port in the network config
38+
section](configuration.html#networking)). As such, the driver program must be network
39+
addressable from the worker nodes.
40+
4. Because the driver schedules tasks on the cluster, it should be run close to the worker
3741
nodes, preferably on the same local area network. If you'd like to send requests to the
3842
cluster remotely, it's better to open an RPC to the driver and have it submit operations
3943
from nearby than to run a driver far away from the worker nodes.

docs/img/cluster-overview.png

5.42 KB
Loading

docs/img/cluster-overview.pptx

-23.1 KB
Binary file not shown.

0 commit comments

Comments
 (0)