Initial architecture documentation. #401

mccheah · 2017-07-28T00:34:12Z

Closes #400.

Initial full documentation for the submission client. Templates for the external shuffle service and the scheduler backend.

mccheah · 2017-07-28T00:35:55Z

resource-managers/kubernetes/architecture-docs/submission-client.md

+---
+
+
+Similarly to YARN and Standalone mode, it is common for Spark applications to be deployed on Kubernetes through the


On the whole this document seems verbose - if we could use bulleted lists or diagrams in place of some of these discourses that would improve readability, but I'm not sure how to best represent the information in that way.

ifilonenko · 2017-07-28T04:50:53Z

Thank you for this

erikerlandson · 2017-07-28T15:52:17Z

LGTM

liyinan926 · 2017-07-28T16:27:39Z

resource-managers/kubernetes/architecture-docs/submission-client.md

+
+## Init-Containers
+
+The submission client and the scheduler backend both use init-containers to localize resources before the driver and


"The driver and executor Pods both contain an init-container to ..." might be more accurate.

liyinan926 · 2017-07-28T16:29:10Z

resource-managers/kubernetes/architecture-docs/submission-client.md

+pod spec in a YML file: https://github.com/apache-spark-on-k8s/spark/issues/38
+- The resource staging server can be backed by a distributed file store like HDFS to improve robustness and scalability
+- Additional driver bootstrap steps need to be added to support communication with Kerberized HDFS clusters:
+https://github.com/apache-spark-on-k8s/spark/pull/391


Missing periods at the end of each bullet point.

liyinan926 · 2017-07-28T16:33:52Z

resource-managers/kubernetes/architecture-docs/submission-client.md

+
+
+Similarly to YARN and Standalone mode, it is common for Spark applications to be deployed on Kubernetes through the
+`spark-submit` process. Applications are deployed on Kubernetes via sending YML files to the Kubernetes API server.


s/YML/YAML. Also this is probably not accurate. Applications are deployed on Kubernetes by creating Kubernetes API objects via the API server. Such Kubernetes API objects are typically declared in YAML files.

liyinan926 · 2017-07-28T16:35:22Z

resource-managers/kubernetes/architecture-docs/submission-client.md

+# Future Work
+
+- The driver's pod specification should be highly customizable, to the point where users may want to specify a template
+pod spec in a YML file: https://github.com/apache-spark-on-k8s/spark/issues/38


s/YML/YAML.

I've seen both used interchangeably - is there a standard to use YAML in the Kubernetes community?

The k8s documentation seems consistently using YAML. https://kubernetes.io/docs/search/?q=YAML.

tnachen · 2017-08-01T17:44:52Z

resource-managers/kubernetes/architecture-docs/submission-client.md

+    /**
+     * Represents a step in preparing the Kubernetes driver.
+     */
+    private[spark] trait DriverConfigurationStep {


My 2cents around putting interface code in arch docs, is that it's probably easier for the maintainers if we document to this detail once we move out of beta. Otherwise I will suspect it's a constant moving target where we're not really maintaining this interface as general public API

I think it's fine if this has to be a constantly moving target - this is an architecture document so it should reflect the semantics for those who are contributing to the project.

liyinan926 · 2017-08-02T20:24:07Z

LGTM.

foxish · 2017-08-03T21:17:32Z

rerun integration tests please

foxish · 2017-08-03T23:11:02Z

rerun integration tests please

mccheah · 2017-08-04T21:11:12Z

Note that this still is missing documentation on the external shuffle service and the scheduler backend itself.

Initial full documentation for the submission client. Templates for the external shuffle service and the scheduler backend.

erikerlandson · 2017-08-08T17:02:36Z

resync w/ head of branch-2.2-kubernetes

The generated file is correct but the expected file in the test was not.

* Initial architecture documentation. Initial full documentation for the submission client. Templates for the external shuffle service and the scheduler backend. * Add title to scheduler backend doc. * edits for PR review feedback

mccheah commented Jul 28, 2017

View reviewed changes

ifilonenko self-requested a review July 28, 2017 04:51

liyinan926 reviewed Jul 28, 2017

View reviewed changes

ifilonenko approved these changes Aug 1, 2017

View reviewed changes

tnachen reviewed Aug 1, 2017

View reviewed changes

foxish mentioned this pull request Aug 3, 2017

Cutting the Spark 2.2 release #398

Closed

10 tasks

mccheah and others added 3 commits August 8, 2017 10:00

Initial architecture documentation.

01d439f

Initial full documentation for the submission client. Templates for the external shuffle service and the scheduler backend.

Add title to scheduler backend doc.

d828df2

edits for PR review feedback

83d6f55

erikerlandson force-pushed the architecture-docs branch from 27c00f1 to 83d6f55 Compare August 8, 2017 17:01

erikerlandson merged commit 24cd9ee into branch-2.2-kubernetes Aug 8, 2017

ifilonenko pushed a commit to ifilonenko/spark that referenced this pull request Feb 26, 2019

Fix expected docker file. (apache-spark-on-k8s#401)

9698d3b

The generated file is correct but the expected file in the test was not.

		---


		Similarly to YARN and Standalone mode, it is common for Spark applications to be deployed on Kubernetes through the


		## Init-Containers

		The submission client and the scheduler backend both use init-containers to localize resources before the driver and



		Similarly to YARN and Standalone mode, it is common for Spark applications to be deployed on Kubernetes through the
		`spark-submit` process. Applications are deployed on Kubernetes via sending YML files to the Kubernetes API server.

Initial architecture documentation. #401

Initial architecture documentation. #401

Uh oh!

Conversation

mccheah commented Jul 28, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ifilonenko commented Jul 28, 2017

Uh oh!

erikerlandson commented Jul 28, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

liyinan926 commented Aug 2, 2017

Uh oh!

foxish commented Aug 3, 2017

Uh oh!

foxish commented Aug 3, 2017

Uh oh!

mccheah commented Aug 4, 2017

Uh oh!

erikerlandson commented Aug 8, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants