Move executor pod construction to a separate class. #452

mccheah · 2017-08-22T19:21:48Z

This is the first of several measures to make KubernetesClusterSchedulerBackend feasible to test. Requires #445 but only for convenience and not semantically speaking.

The idea is to start breaking down the functionality of KubernetesClusterSchedulerBackend into multiple individually unit-test friendly units. The logic that builds the executor pod structure was by far the single longest method that could be isolated with relative ease.

mccheah · 2017-08-22T19:22:51Z

...s/core/src/main/scala/org/apache/spark/scheduler/cluster/kubernetes/ExecutorPodFactory.scala

+    }.getOrElse(containerWithExecutorLimitCores)
+    val withMaybeShuffleConfigPod = shuffleServiceConfig.map { config =>
+      config.shuffleDirs.foldLeft(executorPod) { (builder, dir) =>
+        new PodBuilder(builder)


Ah we lost the indentation here, I'll fix that.

looks fixed now

foxish

Refactor LGTM

foxish · 2017-08-28T17:45:41Z

...s/core/src/main/scala/org/apache/spark/scheduler/cluster/kubernetes/ExecutorPodFactory.scala

+      ConfigurationUtils.parsePrefixedKeyValuePairs(
+          sparkConf,
+          KUBERNETES_NODE_SELECTOR_PREFIX,
+          "node-selector")


"node selector" for consistency?

That would be a break in the configuration I think. Aside from that, SparkConf keys have never had spaces in them.

I think that last string is only used for log output -- node selector seems like it would be fine

foxish · 2017-08-28T21:54:58Z

...ests/src/test/scala/org/apache/spark/deploy/kubernetes/integrationtest/KubernetesSuite.scala

        "-DsimpleDriverConf=simpleDriverConfValue" +
            " -Ddriverconfwithspaces='driver conf with spaces value'")
-    sparkConf.set("spark.files", driverJvmOptionsFile.getAbsolutePath)
+    sparkConf.set(SparkLauncher.EXECUTOR_EXTRA_JAVA_OPTIONS,


This is a separate change from the refactor correct?

mccheah · 2017-08-28T22:29:09Z

I believe the diff is corrupted and I have to fix it in git.

mccheah · 2017-08-28T22:29:48Z

Actually I just need to rebase against branch-2.2-kubernetes and make the diff that way.

This is the first of several measures to make KubernetesClusterSchedulerBackend feasible to test.

mccheah · 2017-08-29T18:37:44Z

resource-managers/kubernetes/README.md

Believe this change is unintentional

mccheah · 2017-08-29T18:40:17Z

@foxish rebase complete.

ash211 · 2017-08-29T18:34:24Z

resource-managers/kubernetes/README.md

please don't revert this change -- it went in in a PR and somehow your PRs keep coming close to reverting it.. ?

ash211 · 2017-08-29T18:58:28Z

...s/core/src/main/scala/org/apache/spark/scheduler/cluster/kubernetes/ExecutorPodFactory.scala

+import org.apache.spark.deploy.kubernetes.submit.{InitContainerUtil, MountSmallFilesBootstrap}
+import org.apache.spark.util.Utils
+
+// Strictly an extension of KubernetesClusterSchedulerBakcne that is factored out for testing.


typo: KubernetesClusterSchedulerBakcne

by extension, you mean the method in this trait is the same as a method in KubernetesClusterSchedulerBackend ? That implies to me it should do multiple inheritance.

I'm not sure the "strictly an extension" language makes sense -- maybe instead say that it's only used in the scheduler backend?

Extension meaning a plugin, or functionality that is pretty much only used in the scheduler backend.

ash211 · 2017-08-29T20:34:07Z

...s/core/src/main/scala/org/apache/spark/scheduler/cluster/kubernetes/ExecutorPodFactory.scala

+      ConfigurationUtils.parsePrefixedKeyValuePairs(
+          sparkConf,
+          KUBERNETES_NODE_SELECTOR_PREFIX,
+          "node-selector")


I think that last string is only used for log output -- node selector seems like it would be fine

ash211 · 2017-08-29T20:40:46Z

...s/core/src/main/scala/org/apache/spark/scheduler/cluster/kubernetes/ExecutorPodFactory.scala

+
+  import ExecutorPodFactoryImpl._
+
+  private val EXECUTOR_ID_COUNTER = new AtomicLong(0L)


where is this used? I think it should be deleted because KubernetesClusterSchedulerBackend.scala already has one

ash211 · 2017-08-29T20:59:23Z

...in/scala/org/apache/spark/scheduler/cluster/kubernetes/NodeAffinityExecutorPodModifier.scala

+import org.apache.spark.deploy.kubernetes.constants.ANNOTATION_EXECUTOR_NODE_AFFINITY
+import org.apache.spark.internal.Logging
+
+// Strictly an extension of ExecutorPodFactory but extracted out for testing.


not sure this comment adds much -- it's good practice to have smaller more modular pieces anyway for understanding, regardless of testing purposes

ash211 · 2017-08-29T21:00:22Z

...s/core/src/main/scala/org/apache/spark/scheduler/cluster/kubernetes/ExecutorPodFactory.scala

+    }.getOrElse(containerWithExecutorLimitCores)
+    val withMaybeShuffleConfigPod = shuffleServiceConfig.map { config =>
+      config.shuffleDirs.foldLeft(executorPod) { (builder, dir) =>
+        new PodBuilder(builder)


looks fixed now

ash211 · 2017-08-30T15:40:06Z

@mccheah conflicts on the GiB -> MiB conversion fix since you moved that elsewhere

…rate-executor-pod-construction

Move MiB change to ExecutorPodFactory.

mccheah · 2017-08-30T23:13:09Z

rerun integration tests please

…ruction

ash211 · 2017-09-06T05:39:07Z

Rerun unit tests please

mccheah · 2017-09-06T20:26:23Z

@aash @foxish good to merge this?

ash211 · 2017-09-06T23:02:01Z

test coverage is unchanged (this is an internal refactor) and enables more granular testing in followup PRs

…k8s#452) * Move executor pod construction to a separate class. This is the first of several measures to make KubernetesClusterSchedulerBackend feasible to test. * Revert change to README * Address comments. * Resolve merge conflicts. Move MiB change to ExecutorPodFactory.

mccheah commented Aug 22, 2017

View reviewed changes

mccheah force-pushed the support-executor-java-options branch from ae0f1f7 to e03492b Compare August 22, 2017 19:41

This was referenced Aug 23, 2017

Extract more of the shuffle management to a different class. #454

Merged

Unit tests for KubernetesClusterSchedulerBackend #275

Open

Unit Tests for KubernetesClusterSchedulerBackend #459

Merged

foxish reviewed Aug 28, 2017

View reviewed changes

Move executor pod construction to a separate class.

d888571

This is the first of several measures to make KubernetesClusterSchedulerBackend feasible to test.

mccheah changed the base branch from support-executor-java-options to branch-2.2-kubernetes August 29, 2017 18:31

mccheah commented Aug 29, 2017

View reviewed changes

resource-managers/kubernetes/README.md Outdated

Copy link

Author

mccheah Aug 29, 2017

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Believe this change is unintentional

Revert change to README

dc6b186

mccheah force-pushed the separate-executor-pod-construction branch from dbb113d to dc6b186 Compare August 29, 2017 18:38

ash211 reviewed Aug 29, 2017

View reviewed changes

Address comments.

3cdba3b

mccheah added 2 commits August 30, 2017 15:00

Merge remote-tracking branch 'origin/branch-2.2-kubernetes' into sepa…

8167cb6

…rate-executor-pod-construction

Resolve merge conflicts.

07cfca2

Move MiB change to ExecutorPodFactory.

Merge branch 'branch-2.2-kubernetes' into separate-executor-pod-const…

8153277

…ruction

ash211 merged commit fa02fb1 into branch-2.2-kubernetes Sep 6, 2017


		import ExecutorPodFactoryImpl._

		private val EXECUTOR_ID_COUNTER = new AtomicLong(0L)

Move executor pod construction to a separate class. #452

Move executor pod construction to a separate class. #452

Uh oh!

Conversation

mccheah commented Aug 22, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

foxish left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mccheah commented Aug 28, 2017

Uh oh!

mccheah commented Aug 28, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mccheah commented Aug 29, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ash211 commented Aug 30, 2017

Uh oh!

mccheah commented Aug 30, 2017

Uh oh!

ash211 commented Sep 6, 2017

Uh oh!

mccheah commented Sep 6, 2017

Uh oh!

ash211 commented Sep 6, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants