Generate the application ID label irrespective of app name. #331

mccheah · 2017-06-06T00:17:40Z

Closes #330.

Use the application name as prefixes for Kubernetes resource names, but the restrictions on labels enforced by Kubernetes makes it untenable to put the application name in any labels. Therefore use a generated UUID as the "application ID" which we use both for registering with the external shuffle service and also when locating executor pods for the application.

ash211

We should add an integration test verifying that the restrictions on Spark app names have loosened. After this do we expect application names of exactly 63 characters and containing non-DNS label characters like _ to succeed?

ash211 · 2017-06-06T06:33:05Z

...urce-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/kubernetes/config.scala

+      .doc("Prefix to use in front of the executor pod names.")
+      .internal()
+      .stringConf
+      .createWithDefault("spark")


will this default ever be used? seems the value will be set before its ever accessed

It's tricky because it's always set on the submission client, but in order to get the value directly we need to use sparkConf.get(...). If the parameter is a config entry of type optional, then we need to call .get on the option, and require it to be present. I think it's ok if the contract is that it's always provided but there's still a default value here. At worst the executor names are incorrect but since we're doing all of our logic based on the ID label, it should not matter for correctness.

compromise with ...getOrElse("spark") ?

That would be more or less the same effect as this, so we might as well encode the default here - code in the config class is preferred over complexity in the scheduler backend class.

ash211 · 2017-06-06T06:34:32Z

...nagers/kubernetes/core/src/main/scala/org/apache/spark/deploy/kubernetes/submit/Client.scala

        .withName(kubernetesDriverPodName)
        .addToLabels(allLabels.asJava)
        .addToAnnotations(parsedCustomAnnotations.asJava)
+        .addToAnnotations(SPARK_APP_NAME_ANNOTATION, appName)


annotations don't have the same restrictions labels do?

Doesn't seem that way since we put in full init-container specs in the annotations.

…8s/spark into use-generated-id-label

mccheah · 2017-06-08T05:59:04Z

@ash211 I added an integration test.

mccheah · 2017-06-08T06:06:59Z

I just tried a test and underscores aren't allowed in pod names either, unfortunately. Kubernetes documentation has the exact requirements - lowercase and numerical characters, dashes, and dots: https://kubernetes.io/docs/concepts/overview/working-with-objects/names/.

erikerlandson · 2017-06-08T18:44:23Z

if we rebase this on branch-2.1-kubernetes it should clear the out-of-date warning

ash211

LGTM -- this looks to significantly reduce exposure where the user provides a Spark app name that's reused for something in kubernetes and is then rejected.

Planning to merge when builds are green

ash211 · 2017-06-08T19:06:28Z

rerun integration test please

erikerlandson · 2017-06-08T21:10:04Z

...e-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/kubernetes/constants.scala

  private[spark] val SPARK_ROLE_LABEL = "spark-role"
+  private[spark] val SPARK_POD_DRIVER_ROLE = "driver"
+  private[spark] val SPARK_POD_EXECUTOR_ROLE = "executor"
+  private[spark] val SPARK_APP_NAME_ANNOTATION = "spark-app-name"


Should "reserved" identifiers like "spark-app-name" in annotation keys be documented?

They could be but the error message might be self-documenting enough, and I'm not sure if it's worth the space. Do you anticipate this being an annotation that people will want to set?

It seems more like a low-prob corner case. If it's a lot of doc, then might not be worth it. If it's low effort then I'd consider it worth doing

Mostly would be worried if there is potential for silent failures

We send the notification here: https://github.com/apache-spark-on-k8s/spark/pull/331/files#diff-8c861c0709460ebb9571f0d44791b6beR92

* Generate the application ID label irrespective of app name. * Add an integration test. * Fix scalastyle

…park-on-k8s#331) * Generate the application ID label irrespective of app name. * Add an integration test. * Fix scalastyle

mccheah and others added 2 commits June 5, 2017 17:13

Generate the application ID label irrespective of app name.

2c7c786

Merge branch 'branch-2.1-kubernetes' into use-generated-id-label

4743c21

ash211 reviewed Jun 6, 2017

View reviewed changes

mccheah added 3 commits June 7, 2017 22:49

Add an integration test.

b617001

Merge branch 'use-generated-id-label' of github.com:apache-spark-on-k…

2dacf57

…8s/spark into use-generated-id-label

Fix scalastyle

8c11cb5

Merge branch 'branch-2.1-kubernetes' into use-generated-id-label

b4a1b21

ash211 approved these changes Jun 8, 2017

View reviewed changes

ash211 merged commit bcf57cf into branch-2.1-kubernetes Jun 8, 2017

ash211 deleted the use-generated-id-label branch June 8, 2017 19:40

erikerlandson reviewed Jun 8, 2017

View reviewed changes

mccheah mentioned this pull request Jun 9, 2017

V0.2 dev #341

Closed

foxish pushed a commit that referenced this pull request Jul 24, 2017

Generate the application ID label irrespective of app name. (#331)

4a01baf

* Generate the application ID label irrespective of app name. * Add an integration test. * Fix scalastyle

ifilonenko pushed a commit to ifilonenko/spark that referenced this pull request Feb 26, 2019

Prepackage deploy step (apache-spark-on-k8s#331)

1372eb2

Generate the application ID label irrespective of app name. #331

Generate the application ID label irrespective of app name. #331

Uh oh!

Conversation

mccheah commented Jun 6, 2017

Uh oh!

ash211 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mccheah commented Jun 8, 2017

Uh oh!

mccheah commented Jun 8, 2017

Uh oh!

erikerlandson commented Jun 8, 2017

Uh oh!

ash211 left a comment

Choose a reason for hiding this comment

Uh oh!

ash211 commented Jun 8, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants