-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-6653][yarn] New config to specify port for sparkYarnAM actor system #5719
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…rkYarnAM actor system
|
Does this need to be documented too? |
|
I think we should document it. |
…rkYarnAM actor system
|
added "spark.yarn.am.port" in running-on-yarn.md |
|
Test build #718 has finished for PR 5719 at commit
|
docs/running-on-yarn.md
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we should be more specific in the description of this. an application master could have many ports open, what is this one for. I think either put in name or atleast put in description.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also kind of confusing is that in other places we use the spark.yarn.am.* namespace to refer only to the yarn-client application master. If this applies to both yarn-client and yarn-cluster, the naming is kind of inconsistent.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@sryza what do you suggest.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Unless I'm missing something this is only used in yarn client mode. This the communication channel (rpc) between AM on cluster and driver on the client side. It looks like the rest of spark uses things just like spark.executor.port, spark.driver.port, etc.. So this kind of follows that convention. Personally I would like it more descriptive but since that is kind of convention I'm ok with it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@WangTaoTheTonic I am not 100% sure but I think it does communicate with driver also, in runAMEndpoint method it is creating RpcEndpoint to communicate with driver. Can someone please confirm, I will update description accordingly.
|
So, isn't a single port here kind of limiting? If you're unlucky and two AMs start on the same node, one of them will fail. Wouldn't it be better to have a port range here? |
|
@vanzin user is expected to specify different port number for different application. |
|
That is not very user-friendly at all. |
|
@vanzin, Rpc.create will call |
I see. In that case the documentation added in this change is wrong, since the actual port will be anything between |
|
There is also a separate jira to add support for port ranges. SPARK-4449 |
|
ok to test |
docs/running-on-yarn.md
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this channel to communicate with driver either?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is not the channel used to communicate with executors.
This channel is used in yarn client mode to communicate between the spark driver running on a gateway and the application master (which is doing container requests) running on yarn.
It also looks like I was wrong my previous comment. It actually is also used in yarn cluster mode. Its used for the dynamic executor feature, where it is handling the kill from the scheduler backend.
|
Test build #31258 has finished for PR 5719 at commit
|
|
this is odd, I don't see our comments showing up out here. @zuxqoj can you look at the Files changed tab and update based on the comments |
…ystem updated config description as suggested by @tgravescs
|
LGTM pending tests |
|
Test build #31661 has finished for PR 5719 at commit
|
docs/running-on-yarn.md
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nits: yarn -> YARN, spark -> Spark, 'application master' -> 'Application Master'. Is this really the most accurate description of the AM role? it's where you find the Spark UI, right? the rest of this info isn't so useful to callers. It's just the port where the AM listens.
…ystem Incorporating @srowen feedback
|
Test build #31688 has finished for PR 5719 at commit
|
|
Jenkins, retest this please. |
|
Test build #31690 has finished for PR 5719 at commit
|
…system Author: shekhar.bansal <[email protected]> Closes #5719 from zuxqoj/master and squashes the following commits: 5574ff7 [shekhar.bansal] [SPARK-6653][yarn] New config to specify port for sparkYarnAM actor system 5117258 [shekhar.bansal] [SPARK-6653][yarn] New config to specify port for sparkYarnAM actor system 9de5330 [shekhar.bansal] [SPARK-6653][yarn] New config to specify port for sparkYarnAM actor system 456a592 [shekhar.bansal] [SPARK-6653][yarn] New configuration property to specify port for sparkYarnAM actor system 803e93e [shekhar.bansal] [SPARK-6653][yarn] New configuration property to specify port for sparkYarnAM actor system (cherry picked from commit fc8feaa) Signed-off-by: Sean Owen <[email protected]>
…system Author: shekhar.bansal <[email protected]> Closes apache#5719 from zuxqoj/master and squashes the following commits: 5574ff7 [shekhar.bansal] [SPARK-6653][yarn] New config to specify port for sparkYarnAM actor system 5117258 [shekhar.bansal] [SPARK-6653][yarn] New config to specify port for sparkYarnAM actor system 9de5330 [shekhar.bansal] [SPARK-6653][yarn] New config to specify port for sparkYarnAM actor system 456a592 [shekhar.bansal] [SPARK-6653][yarn] New configuration property to specify port for sparkYarnAM actor system 803e93e [shekhar.bansal] [SPARK-6653][yarn] New configuration property to specify port for sparkYarnAM actor system
…system Author: shekhar.bansal <[email protected]> Closes apache#5719 from zuxqoj/master and squashes the following commits: 5574ff7 [shekhar.bansal] [SPARK-6653][yarn] New config to specify port for sparkYarnAM actor system 5117258 [shekhar.bansal] [SPARK-6653][yarn] New config to specify port for sparkYarnAM actor system 9de5330 [shekhar.bansal] [SPARK-6653][yarn] New config to specify port for sparkYarnAM actor system 456a592 [shekhar.bansal] [SPARK-6653][yarn] New configuration property to specify port for sparkYarnAM actor system 803e93e [shekhar.bansal] [SPARK-6653][yarn] New configuration property to specify port for sparkYarnAM actor system
…system Author: shekhar.bansal <[email protected]> Closes apache#5719 from zuxqoj/master and squashes the following commits: 5574ff7 [shekhar.bansal] [SPARK-6653][yarn] New config to specify port for sparkYarnAM actor system 5117258 [shekhar.bansal] [SPARK-6653][yarn] New config to specify port for sparkYarnAM actor system 9de5330 [shekhar.bansal] [SPARK-6653][yarn] New config to specify port for sparkYarnAM actor system 456a592 [shekhar.bansal] [SPARK-6653][yarn] New configuration property to specify port for sparkYarnAM actor system 803e93e [shekhar.bansal] [SPARK-6653][yarn] New configuration property to specify port for sparkYarnAM actor system
No description provided.