Skip to content

Conversation

@DaveCTurner
Copy link
Contributor

@DaveCTurner DaveCTurner commented Aug 10, 2020

There is no point in timing out a join attempt any more once a cluster
is entirely in 7.x. Timing out and retrying with the same master is
pointless, and an in-flight join attempt to one master no longer blocks
attempts to join other masters. This commit deprecates this unnecessary
setting and removes its effect from the joining process.

Relates #60873 which removes this setting in master.

There is no point in timing out a join attempt any more once a cluster
is entirely in 7.x. Timing out and retrying with the same master is
pointless, and an in-flight join attempt to one master no longer blocks
attempts to join other masters. This commit deprecates this unnecessary
setting and removes its effect from the joining process.
@DaveCTurner DaveCTurner added >enhancement >deprecation :Distributed Coordination/Cluster Coordination Cluster formation and cluster state publication, including cluster membership and fault detection. v7.10.0 labels Aug 10, 2020
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-distributed (:Distributed/Cluster Coordination)

@elasticmachine elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Aug 10, 2020
DaveCTurner added a commit to DaveCTurner/elasticsearch that referenced this pull request Aug 10, 2020
There is no point in timing out a join attempt any more. Timing out and
retrying with the same master is pointless, and an in-flight join
attempt to one master no longer blocks attempts to join other masters.
This commit removes this unnecessary setting.

Relates elastic#60872 in which this setting was deprecated.
@DaveCTurner
Copy link
Contributor Author

DaveCTurner commented Aug 10, 2020

NB this PR is against 7.x only, this code has diverged sufficiently from the master branch (and the docs changes needed are sufficiently different) that the simplest approach is two independent PRs that will be merged together.

@DaveCTurner DaveCTurner requested a review from ywelsch August 10, 2020 08:35
public static final String VALIDATE_JOIN_ACTION_NAME = "internal:cluster/coordination/join/validate";
public static final String START_JOIN_ACTION_NAME = "internal:cluster/coordination/start_join";

// the timeout for each join attempt
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We probably don't need this for Zen1 joining too, although there's some complexity with the process of bootstrapping the cluster during a rolling upgrade. Rather than spend any time on the analysis I decided to leave the behaviour in mixed-version clusters alone.

Copy link
Contributor

@ywelsch ywelsch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@DaveCTurner
Copy link
Contributor Author

@elasticmachine please run elasticsearch-ci/1 (looks like a transient (?) Gradle (?) issue)

@DaveCTurner DaveCTurner merged commit f44c28b into elastic:7.x Aug 10, 2020
@DaveCTurner DaveCTurner deleted the 2020-08-10-deprecate-join-timeout branch August 10, 2020 12:57
DaveCTurner added a commit that referenced this pull request Aug 10, 2020
There is no point in timing out a join attempt any more. Timing out and
retrying with the same master is pointless, and an in-flight join
attempt to one master no longer blocks attempts to join other masters.
This commit removes this unnecessary setting.

Relates #60872 in which this setting was deprecated.
jrodewig added a commit that referenced this pull request Sep 16, 2021
We deprecated the `cluster.join.timeout` setting in 7.10 with PR #60872.
However, we didn't add a related item to the 7.10 deprecation docs. This adds
the missing item.

Relates to #60873.
elasticsearchmachine pushed a commit that referenced this pull request Sep 16, 2021
We deprecated the `cluster.join.timeout` setting in 7.10 with PR #60872.
However, we didn't add a related item to the 7.10 deprecation docs. This adds
the missing item.

Relates to #60873.
elasticsearchmachine pushed a commit that referenced this pull request Sep 16, 2021
We deprecated the `cluster.join.timeout` setting in 7.10 with PR #60872.
However, we didn't add a related item to the 7.10 deprecation docs. This adds
the missing item.

Relates to #60873.
elasticsearchmachine pushed a commit that referenced this pull request Sep 16, 2021
We deprecated the `cluster.join.timeout` setting in 7.10 with PR #60872.
However, we didn't add a related item to the 7.10 deprecation docs. This adds
the missing item.

Relates to #60873.
jrodewig added a commit that referenced this pull request Sep 16, 2021
We deprecated the `cluster.join.timeout` setting in 7.10 with PR #60872.
However, we didn't add a related item to the 7.10 deprecation docs. This adds
the missing item.

Relates to #60873.
jrodewig added a commit that referenced this pull request Sep 16, 2021
We deprecated the `cluster.join.timeout` setting in 7.10 with PR #60872.
However, we didn't add a related item to the 7.10 deprecation docs. This adds
the missing item.

Relates to #60873.
jrodewig added a commit that referenced this pull request Sep 16, 2021
We deprecated the `cluster.join.timeout` setting in 7.10 with PR #60872.
However, we didn't add a related item to the 7.10 deprecation docs. This adds
the missing item.

Relates to #60873.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

>deprecation :Distributed Coordination/Cluster Coordination Cluster formation and cluster state publication, including cluster membership and fault detection. >enhancement Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. v7.10.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants