Skip cluster state serialization to closed channel #67413

DaveCTurner · 2021-01-13T09:38:42Z

Today if a client requests a cluster state and then closes the
connection then we still do all the work of computing and serializing
the cluster state before finally dropping it all on the floor.

With this commit we introduce checks to make sure that the HTTP channel
is still open before starting the serialization process. We also make
the tasks themselves cancellable and abort any ongoing waiting if the
channel is closed (mainly to make the cancellability testing easier).
Finally we introduce a more detailed description of the task to help
identify cases where clients are inefficiently requesting more
components of the cluster state than they need.

Today if a client requests a cluster state and then closes the connection then we still do all the work of computing and serializing the cluster state before finally dropping it all on the floor. With this commit we introduce checks to make sure that the HTTP channel is still open before starting the serialization process. We also make the tasks themselves cancellable and abort any ongoing waiting if the channel is closed (mainly to make the cancellability testing easier). Finally we introduce a more detailed description of the task to help identify cases where clients are inefficiently requesting more components of the cluster state than they need.

elasticmachine · 2021-01-13T09:38:46Z

Pinging @elastic/es-distributed (Team:Distributed)

original-brownbear

One question, will give the test another read otherwise but otherwise looks nice :)

original-brownbear · 2021-01-13T12:15:46Z

server/src/main/java/org/elasticsearch/rest/action/admin/cluster/RestClusterStateAction.java

                                                singletonMap(Metadata.CONTEXT_MODE_PARAM, Metadata.CONTEXT_MODE_API), request);
-                                        response.getState().toXContent(builder, params);
+                                        final ClusterState responseState = response.getState();
+                                        if (responseState != null) {


Only question pretty much on this one: why can this be null all of a sudden?

It was always thus:

elasticsearch/server/src/main/java/org/elasticsearch/action/admin/cluster/state/TransportClusterStateAction.java

Line 109 in c4c3c8b

listener.onResponse(new ClusterStateResponse(state.getClusterName(), null, true));

TBF this only happens if it times out waiting for a particular metadata version, which in practice means the client is CCR and therefore not going via the REST layer anyway.

original-brownbear

LGTM :)

Today if a client requests a cluster state and then closes the connection then we still do all the work of computing and serializing the cluster state before finally dropping it all on the floor. With this commit we introduce checks to make sure that the HTTP channel is still open before starting the serialization process. We also make the tasks themselves cancellable and abort any ongoing waiting if the channel is closed (mainly to make the cancellability testing easier). Finally we introduce a more detailed description of the task to help identify cases where clients are inefficiently requesting more components of the cluster state than they need.

This reverts commit 563d07f.

Today if a client requests a cluster state and then closes the connection then we still do all the work of computing and serializing the cluster state before finally dropping it all on the floor. With this commit we introduce checks to make sure that the HTTP channel is still open before starting the serialization process. We also make the tasks themselves cancellable and abort any ongoing waiting if the channel is closed (mainly to make the cancellability testing easier). Finally we introduce a more detailed description of the task to help identify cases where clients are inefficiently requesting more components of the cluster state than they need. Backport of elastic#67413

Today if a client requests a cluster state and then closes the connection then we still do all the work of computing and serializing the cluster state before finally dropping it all on the floor. With this commit we introduce checks to make sure that the HTTP channel is still open before starting the serialization process. We also make the tasks themselves cancellable and abort any ongoing waiting if the channel is closed (mainly to make the cancellability testing easier). Finally we introduce a more detailed description of the task to help identify cases where clients are inefficiently requesting more components of the cluster state than they need. Backport of #67413

A small followup to elastic#67413 and elastic#68965: the underlying actions of the `GET /_cat/segments` API are now cancellable, so we may as well cancel them if needed.

A small followup to #67413 and #68965: the underlying actions of the `GET /_cat/segments` API are now cancellable, so we may as well cancel them if needed.

DaveCTurner added >enhancement :Distributed Coordination/Cluster Coordination Cluster formation and cluster state publication, including cluster membership and fault detection. v8.0.0 v7.12.0 labels Jan 13, 2021

DaveCTurner requested a review from original-brownbear January 13, 2021 09:38

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Jan 13, 2021

DaveCTurner added 2 commits January 13, 2021 09:39

Check for cancellation first

8296afb

Make FakeRestRequest closeable

7d24138

original-brownbear reviewed Jan 13, 2021

View reviewed changes

original-brownbear approved these changes Jan 13, 2021

View reviewed changes

DaveCTurner merged commit 31cf058 into elastic:master Jan 13, 2021

DaveCTurner deleted the 2021-01-13-cancel-cluster-state-requests branch January 13, 2021 13:24

DaveCTurner added a commit that referenced this pull request Jan 13, 2021

Revert "Skip cluster state serialization to closed channel (#67413)"

46fa3dd

This reverts commit 563d07f.

DaveCTurner added the backport pending label Jan 13, 2021

DaveCTurner mentioned this pull request Jan 13, 2021

Skip cluster state serialization to closed channel #67450

Merged

DaveCTurner removed the backport pending label Jan 14, 2021

DaveCTurner mentioned this pull request Jan 14, 2021

Improve robustness of monitoring APIs #55550

Closed

DaveCTurner mentioned this pull request Feb 16, 2021

Make GET /_cat/segments cancellable #69020

Merged

DaveCTurner added a commit that referenced this pull request Feb 16, 2021

Make GET /_cat/segments cancellable (#69020)

b6598c6

A small followup to #67413 and #68965: the underlying actions of the `GET /_cat/segments` API are now cancellable, so we may as well cancel them if needed.

DaveCTurner added a commit that referenced this pull request Feb 16, 2021

Make GET /_cat/segments cancellable (#69020)

4b8c8f8

A small followup to #67413 and #68965: the underlying actions of the `GET /_cat/segments` API are now cancellable, so we may as well cancel them if needed.

DaveCTurner mentioned this pull request Apr 29, 2021

Fix ClusterStateRestCancellationIT #72407

Merged

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Skip cluster state serialization to closed channel #67413

Skip cluster state serialization to closed channel #67413

Uh oh!

DaveCTurner commented Jan 13, 2021

Uh oh!

elasticmachine commented Jan 13, 2021

Uh oh!

original-brownbear left a comment

Uh oh!

original-brownbear Jan 13, 2021

Uh oh!

DaveCTurner Jan 13, 2021 •

edited

Loading

Uh oh!

original-brownbear left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Skip cluster state serialization to closed channel #67413

Skip cluster state serialization to closed channel #67413

Uh oh!

Conversation

DaveCTurner commented Jan 13, 2021

Uh oh!

elasticmachine commented Jan 13, 2021

Uh oh!

original-brownbear left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear Jan 13, 2021

Choose a reason for hiding this comment

Uh oh!

DaveCTurner Jan 13, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

original-brownbear left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

DaveCTurner Jan 13, 2021 •

edited

Loading