Log primary-replica resync failures #27421

jasontedor · 2017-11-17T01:52:28Z

Today we do not fail a replica shard if the primary-replica resync to that replica fails. Yet, we should at least log the failure messages. This commit causes this to be the case.

Relates #24841, relates #27418

Today we do not fail a replica shard if the primary-replica resync to that replica fails. Yet, we should at least log the failure messages. This commit causes this to be the case.

dakrone

LGTM

ywelsch

I'm not sure it's a good idea to do this. For example, when shutting down a cluster (full cluster restart), this might result in these warnings being logged, which could look alarming to users even though there is no reason to be alarmed. The primary-replica resync being best-effort at the moment anyhow, what advantage can the user gain from seeing these warnings? If it's for debugging purposes, I'm fine logging this as debug here.

jasontedor · 2017-11-17T11:49:10Z

@ywelsch I disagree; today we log and only log primary-replica resync completed with {} operations and that is misleading. I do not want to throw out having these error messages in the logs when there is a genuine failure (the baby) solely to not have them in the logs when this occurs during shutdown (the bathwater, which will occur rarely).

ywelsch · 2017-11-17T12:17:55Z

I agree that the current log message is misleading. I'm not sure though that we should log every replication failure on a primary-replica resync, though. I'll reach out to discuss.

ywelsch

LGTM if log-level is changed to info

jasontedor · 2017-11-17T18:34:31Z

I discussed this with @ywelsch and we agreed that info logging is acceptable here.

Today we do not fail a replica shard if the primary-replica resync to that replica fails. Yet, we should at least log the failure messages. This commit causes this to be the case. Relates #27421

Log primary-replica resync failures

367be38

Today we do not fail a replica shard if the primary-replica resync to that replica fails. Yet, we should at least log the failure messages. This commit causes this to be the case.

jasontedor added :Sequence IDs v6.0.1 v6.1.0 v7.0.0 labels Nov 17, 2017

jasontedor requested a review from ywelsch November 17, 2017 01:52

dakrone approved these changes Nov 17, 2017

View reviewed changes

ywelsch suggested changes Nov 17, 2017

View reviewed changes

ywelsch approved these changes Nov 17, 2017

View reviewed changes

jasontedor added 2 commits November 17, 2017 12:36

warn -> info

ac87aa8

annotation

bd23fdb

jasontedor merged commit da11515 into elastic:master Nov 17, 2017

jasontedor deleted the log-resync-failures branch November 17, 2017 18:34

clintongormley added the >enhancement label Dec 6, 2017

clintongormley added :Distributed Indexing/Engine Anything around managing Lucene and the Translog in an open shard. and removed :Sequence IDs labels Feb 14, 2018

jimczi added v7.0.0-beta1 and removed v7.0.0 labels Feb 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Log primary-replica resync failures #27421

Log primary-replica resync failures #27421

Uh oh!

jasontedor commented Nov 17, 2017

Uh oh!

dakrone left a comment

Uh oh!

ywelsch left a comment

Uh oh!

jasontedor commented Nov 17, 2017

Uh oh!

ywelsch commented Nov 17, 2017

Uh oh!

ywelsch left a comment

Uh oh!

jasontedor commented Nov 17, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Log primary-replica resync failures #27421

Log primary-replica resync failures #27421

Uh oh!

Conversation

jasontedor commented Nov 17, 2017

Uh oh!

dakrone left a comment

Choose a reason for hiding this comment

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

jasontedor commented Nov 17, 2017

Uh oh!

ywelsch commented Nov 17, 2017

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

jasontedor commented Nov 17, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants