Skip to content

Should replication lag be calculated from heartbeat metrics? #53

@genuss

Description

@genuss

Currently I use mongodb_consistent_backup as a backup tool for my config-server replica set. In this case updates are very rare. The tool calculates lag using last oplog operation time. In my case this often causes false-positive results of too-high replication lag. Will it be more consious to use lastHeartBeat* metrics or am I missing something here?
As an example please see attached metrics from one of my secondary replica:
image
Note the oplog lag goes up to 10 seconds although heartBeat doesn't exceed 2 seconds (my heartbeatInterval)

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions