Add node shutdown API for shutting down nodes cleanly

This issue supersedes #49064, which will be closed.

The node shutdown API should provide a safe way for operators to shutdown a node ensuring all relevant orchestration steps are taken to prevent cluster instability and data loss. The feature can be used to decommission, power cycle or upgrade nodes.

An example of marking a node as part of the shutdown:

```json
PUT /_nodes/<node_id>/shutdown
{
  "type": "remove",¹
  "reason": "shutdown of node so we can remove it from the cluster"²
}
¹ The type of decommission, in this case either a "remove" (the node is never coming back) or a "restart"
² A user-enterable free text block description of the reason why the node is being shut down
```

And retrieving the shutdown status:

```json
GET /_nodes/<node_id>/shutdown

{
  "node": "data-node-1",
  "node_id": "node-id-1",
  "type": "remove",
  "reason": "shutdown of node so we can remove it from the cluster"
  "status": {¹
     "shutdown_status": "IN_PROGRESS",²
     "shard_migration": {
       "status": "IN_PROGRESS",
       "shard_migrations_remaining": 7,³
       "time_started": "<user readable date>",
       "time_started_millis": 234091892
     },
     "persistent_tasks": {⁴
       "status": "IN_PROGRESS",
       "tasks_remaining": 2,⁵
       "error": "ICouldntStopTheTasksException[i can't do that dave]...etc stacktrack etc...",
       "time_started": "<user readable date>",
       "time_started_millis": 128391987
     },
     "plugins": {⁶
       "status": "NOT_STARTED",
     },
     "data_loss_on_removal": false⁷
  },
  "time_since_shutdown": "1.2h",⁸
  "time_since_shutdown_millis": 4320000,
  "shutdown_started": "<user readable date>",9
  "shutdown_started_millis": 128391987
}
1. Shows the current state of the shutdown for this node. This can be used by operators to track progress
2. Overall shutdown status. Possible values are: "IN_PROGRESS", "COMPLETE", "STALLED". IF the shutdown is STALLED a error field will also be returned containing the reason the shutdown is stalled (e.g. no nodes can take remaining shards)
3. How many shards remain to be migrated off of this node
4. Whether in progress persistent tasks have been halt and new tasks have been blocked
5. The number of tasks that need to be completed before shutdown
6. Whether plugins have indicated that they are ready for shutdown
7. Whether data loss could occur if the node was terminated now
8. How long the shutdown has been ongoing.
9. When the shutdown was initiated.
```

-------------------------------------------------

Here are some high-level tasks that need to be completed for this:

- [x] Add cluster state building blocks for tracking node shutdown status (@gwbrown) #70044
  - [x] Implement full status API that reads shutdown status (@gwbrown) #71162
- [x] Add REST scaffolding and feature flag for the shutdown APIs (@dakrone) #70697
- [x] Mechanism for migrating data away from a decommissioned node
  - [x] Allocation decider (@gwbrown) #71658
  - [x] Ensure status is updated for data migration (@gwbrown) #73873
- [x] Mechanism to handle persistent tasks
  - [x] Ensure persistent tasks are not assigned to nodes shutting down (@dakrone) #72260
- [x] Mechanism for a node being restarted to retain its data (@gwbrown) #75606
- [x] Method to avoid needing to stop ILM (@dakrone) #73690
- [x] Check within the plugin lifecycle for the safety of shutdown (@dakrone) #73690
  - [x] Update ML to make use of the `ShutdownAwarePlugin` and stop its work while shutting down
- [x] Convert system property feature flag into yml setting that cannot be enabled on a non-release build (@dakrone) #74267
- [x] Remove feature flag (when ready for release) (@gwbrown) #76588
  - [x] Flip feature flag to default to "true" for snapshot builds (@dakrone) #75962

---

Phase 2:
- [x] Add "REPLACE" shutdown type
  - [x] Add REST and cluster state support for the "REPLACE" shutdown type (@gwbrown)
  - [x] Add allocation decider and change existing deciders to handle node replacements (@dakrone)
- [ ] Upgrades to persistent task handling
  - [ ] Cancel pre-existing tasks running on a node that is marked as shutting down (@dakrone)
  - [ ] Hook persistent task state into shutdown status API (@dakrone)
- [ ] Enhance data tier allocation decider to allow migrating to a different tier if all nodes in a certain tier are shutdown (possibly?)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add node shutdown API for shutting down nodes cleanly #70338

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add node shutdown API for shutting down nodes cleanly #70338

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions