Skip to content

[ML] Job opening fails during .ml-state creation #36271

@pheyos

Description

@pheyos

Found in version

  • 7.0.0 b3663

Steps to reproduce
Perform the follwoing steps on a new created instance / cluster (i.e. the .ml-state index does not exist yet):

  • Create a machine learning job and start the datafeed
  • When the lookback completes and the job is being closed, open a second job

Expected result

  • The second job is opened without errors

Actual result

  • When the first job is closed, the index .ml-state is created. With bad luck on timing it happens that the index is not yet green when the second job should open, such that the opening fails with the message
Could not open job because no suitable nodes were found, allocation explanation
[Not opening job [remote_ip_request_rate], because not all primary shards are active
for the following indices [.ml-state]]

Additional information

  • This happens particularly often when using the nginx recognizer module to create multiple jobs at a time

Metadata

Metadata

Labels

:mlMachine learning>bug

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions