Skip to content

[ML] Repeated info in log message from all ML nodes leads to long message #29950

@elasticmachine

Description

@elasticmachine

Original comment by @sophiec20:

Found in 5.5 "build" : { "hash" : "62e486b", "date" : "2017-06-06T13:54:18.605Z" }

23 node cluster, of which 3 are master and 20 are ML and data nodes.
Once I reached the limit of max open jobs, the following error occurs when you try to open a job.

This is repeating the error message from 20 nodes. This will only get worse as the number of nodes increases.

Not a priority, considering we are only recommending a small number of dedicated nodes.

== Opening job for streamingtv299...
{"error":{"root_cause":[{"type":"status_exception","reason":"Could not open job because no suitable nodes were found, allocation explanation [Not opening job [streamingtv299] on node [{ip-10-0-3-237}{iVQhSWkEQV2QBcNgAbry5Q}{L2IqCUuAQLymdOw47osyVg}{10.0.3.237}{10.0.3.237:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-50}{HhByQF_ORUqD0NiClqPArg}{pmkQnJAhRBeUUx6pCZzgtQ}{10.0.3.50}{10.0.3.50:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-231}{cA5e1J2yS4-E96bBlaRJ7w}{XBoP_nupR6K5qB9ve6V3KA}{10.0.3.231}{10.0.3.231:9300}], because this node isn't a ml node.|Not opening job [streamingtv299] on node [{ip-10-0-3-56}{9nywi9T5S5mz0MXEibFaCw}{wqon-rXpStm2SuIYfga7zA}{10.0.3.56}{10.0.3.56:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-162}{pvnUofCaRwOiO8LUiJ4TbQ}{Rwwf5uc0SKG9IS28nifBhw}{10.0.3.162}{10.0.3.162:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-196}{gZ36z6RYQfqKUaLqMTjiKw}{NFRpcdtPTYerbM5C1Hj3jQ}{10.0.3.196}{10.0.3.196:9300}], because this node isn't a ml node.|Not opening job [streamingtv299] on node [{ip-10-0-3-110}{VntXc0TpQKGzhK6OprukSw}{fam8-p92SWCf4QNg5I3dpA}{10.0.3.110}{10.0.3.110:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-126}{bkigpzP5SsW4j4Z0ox5g-Q}{bR8YYWd7TfGrGjv0sTsZ7Q}{10.0.3.126}{10.0.3.126:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-209}{9rluv4g5RXKbzM9clOaWGA}{bvM7WC3DQhm9VLBzIwCSQg}{10.0.3.209}{10.0.3.209:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-203}{J8r3z4gpQBW7DjAhLaMdnA}{Fq7BKus3T02JSJt7DoA5Cg}{10.0.3.203}{10.0.3.203:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-58}{c-jxUMWFR9S5HUasmhppaw}{RpTzjgLfTnag4kt__Qd1UQ}{10.0.3.58}{10.0.3.58:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-57}{x1u2qSaZQ9mSuwzsc9uS5w}{akMho_T0TJSttYCrbiolVw}{10.0.3.57}{10.0.3.57:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-197}{cWYjfUh8R2Gyut_sQGOG3Q}{qjZUBsfdRsuTazL_Lc2f6Q}{10.0.3.197}{10.0.3.197:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-215}{-eE-JaQMQgqYdRB4FJY5KQ}{WYfKVgAdT4yC2I7cpIa1Cg}{10.0.3.215}{10.0.3.215:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-41}{oyVlA-MBRW6dzjhWNwDAWg}{QUbWs3F4SUOOYx3bMDdXAA}{10.0.3.41}{10.0.3.41:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-68}{A4G6q0_QR7WIjuCCr0zfMQ}{1jJwyj94QoKLSKRW4LQjlw}{10.0.3.68}{10.0.3.68:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-228}{VIxQSGk-SGuhcoPjYnMlgg}{lqz7p1NLTnWpsJyfG92-QA}{10.0.3.228}{10.0.3.228:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-44}{_lVGfCsVQ6eZlRc7SKqQWw}{Lwepw_9tQnefUECnO11fpQ}{10.0.3.44}{10.0.3.44:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-32}{wqVpvt-cT1aHdB8ASGh_Rg}{7rqe4GuqTAW7iGrvc3uaUw}{10.0.3.32}{10.0.3.32:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-182}{5nZNKrlKTdOt1aZW75AkYQ}{kDOS7T6cS8KWtKnDjRSqpA}{10.0.3.182}{10.0.3.182:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-47}{MQlVJm70SFSu3ZElRf2Hlg}{2TWEPfQQQQaefdIvSyx2MA}{10.0.3.47}{10.0.3.47:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-10}{C2y3cPk_QjyyZDZb-GHKwQ}{ePrCTvllTLaMiIqRjOJVWA}{10.0.3.10}{10.0.3.10:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-33}{GhXTuf3hT5WDUEYciix-yw}{gONvo5mLRL2_VGWOugPfJA}{10.0.3.33}{10.0.3.33:9300}], because this node isn't a ml node.]"}],"type":"status_exception","reason":"Could not open job because no suitable nodes were found, allocation explanation [Not opening job [streamingtv299] on node [{ip-10-0-3-237}{iVQhSWkEQV2QBcNgAbry5Q}{L2IqCUuAQLymdOw47osyVg}{10.0.3.237}{10.0.3.237:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-50}{HhByQF_ORUqD0NiClqPArg}{pmkQnJAhRBeUUx6pCZzgtQ}{10.0.3.50}{10.0.3.50:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-231}{cA5e1J2yS4-E96bBlaRJ7w}{XBoP_nupR6K5qB9ve6V3KA}{10.0.3.231}{10.0.3.231:9300}], because this node isn't a ml node.|Not opening job [streamingtv299] on node [{ip-10-0-3-56}{9nywi9T5S5mz0MXEibFaCw}{wqon-rXpStm2SuIYfga7zA}{10.0.3.56}{10.0.3.56:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-162}{pvnUofCaRwOiO8LUiJ4TbQ}{Rwwf5uc0SKG9IS28nifBhw}{10.0.3.162}{10.0.3.162:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-196}{gZ36z6RYQfqKUaLqMTjiKw}{NFRpcdtPTYerbM5C1Hj3jQ}{10.0.3.196}{10.0.3.196:9300}], because this node isn't a ml node.|Not opening job [streamingtv299] on node [{ip-10-0-3-110}{VntXc0TpQKGzhK6OprukSw}{fam8-p92SWCf4QNg5I3dpA}{10.0.3.110}{10.0.3.110:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-126}{bkigpzP5SsW4j4Z0ox5g-Q}{bR8YYWd7TfGrGjv0sTsZ7Q}{10.0.3.126}{10.0.3.126:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-209}{9rluv4g5RXKbzM9clOaWGA}{bvM7WC3DQhm9VLBzIwCSQg}{10.0.3.209}{10.0.3.209:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-203}{J8r3z4gpQBW7DjAhLaMdnA}{Fq7BKus3T02JSJt7DoA5Cg}{10.0.3.203}{10.0.3.203:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-58}{c-jxUMWFR9S5HUasmhppaw}{RpTzjgLfTnag4kt__Qd1UQ}{10.0.3.58}{10.0.3.58:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-57}{x1u2qSaZQ9mSuwzsc9uS5w}{akMho_T0TJSttYCrbiolVw}{10.0.3.57}{10.0.3.57:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-197}{cWYjfUh8R2Gyut_sQGOG3Q}{qjZUBsfdRsuTazL_Lc2f6Q}{10.0.3.197}{10.0.3.197:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-215}{-eE-JaQMQgqYdRB4FJY5KQ}{WYfKVgAdT4yC2I7cpIa1Cg}{10.0.3.215}{10.0.3.215:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-41}{oyVlA-MBRW6dzjhWNwDAWg}{QUbWs3F4SUOOYx3bMDdXAA}{10.0.3.41}{10.0.3.41:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-68}{A4G6q0_QR7WIjuCCr0zfMQ}{1jJwyj94QoKLSKRW4LQjlw}{10.0.3.68}{10.0.3.68:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-228}{VIxQSGk-SGuhcoPjYnMlgg}{lqz7p1NLTnWpsJyfG92-QA}{10.0.3.228}{10.0.3.228:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-44}{_lVGfCsVQ6eZlRc7SKqQWw}{Lwepw_9tQnefUECnO11fpQ}{10.0.3.44}{10.0.3.44:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-32}{wqVpvt-cT1aHdB8ASGh_Rg}{7rqe4GuqTAW7iGrvc3uaUw}{10.0.3.32}{10.0.3.32:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-182}{5nZNKrlKTdOt1aZW75AkYQ}{kDOS7T6cS8KWtKnDjRSqpA}{10.0.3.182}{10.0.3.182:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-47}{MQlVJm70SFSu3ZElRf2Hlg}{2TWEPfQQQQaefdIvSyx2MA}{10.0.3.47}{10.0.3.47:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-10}{C2y3cPk_QjyyZDZb-GHKwQ}{ePrCTvllTLaMiIqRjOJVWA}{10.0.3.10}{10.0.3.10:9300}{ml.enabled=true}], because this node is full. Number of opened jobs [10], max_running_jobs [10]|Not opening job [streamingtv299] on node [{ip-10-0-3-33}{GhXTuf3hT5WDUEYciix-yw}{gONvo5mLRL2_VGWOugPfJA}{10.0.3.33}{10.0.3.33:9300}], because this node isn't a ml node.]"},"status":429}

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions