[ML] Add effective max model memory limit to ML info #55529

droberts195 · 2020-04-21T13:40:22Z

The ML info endpoint returns the max_model_memory_limit setting
if one is configured. However, it is still possible to create
a job that cannot run anywhere in the current cluster because
no node in the cluster has enough memory to accommodate it.

This change adds an extra piece of information,
limits.effective_max_model_memory_limit, to the ML info
response that returns the biggest model memory limit that could
be run in the current cluster assuming no other jobs were
running.

The idea is that the ML UI will be able to warn users who try to
create jobs with higher model memory limits that their jobs will
not be able to start unless they add a bigger ML node to their
cluster.

Relates elastic/kibana#63942

The ML info endpoint returns the max_model_memory_limit setting if one is configured. However, it is still possible to create a job that cannot run anywhere in the current cluster because no node in the cluster has enough memory to accommodate it. This change adds an extra piece of information, limits.current_effective_max_model_memory_limit, to the ML info response that returns the biggest model memory limit that could be run in the current cluster assuming no other jobs were running. The idea is that the ML UI will be able to warn users who try to create jobs with higher model memory limits that their jobs will not be able to start unless they add a bigger ML node to their cluster. Relates elastic/kibana#63942

elasticmachine · 2020-04-21T13:40:25Z

Pinging @elastic/ml-core (:ml)

benwtrent · 2020-04-21T14:36:57Z

x-pack/plugin/ml/src/main/java/org/elasticsearch/xpack/ml/action/TransportMlInfoAction.java

        if (maxModelMemoryLimit != null && maxModelMemoryLimit.getBytes() > 0) {
-            limits.put("max_model_memory_limit", maxModelMemoryLimit);
+            limits.put("max_model_memory_limit", maxModelMemoryLimit.getStringRep());
+            if (currentEffectiveMaxModelMemoryLimit == null || currentEffectiveMaxModelMemoryLimit.compareTo(maxModelMemoryLimit) > 0) {


It might be nice to indicate that there is room available for larger jobs if they increased their MAX_MODEL_MEMORY_LIMIT setting.

But, in the scenarios where the user could take action, it seems to me that they SHOULD already know the native memory available.

The main scenario where MAX_MODEL_MEMORY_LIMIT is in Cloud, where it's controlled by the Cloud environment.

The other scenario where we envisage it being used is when an administrator wants to lower powered users from using all the resources with a single job.

In both cases, the user seeing the effect of the restriction wouldn't have the power to increase the limit. It's extremely unlikely there would be a scenario where the user being affected by the limit had the power to change it. Superusers who are using ML and have complete control of their hardware probably don't have the setting set at all.

In the event that both the hard maximum and effective maximum constrain the size of a job the UI should report the hard maximum.

For Elastic Cloud there is the desire for the UI to suggest upgrading to more powerful nodes if limits are hit, as that's just a case of a few clicks in the Cloud console (and paying more). But I think this endpoint still provides enough information to facilitate that because within the Cloud environment we're already setting a hard maximum limit.

droberts195 · 2020-04-21T16:23:26Z

Jenkins test this please

droberts195 · 2020-04-21T16:42:23Z

Jenkins test this please

droberts195 · 2020-04-21T17:17:55Z

Jenkins run elasticsearch-ci/packaging-sample-unix-docker

We decided that using two words was overly verbose

The ML info endpoint returns the max_model_memory_limit setting if one is configured. However, it is still possible to create a job that cannot run anywhere in the current cluster because no node in the cluster has enough memory to accommodate it. This change adds an extra piece of information, limits.effective_max_model_memory_limit, to the ML info response that returns the biggest model memory limit that could be run in the current cluster assuming no other jobs were running. The idea is that the ML UI will be able to warn users who try to create jobs with higher model memory limits that their jobs will not be able to start unless they add a bigger ML node to their cluster. Backport of elastic#55529

The ML info endpoint returns the max_model_memory_limit setting if one is configured. However, it is still possible to create a job that cannot run anywhere in the current cluster because no node in the cluster has enough memory to accommodate it. This change adds an extra piece of information, limits.effective_max_model_memory_limit, to the ML info response that returns the biggest model memory limit that could be run in the current cluster assuming no other jobs were running. The idea is that the ML UI will be able to warn users who try to create jobs with higher model memory limits that their jobs will not be able to start unless they add a bigger ML node to their cluster. Backport of #55529

Relates: elastic/elasticsearch#55529, #4803

Relates: elastic/elasticsearch#55529, #4803 Co-authored-by: Russ Cam <[email protected]>

droberts195 added >enhancement :ml Machine learning v8.0.0 v7.8.0 labels Apr 21, 2020

droberts195 mentioned this pull request Apr 21, 2020

[ML] Warn the user when the model memory limit is higher than the memory available in the ML node elastic/kibana#63942

Closed

benwtrent approved these changes Apr 21, 2020

View reviewed changes

Fix docs test

5bfc7d4

Fix checkstyle

3c08ac5

droberts195 changed the title ~~[ML] Add effective current max model memory limit to ML info~~ [ML] Add effective max model memory limit to ML info Apr 22, 2020

current_effective -> effective

9b183f4

We decided that using two words was overly verbose

droberts195 merged commit d1a9b3a into elastic:master Apr 22, 2020

droberts195 deleted the add_current_mem_limit_to_info branch April 22, 2020 10:37

droberts195 mentioned this pull request Apr 22, 2020

[ML] Add effective max model memory limit to ML info #55581

Merged

jgowdyelastic mentioned this pull request May 7, 2020

[ML] Show warning when the model memory limit is higher than the memory available in the ML node elastic/kibana#65652

Merged

2 tasks

russcam mentioned this pull request May 29, 2020

7.8.0 Meta ticket elastic/elasticsearch-net#4718

Closed

17 tasks

russcam mentioned this pull request Jun 23, 2020

7.8.1 Meta ticket elastic/elasticsearch-net#4803

Closed

4 tasks

russcam added a commit to elastic/elasticsearch-net that referenced this pull request Jun 24, 2020

Add effective max model memory limit to ML info

ebe1605

Relates: elastic/elasticsearch#55529, #4803

russcam mentioned this pull request Jun 24, 2020

Add effective max model memory limit to ML info elastic/elasticsearch-net#4814

Merged

russcam added a commit to elastic/elasticsearch-net that referenced this pull request Jun 29, 2020

Add effective max model memory limit to ML info (#4814)

07bc8b6

Relates: elastic/elasticsearch#55529, #4803

github-actions bot pushed a commit to elastic/elasticsearch-net that referenced this pull request Jun 29, 2020

Add effective max model memory limit to ML info (#4814)

79828fc

Relates: elastic/elasticsearch#55529, #4803

github-actions bot pushed a commit to elastic/elasticsearch-net that referenced this pull request Jun 29, 2020

Add effective max model memory limit to ML info (#4814)

d77ba1a

Relates: elastic/elasticsearch#55529, #4803

russcam added a commit to elastic/elasticsearch-net that referenced this pull request Jun 29, 2020

Add effective max model memory limit to ML info (#4814) (#4821)

5b16055

Relates: elastic/elasticsearch#55529, #4803 Co-authored-by: Russ Cam <[email protected]>

russcam added a commit to elastic/elasticsearch-net that referenced this pull request Jun 29, 2020

Add effective max model memory limit to ML info (#4814) (#4822)

70e92c4

Relates: elastic/elasticsearch#55529, #4803 Co-authored-by: Russ Cam <[email protected]>

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Add effective max model memory limit to ML info #55529

[ML] Add effective max model memory limit to ML info #55529

Uh oh!

droberts195 commented Apr 21, 2020 •

edited

Loading

Uh oh!

elasticmachine commented Apr 21, 2020

Uh oh!

benwtrent Apr 21, 2020

Uh oh!

droberts195 Apr 21, 2020

Uh oh!

droberts195 commented Apr 21, 2020

Uh oh!

droberts195 commented Apr 21, 2020

Uh oh!

droberts195 commented Apr 21, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[ML] Add effective max model memory limit to ML info #55529

[ML] Add effective max model memory limit to ML info #55529

Uh oh!

Conversation

droberts195 commented Apr 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticmachine commented Apr 21, 2020

Uh oh!

benwtrent Apr 21, 2020

Choose a reason for hiding this comment

Uh oh!

droberts195 Apr 21, 2020

Choose a reason for hiding this comment

Uh oh!

droberts195 commented Apr 21, 2020

Uh oh!

droberts195 commented Apr 21, 2020

Uh oh!

droberts195 commented Apr 21, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

droberts195 commented Apr 21, 2020 •

edited

Loading