Skip to content

Commit ec109dd

Browse files
[DOCS] Fixes adaptive_allocations examples (#113248) (#113254)
Co-authored-by: Jan Kuipers <[email protected]>
1 parent 0155456 commit ec109dd

File tree

3 files changed

+10
-2
lines changed

3 files changed

+10
-2
lines changed

docs/reference/inference/service-elasticsearch.asciidoc

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -179,6 +179,7 @@ PUT _inference/text_embedding/my-e5-model
179179
"min_number_of_allocations": 3,
180180
"max_number_of_allocations": 10
181181
},
182+
"num_threads": 1,
182183
"model_id": ".multilingual-e5-small"
183184
}
184185
}

docs/reference/inference/service-elser.asciidoc

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -147,7 +147,8 @@ PUT _inference/sparse_embedding/my-elser-model
147147
"enabled": true,
148148
"min_number_of_allocations": 3,
149149
"max_number_of_allocations": 10
150-
}
150+
},
151+
"num_threads": 1
151152
}
152153
}
153154
------------------------------------------------------------

docs/reference/search/search-your-data/semantic-search-semantic-text.asciidoc

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,11 @@ PUT _inference/sparse_embedding/my-elser-endpoint <1>
3636
{
3737
"service": "elser", <2>
3838
"service_settings": {
39-
"num_allocations": 1,
39+
"adaptive_allocations": { <3>
40+
"enabled": true,
41+
"min_number_of_allocations": 3,
42+
"max_number_of_allocations": 10
43+
},
4044
"num_threads": 1
4145
}
4246
}
@@ -46,6 +50,8 @@ PUT _inference/sparse_embedding/my-elser-endpoint <1>
4650
be used and ELSER creates sparse vectors. The `inference_id` is
4751
`my-elser-endpoint`.
4852
<2> The `elser` service is used in this example.
53+
<3> This setting enables and configures {ml-docs}/ml-nlp-elser.html#elser-adaptive-allocations[adaptive allocations].
54+
Adaptive allocations make it possible for ELSER to automatically scale up or down resources based on the current load on the process.
4955

5056
[NOTE]
5157
====

0 commit comments

Comments
 (0)