From f6d7de0af1f67c914cd8ea58b419eae4578f42b0 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Istv=C3=A1n=20Zolt=C3=A1n=20Szab=C3=B3?= Date: Thu, 24 Sep 2020 11:50:35 +0200 Subject: [PATCH 1/3] [DOCS] Adds DFA limitation item about number of training documents. --- .../stack/ml/df-analytics/ml-dfa-limitations.asciidoc | 11 +++++++++++ 1 file changed, 11 insertions(+) diff --git a/docs/en/stack/ml/df-analytics/ml-dfa-limitations.asciidoc b/docs/en/stack/ml/df-analytics/ml-dfa-limitations.asciidoc index db69b8a15..49cce6d5d 100644 --- a/docs/en/stack/ml/df-analytics/ml-dfa-limitations.asciidoc +++ b/docs/en/stack/ml/df-analytics/ml-dfa-limitations.asciidoc @@ -58,6 +58,17 @@ training percent. Run a few {dfanalytics-jobs} to see how the runtime scales with the increased number of data points and how the quality of results scales with an increased training percentage. + +[float] +[[dfa-training-docs]] +== The number of training documents cannot be more than 2^32 + +A {dfanalytics-job} that would use more than 2^32 documents for training cannot +be started. The limitation applies only for documents participating in training +the model. If your source index contains more than 2^32 documents, set the +`training_percent` to a value that represents fewer documents than 2^32. + + [float] [[dfa-missing-fields-limitations]] == Documents with missing values in analyzed fields are skipped From 6ff2dce0d41c2a8dd149118d85fa7ee5f24ffdad Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Istv=C3=A1n=20Zolt=C3=A1n=20Szab=C3=B3?= Date: Thu, 24 Sep 2020 12:01:32 +0200 Subject: [PATCH 2/3] [DOCS] Changes limitation title. --- docs/en/stack/ml/df-analytics/ml-dfa-limitations.asciidoc | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/en/stack/ml/df-analytics/ml-dfa-limitations.asciidoc b/docs/en/stack/ml/df-analytics/ml-dfa-limitations.asciidoc index 49cce6d5d..055852eba 100644 --- a/docs/en/stack/ml/df-analytics/ml-dfa-limitations.asciidoc +++ b/docs/en/stack/ml/df-analytics/ml-dfa-limitations.asciidoc @@ -61,12 +61,12 @@ with an increased training percentage. [float] [[dfa-training-docs]] -== The number of training documents cannot be more than 2^32 +== {dfanalytics-jobs-cap} cannot use more than 2^32 documents for training A {dfanalytics-job} that would use more than 2^32 documents for training cannot be started. The limitation applies only for documents participating in training the model. If your source index contains more than 2^32 documents, set the -`training_percent` to a value that represents fewer documents than 2^32. +`training_percent` to a value that represents less than 2^32 documents. [float] From 4e0056f2cdb30fa68f8b17ba261cc2d812526ec6 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Istv=C3=A1n=20Zolt=C3=A1n=20Szab=C3=B3?= Date: Thu, 24 Sep 2020 12:37:27 +0200 Subject: [PATCH 3/3] [DOCS] Fixes markup. --- docs/en/stack/ml/df-analytics/ml-dfa-limitations.asciidoc | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/docs/en/stack/ml/df-analytics/ml-dfa-limitations.asciidoc b/docs/en/stack/ml/df-analytics/ml-dfa-limitations.asciidoc index 055852eba..724041f97 100644 --- a/docs/en/stack/ml/df-analytics/ml-dfa-limitations.asciidoc +++ b/docs/en/stack/ml/df-analytics/ml-dfa-limitations.asciidoc @@ -61,12 +61,12 @@ with an increased training percentage. [float] [[dfa-training-docs]] -== {dfanalytics-jobs-cap} cannot use more than 2^32 documents for training +== {dfanalytics-jobs-cap} cannot use more than 2^32^ documents for training -A {dfanalytics-job} that would use more than 2^32 documents for training cannot +A {dfanalytics-job} that would use more than 2^32^ documents for training cannot be started. The limitation applies only for documents participating in training -the model. If your source index contains more than 2^32 documents, set the -`training_percent` to a value that represents less than 2^32 documents. +the model. If your source index contains more than 2^32^ documents, set the +`training_percent` to a value that represents less than 2^32^ documents. [float]