Skip to content

Conversation

@dimitris-athanasiou
Copy link
Contributor

…(#53915)

While CustomProcessor is generic and allows for flexibility, there
are new requirements that make cross validation a concept it's hard
to abstract behind custom processor. In particular, we would like to
add data_counts to the DFA jobs stats. Counting training VS. test
docs would be a useful statistic. We would also want to add a
different cross validation strategy for multiclass classification.

This commit renames custom processors to cross validation splitters
which allows for those enhancements without cryptically doing
things as a side effect of the abstract custom processing.

Backport of #53915

…lastic#53915)

While `CustomProcessor` is generic and allows for flexibility, there
are new requirements that make cross validation a concept it's hard
to abstract behind custom processor. In particular, we would like to
add data_counts to the DFA jobs stats. Counting training VS. test
docs would be a useful statistic. We would also want to add a
different cross validation strategy for multiclass classification.

This commit renames custom processors to cross validation splitters
which allows for those enhancements without cryptically doing
things as a side effect of the abstract custom processing.

Backport of elastic#53915
@dimitris-athanasiou
Copy link
Contributor Author

run elasticsearch-ci/master-fwc

@dimitris-athanasiou
Copy link
Contributor Author

run elasticsearch-ci/master-fwc

@dimitris-athanasiou dimitris-athanasiou merged commit 3873510 into elastic:7.x Mar 23, 2020
@dimitris-athanasiou dimitris-athanasiou deleted the refactor-dfa-custom-processor-to-cross-validation-splitter-7x branch March 23, 2020 15:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant