Use an index to store enrich policies #47475

hub-cap · 2019-10-02T21:14:27Z

This commit changes the enrich store's backing storage. It was
previously stored in cluster state, and now it is stored in an index. A
side effect of this is that listeners had to be added in the enrich
stores API. The actions no longer need to be master actions, so they
have been changed too. The named writable that was used in cluster state
is also deleted in this commit.

This commit changes the enrich store's backing storage. It was previously stored in cluster state, and now it is stored in an index. A side effect of this is that listeners had to be added in the enrich stores API. The actions no longer need to be master actions, so they have been changed too. The named writable that was used in cluster state is also deleted in this commit.

elasticmachine · 2019-10-02T21:14:29Z

Pinging @elastic/es-core-features (:Core/Features/Ingest)

martijnvg

This looks good! I left of couple of comments.

martijnvg · 2019-10-03T13:28:48Z

x-pack/plugin/enrich/src/main/java/org/elasticsearch/xpack/enrich/EnrichStore.java

+
+        if (enrichIndex == null) {
+            // create the index
+            client.admin().indices().prepareCreate(ENRICH_INDEX)


Like we discussed in chat, we should either use index template or keep using the create index api call with the right settings and mappings.

martijnvg · 2019-10-03T13:29:16Z

...rich/src/main/java/org/elasticsearch/xpack/enrich/action/TransportGetEnrichPolicyAction.java

+import java.util.stream.Collectors;

-public class TransportGetEnrichPolicyAction extends TransportMasterNodeReadAction<GetEnrichPolicyAction.Request,
+public class TransportGetEnrichPolicyAction extends HandledTransportAction<GetEnrichPolicyAction.Request,


Can the get policy api remain a master node action? I think we are going to add additional things to this API for the UI. Like status and that information can only be read from elected master node. (this is where the policy executor lives)

martijnvg · 2019-10-03T13:30:02Z

x-pack/plugin/enrich/src/main/java/org/elasticsearch/xpack/enrich/EnrichPlugin.java

    }

-    @Override
-    public List<NamedWriteableRegistry.Entry> getNamedWriteables() {


martijnvg · 2019-10-03T13:32:00Z

x-pack/plugin/enrich/src/main/java/org/elasticsearch/xpack/enrich/EnrichStore.java

-    public static void putPolicy(String name, EnrichPolicy policy, ClusterService clusterService, Consumer<Exception> handler) {
-        assert clusterService.localNode().isMasterNode();
-
+    public static void putPolicy(String name, EnrichPolicy policy, ClusterService clusterService, Client client,


Maybe replace the ClusterService parameter with ClusterState here and in other methods, since we only seem to invoke ClusterService#state() method now.

martijnvg · 2019-10-03T13:35:02Z

...ugin/enrich/src/main/java/org/elasticsearch/xpack/enrich/EnrichPolicyMaintenanceService.java

-                @Override
-                public void onFailure(Exception e) {
-                    logger.error("Failed to get indices during enrich index maintenance task", e);
+        EnrichStore.getPolicies(clusterService.state(), client, ActionListener.wrap(


Perhaps the get policies call should be done inside:

final EnrichPolicyLocks.EnrichPolicyExecutionState executionState = enrichPolicyLocks.captureExecutionState(); if (executionState.isAnyPolicyInFlight() == false) { ... }

hub-cap · 2019-10-03T15:31:13Z

After discussing the use of the template registry for setting up the index for enrich, we came to the conclusion that we should not use an index, and rely on cluster state which is the default. The policies are small, we get a lot of things for free such as getting the policies when we query cluster state doing diagnostics, and we dont have to worry about the index going away during a migration or some failure that can occur with indexes that will generally speaking not occur in cluster state. There is a chance we can intro a bug into the cluster state, and while that is annoying, it can be fixed. These reasons led us to decide to stop this work and let it continue to be in cluster state.

hub-cap added >non-issue :Data Management/Ingest Node Execution or management of Ingest Pipelines including GeoIP labels Oct 2, 2019

jakelandis mentioned this pull request Oct 2, 2019

[ingest] Enrich documents prior to indexing #32789

Closed

55 tasks

precommit

c43e649

martijnvg reviewed Oct 3, 2019

View reviewed changes

hub-cap closed this Oct 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use an index to store enrich policies #47475

Use an index to store enrich policies #47475

Uh oh!

hub-cap commented Oct 2, 2019

Uh oh!

elasticmachine commented Oct 2, 2019

Uh oh!

martijnvg left a comment

Uh oh!

martijnvg Oct 3, 2019

Uh oh!

martijnvg Oct 3, 2019

Uh oh!

martijnvg Oct 3, 2019

Uh oh!

martijnvg Oct 3, 2019

Uh oh!

martijnvg Oct 3, 2019

Uh oh!

hub-cap commented Oct 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Use an index to store enrich policies #47475

Use an index to store enrich policies #47475

Uh oh!

Conversation

hub-cap commented Oct 2, 2019

Uh oh!

elasticmachine commented Oct 2, 2019

Uh oh!

martijnvg left a comment

Choose a reason for hiding this comment

Uh oh!

martijnvg Oct 3, 2019

Choose a reason for hiding this comment

Uh oh!

martijnvg Oct 3, 2019

Choose a reason for hiding this comment

Uh oh!

martijnvg Oct 3, 2019

Choose a reason for hiding this comment

Uh oh!

martijnvg Oct 3, 2019

Choose a reason for hiding this comment

Uh oh!

martijnvg Oct 3, 2019

Choose a reason for hiding this comment

Uh oh!

hub-cap commented Oct 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants