Make RepositoryData Less Memory Heavy #55293

original-brownbear · 2020-04-16T09:35:50Z

We don't really need LinkedHashSet here. We can assume that all the
entries are unique and just use a list and use the list utilities to
create the cheapest possible version of the list.
Also, this fixes a bug in addSnapshot which would mutate the existing
linked hash set on the current instance (fortunately this never caused a real world bug)
and brings the collection in line with the java docs on its getter that claim immutability.

We don't really need `LinkedHashSet` here. We can assume that all the entries are unique and just use a list and use the list utilities to create the cheapest possible version of the list. Also, this fixes a bug in `addSnapshot` which would mutate the existing linked hash set on the current instance (fortunately this never caused a real world bug) and brings the collection in line with the java docs on its getter that claim immutability.

elasticmachine · 2020-04-16T09:36:08Z

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

original-brownbear · 2020-04-16T09:46:29Z

server/src/main/java/org/elasticsearch/repositories/RepositoryData.java

-        Map<IndexId, Set<SnapshotId>> allIndexSnapshots = new HashMap<>(indexSnapshots);
+        Map<IndexId, List<SnapshotId>> allIndexSnapshots = new HashMap<>(indexSnapshots);
        for (final IndexId indexId : shardGenerations.indices()) {
-            allIndexSnapshots.computeIfAbsent(indexId, k -> new LinkedHashSet<>()).add(snapshotId);


This was broken, we were mutating the existing LinkedHashSet

original-brownbear · 2020-04-16T09:51:09Z

server/src/main/java/org/elasticsearch/repositories/RepositoryData.java

+            List<SnapshotId> remaining;
+            List<SnapshotId> snapshotIds = this.indexSnapshots.get(indexId);
            assert snapshotIds != null;
            if (snapshotIds.contains(snapshotId)) {


Not great that we're quadratic here now (for the nested loop), but I don't think it really matters much relative to the significant space+GC savings.

original-brownbear · 2020-04-16T10:04:47Z

Found this while examining a heap dump for a cluster running into #55153 .
For a few thousand indices and ~1k snapshots the heap was full of uncollected java.util.LinkedHashMap.Entry (the sets/maps themselves were mostly already collected but collecting the entries is apparently trickier for the JVM) under load on the snapshot status APIs from the somewhat long lived RepositoryData instances. Also the outright overhead is massive for LinkedHashSet just on the face of it with at least 44 bytes per element (say the cluster contains 100 live indices at a time and we have 100 snapshots this comes out to half a MB in overhead per instance easily).

ywelsch

Change looks ok, but there is too much unnecessary list copying going on.

ywelsch · 2020-04-20T07:35:33Z

server/src/main/java/org/elasticsearch/repositories/RepositoryData.java

+            } else {
+                final List<SnapshotId> copy = new ArrayList<>(snapshotIds);
+                copy.add(snapshotId);
+                allIndexSnapshots.put(indexId, List.copyOf(copy));


why create a copy of the copy?

These RepositoryData instances live for quite a while, so I figured the cost of doing another copy is worth the lower storage overhead + shorter path to the GC root compared to wrapping with Collections.unmodifiableList? I could technically make this more efficient by copying to a SnapshotId[] and then just wrapping that array but I figured this wasn't that much slower and nicer to read.

I looked into how List.copyOf is implemented, and lo and behold, it copies the elements twice (first calls Collection.toArray(), and then creates another copy of that temporary array in List.of (using manual for loop, FFS).
This means that the list is copied three times here, plus the resize of the ArrayList when calling copy.add(snapshotId);, leading to another full copy ....
High-level languages ftw.

:) you win => I pushed 8319e21 , probably not worth the hassle to go further than this then.

ywelsch · 2020-04-20T07:37:23Z

server/src/main/java/org/elasticsearch/repositories/RepositoryData.java

-                set.remove(snapshotId);
+                remaining = new ArrayList<>(snapshotIds);
+                remaining.remove(listIndex);
+                remaining = List.copyOf(remaining);


same thing here, copy of copy

ywelsch · 2020-04-20T07:38:06Z

server/src/main/java/org/elasticsearch/repositories/RepositoryData.java

                        }
                        assert indexId != null;
-                        indexSnapshots.put(indexId, snapshotIds);
+                        indexSnapshots.put(indexId, List.copyOf(snapshotIds));


copy of copy

…-data

ywelsch

LGTM

original-brownbear · 2020-04-20T15:23:30Z

Thanks Yannick!

We don't really need `LinkedHashSet` here. We can assume that all the entries are unique and just use a list and use the list utilities to create the cheapest possible version of the list. Also, this fixes a bug in `addSnapshot` which would mutate the existing linked hash set on the current instance (fortunately this never caused a real world bug) and brings the collection in line with the java docs on its getter that claim immutability.

original-brownbear added :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs >non-issue v7.8.0 v8.0.0 labels Apr 16, 2020

original-brownbear commented Apr 16, 2020

View reviewed changes

original-brownbear requested review from tlrx and ywelsch April 16, 2020 10:06

at least be a little efficient

b267d9c

ywelsch reviewed Apr 20, 2020

View reviewed changes

original-brownbear requested a review from ywelsch April 20, 2020 08:55

original-brownbear added 2 commits April 20, 2020 15:44

Merge remote-tracking branch 'elastic/master' into smaller-repository…

d99b285

…-data

unmodifiable list it is :)

8319e21

ywelsch approved these changes Apr 20, 2020

View reviewed changes

original-brownbear merged commit 8fd81df into elastic:master Apr 20, 2020

original-brownbear deleted the smaller-repository-data branch April 20, 2020 15:23

original-brownbear mentioned this pull request Apr 20, 2020

Make RepositoryData Less Memory Heavy (#55293) #55468

Merged

original-brownbear restored the smaller-repository-data branch August 6, 2020 18:35

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Make RepositoryData Less Memory Heavy #55293

Make RepositoryData Less Memory Heavy #55293

Uh oh!

Conversation

original-brownbear commented Apr 16, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticmachine commented Apr 16, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Apr 16, 2020

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Apr 20, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

original-brownbear commented Apr 16, 2020 •

edited

Loading