@@ -58,6 +58,16 @@ You may want to split chunks manually if:
58
58
values between ``300`` and ``400``, *but* all values of your shard
59
59
keys are between ``250`` and ``500`` are in a single chunk.
60
60
61
+ .. warning::
62
+
63
+ Be careful when splitting chunks. When you shard a collection that
64
+ has existing data, MongoDB automatically creates chunks to evenly
65
+ spread the collection. Performing additional splits requires
66
+ knowledge of the resulting chunk sizes by numbers of documents and by
67
+ size. You do not want splits that cause some chunks to be much larger
68
+ than others. This leads to balancing based on count of chunks, not on
69
+ their size, which may cause extreme load/data-distribution problems.
70
+
61
71
Use :method:`sh.status()` to determine the current chunks ranges across
62
72
the cluster.
63
73
@@ -101,11 +111,13 @@ chunk splits.
101
111
Create Chunks (Pre-Splitting)
102
112
-----------------------------
103
113
114
+ Pre-splitting lets you preemptively split chunks in an empty collection
115
+ and is used *only* in certain situations.
104
116
In most situations a :term:`sharded cluster` will create and distribute
105
117
chunks automatically without user intervention. However, in a limited
106
118
number of use profiles, MongoDB cannot create enough chunks or
107
- distribute data fast enough to support required throughput. Consider
108
- the following scenarios :
119
+ distribute data fast enough to support required throughput.
120
+ For example, if :
109
121
110
122
- you must partition an existing data collection that resides on a
111
123
single shard.
@@ -123,7 +135,7 @@ the following scenarios:
123
135
Preemptively splitting chunks increases cluster throughput for these
124
136
operations, by reducing the overhead of migrating chunks that hold
125
137
data during the write operation. MongoDB only creates splits after an
126
- insert operation, and can only migrate a single chunk at a time. Chunk
138
+ insert operation and can migrate only a single chunk at a time. Chunk
127
139
migrations are resource intensive and further complicated by large
128
140
write volume to the migrating chunk.
129
141
0 commit comments