Skip to content

Commit 6d23e36

Browse files
committed
HADOOP-19015. Increase fs.s3a.connection.maximum to 500 to minimize risk of Timeout waiting for connection from pool. (#6372)
HADOOP-19015. Increase fs.s3a.connection.maximum to 500 to minimize the risk of Timeout waiting for connection from the pool Contributed By: Mukund Thakur Conflicts: hadoop-common-project/hadoop-common/src/main/resources/core-default.xml hadoop-tools/hadoop-aws/src/main/java/org/apache/hadoop/fs/s3a/Constants.java hadoop-tools/hadoop-aws/src/site/markdown/tools/hadoop-aws/performance.md
1 parent 585f966 commit 6d23e36

File tree

3 files changed

+11
-5
lines changed

3 files changed

+11
-5
lines changed

hadoop-common-project/hadoop-common/src/main/resources/core-default.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1362,7 +1362,7 @@
13621362

13631363
<property>
13641364
<name>fs.s3a.connection.maximum</name>
1365-
<value>96</value>
1365+
<value>500</value>
13661366
<description>Controls the maximum number of simultaneous connections to S3.
13671367
This must be bigger than the value of fs.s3a.threads.max so as to stop
13681368
threads being blocked waiting for new HTTPS connections.

hadoop-tools/hadoop-aws/src/main/java/org/apache/hadoop/fs/s3a/Constants.java

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -152,7 +152,13 @@ private Constants() {
152152

153153
// number of simultaneous connections to s3
154154
public static final String MAXIMUM_CONNECTIONS = "fs.s3a.connection.maximum";
155-
public static final int DEFAULT_MAXIMUM_CONNECTIONS = 96;
155+
156+
/**
157+
* Default value for {@link #MAXIMUM_CONNECTIONS}: {@value}.
158+
* Future releases are likely to increase this value.
159+
* Keep in sync with the value in {@code core-default.xml}
160+
*/
161+
public static final int DEFAULT_MAXIMUM_CONNECTIONS = 500;
156162

157163
/**
158164
* Configuration option to configure expiration time of

hadoop-tools/hadoop-aws/src/site/markdown/tools/hadoop-aws/performance.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -211,19 +211,19 @@ for parallel IO (especially uploads) by setting the properties
211211
| property | meaning | default |
212212
|----------|---------|---------|
213213
| `fs.s3a.threads.max`| Threads in the AWS transfer manager| 10 |
214-
| `fs.s3a.connection.maximum`| Maximum number of HTTP connections | 10|
214+
| `fs.s3a.connection.maximum`| Maximum number of HTTP connections | 500 |
215215

216216
We recommend using larger values for processes which perform
217217
a lot of IO: `DistCp`, Spark Workers and similar.
218218

219219
```xml
220220
<property>
221221
<name>fs.s3a.threads.max</name>
222-
<value>20</value>
222+
<value>10</value>
223223
</property>
224224
<property>
225225
<name>fs.s3a.connection.maximum</name>
226-
<value>20</value>
226+
<value>500</value>
227227
</property>
228228
```
229229

0 commit comments

Comments
 (0)