Commit e0d41e8
committed
[SPARK-37530][CORE] Spark reads many paths very slow though newAPIHadoopFile
### What changes were proposed in this pull request?
Same as #18441, we parallelize FileInputFormat.listStatus for newAPIHadoopFile
### Why are the changes needed?

Spark can be slow when accessing external storage at driver side, improve perf by parallelizing
### Does this PR introduce _any_ user-facing change?
no
### How was this patch tested?
passing GA
Closes #34792 from yaooqinn/SPARK-37530.
Authored-by: Kent Yao <[email protected]>
Signed-off-by: Kent Yao <[email protected]>1 parent ae9aeba commit e0d41e8
1 file changed
+4
-0
lines changedLines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
123 | 123 | | |
124 | 124 | | |
125 | 125 | | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
126 | 130 | | |
127 | 131 | | |
128 | 132 | | |
| |||
0 commit comments