|
2 | 2 | aggregate without grouping |
3 | 3 | ================================================================================================ |
4 | 4 |
|
5 | | -Java HotSpot(TM) 64-Bit Server VM 1.8.0_151-b12 on Mac OS X 10.12.6 |
6 | | -Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz |
7 | | - |
| 5 | +OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64 |
| 6 | +Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz |
8 | 7 | agg w/o group: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative |
9 | 8 | ------------------------------------------------------------------------------------------------ |
10 | | -agg w/o group wholestage off 40943 / 47670 51.2 19.5 1.0X |
11 | | -agg w/o group wholestage on 979 / 1452 2142.8 0.5 41.8X |
| 9 | +agg w/o group wholestage off 65374 / 70665 32.1 31.2 1.0X |
| 10 | +agg w/o group wholestage on 1178 / 1209 1779.8 0.6 55.5X |
12 | 11 |
|
13 | 12 |
|
14 | 13 | ================================================================================================ |
15 | 14 | stat functions |
16 | 15 | ================================================================================================ |
17 | 16 |
|
18 | | -Java HotSpot(TM) 64-Bit Server VM 1.8.0_151-b12 on Mac OS X 10.12.6 |
19 | | -Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz |
20 | | - |
| 17 | +OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64 |
| 18 | +Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz |
21 | 19 | stddev: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative |
22 | 20 | ------------------------------------------------------------------------------------------------ |
23 | | -stddev wholestage off 6752 / 6756 15.5 64.4 1.0X |
24 | | -stddev wholestage on 921 / 950 113.9 8.8 7.3X |
25 | | - |
26 | | -Java HotSpot(TM) 64-Bit Server VM 1.8.0_151-b12 on Mac OS X 10.12.6 |
27 | | -Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz |
| 21 | +stddev wholestage off 8667 / 8851 12.1 82.7 1.0X |
| 22 | +stddev wholestage on 1266 / 1273 82.8 12.1 6.8X |
28 | 23 |
|
| 24 | +OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64 |
| 25 | +Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz |
29 | 26 | kurtosis: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative |
30 | 27 | ------------------------------------------------------------------------------------------------ |
31 | | -kurtosis wholestage off 31176 / 34086 3.4 297.3 1.0X |
32 | | -kurtosis wholestage on 1036 / 1076 101.2 9.9 30.1X |
| 28 | +kurtosis wholestage off 41218 / 41231 2.5 393.1 1.0X |
| 29 | +kurtosis wholestage on 1347 / 1357 77.8 12.8 30.6X |
33 | 30 |
|
34 | 31 |
|
35 | 32 | ================================================================================================ |
36 | 33 | aggregate with linear keys |
37 | 34 | ================================================================================================ |
38 | 35 |
|
39 | | -Java HotSpot(TM) 64-Bit Server VM 1.8.0_151-b12 on Mac OS X 10.12.6 |
40 | | -Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz |
41 | | - |
| 36 | +OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64 |
| 37 | +Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz |
42 | 38 | Aggregate w keys: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative |
43 | 39 | ------------------------------------------------------------------------------------------------ |
44 | | -codegen = F 10796 / 11265 7.8 128.7 1.0X |
45 | | -codegen = T hashmap = F 7349 / 8640 11.4 87.6 1.5X |
46 | | -codegen = T hashmap = T 944 / 995 88.8 11.3 11.4X |
| 40 | +codegen = F 9309 / 9389 9.0 111.0 1.0X |
| 41 | +codegen = T hashmap = F 4417 / 4435 19.0 52.7 2.1X |
| 42 | +codegen = T hashmap = T 1289 / 1298 65.1 15.4 7.2X |
47 | 43 |
|
48 | 44 |
|
49 | 45 | ================================================================================================ |
50 | 46 | aggregate with randomized keys |
51 | 47 | ================================================================================================ |
52 | 48 |
|
53 | | -Java HotSpot(TM) 64-Bit Server VM 1.8.0_151-b12 on Mac OS X 10.12.6 |
54 | | -Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz |
55 | | - |
| 49 | +OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64 |
| 50 | +Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz |
56 | 51 | Aggregate w keys: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative |
57 | 52 | ------------------------------------------------------------------------------------------------ |
58 | | -codegen = F 13825 / 14425 6.1 164.8 1.0X |
59 | | -codegen = T hashmap = F 5962 / 6289 14.1 71.1 2.3X |
60 | | -codegen = T hashmap = T 1856 / 2074 45.2 22.1 7.4X |
| 53 | +codegen = F 11424 / 11426 7.3 136.2 1.0X |
| 54 | +codegen = T hashmap = F 6441 / 6496 13.0 76.8 1.8X |
| 55 | +codegen = T hashmap = T 2333 / 2344 36.0 27.8 4.9X |
61 | 56 |
|
62 | 57 |
|
63 | 58 | ================================================================================================ |
64 | 59 | aggregate with string key |
65 | 60 | ================================================================================================ |
66 | 61 |
|
67 | | -Java HotSpot(TM) 64-Bit Server VM 1.8.0_151-b12 on Mac OS X 10.12.6 |
68 | | -Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz |
69 | | - |
| 62 | +OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64 |
| 63 | +Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz |
70 | 64 | Aggregate w string key: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative |
71 | 65 | ------------------------------------------------------------------------------------------------ |
72 | | -codegen = F 4094 / 4161 5.1 195.2 1.0X |
73 | | -codegen = T hashmap = F 2844 / 3192 7.4 135.6 1.4X |
74 | | -codegen = T hashmap = T 1917 / 2011 10.9 91.4 2.1X |
| 66 | +codegen = F 4751 / 4890 4.4 226.5 1.0X |
| 67 | +codegen = T hashmap = F 3146 / 3182 6.7 150.0 1.5X |
| 68 | +codegen = T hashmap = T 2211 / 2261 9.5 105.4 2.1X |
75 | 69 |
|
76 | 70 |
|
77 | 71 | ================================================================================================ |
78 | 72 | aggregate with decimal key |
79 | 73 | ================================================================================================ |
80 | 74 |
|
81 | | -Java HotSpot(TM) 64-Bit Server VM 1.8.0_151-b12 on Mac OS X 10.12.6 |
82 | | -Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz |
83 | | - |
| 75 | +OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64 |
| 76 | +Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz |
84 | 77 | Aggregate w decimal key: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative |
85 | 78 | ------------------------------------------------------------------------------------------------ |
86 | | -codegen = F 3831 / 3910 5.5 182.7 1.0X |
87 | | -codegen = T hashmap = F 2097 / 2338 10.0 100.0 1.8X |
88 | | -codegen = T hashmap = T 427 / 478 49.1 20.4 9.0X |
| 79 | +codegen = F 3029 / 3062 6.9 144.4 1.0X |
| 80 | +codegen = T hashmap = F 1534 / 1569 13.7 73.2 2.0X |
| 81 | +codegen = T hashmap = T 575 / 578 36.5 27.4 5.3X |
89 | 82 |
|
90 | 83 |
|
91 | 84 | ================================================================================================ |
92 | 85 | aggregate with multiple key types |
93 | 86 | ================================================================================================ |
94 | 87 |
|
95 | | -Java HotSpot(TM) 64-Bit Server VM 1.8.0_151-b12 on Mac OS X 10.12.6 |
96 | | -Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz |
97 | | - |
| 88 | +OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64 |
| 89 | +Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz |
98 | 90 | Aggregate w multiple keys: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative |
99 | 91 | ------------------------------------------------------------------------------------------------ |
100 | | -codegen = F 5916 / 6192 3.5 282.1 1.0X |
101 | | -codegen = T hashmap = F 3981 / 4011 5.3 189.8 1.5X |
102 | | -codegen = T hashmap = T 3288 / 3337 6.4 156.8 1.8X |
| 92 | +codegen = F 7506 / 7521 2.8 357.9 1.0X |
| 93 | +codegen = T hashmap = F 4791 / 4808 4.4 228.5 1.6X |
| 94 | +codegen = T hashmap = T 3553 / 3585 5.9 169.4 2.1X |
103 | 95 |
|
104 | 96 |
|
105 | 97 | ================================================================================================ |
106 | 98 | max function bytecode size of wholestagecodegen |
107 | 99 | ================================================================================================ |
108 | 100 |
|
109 | | -Java HotSpot(TM) 64-Bit Server VM 1.8.0_151-b12 on Mac OS X 10.12.6 |
110 | | -Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz |
111 | | - |
| 101 | +OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64 |
| 102 | +Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz |
112 | 103 | max function bytecode size: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative |
113 | 104 | ------------------------------------------------------------------------------------------------ |
114 | | -codegen = F 475 / 578 1.4 725.1 1.0X |
115 | | -codegen = T hugeMethodLimit = 10000 276 / 331 2.4 420.9 1.7X |
116 | | -codegen = T hugeMethodLimit = 1500 449 / 510 1.5 685.4 1.1X |
| 105 | +codegen = F 608 / 656 1.1 927.1 1.0X |
| 106 | +codegen = T hugeMethodLimit = 10000 402 / 419 1.6 613.5 1.5X |
| 107 | +codegen = T hugeMethodLimit = 1500 616 / 619 1.1 939.9 1.0X |
117 | 108 |
|
118 | 109 |
|
119 | 110 | ================================================================================================ |
120 | 111 | cube |
121 | 112 | ================================================================================================ |
122 | 113 |
|
123 | | -Java HotSpot(TM) 64-Bit Server VM 1.8.0_151-b12 on Mac OS X 10.12.6 |
124 | | -Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz |
125 | | - |
| 114 | +OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64 |
| 115 | +Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz |
126 | 116 | cube: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative |
127 | 117 | ------------------------------------------------------------------------------------------------ |
128 | | -cube wholestage off 2536 / 2537 2.1 483.8 1.0X |
129 | | -cube wholestage on 1055 / 1237 5.0 201.1 2.4X |
| 118 | +cube wholestage off 3229 / 3237 1.6 615.9 1.0X |
| 119 | +cube wholestage on 1285 / 1306 4.1 245.2 2.5X |
130 | 120 |
|
131 | 121 |
|
132 | 122 | ================================================================================================ |
133 | 123 | hash and BytesToBytesMap |
134 | 124 | ================================================================================================ |
135 | 125 |
|
136 | | -Java HotSpot(TM) 64-Bit Server VM 1.8.0_151-b12 on Mac OS X 10.12.6 |
137 | | -Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz |
138 | | - |
| 126 | +OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64 |
| 127 | +Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz |
139 | 128 | BytesToBytesMap: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative |
140 | 129 | ------------------------------------------------------------------------------------------------ |
141 | | -UnsafeRowhash 231 / 242 90.8 11.0 1.0X |
142 | | -murmur3 hash 116 / 122 180.3 5.5 2.0X |
143 | | -fast hash 61 / 65 346.5 2.9 3.8X |
144 | | -arrayEqual 147 / 151 142.9 7.0 1.6X |
145 | | -Java HashMap (Long) 101 / 136 206.8 4.8 2.3X |
146 | | -Java HashMap (two ints) 102 / 123 205.1 4.9 2.3X |
147 | | -Java HashMap (UnsafeRow) 968 / 980 21.7 46.1 0.2X |
148 | | -LongToUnsafeRowMap (opt=false) 480 / 503 43.7 22.9 0.5X |
149 | | -LongToUnsafeRowMap (opt=true) 89 / 98 234.7 4.3 2.6X |
150 | | -BytesToBytesMap (off Heap) 1125 / 1151 18.6 53.6 0.2X |
151 | | -BytesToBytesMap (on Heap) 1168 / 1192 18.0 55.7 0.2X |
152 | | -Aggregate HashMap 41 / 57 516.4 1.9 5.7X |
| 130 | +UnsafeRowhash 328 / 330 64.0 15.6 1.0X |
| 131 | +murmur3 hash 167 / 167 125.4 8.0 2.0X |
| 132 | +fast hash 84 / 85 249.0 4.0 3.9X |
| 133 | +arrayEqual 192 / 192 109.3 9.1 1.7X |
| 134 | +Java HashMap (Long) 144 / 147 145.9 6.9 2.3X |
| 135 | +Java HashMap (two ints) 147 / 153 142.3 7.0 2.2X |
| 136 | +Java HashMap (UnsafeRow) 785 / 788 26.7 37.4 0.4X |
| 137 | +LongToUnsafeRowMap (opt=false) 456 / 457 46.0 21.8 0.7X |
| 138 | +LongToUnsafeRowMap (opt=true) 125 / 125 168.3 5.9 2.6X |
| 139 | +BytesToBytesMap (off Heap) 885 / 885 23.7 42.2 0.4X |
| 140 | +BytesToBytesMap (on Heap) 860 / 864 24.4 41.0 0.4X |
| 141 | +Aggregate HashMap 56 / 56 373.9 2.7 5.8X |
153 | 142 |
|
154 | 143 |
|
0 commit comments