Commit 9ff26dd
committed
Add support for float8 activation for Int4GroupwisePreshuffleTensor
Summary:
Added basic op support like linear and bmm, we have both float8 and bf16 in the same Tensor
because it's the same dtype, only difference is whether the activation is quantized or not. Although
there is some differneces in implementation:
bf16 activaton:
* group_scale
* group_zero
fp8 activation
* group_scale
* row_scale
Test Plan:
python test/dtypes/test_float8_activation_int4_groupwise_preshuffle.py
Reviewers:
Subscribers:
Tasks:
Tags:
stack-info: PR: #2437, branch: jerryzh168/stack/41 parent ad1efd7 commit 9ff26dd
File tree
6 files changed
+289
-77
lines changed- test
- integration
- quantization/quantize_
- torchao/quantization
- quantize_
- int4
6 files changed
+289
-77
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
Lines changed: 64 additions & 24 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
| 7 | + | |
7 | 8 | | |
8 | 9 | | |
9 | 10 | | |
10 | 11 | | |
11 | 12 | | |
| 13 | + | |
| 14 | + | |
12 | 15 | | |
13 | 16 | | |
14 | 17 | | |
| |||
22 | 25 | | |
23 | 26 | | |
24 | 27 | | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
25 | 61 | | |
26 | 62 | | |
27 | 63 | | |
| |||
32 | 68 | | |
33 | 69 | | |
34 | 70 | | |
35 | | - | |
36 | | - | |
37 | | - | |
38 | | - | |
39 | | - | |
40 | | - | |
41 | | - | |
42 | | - | |
43 | | - | |
44 | | - | |
45 | | - | |
46 | | - | |
47 | | - | |
48 | | - | |
49 | 71 | | |
50 | 72 | | |
51 | | - | |
| 73 | + | |
| 74 | + | |
52 | 75 | | |
53 | 76 | | |
54 | 77 | | |
55 | 78 | | |
56 | 79 | | |
57 | | - | |
| 80 | + | |
58 | 81 | | |
59 | 82 | | |
60 | 83 | | |
61 | | - | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
62 | 88 | | |
63 | 89 | | |
64 | 90 | | |
| |||
74 | 100 | | |
75 | 101 | | |
76 | 102 | | |
77 | | - | |
| 103 | + | |
78 | 104 | | |
79 | 105 | | |
80 | 106 | | |
81 | | - | |
| 107 | + | |
| 108 | + | |
82 | 109 | | |
83 | 110 | | |
84 | | - | |
| 111 | + | |
85 | 112 | | |
86 | 113 | | |
87 | 114 | | |
88 | | - | |
| 115 | + | |
89 | 116 | | |
90 | 117 | | |
91 | 118 | | |
92 | | - | |
| 119 | + | |
93 | 120 | | |
94 | 121 | | |
95 | | - | |
| 122 | + | |
| 123 | + | |
96 | 124 | | |
97 | | - | |
| 125 | + | |
98 | 126 | | |
99 | 127 | | |
100 | 128 | | |
101 | 129 | | |
102 | 130 | | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
103 | 143 | | |
104 | 144 | | |
105 | 145 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2040 | 2040 | | |
2041 | 2041 | | |
2042 | 2042 | | |
| 2043 | + | |
| 2044 | + | |
2043 | 2045 | | |
2044 | 2046 | | |
2045 | 2047 | | |
| |||
2067 | 2069 | | |
2068 | 2070 | | |
2069 | 2071 | | |
2070 | | - | |
| 2072 | + | |
| 2073 | + | |
| 2074 | + | |
2071 | 2075 | | |
2072 | 2076 | | |
2073 | 2077 | | |
| |||
2077 | 2081 | | |
2078 | 2082 | | |
2079 | 2083 | | |
| 2084 | + | |
| 2085 | + | |
| 2086 | + | |
| 2087 | + | |
| 2088 | + | |
| 2089 | + | |
| 2090 | + | |
| 2091 | + | |
| 2092 | + | |
| 2093 | + | |
| 2094 | + | |
| 2095 | + | |
| 2096 | + | |
| 2097 | + | |
2080 | 2098 | | |
2081 | 2099 | | |
2082 | 2100 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
0 commit comments