Commit f64d5a1
authored
Add bias support for Int8DynActInt4WeightLinear (#1845)
**Summary:** Previously, when we see a linear with bias, we simply
do not swap it to `Int8DynActInt4WeightLinear` and leave it as is.
Now we do swap it, but bias is not quantized and passed to F.linear
in full precision.
Fixes #1821
**Test Plan:**
python test/quantization/test_quant_api.py -k test_8da4w_quantizer_linear_bias1 parent 073b4f0 commit f64d5a1
File tree
4 files changed
+40
-30
lines changed- test/quantization
- torchao/quantization
- qat
4 files changed
+40
-30
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1043 | 1043 | | |
1044 | 1044 | | |
1045 | 1045 | | |
1046 | | - | |
1047 | | - | |
1048 | | - | |
1049 | | - | |
1050 | | - | |
1051 | | - | |
1052 | | - | |
1053 | | - | |
1054 | | - | |
1055 | | - | |
1056 | | - | |
1057 | | - | |
1058 | | - | |
1059 | | - | |
1060 | | - | |
1061 | | - | |
| 1046 | + | |
| 1047 | + | |
| 1048 | + | |
| 1049 | + | |
1062 | 1050 | | |
1063 | 1051 | | |
1064 | 1052 | | |
| |||
1070 | 1058 | | |
1071 | 1059 | | |
1072 | 1060 | | |
| 1061 | + | |
1073 | 1062 | | |
1074 | 1063 | | |
1075 | 1064 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
115 | 115 | | |
116 | 116 | | |
117 | 117 | | |
118 | | - | |
| 118 | + | |
119 | 119 | | |
120 | | - | |
121 | | - | |
| 120 | + | |
| 121 | + | |
122 | 122 | | |
123 | 123 | | |
124 | 124 | | |
| |||
272 | 272 | | |
273 | 273 | | |
274 | 274 | | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
| 279 | + | |
| 280 | + | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
| 284 | + | |
| 285 | + | |
| 286 | + | |
| 287 | + | |
| 288 | + | |
| 289 | + | |
275 | 290 | | |
276 | 291 | | |
277 | 292 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
923 | 923 | | |
924 | 924 | | |
925 | 925 | | |
| 926 | + | |
926 | 927 | | |
927 | 928 | | |
928 | 929 | | |
| |||
956 | 957 | | |
957 | 958 | | |
958 | 959 | | |
959 | | - | |
| 960 | + | |
960 | 961 | | |
961 | 962 | | |
962 | 963 | | |
| |||
970 | 971 | | |
971 | 972 | | |
972 | 973 | | |
| 974 | + | |
973 | 975 | | |
974 | 976 | | |
975 | 977 | | |
| |||
1003 | 1005 | | |
1004 | 1006 | | |
1005 | 1007 | | |
1006 | | - | |
1007 | 1008 | | |
1008 | 1009 | | |
1009 | 1010 | | |
| |||
1034 | 1035 | | |
1035 | 1036 | | |
1036 | 1037 | | |
| 1038 | + | |
| 1039 | + | |
| 1040 | + | |
| 1041 | + | |
| 1042 | + | |
1037 | 1043 | | |
1038 | 1044 | | |
1039 | 1045 | | |
1040 | 1046 | | |
1041 | 1047 | | |
1042 | 1048 | | |
1043 | 1049 | | |
| 1050 | + | |
1044 | 1051 | | |
1045 | 1052 | | |
1046 | 1053 | | |
| |||
1062 | 1069 | | |
1063 | 1070 | | |
1064 | 1071 | | |
1065 | | - | |
1066 | | - | |
1067 | | - | |
1068 | | - | |
1069 | | - | |
| 1072 | + | |
| 1073 | + | |
1070 | 1074 | | |
1071 | 1075 | | |
1072 | 1076 | | |
1073 | 1077 | | |
1074 | 1078 | | |
1075 | 1079 | | |
1076 | | - | |
| 1080 | + | |
1077 | 1081 | | |
1078 | 1082 | | |
1079 | 1083 | | |
| |||
1084 | 1088 | | |
1085 | 1089 | | |
1086 | 1090 | | |
| 1091 | + | |
1087 | 1092 | | |
1088 | 1093 | | |
1089 | 1094 | | |
| |||
1130 | 1135 | | |
1131 | 1136 | | |
1132 | 1137 | | |
1133 | | - | |
| 1138 | + | |
1134 | 1139 | | |
1135 | 1140 | | |
1136 | 1141 | | |
| |||
1172 | 1177 | | |
1173 | 1178 | | |
1174 | 1179 | | |
1175 | | - | |
1176 | 1180 | | |
1177 | 1181 | | |
1178 | 1182 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
208 | 208 | | |
209 | 209 | | |
210 | 210 | | |
211 | | - | |
| 211 | + | |
212 | 212 | | |
213 | 213 | | |
214 | 214 | | |
| |||
237 | 237 | | |
238 | 238 | | |
239 | 239 | | |
| 240 | + | |
| 241 | + | |
240 | 242 | | |
241 | 243 | | |
242 | 244 | | |
| |||
0 commit comments