Commit ee7136c
llama: add support for QRWKV6 model architecture (#11001)
llama: add support for QRWKV6 model architecture (#11001)
* WIP: Add support for RWKV6Qwen2
Signed-off-by: Molly Sophia <[email protected]>
* RWKV: Some graph simplification
Signed-off-by: Molly Sophia <[email protected]>
* Add support for RWKV6Qwen2 with cpu and cuda GLA
Signed-off-by: Molly Sophia <[email protected]>
* RWKV6[QWEN2]: Concat lerp weights together to reduce cpu overhead
Signed-off-by: Molly Sophia <[email protected]>
* Fix some typos
Signed-off-by: Molly Sophia <[email protected]>
* code format changes
Signed-off-by: Molly Sophia <[email protected]>
* Fix wkv test & add gla test
Signed-off-by: Molly Sophia <[email protected]>
* Fix cuda warning
Signed-off-by: Molly Sophia <[email protected]>
* Update README.md
Signed-off-by: Molly Sophia <[email protected]>
* Update ggml/src/ggml-cuda/gla.cu
Co-authored-by: Georgi Gerganov <[email protected]>
* Fix fused lerp weights loading with RWKV6
Signed-off-by: Molly Sophia <[email protected]>
* better sanity check skipping for QRWKV6 in llama-quant
thanks @compilade
Signed-off-by: Molly Sophia <[email protected]>
Co-authored-by: compilade <[email protected]>
---------
Signed-off-by: Molly Sophia <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>
Co-authored-by: compilade <[email protected]>1 parent c6860cc commit ee7136c
File tree
23 files changed
+863
-125
lines changed- ggml
- include
- src
- ggml-cpu
- ggml-cuda
- ggml-sycl
- ggml-vulkan
- gguf-py/gguf
- src
- tests
23 files changed
+863
-125
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
99 | 99 | | |
100 | 100 | | |
101 | 101 | | |
| 102 | + | |
102 | 103 | | |
103 | 104 | | |
104 | 105 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
326 | 326 | | |
327 | 327 | | |
328 | 328 | | |
| 329 | + | |
329 | 330 | | |
330 | 331 | | |
331 | 332 | | |
| |||
3316 | 3317 | | |
3317 | 3318 | | |
3318 | 3319 | | |
| 3320 | + | |
| 3321 | + | |
3319 | 3322 | | |
3320 | 3323 | | |
3321 | 3324 | | |
| |||
3331 | 3334 | | |
3332 | 3335 | | |
3333 | 3336 | | |
3334 | | - | |
3335 | | - | |
3336 | | - | |
3337 | | - | |
| 3337 | + | |
| 3338 | + | |
| 3339 | + | |
| 3340 | + | |
| 3341 | + | |
| 3342 | + | |
| 3343 | + | |
| 3344 | + | |
| 3345 | + | |
| 3346 | + | |
| 3347 | + | |
| 3348 | + | |
| 3349 | + | |
| 3350 | + | |
| 3351 | + | |
| 3352 | + | |
| 3353 | + | |
| 3354 | + | |
| 3355 | + | |
| 3356 | + | |
3338 | 3357 | | |
3339 | 3358 | | |
3340 | 3359 | | |
3341 | 3360 | | |
| 3361 | + | |
| 3362 | + | |
| 3363 | + | |
| 3364 | + | |
| 3365 | + | |
| 3366 | + | |
| 3367 | + | |
| 3368 | + | |
| 3369 | + | |
| 3370 | + | |
| 3371 | + | |
| 3372 | + | |
| 3373 | + | |
| 3374 | + | |
| 3375 | + | |
| 3376 | + | |
| 3377 | + | |
| 3378 | + | |
| 3379 | + | |
| 3380 | + | |
| 3381 | + | |
| 3382 | + | |
| 3383 | + | |
| 3384 | + | |
| 3385 | + | |
| 3386 | + | |
| 3387 | + | |
| 3388 | + | |
| 3389 | + | |
| 3390 | + | |
| 3391 | + | |
| 3392 | + | |
| 3393 | + | |
| 3394 | + | |
| 3395 | + | |
| 3396 | + | |
| 3397 | + | |
| 3398 | + | |
| 3399 | + | |
| 3400 | + | |
| 3401 | + | |
| 3402 | + | |
| 3403 | + | |
| 3404 | + | |
| 3405 | + | |
| 3406 | + | |
| 3407 | + | |
| 3408 | + | |
| 3409 | + | |
| 3410 | + | |
| 3411 | + | |
| 3412 | + | |
| 3413 | + | |
| 3414 | + | |
3342 | 3415 | | |
3343 | 3416 | | |
3344 | 3417 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
501 | 501 | | |
502 | 502 | | |
503 | 503 | | |
| 504 | + | |
504 | 505 | | |
505 | 506 | | |
506 | 507 | | |
| |||
1859 | 1860 | | |
1860 | 1861 | | |
1861 | 1862 | | |
| 1863 | + | |
| 1864 | + | |
| 1865 | + | |
| 1866 | + | |
| 1867 | + | |
| 1868 | + | |
| 1869 | + | |
| 1870 | + | |
| 1871 | + | |
1862 | 1872 | | |
1863 | 1873 | | |
1864 | 1874 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
11803 | 11803 | | |
11804 | 11804 | | |
11805 | 11805 | | |
11806 | | - | |
| 11806 | + | |
11807 | 11807 | | |
11808 | | - | |
| 11808 | + | |
11809 | 11809 | | |
11810 | 11810 | | |
11811 | 11811 | | |
| |||
12000 | 12000 | | |
12001 | 12001 | | |
12002 | 12002 | | |
| 12003 | + | |
| 12004 | + | |
| 12005 | + | |
| 12006 | + | |
| 12007 | + | |
| 12008 | + | |
| 12009 | + | |
| 12010 | + | |
| 12011 | + | |
| 12012 | + | |
| 12013 | + | |
| 12014 | + | |
| 12015 | + | |
| 12016 | + | |
| 12017 | + | |
| 12018 | + | |
| 12019 | + | |
| 12020 | + | |
| 12021 | + | |
| 12022 | + | |
| 12023 | + | |
| 12024 | + | |
| 12025 | + | |
| 12026 | + | |
| 12027 | + | |
| 12028 | + | |
| 12029 | + | |
| 12030 | + | |
| 12031 | + | |
| 12032 | + | |
| 12033 | + | |
| 12034 | + | |
| 12035 | + | |
| 12036 | + | |
| 12037 | + | |
| 12038 | + | |
| 12039 | + | |
| 12040 | + | |
| 12041 | + | |
| 12042 | + | |
| 12043 | + | |
| 12044 | + | |
| 12045 | + | |
| 12046 | + | |
| 12047 | + | |
| 12048 | + | |
| 12049 | + | |
| 12050 | + | |
| 12051 | + | |
| 12052 | + | |
| 12053 | + | |
| 12054 | + | |
| 12055 | + | |
| 12056 | + | |
| 12057 | + | |
| 12058 | + | |
| 12059 | + | |
| 12060 | + | |
| 12061 | + | |
| 12062 | + | |
| 12063 | + | |
| 12064 | + | |
| 12065 | + | |
| 12066 | + | |
| 12067 | + | |
| 12068 | + | |
| 12069 | + | |
| 12070 | + | |
| 12071 | + | |
| 12072 | + | |
| 12073 | + | |
| 12074 | + | |
| 12075 | + | |
| 12076 | + | |
| 12077 | + | |
| 12078 | + | |
| 12079 | + | |
| 12080 | + | |
| 12081 | + | |
| 12082 | + | |
| 12083 | + | |
| 12084 | + | |
| 12085 | + | |
| 12086 | + | |
| 12087 | + | |
| 12088 | + | |
| 12089 | + | |
| 12090 | + | |
| 12091 | + | |
| 12092 | + | |
| 12093 | + | |
| 12094 | + | |
| 12095 | + | |
| 12096 | + | |
| 12097 | + | |
| 12098 | + | |
| 12099 | + | |
| 12100 | + | |
| 12101 | + | |
| 12102 | + | |
| 12103 | + | |
| 12104 | + | |
| 12105 | + | |
| 12106 | + | |
| 12107 | + | |
| 12108 | + | |
| 12109 | + | |
| 12110 | + | |
| 12111 | + | |
| 12112 | + | |
| 12113 | + | |
| 12114 | + | |
| 12115 | + | |
| 12116 | + | |
| 12117 | + | |
| 12118 | + | |
| 12119 | + | |
| 12120 | + | |
| 12121 | + | |
| 12122 | + | |
| 12123 | + | |
| 12124 | + | |
| 12125 | + | |
| 12126 | + | |
| 12127 | + | |
| 12128 | + | |
| 12129 | + | |
| 12130 | + | |
| 12131 | + | |
| 12132 | + | |
| 12133 | + | |
| 12134 | + | |
| 12135 | + | |
| 12136 | + | |
| 12137 | + | |
| 12138 | + | |
| 12139 | + | |
| 12140 | + | |
| 12141 | + | |
| 12142 | + | |
| 12143 | + | |
| 12144 | + | |
| 12145 | + | |
| 12146 | + | |
| 12147 | + | |
| 12148 | + | |
| 12149 | + | |
| 12150 | + | |
| 12151 | + | |
| 12152 | + | |
| 12153 | + | |
| 12154 | + | |
| 12155 | + | |
| 12156 | + | |
| 12157 | + | |
| 12158 | + | |
| 12159 | + | |
| 12160 | + | |
| 12161 | + | |
| 12162 | + | |
| 12163 | + | |
| 12164 | + | |
| 12165 | + | |
| 12166 | + | |
| 12167 | + | |
| 12168 | + | |
| 12169 | + | |
| 12170 | + | |
| 12171 | + | |
| 12172 | + | |
| 12173 | + | |
| 12174 | + | |
| 12175 | + | |
| 12176 | + | |
| 12177 | + | |
| 12178 | + | |
| 12179 | + | |
| 12180 | + | |
| 12181 | + | |
| 12182 | + | |
| 12183 | + | |
| 12184 | + | |
| 12185 | + | |
| 12186 | + | |
| 12187 | + | |
| 12188 | + | |
| 12189 | + | |
| 12190 | + | |
| 12191 | + | |
| 12192 | + | |
| 12193 | + | |
12003 | 12194 | | |
12004 | 12195 | | |
12005 | 12196 | | |
| |||
12749 | 12940 | | |
12750 | 12941 | | |
12751 | 12942 | | |
| 12943 | + | |
| 12944 | + | |
| 12945 | + | |
| 12946 | + | |
12752 | 12947 | | |
12753 | 12948 | | |
12754 | 12949 | | |
| |||
13047 | 13242 | | |
13048 | 13243 | | |
13049 | 13244 | | |
| 13245 | + | |
13050 | 13246 | | |
13051 | 13247 | | |
13052 | 13248 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
| 40 | + | |
40 | 41 | | |
41 | 42 | | |
42 | 43 | | |
| |||
2167 | 2168 | | |
2168 | 2169 | | |
2169 | 2170 | | |
| 2171 | + | |
| 2172 | + | |
| 2173 | + | |
2170 | 2174 | | |
2171 | 2175 | | |
2172 | 2176 | | |
| |||
3011 | 3015 | | |
3012 | 3016 | | |
3013 | 3017 | | |
| 3018 | + | |
3014 | 3019 | | |
3015 | 3020 | | |
3016 | 3021 | | |
| |||
0 commit comments