Commit 67302b0
Fix bnb for the weights refactor (#42043)
* small fix
* nits
* ish
* up
* rev
* fix more tie weights keys
* small fixes
* nit
* update
* fix and fix
* fix a test
* glubs
* current shitty changes
* ship validated ones
* more
* more update
* more
* more
* more
* mllama
* more up
* fix ernie
* fix xopies
* up more
* more fixes
* up
* up
* fix-copies
* fix more
* more updates
* AI UPDATE
* up
* hoey
* make it fast
* fix
* lol
* fix asjusting
* more fixes
* _dtype nit
* up
* nit
* update
* update
* remove semaphores
* fix import to avoid jit execution
* try to remove custom tiing logic when its stupid
* fix more individual models
* fix whisper as well
* fix?
* fox umt5
* improve tqdm bar
* cleanup a bit
* oupsi
* some updates
* improve
* remove all buffering -> much faster without it
* remove some tie_weights custome funcs when not needed
* more fixes related to strict matching regex
* remove ALL custom tie weights
* small update
* revert change to init scheme (no need for params)
* fix
* mixtral init
* try less strict source check
* tied weight first shot to the fiiiixxxxxx
* does this help?
* :)
* fix some ppolry defined tied_weights_keys for now
* fixes for more models torch_bc
* nits and fixes
* last update
* Revert "tied weight first shot to the fiiiixxxxxx"
This reverts commit 3fea865.
* here we go again
* an attempt
* up?
* nits
* Fix bnb loading !
* rm print
* subclass nn.Parameters
* up
* lol
* Ouiiii
* fix led
* fix long cat flash
* fix qwen and long cat flash
* properly fix qwen init
* just push this for now
* propnet is dumb
* update
* rm import
* update
* push
* Update src/transformers/core_model_loading.py
Co-authored-by: Matthew Douglas <[email protected]>
* remove explict sharing of some tied keys.
* update decoder.bias
* moe case
* Fix loadedparam
* rm report
* more changes to untangle old hardcoded ting
* fixup
* fix big faileurs
* Fix tests single gpu
* should fix it
* fix prophnet
* fix resize token embeddings
* nits
* fix xcodex
* asyncio?
* fix smart apply
* fix data-2-vec
* [build-ci-image]
* checkout
* uupdate
* fix hunyuan
* update error message
* fix deformable detr
* fixes
* fix init weights for non param gate up projs
* shared todo?
* guard needed for compressed-tensors
* deal with buffers
* update some models
* big revert, don't break this behaviour
* ty @SunMarc this fixes the buffers
Co-authored-by: SunMarc <[email protected]>
* mt5 fuck
* fix lxmbert
* nuke slow test fetcher
* fix
* fix zamba and deepcopy for now
* fix zamba tied weight keys! ~
* fix-copies
* update fetch terst
* fix gradient for test modeling common!
* break "shared" for now I will fix tomorrow changes are properly isoalted now :)
* does this fix marian? probably not
* fix some vlms
* D fine seems to handle this well
* glob is fine actually
* fix dab detr
* small steps
* opusy
* fix some more models?
* yups
* better erro
* fix?
* fix double escape
* escape wehere it makes sense
* ??
* fix ibert
* fix tvp as well
* more fxes
* try always download ref PR
* ONONONO
* big fixup
* more fixup
* small step
* small nits
* nits
* brut force some stuff
* fix vilt
* make sure special models that always need tie always tie
* cleaning up
* small nits
* fix zamba and bridge tower!
* just fixup
* potential culprits
* revert bark and fix bridgetower
* remove now non existant tie_weights
* ?
* lol reformer actually had nothing tied!
* wow these two fucking models were really not well made
* fix sam family!
* fix bark revision
* fix speech2test ?
* push this for now....
* upsy
* the fuck
* fix rtdetr
* update
* proper
* wow that one 's annoying
* update
* try to find the culprit
* get some help on common
* nit about general init and cls.padding_idx
* revert num workers update
* remove old loading func
* fix glob
* add annotations
* fix re
* small improvements
* clean some stuff
* improvements
* someone did not understannnnnnd what I tried to dooo or does BNB not support that either?
* gluos
* fix case when `.` is just not there
* for now let's do this
* fix
* fix small test
* style
* fix merge conflits
* style
* 8bit fixed ?
* fix
* fix 8bit dtype
* fix
* rm copy
* Apply suggestions from code review
Co-authored-by: Arthur <[email protected]>
* style
* test
* fix
* finally ?
* Apply style fixes
* fix
* fix
* Apply style fixes
* tie weights
* warning
* Apply style fixes
* init
* default
---------
Co-authored-by: Arthur <[email protected]>
Co-authored-by: Cyril Vallez <[email protected]>
Co-authored-by: Matthew Douglas <[email protected]>
Co-authored-by: Arthur <[email protected]>
Co-authored-by: SunMarc <[email protected]>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>1 parent 9f31104 commit 67302b0
File tree
13 files changed
+627
-185
lines changed- src/transformers
- integrations
- quantizers
- tests
- quantization/bnb
- utils
13 files changed
+627
-185
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
46 | 46 | | |
47 | 47 | | |
48 | 48 | | |
49 | | - | |
50 | | - | |
51 | | - | |
52 | | - | |
53 | | - | |
54 | | - | |
55 | | - | |
56 | | - | |
57 | | - | |
58 | | - | |
59 | | - | |
60 | | - | |
61 | | - | |
62 | | - | |
63 | | - | |
64 | | - | |
65 | 49 | | |
66 | 50 | | |
67 | 51 | | |
| |||
389 | 373 | | |
390 | 374 | | |
391 | 375 | | |
| 376 | + | |
392 | 377 | | |
393 | 378 | | |
394 | 379 | | |
395 | 380 | | |
396 | | - | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
397 | 385 | | |
398 | 386 | | |
399 | 387 | | |
| |||
415 | 403 | | |
416 | 404 | | |
417 | 405 | | |
418 | | - | |
| 406 | + | |
419 | 407 | | |
420 | 408 | | |
421 | 409 | | |
| |||
434 | 422 | | |
435 | 423 | | |
436 | 424 | | |
437 | | - | |
| 425 | + | |
438 | 426 | | |
439 | 427 | | |
440 | 428 | | |
| |||
499 | 487 | | |
500 | 488 | | |
501 | 489 | | |
502 | | - | |
503 | | - | |
504 | | - | |
505 | | - | |
506 | | - | |
507 | | - | |
508 | | - | |
509 | | - | |
510 | | - | |
511 | | - | |
512 | | - | |
513 | | - | |
514 | | - | |
515 | | - | |
| 490 | + | |
| 491 | + | |
| 492 | + | |
| 493 | + | |
| 494 | + | |
| 495 | + | |
| 496 | + | |
| 497 | + | |
516 | 498 | | |
517 | 499 | | |
518 | 500 | | |
| |||
575 | 557 | | |
576 | 558 | | |
577 | 559 | | |
578 | | - | |
579 | | - | |
580 | | - | |
| 560 | + | |
581 | 561 | | |
582 | 562 | | |
583 | 563 | | |
| |||
591 | 571 | | |
592 | 572 | | |
593 | 573 | | |
| 574 | + | |
594 | 575 | | |
595 | 576 | | |
596 | 577 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
32 | 32 | | |
33 | 33 | | |
34 | 34 | | |
| 35 | + | |
35 | 36 | | |
36 | | - | |
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
| |||
177 | 177 | | |
178 | 178 | | |
179 | 179 | | |
| 180 | + | |
180 | 181 | | |
181 | | - | |
182 | 182 | | |
183 | 183 | | |
184 | 184 | | |
| |||
0 commit comments