Fix bnb for the weights refactor #42043

SunMarc · 2025-11-05T16:27:35Z

What does this PR do?

This PR fixes bnb support (8bit + 4bit) in the new weight loading logic.

Testing

from transformers import AutoTokenizer, AutoModelForCausalLM, BitsAndBytesConfig

model_name = "meta-llama/Llama-3.2-3B-Instruct"
quantization_config = BitsAndBytesConfig(load_in_4bit=True, bnb_4bit_quant_type="nf4")

#model_name = "unsloth/Llama-3.2-3B-Instruct-bnb-4bit"
# don't pass quantization_config

tokenizer = AutoTokenizer.from_pretrained(model_name)

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    quantization_config=quantization_config,
    device_map=0
)

input_text = "Write me a poem about Machine Learning."
input_ids = tokenizer(input_text, return_tensors="pt").to("cuda")

outputs = model.generate(**input_ids, do_sample=False, max_new_tokens=1024)
print(tokenizer.decode(outputs[0]))

check why the memory is way too high when quantizing on the fly
bnb tests

SunMarc · 2025-11-17T17:35:52Z

@bot /style

github-actions · 2025-11-17T17:36:39Z

Style bot fixed some files and pushed the changes.

ArthurZucker

LGTM.Just tie twice is my nightmare but good otherwise

ArthurZucker · 2025-11-18T13:30:49Z

src/transformers/core_model_loading.py

+            if hf_quantizer is not None and hf_quantizer.param_needs_quantization(model, t):
+                converter.quantization_operation = hf_quantizer.get_quantize_ops()
+            _dtype = dtype


…x-bnb

SunMarc · 2025-11-18T15:50:37Z

run-slow: bnb

github-actions · 2025-11-18T15:51:52Z

This comment contains run-slow, running the specified jobs:

models: []
quantizations: ["quantization/bnb"]

SunMarc · 2025-11-18T15:52:15Z

LGTM.Just tie twice is my nightmare but good otherwise

I've upstreamed some code from accelerate to fix tied weights. This should be easier this way and we can better tweak device map related code in the future.

SunMarc · 2025-11-18T15:52:26Z

@bot /style

github-actions · 2025-11-18T15:53:11Z

Style fix is beginning .... View the workflow run here.

SunMarc · 2025-11-18T15:57:05Z

@bot /style

github-actions · 2025-11-18T15:57:51Z

Style bot fixed some files and pushed the changes.

github-actions · 2025-11-18T16:00:33Z

CI Results

Workflow Run ⚙️

✅ No failing test specific to this PR 🎉 !

SunMarc · 2025-11-18T16:02:58Z

run-slow: bnb

github-actions · 2025-11-18T16:04:52Z

This comment contains run-slow, running the specified jobs:

models: []
quantizations: ["quantization/bnb"]

github-actions · 2025-11-18T16:10:42Z

CI Results

Workflow Run ⚙️

✅ No failing test specific to this PR 🎉 !

SunMarc · 2025-11-18T16:13:30Z

src/transformers/integrations/accelerate.py

+    if tied_parameters is None and len(model.all_tied_weights_keys) > 0:
+        # create a list of list of tied params
+        tied_parameters = [list(t) for t in model.all_tied_weights_keys.items()]


changed this

SunMarc · 2025-11-18T16:15:03Z

src/transformers/integrations/accelerate.py

+def infer_auto_device_map(
+    model: nn.Module,
+    max_memory: Optional[dict[Union[int, str], Union[int, str]]] = None,
+    no_split_module_classes: Optional[list[str]] = None,
+    verbose: bool = False,
+    clean_result: bool = True,
+    offload_buffers: bool = False,
+    tied_parameters: Optional[list[list[str]]] = None,
+    hf_quantizer: "HfQuantizer | None" = None,
+):


removed dtype and special_dtype to rely on hf_quantizer instead when computing compute_module_sizes

…x-bnb

github-actions · 2025-11-18T16:35:13Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: bnb, finegrained_fp8

ArthurZucker added 30 commits November 3, 2025 12:33

small fix

23e3ed7

nits

a8fb554

ish

ab6ee8a

up

77ccbb1

rev

8a8beff

fix more tie weights keys

02386ce

small fixes

1c87945

nit

00b95ee

update

a170f29

fix and fix

8b924a3

fix a test

8f7b1d0

glubs

9386217

current shitty changes

4894a25

ship validated ones

da7dc10

more

d7c8171

more update

e088408

more

4f212de

more

dc5a22c

more

675b2bc

mllama

f85f239

more up

76b6a92

fix ernie

ba1a8b6

fix xopies

ba3de5a

up more

8fd255c

more fixes

5d7507b

up

0fb2340

up

32b9273

fix-copies

0b95826

fix more

5794d27

more updates

5e71bd4

Apply style fixes

11437cb

ArthurZucker approved these changes Nov 18, 2025

View reviewed changes

SunMarc added 3 commits November 18, 2025 16:48

tie weights

3d386b4

Merge branch 'fix-bnb' of github.com:huggingface/transformers into fi…

6f4feea

…x-bnb

Merge remote-tracking branch 'upstream/main' into fix-bnb

2c45d7c

warning

bcc929c

Apply style fixes

fa1273a

init

d1ff2a7

SunMarc commented Nov 18, 2025

View reviewed changes

SunMarc added 2 commits November 18, 2025 17:33

default

6ddbb18

Merge branch 'fix-bnb' of github.com:huggingface/transformers into fi…

e9d8094

…x-bnb

ArthurZucker merged commit 67302b0 into main Nov 18, 2025
22 of 24 checks passed

ArthurZucker deleted the fix-bnb branch November 18, 2025 17:28

This was referenced Nov 18, 2025

Fix accelerate integration #42264

Merged

Fix device_map computation part 2 #42290

Merged

Fix bnb for the weights refactor #42043

Fix bnb for the weights refactor #42043

Uh oh!

Conversation

SunMarc commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Testing

Uh oh!

SunMarc commented Nov 17, 2025

Uh oh!

github-actions bot commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

SunMarc commented Nov 18, 2025

Uh oh!

github-actions bot commented Nov 18, 2025

Uh oh!

SunMarc commented Nov 18, 2025

Uh oh!

SunMarc commented Nov 18, 2025

Uh oh!

github-actions bot commented Nov 18, 2025

Uh oh!

SunMarc commented Nov 18, 2025

Uh oh!

github-actions bot commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Nov 18, 2025

CI Results

Uh oh!

SunMarc commented Nov 18, 2025

Uh oh!

github-actions bot commented Nov 18, 2025

Uh oh!

github-actions bot commented Nov 18, 2025

CI Results

Uh oh!

SunMarc Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

SunMarc Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

SunMarc commented Nov 5, 2025 •

edited

Loading

github-actions bot commented Nov 17, 2025 •

edited

Loading

github-actions bot commented Nov 18, 2025 •

edited

Loading