Check model inputs - hidden states #40994

zucchini-nlp · 2025-09-19T08:44:06Z

What does this PR do?

In most vision models the output.hidden_states are the hiddens right after encoder blocks, i.e. before the last layernorm. Therefore for these models output.hidden_states != output.last_hidden_state

Currently check_model_inputs assumes that last hidden state is the correct one to return which is true for language models only. This PR adds a kwarg for check_model_inputs which decides whether to replace last hidden state or not

TBH, i think the way it is done in LMs is the ultimate correct version and we probably need to "break" vision models. But I can't think of a way to obtain pre-norm last hidden states which are needed for some VLMs

github-actions · 2025-09-19T08:45:18Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: aimv2, apertus, arcee, aria, audio_spectrogram_transformer, aya_vision, bitnet, blip, blip_2, cohere, cohere2, cohere2_vision, csm, deepseek_v2, deepseek_v3, deit

HuggingFaceDocBuilderDev · 2025-09-19T08:53:16Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp added 2 commits September 18, 2025 16:42

update all models

9ee3f43

fix copies

63bb723

zucchini-nlp requested a review from ArthurZucker September 19, 2025 08:45

zucchini-nlp removed the request for review from ArthurZucker September 19, 2025 10:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Check model inputs - hidden states #40994

Check model inputs - hidden states #40994

Uh oh!

zucchini-nlp commented Sep 19, 2025

Uh oh!

github-actions bot commented Sep 19, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 19, 2025

Uh oh!

Uh oh!

Check model inputs - hidden states #40994

Are you sure you want to change the base?

Check model inputs - hidden states #40994

Uh oh!

Conversation

zucchini-nlp commented Sep 19, 2025

What does this PR do?

Uh oh!

github-actions bot commented Sep 19, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 19, 2025

Uh oh!

Uh oh!