Support conversion of Qwen3-Embedding models #15023

iamlemec · 2025-08-01T21:27:41Z

This adds GGUF conversion support for the Qwen3-Embedding class of models and makes pooling work properly by default. I'm not sure how the official Qwen GGUFs were produced, but I'll try to get them updated if this gets merged. More details:

Though the pre-tokenizer is in fact the same qwen2 as usual, the HF tokenizer adds an EOT token that makes the checksum different from the usual text generation models
For some reason the tensor names are not prefixed with model. like most other models in this class
Actually loads the default pooling mode so it works out of the box

ggerganov

Thanks for looking into this - it would be great to get this model supported.

CISC · 2025-08-02T08:47:08Z

Ooops, was a bit trigger happy, forgot convert_hf_to_gguf_update.py...

@iamlemec care to add the model there in a new PR?

Edit: It has to go in pre_computed_hashes (since qwen2 is in models already), which means the entry in convert_hf_to_gguf.py will be moved towards the top as well.

iamlemec · 2025-08-02T09:21:37Z

@CISC Ah I see, will have that up in a moment. See #15030.

Mushoz · 2025-08-27T09:06:12Z

@iamlemec Did you ever speak to Qwen about the official GGUFs? Are they okay to use in their current form or would they have to be updated?

iamlemec · 2025-08-28T16:55:11Z

@Mushoz looks like they uploaded new GGUFs to HF! Just tested the 0.6B version and it works as expected with correct pooling.

Mushoz · 2025-08-28T20:31:52Z

@iamlemec which ones are you talking about?

I am looking at this one: https://huggingface.co/Qwen/Qwen3-Embedding-0.6B-GGUF/tree/main
But that has been updated 14th of July, way before this PR was merged.

Same holds true for the 8b version: https://huggingface.co/Qwen/Qwen3-Embedding-8B-GGUF/tree/main

iamlemec · 2025-08-28T21:21:01Z

@Mushoz Huh, you're right. Maybe I missed that update before and was still using the old ones? Anyway, the ones you linked to have the pooling type properly specified and seem to work as expected!

Support conversion of Qwen3-Embedding models

5a33e3d

github-actions bot added the python python script changes label Aug 1, 2025

ggerganov approved these changes Aug 1, 2025

View reviewed changes

CISC approved these changes Aug 2, 2025

View reviewed changes

CISC merged commit 339bd02 into ggml-org:master Aug 2, 2025
49 of 50 checks passed

Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Aug 2, 2025

model : support Qwen3-Embedding (ggml-org#15023)

9368bab

CISC mentioned this pull request Aug 3, 2025

Misc. bug: convert_hf_to_gguf.py not working on qwen3-embedding and qwen3-embedding lora tuned models #14459

Closed

Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Aug 5, 2025

model : support Qwen3-Embedding (ggml-org#15023)

5914d64

abc-nix mentioned this pull request Aug 12, 2025

llama : support qwen3 rerank and embeddings #14029

Open

kripper mentioned this pull request Aug 26, 2025

support for qwen3-embedding and qwen3-reranker models ollama/ollama#10989

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support conversion of Qwen3-Embedding models #15023

Support conversion of Qwen3-Embedding models #15023

Uh oh!

iamlemec commented Aug 1, 2025

Uh oh!

ggerganov left a comment

Uh oh!

Uh oh!

CISC commented Aug 2, 2025 •

edited

Loading

Uh oh!

iamlemec commented Aug 2, 2025 •

edited

Loading

Uh oh!

Mushoz commented Aug 27, 2025

Uh oh!

iamlemec commented Aug 28, 2025

Uh oh!

Mushoz commented Aug 28, 2025

Uh oh!

iamlemec commented Aug 28, 2025

Uh oh!

Uh oh!

Support conversion of Qwen3-Embedding models #15023

Support conversion of Qwen3-Embedding models #15023

Uh oh!

Conversation

iamlemec commented Aug 1, 2025

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

CISC commented Aug 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

iamlemec commented Aug 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Mushoz commented Aug 27, 2025

Uh oh!

iamlemec commented Aug 28, 2025

Uh oh!

Mushoz commented Aug 28, 2025

Uh oh!

iamlemec commented Aug 28, 2025

Uh oh!

Uh oh!

CISC commented Aug 2, 2025 •

edited

Loading

iamlemec commented Aug 2, 2025 •

edited

Loading