convert : Qwerky : use lora_rank_tokenshift and lora_rank_decay if present #12667

CISC · 2025-03-31T07:38:29Z

The config items lora_rank_tokenshift and lora_rank_decay were introduced in new release, see:
https://huggingface.co/featherless-ai/Qwerky-72B/blob/main/modeling_rwkv6qwen2.py#L268-L279

Fixes #12662

MollySophia · 2025-03-31T08:25:35Z

The change looks good to me. I haven't got the chance to download the full 72B model yet. Have you tested this?

CISC · 2025-03-31T08:34:22Z

@MollySophia No, purely based on diffing the modeling code.

MollySophia · 2025-03-31T08:37:49Z

@MollySophia No, purely based on diffing the modeling code.

I see. Then let's wait for a feedback from #12662 :)

CISC · 2025-03-31T09:01:18Z

BTW, the QWQ-32B modeling code uses lora_rank_decay incorrectly, but since it's identical to lora_rank_tokenshift in this model it has no implications.

ref:
https://huggingface.co/featherless-ai/Qwerky-QwQ-32B/blob/main/modeling_rwkv6qwen2.py#L268-L279

CISC · 2025-03-31T10:26:47Z

@MollySophia Looks like it's working, even though @kanttouchthis only tested 32B I think it's safe to assume this change works for 72B too.

use lora_rank_tokenshift and lora_rank_decay if present

d514a8b

github-actions bot added the python python script changes label Mar 31, 2025

CISC requested a review from MollySophia March 31, 2025 08:22

kanttouchthis mentioned this pull request Mar 31, 2025

Eval bug: Qwerky QwQ 32B (rwkv6qwen2) failed to load #12662

Closed

MollySophia approved these changes Mar 31, 2025

View reviewed changes

CISC merged commit 403fbac into ggml-org:master Mar 31, 2025
5 checks passed

CISC deleted the qwerky-lora-rank-decay branch March 31, 2025 14:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

convert : Qwerky : use lora_rank_tokenshift and lora_rank_decay if present #12667

convert : Qwerky : use lora_rank_tokenshift and lora_rank_decay if present #12667

Uh oh!

CISC commented Mar 31, 2025

Uh oh!

MollySophia commented Mar 31, 2025

Uh oh!

CISC commented Mar 31, 2025

Uh oh!

MollySophia commented Mar 31, 2025

Uh oh!

CISC commented Mar 31, 2025 •

edited

Loading

Uh oh!

CISC commented Mar 31, 2025

Uh oh!

Uh oh!

Uh oh!

convert : Qwerky : use lora_rank_tokenshift and lora_rank_decay if present #12667

convert : Qwerky : use lora_rank_tokenshift and lora_rank_decay if present #12667

Uh oh!

Conversation

CISC commented Mar 31, 2025

Uh oh!

MollySophia commented Mar 31, 2025

Uh oh!

CISC commented Mar 31, 2025

Uh oh!

MollySophia commented Mar 31, 2025

Uh oh!

CISC commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CISC commented Mar 31, 2025

Uh oh!

Uh oh!

Uh oh!

CISC commented Mar 31, 2025 •

edited

Loading