Qwen3-0.6 Qwen3-8B q4_0 originial output.weight Q6_k, token_embed Q4_0 ours output.weight f32, token_embed Q6_K