-
Notifications
You must be signed in to change notification settings - Fork 13.7k
Closed
Labels
bug-unconfirmedmedium severityUsed to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
Description
What happened?
Hi, i'm working on Support for Ascend NPU backend and found a curious nb.
Way to reproduce:
- Model: NousResearch/Hermes-2-Pro-Llama-3-8B-GGUF
- Build:
1cmake .. -DCMAKE_BUILD_TYPE=debug && make -j16 - Run:
./bin/main -m /home/models/hermes_gguf/Hermes-2-Pro-Llama-3-8B-F16.gguf -p "Building a website can be done in 10 simple steps:" -ngl 0 -sm none -mg 0 -t 0 - Result: In
MUL_MATop which node id is 1110, thesrc0->nbis[2,2048,256,2048]. I think this is wrong because overlaped, but the model can still generatate words.

Name and Version
version: 2775 (ef6e54c)
built with cc (Ubuntu 9.4.0-1ubuntu1~20.04.2) 9.4.0 for aarch64-linux-gnu
What operating system are you seeing the problem on?
No response
Relevant log output
No response
Metadata
Metadata
Assignees
Labels
bug-unconfirmedmedium severityUsed to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)