Skip to content

Bug: Incorrect nb #7930

@wangshuai09

Description

@wangshuai09

What happened?

Hi, i'm working on Support for Ascend NPU backend and found a curious nb.
Way to reproduce:

  1. Model: NousResearch/Hermes-2-Pro-Llama-3-8B-GGUF
  2. Build: 1cmake .. -DCMAKE_BUILD_TYPE=debug && make -j16
  3. Run: ./bin/main -m /home/models/hermes_gguf/Hermes-2-Pro-Llama-3-8B-F16.gguf -p "Building a website can be done in 10 simple steps:" -ngl 0 -sm none -mg 0 -t 0
  4. Result: In MUL_MATop which node id is 1110, the src0->nb is [2,2048,256,2048]. I think this is wrong because overlaped, but the model can still generatate words.
    image

Name and Version

version: 2775 (ef6e54c)
built with cc (Ubuntu 9.4.0-1ubuntu1~20.04.2) 9.4.0 for aarch64-linux-gnu

What operating system are you seeing the problem on?

No response

Relevant log output

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bug-unconfirmedmedium severityUsed to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions