Eval bug: Vulkan not working on Intel GPU

### Name and Version

master

### Operating systems

Linux

### GGML backends

Vulkan

### Hardware

main system where I did the git bisection:
```
ggml_vulkan: 0 = Intel(R) Iris(R) Xe Graphics (TGL GT2) (Intel open-source Mesa driver) 
```
also having issues on this system:
```
ggml_vulkan: 0 = Intel(R) UHD Graphics 630 (CFL GT2) (Intel open-source Mesa driver)
```

### Models

`smollm:135m`


### Problem description & steps to reproduce

When I run `llama-run` (same with `llama-serve`), the inference either crashes or outputs garbage.

```
../build.vulkan-linux/bin/llama-run ~/models/smollm:135m "say nothing" --ngl 99 --verbose
```

### First Bad Commit

bcf5bda6f5df559565d11d7c8e8295c1159a85ec (https://github.com/ggml-org/llama.cpp/pull/16536)

### Relevant log output

```shell
git reset --hard b6874
# rebuild
../build.vulkan-linux/bin/llama-run ~/models/smollm:135m "say nothing" --ngl 99 --verbose
<answers>


git reset --hard b6875
# rebuild --> compilation takes forever...


git reset --hard b6876
# rebuild
../build.vulkan-linux/bin/llama-run ~/models/smollm:135m "say nothing" --ngl 99 --verbose
llama_context:    Vulkan0 compute buffer size =    98.25 MiB
llama_context: Vulkan_Host compute buffer size =     5.14 MiB
llama_context: graph nodes  = 937
llama_context: graph splits = 2
llama-run: /var/home/kpouget/pod-virt/remoting/linux-work/llama_cpp/src/src/llama-sampling.cpp:662: void llama_sampler_dist_apply(llama_sampler*, llama_token_data_array*): Assertion `found' failed.
./run.linux.sh: line 1: 97210 Aborted 



git reset --hard b6969
# rebuild 
llama_context:    Vulkan0 compute buffer size =    98.25 MiB
llama_context: Vulkan_Host compute buffer size =     5.14 MiB
llama_context: graph nodes  = 937
llama_context: graph splits = 2
llama-run: /var/home/kpouget/pod-virt/remoting/linux-work/llama_cpp/src/src/llama-sampling.cpp:662: void llama_sampler_dist_apply(llama_sampler*, llama_token_data_array*): Assertion `found' failed.
./run.linux.sh: line 1: 108491 Aborted
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Eval bug: Vulkan not working on Intel GPU #17056

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Eval bug: Vulkan not working on Intel GPU #17056

Description

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions