webgpu : fix build on emscripten #15826

ngxson · 2025-09-05T23:22:48Z

Ref original webgpu PR: #14978

Example command:

# install emscripten: brew install emscripten

emcmake cmake -B build-wasm -DGGML_WEBGPU=ON -DLLAMA_CURL=OFF -DGGML_WEBGPU_DEBUG=ON
cmake --build build-wasm --target test-backend-ops

Green-Sky · 2025-09-06T12:45:30Z

.gitignore

 .ccache/
+
+# emscripten
+a.out.*


Why not just a.out* ?

ngxson · 2025-09-06T15:56:54Z

@ggerganov @slaren Quick question, I'm building test-backend-ops on wasm without multi-thread support for simplification. However, seems like ggml-cpu now always requires a threadpool, which makes multi-thread a requirement.

So I'm wondering, is there any ways to completely disable threadpool?

Edit: I'm referring to this code:

llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c

Lines 3124 to 3130 in c4df49a

    
           if (threadpool == NULL) { 
        
               //GGML_PRINT_DEBUG("Threadpool is not specified. Will create a disposable threadpool : n_threads %d\n", n_threads); 
        
               disposable_threadpool = true; 
        
               struct ggml_threadpool_params ttp = ggml_threadpool_params_default(n_threads); 
        
               threadpool = ggml_threadpool_new_impl(&ttp, cgraph, cplan); 
        
           } else {

ggerganov · 2025-09-06T21:16:34Z

I think a threadpool is currently required - don't think there is an easy workaround. @max-krasnyansky any thoughts?

ngxson · 2025-09-08T07:25:06Z

I think a threadpool is currently required - don't think there is an easy workaround. @max-krasnyansky any thoughts?

Hmm ok that means both wllama and whisper.cpp single-thread wasm builds are currently broken. Having single-thread support would be nice, but it's not urgent though.

ggerganov · 2025-09-08T07:52:44Z

Yes, we should support to launch a single-thread compute without invoking synchronization primitives and spawning threads so that thread-less WASM works. Shouldn't be hard to implement.

Looking at the implementation, I think almost everything is inplace for that. Where does the single-thread WASM fail when you call ggml compute with n_threads == 1?

ngxson · 2025-09-08T09:18:23Z

It currently fails at this line:

llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c

Line 3091 in c4df49a

GGML_ASSERT(rc == 0);

I don't have the stack trace due to some difficulty debugging in-browser, but it's very likely invoked by this line:

llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c

Line 3129 in c4df49a

threadpool = ggml_threadpool_new_impl(&ttp, cgraph, cplan);

Where we try to create a threadpool of one single thread

ngxson · 2025-09-08T09:20:24Z

Also just want to note that atomic ops like atomic_store_explicit are not available in single-thread build, so this should requires some extra adaptations

ggerganov · 2025-09-08T09:30:48Z

It currently fails at this line:

llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c

Line 3091 in c4df49a

GGML_ASSERT(rc == 0);

I don't have the stack trace due to some difficulty debugging in-browser, but it's very likely invoked by this line:

llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c

Line 3129 in c4df49a

threadpool = ggml_threadpool_new_impl(&ttp, cgraph, cplan);

Where we try to create a threadpool of one single thread

To enter that loop, it would mean that tpp->nthreads is larger than 1, because the loop stars from 1:

    for (int j = 1; j < tpp->n_threads; j++) {
        ggml_thread_cpumask_next(tpp->cpumask, workers[j].cpumask, tpp->strict_cpu, &cpumask_iter);

        int32_t rc = ggml_thread_create(&workers[j].thrd, NULL, ggml_graph_compute_secondary_thread, &workers[j]);
        GGML_ASSERT(rc == 0);
    }

Could the calling program be using n_threads > 1 instead of n_threads == 1?

ngxson · 2025-09-08T10:13:27Z

Yeah you're right, the n_thread for CPU is not correctly set for test-backend-ops. I added it in bf9d14c and it's now working as expected

reeselevine · 2025-10-03T04:28:46Z

What's the status on this PR, and more generally, potential integration of WebGPU with wllama @ngxson?

When #16400 is merged the WebGPU backend should be able to run all the operations for a decent number of text-generation models (excepting flash attention, but my understanding is it falls back to standard multi-kernel attention for now).

There's still some work to do on optimizing the existing shaders for it to really work well, but it would be great to also start getting it ready in a demo form on the browser. Happy to help work on integration if you'd like too.

ngxson · 2025-10-07T07:55:11Z

hey @reeselevine I still have some weird issues where running this can cause the browser to hang. need to investigate this a bit more.

webgpu : fix build on emscripten

4529332

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Sep 5, 2025

more debugging stuff

990a98a

github-actions bot added the build Compilation issues label Sep 6, 2025

test-backend-ops: force single thread on wasm

5616b9c

github-actions bot added the testing Everything test related label Sep 6, 2025

fix single-thread case for init_tensor_uniform

56d02f6

Green-Sky reviewed Sep 6, 2025

View reviewed changes

.gitignore

.ccache/

# emscripten

a.out.*

Copy link

Collaborator

Green-Sky Sep 6, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why not just a.out* ?

use jspi

1cd87e0

add pthread

8549245

github-actions bot added the script Script related label Sep 7, 2025

test: remember to set n_thread for cpu backend

bf9d14c

reeselevine mentioned this pull request Oct 7, 2025

ggml webgpu: profiling, CI updates, reworking of command submission #16452

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

webgpu : fix build on emscripten #15826

webgpu : fix build on emscripten #15826

Uh oh!

ngxson commented Sep 5, 2025 •

edited

Loading

Uh oh!

Green-Sky Sep 6, 2025

Uh oh!

ngxson commented Sep 6, 2025 •

edited

Loading

Uh oh!

ggerganov commented Sep 6, 2025

Uh oh!

ngxson commented Sep 8, 2025

Uh oh!

ggerganov commented Sep 8, 2025

Uh oh!

ngxson commented Sep 8, 2025

Uh oh!

ngxson commented Sep 8, 2025

Uh oh!

ggerganov commented Sep 8, 2025

Uh oh!

ngxson commented Sep 8, 2025

Uh oh!

reeselevine commented Oct 3, 2025

Uh oh!

ngxson commented Oct 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

webgpu : fix build on emscripten #15826

Are you sure you want to change the base?

webgpu : fix build on emscripten #15826

Uh oh!

Conversation

ngxson commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Green-Sky Sep 6, 2025

Choose a reason for hiding this comment

Uh oh!

ngxson commented Sep 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Sep 6, 2025

Uh oh!

ngxson commented Sep 8, 2025

Uh oh!

ggerganov commented Sep 8, 2025

Uh oh!

ngxson commented Sep 8, 2025

Uh oh!

ngxson commented Sep 8, 2025

Uh oh!

ggerganov commented Sep 8, 2025

Uh oh!

ngxson commented Sep 8, 2025

Uh oh!

reeselevine commented Oct 3, 2025

Uh oh!

ngxson commented Oct 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ngxson commented Sep 5, 2025 •

edited

Loading

ngxson commented Sep 6, 2025 •

edited

Loading