[Bugfix] [Easy] Fixed a bug in the multiprocessing GPU executor. #6770

eaplatanios · 2024-07-25T04:59:53Z

This was broken in the 0.5.3 release when these signal calls were introduced. It results in the following error when deploying on our machines:

ValueError: signal only works in main thread of the main interpreter

The fix is borrowed from here.

It would be great if you could cut a 0.5.3.post2 release after this is merged to unblock us (and I assume others as well) from using vLLM with Llama 3.1. Thank you!

github-actions · 2024-07-25T05:00:07Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

youkaichao · 2024-07-25T06:09:58Z

Hi, why would this be a problem? When would this code be called from another thread?

eaplatanios · 2024-07-25T13:21:34Z

We are using uvicorn to start our service (with no concurrency) and initializing a vLLM engine using vllm.AsyncLLMEngine.from_engine_args() so nothing special. I was thinking that probably vLLM behind the scenes starts multiple processes and then they all run some initialization code including this part perhaps but I haven't looked into the vLLM internals to know if that's the case.

eaplatanios · 2024-07-26T04:07:54Z

I looked into this a bit more and the main issue is that we have a FastAPI service and we are initializing the vLLM engine in a separate thread so that the API can still respond with something like "Model is being loaded" while the model is being loaded etc., which can take a while for some models. This looks something like this:

loop = asyncio.get_event_loop()
loop.set_default_executor(ThreadPoolExecutor())
model_download_task = loop.run_in_executor(None, load_model)
model_download_task.add_done_callback(done_callback)

where load_model is a function that internally constructs a vLLM async engine at some point. Do you have any advice for how to work around this issue? It only popped up now because these signal handlers were added in the0.5.3 release.

eaplatanios · 2024-07-26T04:11:42Z

@youkaichao I believe that the change introduced in this PR should be sufficient to address this use case and innocuous for other use cases but I'm not sure if there are consequences I'm not aware of.

youkaichao · 2024-07-26T04:17:53Z

Got it. So you are indeed calling it from another thread. This is not the common usecase.

I can accept this change, but since this is not a common usage, we will not make a release just for it. It can be in the next release.

eaplatanios · 2024-07-26T04:20:39Z

That is understandable, thank you! Do you have an estimated timeline for the next release?

youkaichao · 2024-07-26T04:24:18Z

We are usually in a bi-weekly release cadence.

youkaichao

LGTM!

eaplatanios · 2024-07-26T04:27:06Z

Sounds good, thank you!

…m-project#6770) Signed-off-by: Alvant <[email protected]>

…m-project#6770) Signed-off-by: LeiWang1999 <[email protected]>

Fixed a bug in the multiprocessing GPU executor.

244225b

eaplatanios changed the title ~~[Easy] Fixed a bug in the multiprocessing GPU executor.~~ [Bugfix] [Easy] Fixed a bug in the multiprocessing GPU executor. Jul 25, 2024

youkaichao approved these changes Jul 26, 2024

View reviewed changes

youkaichao merged commit 084a01f into vllm-project:main Jul 26, 2024

dtrifiro mentioned this pull request Aug 5, 2024

Sync with [email protected] opendatahub-io/vllm#120

Closed

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Bugfix] [Easy] Fixed a bug in the multiprocessing GPU executor. (vll…

a7080f4

…m-project#6770) Signed-off-by: Alvant <[email protected]>

LeiWang1999 pushed a commit to LeiWang1999/vllm-bitblas that referenced this pull request Mar 26, 2025

[Bugfix] [Easy] Fixed a bug in the multiprocessing GPU executor. (vll…

e64257d

…m-project#6770) Signed-off-by: LeiWang1999 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] [Easy] Fixed a bug in the multiprocessing GPU executor. #6770

[Bugfix] [Easy] Fixed a bug in the multiprocessing GPU executor. #6770

Uh oh!

eaplatanios commented Jul 25, 2024

Uh oh!

github-actions bot commented Jul 25, 2024

Uh oh!

youkaichao commented Jul 25, 2024

Uh oh!

eaplatanios commented Jul 25, 2024 •

edited

Loading

Uh oh!

eaplatanios commented Jul 26, 2024

Uh oh!

eaplatanios commented Jul 26, 2024

Uh oh!

youkaichao commented Jul 26, 2024

Uh oh!

eaplatanios commented Jul 26, 2024

Uh oh!

youkaichao commented Jul 26, 2024

Uh oh!

youkaichao left a comment

Uh oh!

eaplatanios commented Jul 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Bugfix] [Easy] Fixed a bug in the multiprocessing GPU executor. #6770

[Bugfix] [Easy] Fixed a bug in the multiprocessing GPU executor. #6770

Uh oh!

Conversation

eaplatanios commented Jul 25, 2024

Uh oh!

github-actions bot commented Jul 25, 2024

Uh oh!

youkaichao commented Jul 25, 2024

Uh oh!

eaplatanios commented Jul 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eaplatanios commented Jul 26, 2024

Uh oh!

eaplatanios commented Jul 26, 2024

Uh oh!

youkaichao commented Jul 26, 2024

Uh oh!

eaplatanios commented Jul 26, 2024

Uh oh!

youkaichao commented Jul 26, 2024

Uh oh!

youkaichao left a comment

Choose a reason for hiding this comment

Uh oh!

eaplatanios commented Jul 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

eaplatanios commented Jul 25, 2024 •

edited

Loading