[BugFix] Fix PP/async scheduling with pooling models #28899

njhill · 2025-11-18T02:48:16Z

Broken by #28768. This wasn't exercised by the CI that ran on that PR.

Signed-off-by: Nick Hill <nhill@redhat.com>

gemini-code-assist

Code Review

This pull request addresses a bug related to pipeline parallelism and asynchronous scheduling for pooling models. The changes correctly prevent pooling models from going through the token sampling pipeline, which is intended only for generative models. The introduction of the is_pooling_model flag in vllm/v1/engine/core.py and the uses_sampler flag in vllm/v1/executor/ray_executor.py makes the code's intent clearer and fixes the incorrect behavior. The changes are logical, well-targeted, and appear to resolve the issue effectively. I have no further suggestions.

njhill · 2025-11-18T05:22:58Z

The two test failure here are unrelated:

Recent test flake, fixed by [CI] Fix async scheduling + spec decoding test flake #28902
Wikipedia image url access failure

) Signed-off-by: Nick Hill <nhill@redhat.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>

Signed-off-by: Nick Hill <nhill@redhat.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk> Signed-off-by: jiang1.li <jiang1.li@intel.com>

) Signed-off-by: Nick Hill <nhill@redhat.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>

[BugFix] Fix PP/async scheduling with pooling models

fcfcf6f

Signed-off-by: Nick Hill <nhill@redhat.com>

njhill added this to the v0.11.1 milestone Nov 18, 2025

njhill added the bug Something isn't working label Nov 18, 2025

mergify bot added the v1 label Nov 18, 2025

gemini-code-assist bot reviewed Nov 18, 2025

View reviewed changes

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 18, 2025

noooop approved these changes Nov 18, 2025

View reviewed changes

njhill enabled auto-merge (squash) November 18, 2025 03:31

Merge branch 'main' into fix-embedding-pp

3b55e5c

vllm-bot merged commit 4393684 into vllm-project:main Nov 18, 2025
41 of 43 checks passed

njhill deleted the fix-embedding-pp branch November 18, 2025 14:03

Victor49152 pushed a commit to Victor49152/vllm that referenced this pull request Nov 20, 2025

[BugFix] Fix PP/async scheduling with pooling models (vllm-project#28899

fec6565

) Signed-off-by: Nick Hill <nhill@redhat.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>

bigPYJ1151 pushed a commit that referenced this pull request Nov 25, 2025

[BugFix] Fix PP/async scheduling with pooling models (#28899)

838886b

Signed-off-by: Nick Hill <nhill@redhat.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk> Signed-off-by: jiang1.li <jiang1.li@intel.com>

bringlein pushed a commit to bringlein/vllm that referenced this pull request Nov 26, 2025

[BugFix] Fix PP/async scheduling with pooling models (vllm-project#28899

7773280

) Signed-off-by: Nick Hill <nhill@redhat.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

[BugFix] Fix PP/async scheduling with pooling models (vllm-project#28899

ce18c84

) Signed-off-by: Nick Hill <nhill@redhat.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[BugFix] Fix PP/async scheduling with pooling models #28899

[BugFix] Fix PP/async scheduling with pooling models #28899

Uh oh!

njhill commented Nov 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

njhill commented Nov 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[BugFix] Fix PP/async scheduling with pooling models #28899

[BugFix] Fix PP/async scheduling with pooling models #28899

Uh oh!

Conversation

njhill commented Nov 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

njhill commented Nov 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants