[RL] [V1] Remove unused device argument from reset_kv_cache #28766

zhuohan123 · 2025-11-15T00:57:00Z

Purpose

The device argument for reset_prefix_cache is no longer being used in V1.

Test Plan

Existing tests should pass.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Zhuohan Li <zhuohan123@gmail.com>

gemini-code-assist

Code Review

This pull request aims to remove the unused device argument from the reset_prefix_cache method across various components. The changes in the modified files are consistent with this goal. However, there is a critical issue: the base class EngineClient in vllm/engine/protocol.py has not been updated. This will cause a signature mismatch for the reset_prefix_cache method in the AsyncLLM class, which inherits from EngineClient. This file needs to be included in the pull request with the necessary changes to ensure the code compiles and runs correctly.

vllm/v1/engine/async_llm.py

Signed-off-by: Zhuohan Li <zhuohan123@gmail.com>

zhuohan123 · 2025-11-15T01:00:30Z

/gemini review

gemini-code-assist

Code Review

This pull request correctly removes the unused device argument from the reset_prefix_cache function across multiple files. The changes are consistent, simplifying the function signatures and removing unnecessary logic. This refactoring improves code clarity and maintainability. I noticed a minor discrepancy: the pull request title refers to reset_kv_cache, while the changes and description correctly target reset_prefix_cache. You may want to update the title for consistency. Overall, the changes are well-executed and I have no further comments.

DarkLight1337 · 2025-11-15T07:59:50Z

Doc failure is unrelated - merging

…ject#28766) Signed-off-by: Zhuohan Li <zhuohan123@gmail.com> Signed-off-by: George D. Torres <gdavtor@gmail.com>

…ject#28766) Signed-off-by: Zhuohan Li <zhuohan123@gmail.com> Signed-off-by: Bram Wasti <bwasti@meta.com>

…ject#28766) Signed-off-by: Zhuohan Li <zhuohan123@gmail.com>

[RL] Remove unused device argument from reset_kv_cache

f187e2a

Signed-off-by: Zhuohan Li <zhuohan123@gmail.com>

zhuohan123 requested a review from njhill November 15, 2025 00:57

zhuohan123 requested review from aarnphm and chaunceyjiang as code owners November 15, 2025 00:57

mergify bot added frontend v1 labels Nov 15, 2025

gemini-code-assist bot reviewed Nov 15, 2025

View reviewed changes

vllm/v1/engine/async_llm.py Show resolved Hide resolved

fix

80b3b5e

Signed-off-by: Zhuohan Li <zhuohan123@gmail.com>

gemini-code-assist bot reviewed Nov 15, 2025

View reviewed changes

DarkLight1337 approved these changes Nov 15, 2025

View reviewed changes

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 15, 2025

Merge branch 'main' into zhuohan/remove-device-from-reset-kv-cache

214b237

zhuohan123 enabled auto-merge (squash) November 15, 2025 07:22

vllm-bot merged commit dd6ac1c into main Nov 15, 2025
52 of 53 checks passed

vllm-bot deleted the zhuohan/remove-device-from-reset-kv-cache branch November 15, 2025 07:59

geodavic pushed a commit to geodavic/vllm that referenced this pull request Nov 16, 2025

[RL] [V1] Remove unused device argument from reset_kv_cache (vllm-pro…

e9abfe7

…ject#28766) Signed-off-by: Zhuohan Li <zhuohan123@gmail.com> Signed-off-by: George D. Torres <gdavtor@gmail.com>

bwasti pushed a commit to bwasti/vllm that referenced this pull request Nov 17, 2025

[RL] [V1] Remove unused device argument from reset_kv_cache (vllm-pro…

2a62486

…ject#28766) Signed-off-by: Zhuohan Li <zhuohan123@gmail.com> Signed-off-by: Bram Wasti <bwasti@meta.com>

bringlein pushed a commit to bringlein/vllm that referenced this pull request Nov 26, 2025

[RL] [V1] Remove unused device argument from reset_kv_cache (vllm-pro…

2950205

…ject#28766) Signed-off-by: Zhuohan Li <zhuohan123@gmail.com>

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

[RL] [V1] Remove unused device argument from reset_kv_cache (vllm-pro…

d37ee91

…ject#28766) Signed-off-by: Zhuohan Li <zhuohan123@gmail.com>

kitaekatt pushed a commit to kitaekatt/vllm that referenced this pull request Dec 1, 2025

[RL] [V1] Remove unused device argument from reset_kv_cache (vllm-pro…

0b5c372

…ject#28766) Signed-off-by: Zhuohan Li <zhuohan123@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[RL] [V1] Remove unused device argument from reset_kv_cache #28766

[RL] [V1] Remove unused device argument from reset_kv_cache #28766

zhuohan123 commented Nov 15, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

zhuohan123 commented Nov 15, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

DarkLight1337 commented Nov 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[RL] [V1] Remove unused device argument from reset_kv_cache #28766

[RL] [V1] Remove unused device argument from reset_kv_cache #28766

Conversation

zhuohan123 commented Nov 15, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

zhuohan123 commented Nov 15, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

DarkLight1337 commented Nov 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zhuohan123 commented Nov 15, 2025 •

edited by github-actions bot

Loading