[RFC]: Remove VL Modeling Files

### Motivation.

To avoid maintaining a variety of models, we propose to remove all modeling files in vllm-ascend. To reach this, there are some refactors need to be done for multi-modal models in both vllm and vllm-ascend.

### Proposed Change.

**vllm:**

- [ ] Extract Qwen MMEncoder layer as custom op. @shen-shanshan
- [ ] Extract `apply_rotary_emb` as CustomOp. @shen-shanshan https://github.com/vllm-project/vllm/pull/29873
- [x] Extract conv layer as custom op. @shen-shanshan https://github.com/vllm-project/vllm/pull/28455
- [x] Use caching to remove repeated sin/cos computations. @gcanlin https://github.com/vllm-project/vllm/pull/28798
- [x] Remove redundant TP logic in split_qkv. @gcanlin https://github.com/vllm-project/vllm/pull/28271

**vllm-ascend:**

- [x] Patch VisionAttention layer and remove Qwen2.5-VL modeling files. @shen-shanshan https://github.com/vllm-project/vllm-ascend/pull/4349
- [x] Remove Qwen2-VL modeling files. @shen-shanshan https://github.com/vllm-project/vllm-ascend/pull/4534
- [x] Remove Qwen3-VL and Qwen3-VL-MoE modeling files. @shen-shanshan https://github.com/vllm-project/vllm-ascend/pull/4577
- [ ] Implement ascend ViT csutom op and register it. @shen-shanshan 
- [ ] Implement `multimodal_cpu_fields` in model runner to guarantee that `grid_thw` should be moved to cpu before converting to numpy. @zhangxinyuehfad 
- [ ] Refactor `set_ascend_forward_context()` to remove patch for ViT embedding. @gcanlin 
- [ ] Remove patch for cos/sin cache. @shen-shanshan 

**Other related:**

- [x] Make mamba backend pluggable. @shen-shanshan https://github.com/vllm-project/vllm/pull/26487

### Feedback Period.

_No response_

### CC List.

@Yikun @wangxiyuan @gcanlin 

### Any Other Things.

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RFC]: Remove VL Modeling Files #4084

Motivation.

Proposed Change.

Feedback Period.

CC List.

Any Other Things.

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[RFC]: Remove VL Modeling Files #4084

Description

Motivation.

Proposed Change.

Feedback Period.

CC List.

Any Other Things.

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions