Skip to content

Conversation

@MingYang119
Copy link

@MingYang119 MingYang119 commented Dec 1, 2025

What this PR does / why we need it?

Provide high-performance AscendC operators lightning_indexer and sparse_flash_attention to boost the execution performance of the DeepSeek v3.2 model. Meanwhile, adapt the two AscendC operators to vllm-ascend framework.

Does this PR introduce any user-facing change?

No (only underlying operator optimizations, with no user-facing changes)

How was this patch tested?

@github-actions
Copy link

github-actions bot commented Dec 1, 2025

This pull request has conflicts, please resolve those before we can evaluate the pull request.

@github-actions
Copy link

github-actions bot commented Dec 1, 2025

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

  • A PR should do only one thing, smaller PRs enable faster reviews.
  • Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
  • Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

@gemini-code-assist
Copy link
Contributor

Warning

Gemini encountered an error creating the review. You can try again by commenting /gemini review.

@MingYang119 MingYang119 changed the title add lightning_indexer and sparse_flash_attention [kernel] add AscendC op lightning_indexer and sparse_flash_attention Dec 2, 2025
@MingYang119 MingYang119 changed the title [kernel] add AscendC op lightning_indexer and sparse_flash_attention [kernel] add AscendC op: lightning_indexer and sparse_flash_attention Dec 2, 2025
@MingYang119 MingYang119 force-pushed the feat/add-li-sfa branch 3 times, most recently from 13d0195 to c56c7a1 Compare December 2, 2025 03:28
@github-actions
Copy link

github-actions bot commented Dec 2, 2025

This pull request has conflicts, please resolve those before we can evaluate the pull request.

@MingYang119 MingYang119 closed this Dec 2, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant