Skip to content

Conversation

@ksmusz
Copy link
Collaborator

@ksmusz ksmusz commented Nov 13, 2025

No description provided.

Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai>
Copilot AI review requested due to automatic review settings November 13, 2025 13:56
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR replaces manual softmax computation with the optimized torch.ops.hpu.softmax_fa2 operation in two attention functions (partial_attn_causal and partial_attn_shared) for improved performance on HPU devices.

Key Changes:

  • Replaced manual softmax implementation (max, exp, sum operations) with torch.ops.hpu.softmax_fa2 in both functions
  • Added initialization of inputM_hpu and inputL_hpu tensors required by the new operation
  • Consolidated bias addition into the initial matmul operation in partial_attn_shared

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai>
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai>
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai>
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai>
@ksmusz ksmusz requested a review from kamil-kaczor as a code owner November 26, 2025 16:52
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@ksmusz
Copy link
Collaborator Author

ksmusz commented Nov 26, 2025

/run-gaudi-tests

@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
0353d2e162cbda776d9dbfe026e65303204a7f1f

@ksmusz
Copy link
Collaborator Author

ksmusz commented Nov 27, 2025

/run-gaudi-tests

@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
0353d2e162cbda776d9dbfe026e65303204a7f1f

@github-actions
Copy link

github-actions bot commented Dec 1, 2025

✅ CI Passed

All checks passed successfully against the following vllm commit:
0353d2e162cbda776d9dbfe026e65303204a7f1f

ksmusz and others added 3 commits December 3, 2025 12:44
Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai>
Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai>
@github-actions
Copy link

github-actions bot commented Dec 3, 2025

✅ CI Passed

All checks passed successfully against the following vllm commit:
3a7751485b71ce5ef927e4aa03b28602cb90811c

Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai>
@github-actions
Copy link

github-actions bot commented Dec 4, 2025

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai>
@github-actions
Copy link

github-actions bot commented Dec 4, 2025

✅ CI Passed

All checks passed successfully against the following vllm commit:
899e2ef558e7345b99bc0d53c2e1c60ffdca7470

@ksmusz ksmusz merged commit 3fc22d0 into vllm-project:main Dec 4, 2025
44 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants