Enable triton kernels matmul tests #5128

wdziurdz · 2025-09-17T12:34:57Z

whitneywhtsang · 2025-09-20T01:29:22Z

Is this for testing? If it is not ready for review, please convert to draft.

python/triton_kernels/triton_kernels/matmul_ogs.py

python/triton_kernels/tests/test_matmul.py

wdziurdz · 2025-11-12T12:56:46Z

The last eight tests could be fixed by this PR: #5128.
Example of one failed test:

AssertionError: ref_y_scale: 0.004773152060806751, tri_y_scale: 0.005022321827709675
  assert tensor(False, device='xpu:0')
   +  where tensor(False, device='xpu:0') = <built-in method all of type object at 0x7f4175d82400>(tensor([0.0002], device='xpu:0', grad_fn=<AbsBackward0>) < 1e-10)
   +    where <built-in method all of type object at 0x7f4175d82400> = torch.all
   +    and   tensor([0.0002], device='xpu:0', grad_fn=<AbsBackward0>) = <built-in method abs of Tensor object at 0x7f406d589260>()
   +      where <built-in method abs of Tensor object at 0x7f406d589260> = (tensor(0.0048, device='xpu:0', grad_fn=<DivBackward0>) - tensor([0.0050], device='xpu:0')).abs

With this rounding, for the test parameters below we could to pass failed tests:

    sep_scatter = mode == "ragged" and do_scatter and n_expts_act > 1 and split_k == 1
    if is_xpu():  # additional rounding on XPU for references values
        sep_scatter = sep_scatter or (do_scatter and not fused_scatter and n_expts_tot > 1 and split_k == 1 and act_dtype_str == "float8_e4m3fn")
    y_scale = flex.out_data.expected_scale if act_is_float8 else 1

    def round_x(x, idx):
        return x.to(act_dtype).to(torch.float32) if sep_gather else x

    round_y = lambda y: (y / y_scale).to(act_dtype).to(torch.float32) * y_scale if sep_scatter else y
    ref_y = matmul_ogs_torch(x_ref, w_ref, bias_ref,  #
                             rdata, gindx, sindx, round_x=round_x, round_y=round_y, gammas=gs1_ref,
                             inner_routing_data=inner_routing_data, device=device)

Without this fix, there’s look on small precision difference. It’s sufficient to pass all assertions, but the test fails when comparing the actual_scale for these parameters.

Pls looks on this @etiotto @whitneywhtsang

Signed-off-by: Witold Dziurdz <witold.dziurdz@intel.com>

etiotto · 2025-11-13T14:50:55Z

Given that there is a proposed fix upstream we should wait for that change to land.

anmyachev · 2025-11-13T17:35:31Z

scripts/skiplist/a770/triton_kernels.txt

@@ -1,3 +1,2 @@
-tests/test_matmul.py::test_op


Most likely it will not work as is on a770, arl-h, arl-s and mtl.

whitneywhtsang marked this pull request as draft September 25, 2025 00:33

wdziurdz force-pushed the dev/wdziurdz/test-matmul-1 branch 2 times, most recently from 84862d8 to aafbe1a Compare October 6, 2025 12:37

wdziurdz marked this pull request as ready for review October 6, 2025 12:39

wdziurdz mentioned this pull request Oct 6, 2025

Some python/triton_kernels/tests/test_matmul.py::test_op test cases don't work #5074

Open

wdziurdz force-pushed the dev/wdziurdz/test-matmul-1 branch 3 times, most recently from 5b2d42b to 990069a Compare October 8, 2025 10:32

etiotto requested review from jakub-sochacki and whitneywhtsang October 9, 2025 13:54

etiotto reviewed Oct 9, 2025

View reviewed changes

python/triton_kernels/triton_kernels/matmul_ogs.py Show resolved Hide resolved

wdziurdz force-pushed the dev/wdziurdz/test-matmul-1 branch from 990069a to f51fe27 Compare October 10, 2025 07:57

wdziurdz self-assigned this Oct 10, 2025

wdziurdz force-pushed the dev/wdziurdz/test-matmul-1 branch 5 times, most recently from 2d19f0a to 8c7870b Compare November 2, 2025 19:41

wdziurdz force-pushed the dev/wdziurdz/test-matmul-1 branch 4 times, most recently from 50639fa to e5b7881 Compare November 12, 2025 09:08

HBN-MichalSzy reviewed Nov 12, 2025

View reviewed changes

python/triton_kernels/tests/test_matmul.py Outdated Show resolved Hide resolved

wdziurdz force-pushed the dev/wdziurdz/test-matmul-1 branch 4 times, most recently from d291f7c to b42922d Compare November 12, 2025 12:18

wdziurdz force-pushed the dev/wdziurdz/test-matmul-1 branch from b42922d to 914acf2 Compare November 12, 2025 15:30

wdziurdz force-pushed the dev/wdziurdz/test-matmul-1 branch from 914acf2 to 12a2098 Compare November 13, 2025 08:16

Enable triton kernels matmul tests

d9f8c41

Signed-off-by: Witold Dziurdz <witold.dziurdz@intel.com>

wdziurdz force-pushed the dev/wdziurdz/test-matmul-1 branch from 12a2098 to d9f8c41 Compare November 13, 2025 13:56

etiotto requested review from anmyachev and chengjunlu November 13, 2025 14:48

etiotto added the upstream: triton label Nov 13, 2025

etiotto marked this pull request as draft November 13, 2025 14:51

anmyachev reviewed Nov 13, 2025

View reviewed changes

wdziurdz marked this pull request as ready for review November 14, 2025 09:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable triton kernels matmul tests #5128

Enable triton kernels matmul tests #5128

wdziurdz commented Sep 17, 2025 •

edited

Loading

Uh oh!

whitneywhtsang commented Sep 20, 2025

Uh oh!

Uh oh!

Uh oh!

wdziurdz commented Nov 12, 2025

Uh oh!

etiotto commented Nov 13, 2025

Uh oh!

anmyachev Nov 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Enable triton kernels matmul tests #5128

Are you sure you want to change the base?

Enable triton kernels matmul tests #5128

Conversation

wdziurdz commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

whitneywhtsang commented Sep 20, 2025

Uh oh!

Uh oh!

Uh oh!

wdziurdz commented Nov 12, 2025

Uh oh!

etiotto commented Nov 13, 2025

Uh oh!

anmyachev Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

wdziurdz commented Sep 17, 2025 •

edited

Loading