You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
mxfp8 inference roofline: add fusion to observed (#3223)
Summary:
Adds option to benchmark with relu -> linear to capture the impact
of fusing the activation to the quant kernel
Test Plan:
```bash
(pt_nightly_312_2) [vasiliy@devgpu023.atn1 ~/local/ao
(20251021_inference_fusion_modeling)]$ python
benchmarks/float8/float8_inference_roofline.py ~/local/tmp/test.csv
--recipe_name mxfp8_cublas --shape_gen_name pow2_extended
--enable_fusion_modeling True
```
Reviewers:
Subscribers:
Tasks:
Tags:
0 commit comments