You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fix workspace allocation for f8f8bf16_rowwise_batched (#5098)
Summary:
Pull Request resolved: #5098
X-link: https://github.com/facebookresearch/FBGEMM/pull/2105
X-link: https://github.com/meta-pytorch/MSLK/pull/6
This diff updates the workspace allocation for f8f8bf16_rowwise_batched to make sure its on the proper device. Previously, it could default to using device 0 despite other inputs being on a different gpu.
Reviewed By: q10
Differential Revision: D86439655
fbshipit-source-id: c5652c4791b5075103876c8ae76bd65213d6a9cb
Copy file name to clipboardExpand all lines: fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/f8f8bf16_rowwise_batched_common.cuh
0 commit comments