Commit fe5c3cb
Added Conditions of SM90 and ISA7.8 for Using cvt.ftz.f32.bf16 Instruction (#165774)
Updated the conditions for generating the cvt.ftz.f32.bf16 instruction
to include sm90 and isa7.8, so that ftz is only generated when it is
supported.
---------
Co-authored-by: Justin Fargnoli <jfargnoli@nvidia.com>1 parent 7398591 commit fe5c3cb
File tree
2 files changed
+311
-34
lines changed- llvm
- lib/Target/NVPTX
- test/CodeGen/NVPTX
2 files changed
+311
-34
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2267 | 2267 | | |
2268 | 2268 | | |
2269 | 2269 | | |
2270 | | - | |
| 2270 | + | |
2271 | 2271 | | |
2272 | 2272 | | |
2273 | 2273 | | |
| |||
0 commit comments