Commit 91ef068
authored
[FA] Port remaining performance features from advanced path (#3848)
`set_fast_math` didn't work on default path, as lowering pass generates
LLVM operations. This PR changed setting of fastmath flag on LLVM IR.
This PR guards the setting of fastmath flag under an env var to avoid
accuracy failures.

Observations:
1. performance of advanced path without setting fastmath flag is the
same as default path
2. performance of default path with setting fastmath flag is faster than
advanced path
As default path (with env var) is able to achieve performance no worse
than advanced path, this PR stops running FA with advanced path.
Closes #3286
---------
Signed-off-by: Whitney Tsang <whitney.tsang@intel.com>1 parent 25f5666 commit 91ef068
File tree
4 files changed
+19
-23
lines changed- .github/workflows
- include/triton/Tools/Sys
- third_party/intel
- backend
4 files changed
+19
-23
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
258 | 258 | | |
259 | 259 | | |
260 | 260 | | |
261 | | - | |
262 | | - | |
263 | | - | |
264 | | - | |
265 | | - | |
266 | | - | |
267 | | - | |
268 | | - | |
269 | | - | |
270 | | - | |
271 | | - | |
272 | | - | |
273 | 261 | | |
274 | 262 | | |
275 | 263 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
50 | 50 | | |
51 | 51 | | |
52 | 52 | | |
| 53 | + | |
53 | 54 | | |
54 | 55 | | |
55 | 56 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
346 | 346 | | |
347 | 347 | | |
348 | 348 | | |
349 | | - | |
350 | 349 | | |
351 | 350 | | |
352 | 351 | | |
| |||
359 | 358 | | |
360 | 359 | | |
361 | 360 | | |
| 361 | + | |
| 362 | + | |
362 | 363 | | |
363 | 364 | | |
364 | 365 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
| 5 | + | |
5 | 6 | | |
6 | 7 | | |
7 | 8 | | |
| |||
256 | 257 | | |
257 | 258 | | |
258 | 259 | | |
259 | | - | |
260 | | - | |
261 | | - | |
262 | | - | |
263 | | - | |
264 | | - | |
265 | | - | |
266 | | - | |
267 | | - | |
268 | | - | |
| 260 | + | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
| 264 | + | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
269 | 275 | | |
270 | 276 | | |
271 | 277 | | |
| |||
0 commit comments