We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 1f945ab commit fe84d5bCopy full SHA for fe84d5b
generate.py
@@ -310,7 +310,7 @@ def main(
310
decode_one_token = torch.compile(decode_one_token, mode="reduce-overhead", fullgraph=True)
311
312
# Uncomment to squeeze more perf out of prefill
313
- if args.compile_prefill:
+ if compile_prefill:
314
prefill = torch.compile(prefill, fullgraph=True, dynamic=True)
315
316
0 commit comments