We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 528ad38 commit ba18a3eCopy full SHA for ba18a3e
examples/compute_context_length.py
@@ -41,9 +41,6 @@
41
model = QEFFAutoModelForCausalLM.from_pretrained(
42
model_name,
43
continuous_batching=True,
44
- comp_ctx_lengths_prefill=comp_ctx_lengths_prefill,
45
- comp_ctx_lengths_decode=comp_ctx_lengths_decode,
46
- ctx_len=ctx_len,
47
)
48
49
# model compilation for either continuous or static batching. For continuous batching full_batch_size is needed.
0 commit comments