Skip to content

Pull requests: quic/efficient-transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Continuous Batching for VLMs
#610 opened Nov 5, 2025 by asmigosw Draft
removed platform sdk dependency
#609 opened Nov 5, 2025 by smedhe Loading…
Prefill+decode gpt oss 1.21.0 enhancement New feature or request
#608 opened Nov 5, 2025 by ochougul Loading…
[Exp]: Python 3.12 upgrade
#607 opened Nov 5, 2025 by abukhoy Loading…
Diffusers support
#604 opened Nov 4, 2025 by quic-amitraj Draft
Dynamo-enabled ONNX Export
#602 opened Nov 3, 2025 by smedhe Draft
Onnx Subfunction
#600 opened Oct 29, 2025 by abhishek-singh591 Draft
[Test]: Models test configs in a single Config file
#596 opened Oct 22, 2025 by abukhoy Loading…
[WIP]: Add early support for KV replication in VLMs
#594 opened Oct 18, 2025 by vbaddi Loading…
Wan 2.2 5 b
#593 opened Oct 17, 2025 by tv-karthikeya Draft
Onboarding Qwen3VlMoe
#590 opened Oct 16, 2025 by qcdipankar Draft
Add support of qwen_image
#585 opened Oct 13, 2025 by quic-amitraj Draft
removed redundancies from QEFFHybridCache
#582 opened Oct 2, 2025 by ochougul Loading…
Onboarding OmniSVG
#578 opened Sep 27, 2025 by qcdipankar Draft
Adding Compute-Context-Length (CCL)
#576 opened Sep 26, 2025 by vjanfaza Loading…
Embedding Model fix wip Work in progress
#548 opened Aug 28, 2025 by quic-amitraj Draft
Added Multiframe Inference for llama4+internvl
#547 opened Aug 27, 2025 by aditjadh Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.