Skip to content

Pull requests: quic/efficient-transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Onboarding Molmo Model
#554 opened Sep 8, 2025 by mohiso22 Loading…
Embedding Model fix wip Work in progress
#548 opened Aug 28, 2025 by quic-amitraj Draft
Added Multiframe Inference for llama4+internvl
#547 opened Aug 27, 2025 by aditjadh Loading…
updated notebooks
#543 opened Aug 20, 2025 by smedhe Loading…
removed platform sdk dependency
#540 opened Aug 19, 2025 by smedhe Loading…
Added memory optimization for onnx transforms
#538 opened Aug 12, 2025 by quic-rishinr Loading…
Onnx slim transform
#536 opened Aug 12, 2025 by tchawada Loading…
[QEff]: Add OpenAI Oss Models (gpt_oss) enhancement New feature or request
#534 opened Aug 6, 2025 by vbaddi Loading…
Support of Diffusers wip Work in progress
#529 opened Aug 5, 2025 by quic-amitraj Draft
Llama4 VLM Continuous Batching Support
#510 opened Jul 9, 2025 by mohiso22 Loading…
[Olmo2]: Add Support for Olmo2 CausalLM Model in QEff 1.21.0 enhancement New feature or request
#509 opened Jul 9, 2025 by vbaddi Loading…
[Llama4]: Add support for padding num_patches 1.21.0 enhancement New feature or request
#486 opened Jul 1, 2025 by vbaddi Loading…
ProTip! Updated in the last three days: updated:>2025-09-06.