quic / efficient-transformers Public

Notifications You must be signed in to change notification settings
Fork 53
Star 75

Code
Issues 1
Pull requests 34
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: quic/efficient-transformers

Labels 22 Milestones 0

New pull request New

34 Open 510 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Onboarding Molmo Model

#554 opened Sep 8, 2025 by mohiso22

Loading…

Extend On-Device Sampling Support to more Causal Language Models

#553 opened Sep 4, 2025 by quic-sanising • Draft

Onnx function for MMDit block

#552 opened Sep 3, 2025 by quic-akuruvil • Draft

TF ver 4.55.0, pytorch 2.7.1, hf hub 0.34.0 and diffusers 0.31.0

#551 opened Sep 3, 2025 by quic-hemagnih • Draft

[Docs Update:] Auto Classes are Separated from Python API

#550 opened Sep 3, 2025 by abukhoy • Draft

Embedding Model fix wip

Work in progress

#548 opened Aug 28, 2025 by quic-amitraj • Draft

Added Multiframe Inference for llama4+internvl

#547 opened Aug 27, 2025 by aditjadh

Loading…

Optimized ONNX Transform via Class Merging and Thread Pooling

#546 opened Aug 23, 2025 by abhishek-singh591

Loading…

updated notebooks

#543 opened Aug 20, 2025 by smedhe

Loading…

Transformers version 4.55 upgrade

#542 opened Aug 19, 2025 by quic-mamta • Draft

removed platform sdk dependency

#540 opened Aug 19, 2025 by smedhe

Loading…

Added memory optimization for onnx transforms

#538 opened Aug 12, 2025 by quic-rishinr

Loading…

Onnx slim transform

#536 opened Aug 12, 2025 by tchawada

Loading…

[QEff]: Add OpenAI Oss Models (gpt_oss) enhancement

New feature or request

#534 opened Aug 6, 2025 by vbaddi

Loading…

Support of Diffusers wip

Work in progress

#529 opened Aug 5, 2025 by quic-amitraj • Draft

Update PyTorch to 2.7.1+cpu, Torchvision to 0.22.1+cpu, and Python Requirement to >=3.9

#524 opened Jul 28, 2025 by abukhoy

Loading…

2 tasks done

Add Support for Frequency Penalties in On Device Sampling

#523 opened Jul 24, 2025 by quic-sanising • Draft

Logger module in Efficient Transformers 1.21.0 wip

Work in progress

#517 opened Jul 11, 2025 by quic-hemagnih • Draft

Added --iteration and --automation flags

#512 opened Jul 10, 2025 by asmigosw • Draft

Llama4 VLM Continuous Batching Support

#510 opened Jul 9, 2025 by mohiso22

Loading…

[Olmo2]: Add Support for Olmo2 CausalLM Model in QEff 1.21.0 enhancement

New feature or request

#509 opened Jul 9, 2025 by vbaddi

Loading…

Jina model support [experimental]

#502 opened Jul 8, 2025 by quic-amitraj • Draft

Reading mxfp6_matmul for QNN Compilation path from compile API arguments 1.21.0

#499 opened Jul 7, 2025 by shubhagr-qc

Loading…

[Llama4]: Add support for padding num_patches 1.21.0 enhancement

New feature or request

#486 opened Jul 1, 2025 by vbaddi

Loading…

Changing the hashing methodology for cache folder creation of models. 1.21.0

#481 opened Jun 24, 2025 by quic-dhirajku

Loading…

Previous 1 2 Next

Previous Next

ProTip! Updated in the last three days: updated:>2025-09-06.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!