-
Notifications
You must be signed in to change notification settings - Fork 60
Pull requests: quic/efficient-transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[QEff. Finetune]: Added logger and its test cases.
#630
opened Nov 21, 2025 by
quic-meetkuma
•
Draft
[QEff.Finetuning]CI enablement for Fine-Tuning
fine-tuning
#629
opened Nov 21, 2025 by
quic-akuruvil
Loading…
Created ReplicateKVHeadTransform to integrate KV-heads replication module within Qefficient library.
#625
opened Nov 19, 2025 by
quic-dhirajku
Loading…
Add Support for Guided Decoding to On Device Sampling
#624
opened Nov 19, 2025 by
quic-sanising
Loading…
Adding ccl_enabled flag during model loading and passing CCL lists during compilation process
#623
opened Nov 18, 2025 by
vjanfaza
Loading…
Adding support for BlockedKV attention in CasualLM models
#618
opened Nov 14, 2025 by
vaibverm
Loading…
Remove transformers dependencies from cache_utils and restructure cache classes
#616
opened Nov 13, 2025 by
quic-mamta
•
Draft
Prefill+decode gpt oss
1.21.0
enhancement
New feature or request
#608
opened Nov 5, 2025 by
ochougul
Loading…
Diffusers support
Diffusers
Use for PR related to diffusers in efficient-transformers.
#604
opened Nov 4, 2025 by
quic-amitraj
Loading…
Extend on-device sampling support for dual QPC VLMs
enhancement
New feature or request
#597
opened Oct 24, 2025 by
quic-xiyushi
Loading…
Modified qwen_2.5 modelling file to allow replicate_kv_script to work for custom num_kv_heads.
#595
opened Oct 18, 2025 by
quic-dhirajku
Loading…
Logger Module For Efficient Transformers
1.21.0
wip
Work in progress
#555
opened Sep 10, 2025 by
quic-hemagnih
•
Draft
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.