IQ quant performance #17842
Unanswered
ThatGuyWhoAsked
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Prior sentiment was that IQ quants were slower on apple silicon, however is that still true? My benchmarks show it being FASTER then the similar quality quantisation:
and
I also read that apply family 9 (m3 and m4) are faster at this ( I have m3) does anyone have an idea as to why it is faster?
Prior Discussion: #5617
Beta Was this translation helpful? Give feedback.
All reactions