feat: support turboquant_plus by heroims · Pull Request #512 · jundot/omlx

heroims · 2026-04-01T15:07:47Z

No description provided.

SirDominik · 2026-04-02T11:03:36Z

I gave this a try. Unfortunately, just like the previously disabled KV cache quantization, I'm not seeing any reduction in peak memory usage. On top of that, token generation speed dropped by about 5–8% in my testing — though I should note that my tests were fairly short.

feat: support turboquant_plus

9c95740

jundot force-pushed the main branch from 2d46d30 to d0f5a38 Compare April 2, 2026 02:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support turboquant_plus#512

feat: support turboquant_plus#512
heroims wants to merge 1 commit intojundot:mainfrom
heroims:main

heroims commented Apr 1, 2026

Uh oh!

SirDominik commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

heroims commented Apr 1, 2026

Uh oh!

SirDominik commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants