Skip to content

Feature Request: Rotorquant #145

@FerLuisxd

Description

@FerLuisxd

Prerequisites

  • I am running the latest code. Mention the version if possible as well.
  • I carefully followed the README.md.
  • I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
  • I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

Add support for Rotorquant

Motivation

There is a fork that is working on it but provides no releases, also could be great to merge to work to have even more alternatives for cache compression

Possible Implementation

https://github.com/johndpope/llama-cpp-turboquant

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions