Neural Magic
Neural Magic (Acquired by Red Hat) empowers developers to optimize & deploy LLMs at scale. Our model compression & acceleration enable top performance with vLLM
Pinned Loading
Repositories
Showing 10 of 72 repositories
- vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
neuralmagic/vllm’s past year of commit activity - speculators Public
neuralmagic/speculators’s past year of commit activity - sparseml Public
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
neuralmagic/sparseml’s past year of commit activity - lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
neuralmagic/lm-evaluation-harness’s past year of commit activity - compressed-tensors Public
A safetensors extension to efficiently store sparse quantized tensors on disk
neuralmagic/compressed-tensors’s past year of commit activity - model-validation-configs Public
neuralmagic/model-validation-configs’s past year of commit activity - lmms-eval Public Forked from EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
neuralmagic/lmms-eval’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…