Qubitium

Follow

🙌

....

Qubitium-ModelCloud Qubitium

🙌

....

Follow

Golang, Python, Kotlin, Swift. I prefer strongly typed languages and I do not worship PEP. @ModelCloudAi

51 followers · 57 following

ModelCloud.ai
Earth/Epoch 2.0
https://modelcloud.ai
@qubitium

Achievements

Achievements

Pinned Loading

ModelCloud/GPTQModel ModelCloud/GPTQModel Public

Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Python 274 42
ModelCloud/Tokenicer ModelCloud/Tokenicer Public

A (nicer) tokenizer you want to use for model inference and training: with all known peventable gotchas normalized or auto-fixed.

Python 4 1
ModelCloud/Device-SMI ModelCloud/Device-SMI Public

Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-smi or /proc/cpuinfo and parsing it y…

Python 10 1
sgl-project/sglang sgl-project/sglang Public

SGLang is a fast serving framework for large language models and vision language models.

Python 9.6k 912
vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 37.8k 5.7k
flashinfer-ai/flashinfer flashinfer-ai/flashinfer Public

FlashInfer: Kernel Library for LLM Serving

Cuda 2k 207