Change the repository type filter
All
Repositories list
43 repositories
lightllm
PublicLightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.llmc
Public[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".lightllm-blog
PublicLLM_QAT
Publicmtc-token-healing
Publicgeneral-sam
PublicEasyLLM
Publicopencompass
PublicInternVL
PublicOmniBal
PublicTFMQ-DM
Public[CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".L2_Compression
PublicFCPTS
Public templatestatecs
Public- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"
Dipoorlet
PublicLPCV_2023_solution
PublicOutlier_Suppression_Plus
PublicUP_LPCV2023_Plugin
Publicpyvlova
Public