Skip to content
@IST-DASLab

IST Austria Distributed Algorithms and Systems Lab

Popular repositories Loading

  1. gptq gptq Public

    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

    Python 2.2k 186

  2. marlin marlin Public

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    Python 957 80

  3. sparsegpt sparsegpt Public

    Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

    Python 851 115

  4. PanzaMail PanzaMail Public

    Python 297 19

  5. qmoe qmoe Public

    Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

    Python 278 23

  6. llmq llmq Public

    Quantized LLM training in pure CUDA/C++.

    C++ 220 14

Repositories

Showing 10 of 72 repositories
  • llmq Public

    Quantized LLM training in pure CUDA/C++.

    IST-DASLab/llmq’s past year of commit activity
    C++ 220 Apache-2.0 14 0 1 Updated Dec 1, 2025
  • local_platinum_bench Public

    This repo allows you to run Platinum Bench evals via vLLM.

    IST-DASLab/local_platinum_bench’s past year of commit activity
    Python 0 CC-BY-4.0 0 0 1 Updated Nov 26, 2025
  • QuEST Public

    Work in progress.

    IST-DASLab/QuEST’s past year of commit activity
    Jupyter Notebook 75 MIT 7 2 0 Updated Nov 26, 2025
  • EvoPress Public
    IST-DASLab/EvoPress’s past year of commit activity
    Python 37 Apache-2.0 4 1 0 Updated Nov 22, 2025
  • Quartet Public
    IST-DASLab/Quartet’s past year of commit activity
    Jupyter Notebook 110 MIT 11 2 0 Updated Nov 19, 2025
  • FP-Quant Public
    IST-DASLab/FP-Quant’s past year of commit activity
    Python 79 12 6 3 Updated Nov 16, 2025
  • GridSearcher Public

    GridSearcher simplifies running grid searches for machine learning projects in Python, emphasizing parallel execution and GPU scheduling without dependencies on SLURM or other workload managers.

    IST-DASLab/GridSearcher’s past year of commit activity
    Python 3 Apache-2.0 0 0 0 Updated Nov 15, 2025
  • nanochat Public Forked from karpathy/nanochat

    The best ChatGPT that $100 can buy.

    IST-DASLab/nanochat’s past year of commit activity
    Python 0 MIT 4,675 0 0 Updated Nov 12, 2025
  • CAGE Public
    IST-DASLab/CAGE’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Nov 12, 2025
  • qutlass Public

    QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning

    IST-DASLab/qutlass’s past year of commit activity
    C++ 144 Apache-2.0 11 2 0 Updated Nov 11, 2025

Most used topics

Loading…