Skip to content
@IST-DASLab

IST Austria Distributed Algorithms and Systems Lab

Popular repositories Loading

  1. gptq gptq Public

    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

    Python 1.8k 144

  2. sparsegpt sparsegpt Public

    Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

    Python 662 84

  3. marlin marlin Public

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    Python 457 34

  4. qmoe qmoe Public

    Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

    Python 255 22

  5. PanzaMail PanzaMail Public

    Python 251 12

  6. QUIK QUIK Public

    Repository for the QUIK project, enabling the use of 4bit kernels for generative inference

    C++ 161 12

Repositories

Showing 10 of 46 repositories
  • IST-DASLab/PanzaMail’s past year of commit activity
    Python 251 Apache-2.0 12 3 2 Updated Jul 13, 2024
  • marlin Public

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    IST-DASLab/marlin’s past year of commit activity
    Python 457 Apache-2.0 34 20 3 Updated Jul 10, 2024
  • MicroAdam Public

    This repository contains code for the MicroAdam paper.

    IST-DASLab/MicroAdam’s past year of commit activity
    Python 5 Apache-2.0 1 0 0 Updated Jun 28, 2024
  • IST-DASLab/ISTA-DASLab-Optimizers’s past year of commit activity
    Python 4 Apache-2.0 0 0 0 Updated Jun 27, 2024
  • IST-DASLab/AutoGPTQRoSA’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Jun 27, 2024
  • GridSearcher Public

    GridSearcher simplifies running grid searches for machine learning projects in Python, emphasizing parallel execution and GPU scheduling without dependencies on SLURM or other workload managers.

    IST-DASLab/GridSearcher’s past year of commit activity
    Python 0 Apache-2.0 0 0 0 Updated Jun 21, 2024
  • spops Public
    IST-DASLab/spops’s past year of commit activity
    C++ 4 Apache-2.0 0 0 0 Updated Jun 20, 2024
  • Mathador-LM Public

    Code for the paper "Mathador-LM: A Dynamic Benchmark for Mathematical Reasoning on LLMs".

    IST-DASLab/Mathador-LM’s past year of commit activity
    Python 0 Apache-2.0 0 0 0 Updated Jun 18, 2024
  • sparsegpt Public

    Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

    IST-DASLab/sparsegpt’s past year of commit activity
    Python 662 Apache-2.0 84 13 1 Updated May 30, 2024
  • SPADE Public

    Code of SPADE: Sparsity Guided Debugging for Deep Neural Networks

    IST-DASLab/SPADE’s past year of commit activity
    Jupyter Notebook 0 0 0 0 Updated May 25, 2024

Top languages

Loading…

Most used topics

Loading…