Add GPTQModel support for evaluating GPTQ models #2217

Qubitium · 2024-08-16T06:50:34Z

Add option to use GPTQModel for lm_eval. GPTQModel is a replacement for AutoGPTQ on GPTQ quantization and inference with better model support and much faster inference speed out-of-the-box. We have been using it internally with lm-eval for months without issue.

CLAassistant · 2024-08-16T06:50:41Z

All committers have signed the CLA.

Qubitium · 2024-09-30T09:52:48Z

@baberabb Hi, can we get some action on this? What do we need to do to get this reviewed and merged?

Qubitium · 2024-10-22T02:37:41Z

@baberabb Ruff/Lint checks passed. Awaiting review. Thanks.

Qubitium · 2024-10-24T10:23:41Z

@baberabb Ping. Please check our PR. We will push a unit test into test/modes/test_gptq.py later today to complete the PR. Let us know if there is anything else required of us.

Qubitium · 2024-10-25T04:04:15Z

@baberabb Unit test added.

baberabb · 2024-10-25T08:11:50Z

Hi! Thanks for the PR, and sorry it took ages for us to review. This looks good to me, but I want to run it through @haileyschoelkopf as well.

ps. 3.8 test failing because of a recent transformers update.

CL-ModelCloud · 2024-10-25T10:53:02Z

Hi! Thanks for the PR, and sorry it took ages for us to review. This looks good to me, but I want to run it through @haileyschoelkopf as well.

ps. 3.8 test failing because of a recent transformers update.

/opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages/transformers/pipelines/audio_utils.py:54: in <module>
    ffmpeg_additional_args: Optional[list[str]] = None,
E   TypeError: 'type' object is not subscriptable
        Optional   = typing.Optional
        Tuple      = typing.Tuple
        Union      = typing.Union
        __builtins__ = <builtins>
        __cached__ = '/opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages/transformers/pipelines/__pycache__/audio_utils.cpython-38.pyc'
        __doc__    = None
        __file__   = '/opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages/transformers/pipelines/audio_utils.py'
        __loader__ = <_frozen_importlib_external.SourceFileLoader object at 0x7fd6d460cc70>
        __name__   = 'transformers.pipelines.audio_utils'
        __package__ = 'transformers.pipelines'
        __spec__   = ModuleSpec(name='transformers.pipelines.audio_utils', loader=<_frozen_importlib_external.SourceFileLoader object at 0x7fd6d460cc70>, origin='/opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages/transformers/pipelines/audio_utils.py')
        datetime   = <module 'datetime' from '/opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/datetime.py'>
        ffmpeg_read = <function ffmpeg_read at 0x7fd6d467f160>
        np         = <module 'numpy' from '/opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages/numpy/__init__.py'>
        platform   = <module 'platform' from '/opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/platform.py'>
        subprocess = <module 'subprocess' from '/opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/subprocess.py'>
__________________ ERROR collecting tests/models/test_gguf.py __________________

The error "TypeError: 'type' object is not subscriptable" occurs because the code uses Python 3.9+ type hinting syntax, such as list[str], which is not supported in Python 3.8

Transformers is tested on Python 3.9+

Qubitium · 2024-10-25T11:00:39Z

@baberabb transformers (as far as the new releases) are only validated/ci tested on python 3.9. I think lm eval CI and setup needs to upgrade to depend on min 3.9

Or apply the same patch used by @CL-ModelCloud to all other tests. eb10efb

CL-ModelCloud and others added 7 commits August 1, 2024 05:52

support gptqmodel

31832ed

code opt

211cf6c

Merge branch 'EleutherAI:main' into MOD-Support-GPTQModel

7e64072

Merge branch 'EleutherAI:main' into MOD-Support-GPTQModel

2a3c52a

Merge branch 'EleutherAI:main' into MOD-Support-GPTQModel

57c8c00

Merge remote-tracking branch 'origin/main' into MOD-Support-GPTQModel

210bf46

add gptqmodel option

5fed61c

Qubitium requested review from haileyschoelkopf and lintangsutawika as code owners August 16, 2024 06:50

Qubitium and others added 3 commits August 16, 2024 14:51

Update huggingface.py

fc21103

Update pyproject.toml

c6ebc51

Merge branch 'EleutherAI:main' into MOD-Support-GPTQModel

49ff042

CL-ModelCloud requested a review from baberabb as a code owner September 30, 2024 09:11

CL-ModelCloud and others added 4 commits September 30, 2024 10:27

gptqmodel version upgraded to 1.0.6

c11fe81

Merge branch 'EleutherAI:main' into MOD-Support-GPTQModel

b445a30

GPTQModel version upgraded to 1.0.8

88e4874

Update pyproject.toml

ea89c48

Qubitium mentioned this pull request Oct 22, 2024

[COMPAT] HF compat (AutoModel + Optimum) ModelCloud/GPTQModel#440

Closed

CL-ModelCloud added 2 commits October 22, 2024 10:04

Merge branch 'EleutherAI:main' into MOD-Support-GPTQModel

24a6f35

fix ruff-format error

c16111c

Fix code conflicts

8f9f58e

Qubitium changed the title ~~Add GPTQModel support for inferencing GPTQ models~~ Add GPTQModel support for evaluating GPTQ models Oct 24, 2024

CL-ModelCloud added 2 commits October 25, 2024 03:39

add gptqmodel test

ebdba64

Update gptqmodel test model

3c5de46

skip cuda

ee65028

baberabb approved these changes Oct 25, 2024

View reviewed changes

python3.8 compatible

eb10efb

Qubitium mentioned this pull request Oct 25, 2024

[FEATURE] lm-eval integration ModelCloud/GPTQModel#471

Closed

Qubitium and others added 4 commits October 26, 2024 01:48

Update README.md

205e401

Update README.md

fede5cc

Merge branch 'EleutherAI:main' into MOD-Support-GPTQModel

751f5bb

Merge branch 'EleutherAI:main' into MOD-Support-GPTQModel

b549652

baberabb merged commit 4f8e479 into EleutherAI:main Oct 31, 2024
8 checks passed

Qubitium deleted the MOD-Support-GPTQModel branch November 1, 2024 01:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GPTQModel support for evaluating GPTQ models #2217

Add GPTQModel support for evaluating GPTQ models #2217

Qubitium commented Aug 16, 2024 •

edited

Loading

CLAassistant commented Aug 16, 2024 •

edited

Loading

Qubitium commented Sep 30, 2024

Qubitium commented Oct 22, 2024

Qubitium commented Oct 24, 2024

Qubitium commented Oct 25, 2024

baberabb commented Oct 25, 2024

CL-ModelCloud commented Oct 25, 2024

Qubitium commented Oct 25, 2024 •

edited

Loading

Add GPTQModel support for evaluating GPTQ models #2217

Add GPTQModel support for evaluating GPTQ models #2217

Conversation

Qubitium commented Aug 16, 2024 • edited Loading

CLAassistant commented Aug 16, 2024 • edited Loading

Qubitium commented Sep 30, 2024

Qubitium commented Oct 22, 2024

Qubitium commented Oct 24, 2024

Qubitium commented Oct 25, 2024

baberabb commented Oct 25, 2024

CL-ModelCloud commented Oct 25, 2024

Qubitium commented Oct 25, 2024 • edited Loading

Qubitium commented Aug 16, 2024 •

edited

Loading

CLAassistant commented Aug 16, 2024 •

edited

Loading

Qubitium commented Oct 25, 2024 •

edited

Loading