Added Infra in QEfficient for execution of swiftkv models #314

quic-hemagnih · 2025-03-13T01:58:54Z

Added Infra in QEfficient for execution of models whose modelling file is not present at hugging face and checkpoint like swiftkv

QEfficient/__init__.py

QEfficient/transformers/models/llama_swiftkv/modeling_llama_swiftkv.py

QEfficient/transformers/modeling_utils.py

QEfficient/__init__.py

QEfficient/transformers/modeling_utils.py

QEfficient/transformers/models/llama_swiftkv/modeling_llama_swiftkv.py

tests/transformers/models/test_causal_lm_models.py

QEfficient/transformers/models/llama_swiftkv/modeling_llama_swiftkv.py

QEfficient/transformers/cache_utils.py

docs/source/validate.md

QEfficient/transformers/models/llama_swiftkv/modeling_llama_swiftkv.py

…e is not present at hugging face and checkpoint like swiftkv Signed-off-by: Hem Agnihotri <[email protected]>

Signed-off-by: Hem Agnihotri <[email protected]>

Signed-off-by: Onkar Chougule <[email protected]>

Signed-off-by: Hem Agnihotri <[email protected]>

ochougul

LGTM, lets merge once CI passes

ochougul

LGTM

abukhoy · 2025-04-16T10:23:57Z

tests/transformers/models/test_causal_lm_models.py

@@ -187,6 +190,103 @@ def check_causal_lm_pytorch_vs_kv_vs_ort_vs_ai100(
    assert os.path.isfile(os.path.join(os.path.dirname(qpc_path), "qconfig.json"))


+def check_non_hf_kv_vs_ort_vs_ai100(


I have noticed that most of the code is same with the check_causal_lm_pytorch_vs_kv_vs_ort_vs_ai100 function except pytorch_hf_token calculation part. Can't we refactor the code in such a way that we can reuse the common code?

Signed-off-by: Amit Raj <[email protected]>

ochougul · 2025-04-18T07:27:37Z

done in #367

quic-amitraj self-requested a review March 13, 2025 12:01

quic-amitraj requested changes Mar 17, 2025

View reviewed changes

quic-amitraj reviewed Mar 17, 2025

View reviewed changes

QEfficient/transformers/modeling_utils.py Outdated Show resolved Hide resolved

quic-amitraj reviewed Mar 17, 2025

View reviewed changes

QEfficient/__init__.py Outdated Show resolved Hide resolved

quic-amitraj reviewed Mar 17, 2025

View reviewed changes

QEfficient/transformers/modeling_utils.py Outdated Show resolved Hide resolved

quic-swatia force-pushed the main branch from 32651d5 to cd9a6b9 Compare March 20, 2025 09:58

quic-hemagnih marked this pull request as ready for review March 31, 2025 09:24

quic-hemagnih requested review from quic-rishinr and ochougul as code owners March 31, 2025 09:24

quic-amitraj force-pushed the swiftkv_br branch from 44e24a6 to 31a19d3 Compare April 2, 2025 10:46

quic-amitraj requested changes Apr 2, 2025

View reviewed changes

QEfficient/transformers/models/llama_swiftkv/modeling_llama_swiftkv.py Outdated Show resolved Hide resolved

tests/transformers/models/test_causal_lm_models.py Show resolved Hide resolved

quic-amitraj assigned ochougul and quic-hemagnih Apr 4, 2025

quic-amitraj added 1.20.0 ready for review labels Apr 4, 2025

quic-rishinr requested changes Apr 7, 2025

View reviewed changes

quic-amitraj removed the ready for review label Apr 7, 2025

ochougul reviewed Apr 15, 2025

View reviewed changes

QEfficient/transformers/models/llama_swiftkv/modeling_llama_swiftkv.py Show resolved Hide resolved

quic-hemagnih and others added 4 commits April 15, 2025 11:26

Added Infra in QEfficient for execution of models whose modelling fil…

04bfb9d

…e is not present at hugging face and checkpoint like swiftkv Signed-off-by: Hem Agnihotri <[email protected]>

Support for continous batching and unit test

abf9099

Signed-off-by: Hem Agnihotri <[email protected]>

Fixed CB bug for SwiftKV

1b1af48

Signed-off-by: Onkar Chougule <[email protected]>

Added unit test for non HF models like swiftkv

6830445

Signed-off-by: Hem Agnihotri <[email protected]>

quic-hemagnih force-pushed the swiftkv_br branch from 74cd107 to 6830445 Compare April 15, 2025 11:27

quic-hemagnih marked this pull request as draft April 15, 2025 11:28

ochougul approved these changes Apr 15, 2025

View reviewed changes

ochougul marked this pull request as ready for review April 15, 2025 15:27

ochougul approved these changes Apr 15, 2025

View reviewed changes

This was referenced Apr 15, 2025

[QEff Finetune]: Refactor the finetune main __call__ #289

Merged

Adding support for registaration of non transformer models like swiftkv in QEfficient #291

Closed

abukhoy reviewed Apr 16, 2025

View reviewed changes

Changes to modeling file

10d73b8

Signed-off-by: Amit Raj <[email protected]>

ochougul closed this Apr 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added Infra in QEfficient for execution of swiftkv models #314

Added Infra in QEfficient for execution of swiftkv models #314

Uh oh!

quic-hemagnih commented Mar 13, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ochougul left a comment

Uh oh!

ochougul left a comment

Uh oh!

abukhoy Apr 16, 2025

Uh oh!

ochougul commented Apr 18, 2025

Uh oh!

Uh oh!

		@@ -187,6 +190,103 @@ def check_causal_lm_pytorch_vs_kv_vs_ort_vs_ai100(
		assert os.path.isfile(os.path.join(os.path.dirname(qpc_path), "qconfig.json"))


		def check_non_hf_kv_vs_ort_vs_ai100(

Added Infra in QEfficient for execution of swiftkv models #314

Added Infra in QEfficient for execution of swiftkv models #314

Uh oh!

Conversation

quic-hemagnih commented Mar 13, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ochougul left a comment

Choose a reason for hiding this comment

Uh oh!

ochougul left a comment

Choose a reason for hiding this comment

Uh oh!

abukhoy Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

ochougul commented Apr 18, 2025

Uh oh!

Uh oh!