-
Notifications
You must be signed in to change notification settings - Fork 45
Added Infra in QEfficient for execution of swiftkv models #314
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
QEfficient/transformers/models/llama_swiftkv/modeling_llama_swiftkv.py
Outdated
Show resolved
Hide resolved
QEfficient/transformers/models/llama_swiftkv/modeling_llama_swiftkv.py
Outdated
Show resolved
Hide resolved
QEfficient/transformers/models/llama_swiftkv/modeling_llama_swiftkv.py
Outdated
Show resolved
Hide resolved
…e is not present at hugging face and checkpoint like swiftkv Signed-off-by: Hem Agnihotri <[email protected]>
Signed-off-by: Hem Agnihotri <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: Hem Agnihotri <[email protected]>
74cd107
to
6830445
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, lets merge once CI passes
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
@@ -187,6 +190,103 @@ def check_causal_lm_pytorch_vs_kv_vs_ort_vs_ai100( | |||
assert os.path.isfile(os.path.join(os.path.dirname(qpc_path), "qconfig.json")) | |||
|
|||
|
|||
def check_non_hf_kv_vs_ort_vs_ai100( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have noticed that most of the code is same with the check_causal_lm_pytorch_vs_kv_vs_ort_vs_ai100
function except pytorch_hf_token
calculation part. Can't we refactor the code in such a way that we can reuse the common code?
Signed-off-by: Amit Raj <[email protected]>
done in #367 |
Added Infra in QEfficient for execution of models whose modelling file is not present at hugging face and checkpoint like swiftkv