You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
SUMMARY:
I wanted to create a PR showing users how they can add more mappings to
AWQ to account for more models. Turns out qwen has the exact same as
Llama, so I added one for Phi as well. I also updated the naming and
used the infer pattern employed in SmoothQuant, rather than requiring
user to set it
TEST PLAN:
`examples/awq/llama_example.py` works on this branch for
```python
MODEL_ID = "microsoft/Phi-4-mini-reasoning"
```
TODOs:
- [x] Merge in after #1451 lands
---------
Signed-off-by: Brian Dellabetta <[email protected]>
0 commit comments