Bug: Critical accuracy bugs for model_type=qwen2: no causal attention and wrong tokenizer #762

michaelfeil · 2025-11-21T17:23:58Z

What does this PR do?

What happend:
Qwen2Flash was written to support only Alibaba-NLP/gte-Qwen2-1.5B-instruct, which was trained and has reference inference code with causal=False (aka use_bidirectional_attention=True). Later on I asked to add a flag in the config to actually read this.
https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct/discussions/28

Now, we have also newer models, e.g. jina which are built on qwen2 but with:

causal=True
no custom tokenizer.py
affected: https://huggingface.co/jinaai/jina-code-embeddings-0.5b/blob/main/config.json, or nomic-ai/nomic-embed-code

The new models are currently loaded, but the output embeddings will be incorrect.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
X ] Did you read the contributor guideline?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the documentation guidelines.
Did you write any new necessary tests? If applicable, did you include or update the insta snapshots?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

michaelfeil · 2025-11-21T23:29:03Z

@codex review

michaelfeil added 3 commits November 21, 2025 17:01

is causal

7d0c853

support qwen2 flash

e64b1d8

add warning

690b43a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bug: Critical accuracy bugs for model_type=qwen2: no causal attention and wrong tokenizer #762

Bug: Critical accuracy bugs for model_type=qwen2: no causal attention and wrong tokenizer #762

michaelfeil commented Nov 21, 2025 •

edited

Loading

Uh oh!

michaelfeil commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Bug: Critical accuracy bugs for model_type=qwen2: no causal attention and wrong tokenizer #762

Are you sure you want to change the base?

Bug: Critical accuracy bugs for model_type=qwen2: no causal attention and wrong tokenizer #762

Conversation

michaelfeil commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

michaelfeil commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

michaelfeil commented Nov 21, 2025 •

edited

Loading