Skip to content

Fix: English tokens extraction error when word ends with "e" #9546

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from

Conversation

Woody-Hu
Copy link
Contributor

What problem does this PR solve?

#9537
#9540

Type of change

  • Bug Fix (non-breaking change which fixes an issue)

@dosubot dosubot bot added size:S This PR changes 10-29 lines, ignoring generated files. 🐞 bug Something isn't working, pull request that fix bug. labels Aug 19, 2025
@whhe
Copy link
Contributor

whhe commented Aug 19, 2025

I found there is a more universal solution #9310, maybe it's better to upgrade the tokennizer based on that patch.

@Woody-Hu Woody-Hu closed this Aug 19, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🐞 bug Something isn't working, pull request that fix bug. size:S This PR changes 10-29 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants