-
-
Notifications
You must be signed in to change notification settings - Fork 122
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Correction doesn't prioritize bigram. #107
Comments
Is this similar to #92? I believe this is because bigrams are only used when a term from the input phrase is split up, e.g., when The quickest workaround for this particular example would be to have |
Thanks for your explanation. Will have a try. |
@mammothb Do you plan to add this feature in symspellpy ? Or do we continue to have this "custom frequency logic"? As we now understand were it comes from I might do a PR to fix that. For my usecase it's almost vital as I cannot afford false negative and bi-grams helps to solve that. Thanks again for this nice lib, |
There are sengkerang and selatan in gram dictionary, the frequency are 500.
And there is tangkerang selatan in bigram dictionary, the frequency is 1200.
When correcting the address "Jalan Surabaya No.17, Tengkerang Selatan, Bukit Raya".
The output is 'jalan surabaya no 17 sengkerang selatan bukit raya',
But I expect Tengkerang Selatan to be corrected as Tangkerang Selatan bigram.
Is it possible to do this?
The text was updated successfully, but these errors were encountered: