Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

False positive check inserting inserting identities with l and ł values #163

Open
sduenas opened this issue Oct 22, 2018 · 1 comment
Open
Labels

Comments

@sduenas
Copy link
Member

sduenas commented Oct 22, 2018

Values with polish characters l and ł are considered equal in the database giving false positives when a similar identities with these characters are inserted. For example, having the next identity in the database:

AAAAAAAA	Michał Smith	[email protected]	msmith	unknown

The call to the next command fails when the name is written with latin characters:

sortinghat add --name "Michal Smith" --email "[email protected]" --username "msmith" 
Error: unique identity 'Michal [email protected]' already exists in the registry

@sduenas sduenas added the bug label Oct 22, 2018
@sduenas
Copy link
Member Author

sduenas commented Oct 22, 2018

This error is caused by the usage of utf8mb4_unicode_520_ci collation. This collation assumes these two characters are the same: see this link for more info.

We should use utf8mb4_unicode_ci while we wait for a better solution on these kind of problems.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant