Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

multiple encodings #3

Open
bibliata opened this issue Mar 15, 2022 · 0 comments
Open

multiple encodings #3

bibliata opened this issue Mar 15, 2022 · 0 comments

Comments

@bibliata
Copy link

bibliata commented Mar 15, 2022

multiple encodings on same words appear as difference between morphGNT vs STRONGs

ex. φωτίζω VS φωτίζω

which decoded returns:
&# 966;&# 969;&# 964;&# 943;&# 950;&# 969;
VS
&# 966;&# 969;&# 964;&# 8055;&# 950;&# 969;
(spaces added after # because this editor displayed them in Greek)

Difference: ί (perhaps because of the apostrophe) is encoded one time with #943 and another with #8055

In a simple db comparison, this returns almost 50% difference between all Greek words used

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant