-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comparison with Unicode TR25 data #619
Comments
Suggestion here is we tidy up and pre-load the corrected data in the format? |
@josephwright well this issue is just about getting the lists aligned or at least differences documented. We could then consider working similarly to lccode tables, initialising the format based on unicode data, then over-riding with tex specific corrections and historic choices in a separate phase, but that would be a new issue. Actually I thought pre-loading the table was an old issue, it's certainly been raised before, but a search of issues didn't show anything just now. |
I updated the query to also log (marked The list in the issue description is updated. |
The entries marked
I think the first is just an error The others are all old Greek. @wspr I must have thought these were defined in unicode-math at some point, has this changed or are they defined in some option?, I couldn't see them, if not I should probably just similarly correct |
I think they were defined once upon a time, but I came up absolutely blank when looking for examples of them anyway, so I think I assumed they were examples of glyphs from a font that may have been associated with maths but wasn’t actually used in maths typesetting.
If you have even hints that they should be included in UM I’m happy to bring them back.
(I suspect that quite a number of the geometrical symbols and graphical looking arrows are also in this category.)
|
@wspr thanks, I'll delete them for now... done, list in the description updated. |
I’m actually a bit torn on this; I like the idea of providing more symbols for people to extend their repertoire with. And these symbols *are* in STIX Two, so I’d suggest we bring them back in UM rather than keep them deleted… WDYT?
|
Fine by me, now the list in unicode-math-table matches what unicode.xml says is in unicode-math so if you add them here, I'll add them there and keep in sync... |
It's probably worth reviewing the cases where there are different assignments (rather than missing definitions) especially if we might consider pre-initialising the mathcodes from the MathClass-xx file. I have also omitted here assignments to Unicode combining characters which will necessarily require modified (or probably missing) support in tex. delimitersI plan to raise an issue at Unicode on these left and right delimiters that are marked as infix relations
radicalsThese are almost always accessed via (U)radical so the mathclass is perhaps not so important
Arabic operatorsmathord or mathop?
PunctuationIt isn't clear if Unicode
arrows etcmathrel or mathord?
triangles etcmathbin or mathord ?
daggersbin or rel?
the rest
|
U1EEF0 ( U1EEF1 ( |
Description
unicode.xml at https://github.com/w3c/xml-entities/ has data for unicode-math and MathClass-15 from TR25
I have updated it to track updates here and also added a comparison of the math classes assigned, in the stylesheet
comp-um.xsl
in that repository.The resulting log is included here. Some of the differences are intentional, but some are probably errors in MathClass-15 or unicode-math-table.
In the list below
***
denotes a character listed in MathClass-15 with no entry in unicode-math-table.???
denotes a csname unicode.xml associates with unicode-math that is not in unicode-math-table (all now fixed)!
denotes an entry in unicode-math that is not in MathClass-15Add info or delete as appropriate:
The text was updated successfully, but these errors were encountered: