Skip to content

Move all the proofreader and dictionary-related files to lispsuers/proofreader/, with hyphen names #2220

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged

Conversation

rmkaplan
Copy link
Contributor

These are interdependent but spread out in lispusers/. All together they provided the framework and implementation for spell-checking and dictionary lookup, in the early days.

The dictionary resources (e.g. the server or wordlists) are no longer available, and these made use of internal Tedit fields and functions that no longer exist. I made a rough pass to fix up the Tedit dependencies so at least these files compile. And maybe would mostly work (except for one function that appears to be a version TEDIT.SUBSTITUTE with some additional bells and whistles).

This subdirectory should eventually be moved to obsolete, this just pulls the files together, fixes the code, and does the hyphen filename renaming.

@MattHeffron
Copy link
Contributor

I checked out the rmk105 branch and all of the files, except README.TXT, contain the NUL character at every position!!

@rmkaplan
Copy link
Contributor Author

rmkaplan commented Jul 21, 2025 via email

@rmkaplan
Copy link
Contributor Author

I found, moved, and pushed contentful versions of the proofreader files (I hope).

@MattHeffron
Copy link
Contributor

Better, but PROOFREADER-PROOFREADER.TEDIT is still all NUL characters.
It is exactly the same size (9531 bytes) as PROOFREADER.TEDIT (which has good content),
so it seems that PROOFREADER-PROOFREADER.TEDIT shouldn't be there at all?

@rmkaplan
Copy link
Contributor Author

rmkaplan commented Jul 22, 2025 via email

@rmkaplan
Copy link
Contributor Author

I believe these renamed files will load, but and some functionality might even work. But more effort would have to go in to update to modern external formats and Tedit. For example, it seems to read characters with BIN instead of either READCCODE or the more primitive \INCCODE, and it may be assuming that character lengths are the same as byte lengths. Maybe only ever worked for 7-bit Ascii or ISO8859/1.

But it is worth grouping these together, and disposing of them together, one way or another.

@MattHeffron
Copy link
Contributor

I think that PROOFREADER-PROOFREADER.TEDIT is the proper name, given the hyphen convention

For all the other PROOFREADER-*.TEDIT files, the corresponding source and LCOM files also have the PROOFREADER- prefix. For this file, the corresponding source and LCOM files do not.
I thought that the prefix was to designate component files of a module, and the main file of the module wouldn't have the prefix. (For example, library/tedit/TEDIT.LCOM vs library/tedit/TEDIT-*.LCOM)
In that case, it seems that maybe some of the other files should have different prefixes as they are, according to their documentation, separate modules, but associated with PROOFREADER.

@rmkaplan
Copy link
Contributor Author

rmkaplan commented Jul 22, 2025 via email

Copy link
Contributor

@MattHeffron MattHeffron left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK, since there's no effect to current loadups.

@rmkaplan rmkaplan merged commit 840f8af into master Jul 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants