generated from freelawproject/new-project-template
-
-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Eyecite Fails to Parse Complex Citations Correctly #185
Comments
This issue needs to be
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Recently, @anseljh highlighted missing citations in CourtListener (CL), and while investigating, I encountered some challenging parsing issues—likely edge cases.
For example, consider the following from Jasmine v. Superior Court:
Problem:
Eyecite struggles to correctly parse this structure. There are two cases here, each with two citations (one parallel for each):
In re K.F. Citations:
Yield Dynamics Citations:
Results from get_citations:
When parsing the string, Eyecite produces the following four citations:
Now each of these is correct as to the citation- but it fails down when it tries to include the date. This is where the wrench lands with our citation annotator.
Date Issues:
• The date for In re K.F. is incorrectly assigned the year of Yield Dynamics (2007). This happens because Eyecite is not separating the citations appropriately.
Plaintiff/Defendant Parsing:
• In cases like In re K.F., where there are no explicit plaintiff or defendant, Eyecite struggles with parsing parties correctly. For example, “K.F.” is being treated as a defendant.
• Similarly, in Yield Dynamics, “Inc.” is assigned as the plaintiff, when it’s part of the full title of the case.
Extra Data Repairing:
• There’s an issue with the “extra” field being populated with the following citation information.
I think we need to add
The text was updated successfully, but these errors were encountered: