-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove (or make optional) nltk dependency #33
Comments
|
Yes, would be nice to remove this dependency, but the last point is indeed not that easy to remove. We do want to support longer documents, so we need some kind of sentence splitting built in. Unless of course @larsmans comes up with his fancy new super fast matching algorithm... |
He has a fancy super fast matching algorithm? I thought he did a levenshtein implementation? |
He can multitask and said he wanted to do something about the matching today… |
+1 |
Did my big mouth speak for itself again? |
Currently we pull in nltk for doing:
The first two point are easy, the last one can (should?) be made optional (if you're dealing with document and want to split them into sentences.)
The text was updated successfully, but these errors were encountered: