-
Notifications
You must be signed in to change notification settings - Fork 43
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Explore BYOD library #560
Comments
My personal takeaways:
Even though there's nothing ground breaking in the repo and paper I do think it is really interesting to have an approach in which the model is evaluated against itself. |
I agree, some of the tests are very simple, but also easy to implement and fast to run. So maybe we could add like the toxicity one for a quick test without any dependency to an external library to run an ML model... Let's make a list of what is worth to bring to nlptest and add them to the roadmap. |
I just found a paper about self evaluating, would interesting to read and check if we can implement it. https://arxiv.org/abs/2306.13651?utm_source=substack&utm_medium=email |
Explode the BYOD repository for additional tests or datasets to add to nlptest.
Examples:
The text was updated successfully, but these errors were encountered: