Skip to content
This repository has been archived by the owner on Apr 22, 2020. It is now read-only.

Fixed calculation of denominator for Jaccard-similarity #907

Open
wants to merge 3 commits into
base: 3.5
Choose a base branch
from

Conversation

oschlueter
Copy link

The current implementation of Jaccard-similarity doesn't discard dupliate input values when calculating the denominator. I identified this issue by calculating Jaccard on identical input containing duplicates which didn't return 1.0 for which I added test cases.

@oschlueter
Copy link
Author

When creating the test cases I forgot to set the call to Similarities::overlapSimilarity. Once fixed I saw that the calculation is also affected so I added a propsed fix for overlapSimilarity as well.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant