Skip to content

feat(Matching): Use Dice Coefficient Algorithm to compare strings#2

Open
emahuni wants to merge 9 commits intomediabounds:masterfrom
emahuni:master
Open

feat(Matching): Use Dice Coefficient Algorithm to compare strings#2
emahuni wants to merge 9 commits intomediabounds:masterfrom
emahuni:master

Conversation

@emahuni
Copy link

@emahuni emahuni commented Mar 12, 2019

Add the ability to specify a similarity threshold to consider matches. This allows for a truly fuzzy match. eg: "David" can match "Dabid was here" making it useful on matching data that has typos or different in syntax and order of words.

emahuni added 9 commits March 12, 2019 04:20
Add new levenshtein distance matching
Add levenshtein distance to determine matches. This enables truly fuzzy matches even if the strings have a little difference
changed the matching package from leven to string similarity
String Similarity Returns a fraction between 0 and 1, which indicates the degree of similarity between the two strings. 0 indicates completely different strings, 1 indicates identical strings and that is easier to determine the fuzziness of the match
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant