You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| Normalized Levenshtein |distance<br>similarity | Yes | No ||
62
-
| Weighted Levenshtein |distance | No | No ||
63
-
| Damerau-Levenshtein |distance | No | No ||
64
-
| Jaro-Winkler |similarity<br>distance | Yes | No ||
65
-
| Longest Common Subsequence |distance | No | No ||
66
-
| Metric Longest Common Subsequence |distance | Yes | No ||
67
-
| N-Gram (Kondrak) |distance | Yes | No ||
68
-
| Q-Gram |distance | No | No | Profile|
69
-
| Cosine |similarity<br>distance | Yes | No | Profile|
70
-
| Jaccard |similarity<br>distance | Yes | Yes | Set |
71
-
| Sorensen-Dice |similarity<br>distance | Yes | No | Set |
56
+
The main characteristics of each implemented algorithm are presented below. The "cost" column givs an estimation of the computational cost to compute te similarity between two strings of length m and n respectively.
| Sorensen-Dice |similarity<br>distance | Yes | No | Set | O(m+n) |
72
72
73
73
## Levenshtein
74
74
The Levenshtein distance between two words is the minimum number of single-character edits (insertions, deletions or substitutions) required to change one word into the other.
0 commit comments