-
Notifications
You must be signed in to change notification settings - Fork 0
js-ojus/damlevdist.go
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
NAME
similarity - find and print text similarity between sets of strings
SYNOPSIS
similarity combfile
similarity reffile testfile
DESCRIPTION
similarity is a program that finds the Damerau-Levenshtein distance
between strings.
The first form of invocation treats the contents of the file
'combfile' to be strings that each needs to be compared with all the
others in the file.
The second form treats those from the file 'reffile' to be correct
reference strings, against which those in the file 'testfile' should
be compared.
In all cases, the input files should have one string per line.
Blank lines are ignored. The program does trim the strings, but
users should take care of non-printable characters themselves.
The output will be one line printed for each combination of strings,
and has the following format:
pd d tl rl tstr rstr
where, 'd' is the Damerau-Levenshtein distance between strings 'tstr'
'rstr'; 'tl' and 'rl' are the line numbers of test string 'tstr' and
reference string 'rstr', respectively; and 'pd' is calculated as:
d / (len(tstr) + len(rstr)).
AUTHOR
JONNALAGADDA Srinivas
LICENSE
New BSD License
About
A program to compare a set of test words against a set of reference words (text similarity) using Damerau-Levenshtein distance.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published