Harry

A Tool for Measuring String Similarity

Documentation

Manual page

The usage of Harry is covered in a classic manual page (man page), including command line options, configuration files and different operation modes.

Programming

Harry is developed in plain C. Harry's functionality for comparing strings is organized in different modules that are documented using Doxygen annotation. Although Harry should not be directly used as a library, this reference might help integrating Harry with other software frameworks.

Background information

The following articles provide an overview of Harry and common similarity measures for strings. Harry supports edit distances and several string kernels; however, bag-of-words models are not supported and should be computed using Sally.