Fuzzy String Matching

This workflow demonstrates how to apply a fuzzy matching of two string. The string matcher was designed exactly for this task, but is limited to the levenshtein distance. You can edit the parameters of the levenshtein distance in the configuration dialog.

With the support of distance matrices, you have more option to compare the strings .In the String Distance node, the distance can be chosen together with its parameters. Afterwards in the Similarity Search node you can find the closest match (e.g. the nearest neighbor) between the values from the first table to your lookup table in the second table. If provided, the node will use the distance from the distance matrix inport.

Fuzzy String Matching

 

Resources

EXAMPLES Server: 08_Other_Analytics_Types/01_Text_Processing/09_Fuzzy_String_Matching08_Other_Analytics_Types/01_Text_Processing/09_Fuzzy_String_Matching*
Download a zip-archive

 

 


* Find more about the Examples Server here.
The link will open the workflow directly in KNIME Analytics Platform (requirements: Windows; KNIME Analytics Platform must be installed with the Installer version 3.2.0 or higher). In other cases, please use the link to a zip-archive or open the provided path manually