Tika Parsing

This workflow shows how to parse files of various formats as well as their attachments, if exist, using Tika parser nodes and detect the languages of the content using Tika language detector. Based on the detected langauge a filtering is applied to keep only English texts which are finally POS tagged.

Tika Parsing

 

Resources

EXAMPLES Server: 08_Other_Analytics_Types/01_Text_Processing/16_Tika_Parsing08_Other_Analytics_Types/01_Text_Processing/16_Tika_Parsing*
Download a zip-archive

 

 


* Find more about the Examples Server here.
The link will open the workflow directly in KNIME Analytics Platform (requirements: Windows; KNIME Analytics Platform must be installed with the Installer version 3.2.0 or higher). In other cases, please use the link to a zip-archive or open the provided path manually