Palladian is a Java-based toolkit which provides functionality to perform typical Internet Information Retrieval tasks. It provides a collection of algorithms for text processing focused on classification, extraction of various types of information, and retrieval of data from the Web.
The nodes are intended to integrate with existing KNIME Nodes, such as the KNIME Textprocessing and the KNIME XML-Processing nodes.
The growing collection of Palladian KNIME nodes provide the possibility to use Palladian’s capabilities directly within KNIME, to complement and extend existing workflows, or to allow for quick prototyping without having to write any code. The current version features the following nodes:
- Geo Nodes
- Text Classifier
- HttpRetriever, HttpResultDataExtractor, FormEncodedHttpEntityCreator, MultipartEncodedHttpEntityCreator, OAuth
- UrlExtractor, UrlNormalizer, UrlResolver, UrlDomainExtractor
- WebSearcher (deprecated)
Installation instructions for the nodes can be found here: http://tech.knime.org/community
More information about the Palladian toolkit is available here: http://palladian.ws/
If you have any questions, comments, or problems, we are happy to hear from you: firstname.lastname@example.org
The Palladian extension is released under the Palladian Free Software License Version 2.1.
The Palladian KNIME Nodes were created by Philipp Katz, Klemens Muthmann, David Urbansky; 2011 – 2018.
There’s even more — check out the Selenium Nodes!
For advanced web scraping, task automatization and web application testing, also check out the Selenium Nodes, which allow you to control your browser from KNIME.