About the nodes
These nodes, developed in collaboration with KNIME.com, provide some basic, but robust and high-performance, chemistry functionality within KNIME. The current set of nodes includes functionality for:
- Converting between SMILES or SDF and RDKit molecules
- Generating canonical SMILES
- Substructure filtering using SMARTS or RDKit molecules
- Substructure counter with visualization of counted substructures
- Highlighting atoms in molecules for, for example, showing the results of substructure matching
- Filtering sets of molecules by presence or absence of well-defined functional groups.
- Chemical reaction enumeration
- Stripping off salts from molecules
- Picking diverse molecule subsets
- R-group decomposition
- Generating Murcko scaffolds and frameworks
- Generating 2D coordinates for molecules, optionally including a template.
- Generating 3D coordinates for molecules.
- Generating a variety of molecular fingerprints and reading and writing fingerprint files:
- RDKit fingerprints (Daylight-like topological fingerprint)
- Morgan fingerprints (ECFP/FCFP-like circular fingerprints)
- Atom pairs
- Topological torsions
- Avalon fingerprints
Some example workflows can be found on the KNIME Public server that is accessible from within each KNIME installation (look for 099_Community/01_RDKit).
About the RDKit
The RDKit is an open-source cheminformatics toolkit written in C++ that is also useable from Java or Python. It includes a collection of standard cheminformatics functionality for molecule I/O, substructure searching, chemical reactions, coordinate generation (2D or 3D), fingerprinting, etc. as well as a high-performance database cartridge for working with molecules using the PostgreSQL database. The toolkit is covered by the BSD license.
More information about the RDKit is available from the website www.rdkit.org.
The source code can be accessed at https://anonymous:email@example.com/svn/nodes4knime/trunk/org.rdkit.
The RDKit nodes are released under GPLv3.