Dear all,
The nightly builds of the RDKit nodes now contain some new open-source nodes contributed by NIBR:
- Descriptor Calculation : calculates a set of 2D descriptors that are useful for building QSAR models
- Salt Stripper : uses a set of standard definitions to remove salts from input molecules.
- Functional Group Filter : filters a set of input compounds based on counts of standard functional groups.
- Substructure Counter : counts the number of times a set of substructures (provided as an input table) appear in molecules
There are also nodes for reading and writing the FPS format used in Andrew Dalke's chem-fingerprints tools (http://code.google.com/p/chem-fingerprints/).
Finally, support for calculating the Avalon fingerprint has been added to the RDKit Fingerprint node.
The Avalon fingerprint was described in the following publication:
Gedeck, P., Rohde, B. & Bartels, C. QSAR − How Good Is It in Practice? Comparison of Descriptor Sets on an Unbiased Cross Section of Corporate Data Sets. Journal of Chemical Information and Modeling 46, 1924-1936 (2006).
The implemention used here uses an RDKit wrapper around another NIBR open-source project, the Avalon toolkit: https://sourceforge.net/p/avalontoolkit/wiki/Home/
Thanks to Manuel Schwarze, Dillip Kumar Mohanty, Swarnaprava Singh, and Sudip Ghosh for their work on the Knime nodes and for Bernd Rohde for the Avalon toolkit.
I hope these are useful to the knime community,
-greg