Hi Please can I request nodes in Indigo around the following;
- Enumerator/Combichem node which takes a scaffold(s) as input into one port which has Rx positions defined, i.e. R1, R2, R3, R4. (so basically either the scaffolds output from port1 in the decomposer node, or the scaffolds from port0 would be good templates for this). Then in a second port of the enumerator node is input a list of groups with attachment points across multiple columns (i.e. as the output is now from the decomposer node, these R Group columns would be ideal). Then within the Enumerator node you then say which R group column you want to match up to the R1 on the scaffold, which R group column you want to match up to the R2 on the scaffold etc.
- Transformation node where you can choose where the reaction is to take place and thus gives an output of the molecules with attachment points. I hope it can be used to define not only attachment points but also Rx groups for a scaffold too, so for example, being able to take Toluene, and then specifying the introduction of an R1 group at the ortho position to the methyl.
- Improvement to "Molecular Properties" node to include Polar Surface Area (PSA), Hydrogen Bond Donors (HBD), Hydrogen Bond Acceptors (HBA). I appreciate these can be worked out with looping and such like but really need a user friendly way of doing this simply from an interface. Also QSAR Properties to calculate properties like SLogP, Lipinksi Rule of 5, Molecular Volume.
- Alignment node which takes a dataset of molecules in one port, and a set of scaffold(s) in the other point and simply aligns the molecules from the dataset to the same orientation to those drawn in the scaffold set. This is useful as chemists often want structures drawn in the same orientation to aid visual interpretation of molecules. So for example if the scaffold is an indole drawn in a way such that the Nitrogen is at the bottom right, and the 6 membered ring is on the left, and 5 membered on the right, then the output structures of the dataset will have all those molecules redrawn in the same way.
- Stereo Enumerator node which will take a molecule which has chiral centres present and will enumerate the molecule into all the possible enantiomers/diastereomers. Also if double bonds are present, E and Z enumeration is undertaken.
- Isotope Calculator node which will take a molecule and calculate the common isotopic masses of the molecule with relative abundance, i.e. HCl would be 36 (75%) and 38 (25%). This would be really nice for the medicinal chemists in calculating masses from an LCMS, something missing from KNIME at the moment.
- Scaffold Detector node which does not calculate just the Maximum Common Scaffold but calculates commonly exemplified scaffolds within the dataset.
- PAINS Detector node based on the excellent work of others on this site in implementing a workflow around PAINS detection, can this be simplified into one node to provide a count per molecule on the number of problematic groups found in a molecule.
- Tautomer Standardiser node which will take a set of structures and make sure they are all drawn in the same tautomer, i.e. 2-pyridinone and 2-pyridinol are drawn in the same way. This can be very useful to have them uniform in substructure searching and matched pair detection. Currently I see no tool in KNIME capable of this.
- Chirality Finder node which only returns back molecules with chiral centres present. It would be good if possible that the results are ordered in such a way that enantiomeric and diastereomeric pairs are listed next to each other in the table.
- Substructure Dictionary Search like the RDKit node which will search a dataset of molecules using multiple substructure search queries and you can choose to return the results which match at least x number of substructures.
- Indigo Molecule to IUPAC Name Calculator node which will take structures and generate an IUPAC name from them.
- IUPAC Name To Structure node which will take IUPAC names and convert them to Indigo Molecules.
These are just a selection of nodes I would love to see in KNIME based around chemistry. The current implementation of Indigo nodes has been terrific and I would love to see them expanded to include some of the above suggestions. Others feel free to add to this list or suggest favourites etc.
Simon.