Hi, Below are a few suggestions and improvements to KNIME:
- When using the scrollbars in the workflow pane, they scroll as far as the furthest most node, likewise for the arrows next to the scrollbars. However, when building a workflow, you often get to the far right, and want more space to correctly place nodes in a desired position. Therefore is it possible to make it so that if you keep clicking on the arrows next to the scrollbars that it carries on beyond the last node and into white space which will then make workflow design less hassle. This has been a constant niggle I have had which I keep meaning to post.
- Inside a metanode, is it possible to make it so that the metanode vertical greybars with the arrows on are always a set distance away from the nearest node, so everytime you add a new node, they continue to move this set distance away. They have a habit of randomly placing themselves ontop of a node when you go back into the metanode, and sometimes when you try and move them further to the right, they jump back into the position they were in (i.e. ontop of a node which then makes editing or moving this node tricky).
- The "Delegating Loop Start" and "Delegating Loop End" nodes are great and really powerful, but its highly likely no-one knows what they do or where to find them. Would it make sense to rename them to "Recursive Loop Start" and "Recursive Loop End" and move them into the Flow Control section?
- Addition of "ChemSpider" nodes which accesses the free chemical database at www.chemspider.com. This would be really powerful to get chemical names from smiles (useful reverse process for the recent OSCAR implementation), to get molecular properties of molecules, analytical spectra, CAS numbers, and really helpfully chemical suppliers which is missing from KNIME at the moment.
- A Quickform chemical drawing program would be useful. Using the workflows through a webserver is very limiting without being able to draw the structure. I appreciate smiles can be used, but this is not very user friendly. Additionally, generation of the interactive table as a popup window for the output would be good too when using the webserver for workflows.
- Also for the quickform nodes, it would be nice to have an option in these nodes which enables them to pop up when the workflow is run in the KNIME application, asking for the user to select an option to be picked. i.e. it would be nice to set up the "String Radio Buttons" quickform node, then when a user runs the workflow, a window pops up in which the user selects the desired radio button. I can appreciate that in all cases, the user designing the workflow may or maynot want these windows to pop up, so just a tick box in the node configuration would be useful to turn this feature on or off. The reason I ask for this, is some users just want to press execute on a workflow and editing nodes can be unsettling to them, so having windows pop up for them to make options is a less scary alternative!
- "Export to E-Mail" node would be useful, where the entire table is emailed to the user specified email address in the node configuration. And also a second email node "Cell to E-Mail" node would be useful which takes one column for the email address and one column for the cell contents to be emailed, running this node then sends out as many emails as rows in the table to the different email addresses. This can be useful to update clients in the table of some regularly changing property associated to them.
- An improvement to the "Maths" node would be good which besides having the column list in the node config, would also have the flow variables list too so you can build maths expressions with the available variables. The get around is to use two Maths nodes, one which just has a variable in the expression to make a new column, and then another to take this new column and build the maths expression, but why the need for making it so cumbersome. Also should the Maths node be moved from "Misc" category and put into "Data Manipulation/Row/Other" along with the "Rule Engine" node which is already there.
- "Statistics" node is missing a vital statistic. It shows how many datapoints have missing values, but strangely does not report how many datapoints there are in total. A rather strange omission. Please can this be included.
- Improvements to "HiLite Collector" node would be nice, at the moment its rather unuser friendly. Instead of editing the view prior to node execution, it would be much more user friendly if this view popped up when the node was executed, and the node remains in executing mode until the user completes the changes and closes the window. At this point the node completes and the rest of the workflow continues.
- "Sorter" node should automatically put missing value entries at the end of the sort list. Its most frustrating having them appear at the top of the list.
- "Integer to Double" node would be good. I appreciate this is easily done with the Maths node, but most users dont realise this. It makes it very simple for novice users if both options are together.
- The quick filter box in the node repository pane is still painfully slow. Can it be made so that if the user types in letters here, the addition of another letter by the user overides the searching of the node repository. The trouble is, after every letter is entered, it searches the node repository, this slows everything down when just typing in a 5 or 6 letter word (and also then deleting this word a letter at a time is slow too). It would be better if it waits for 2 seconds before searching the node repository, so it gives the user a chance to enter or delete all the remaining letters.
- A reworking of node repository structure would be good (please please!), particularly around removal of the KNIME Labs and Community Contributions directories and moving them into more appropriate folders, i.e. RDKit/Indigo/CDK/Erlwood/ChemAxon into Chemistry, and new categories in the repository root for Text Processing, Image Processing, Internet Processing (for Palladian and Web Analytics), Scripting (for Perl, Java, Python, Octave, R, MatLab, Groovy), also moving Weka and Statistics into Mining and rename it to Modelling&Mining, and moving Parallel Looping into the Flow Control section. The current node structure must be overwhelming for new users as there are so many nodes and often in non-intuitive locations, I'm forever showing people nodes they dont know about. If they dont know they are there, they dont know to search for them! If nodes in KNIME Labs are still experimental, you could always put a little test tube symbol next the directory name to indicate this, and for Community nodes, you could always add a little symbol of a crowd of people to indicate they are Community based.
Thanks,
Simon.