I am a new knimer also university student from China, after the learning of self-paced online courses, I started a data mining internship. But I met some problems.
Currently, I was working on multiple parameter’s tolerance correlation. The tolerance is name from Tolerance, Tolerance #1, Tolerance #2 and so on. Its string format is like 2.2…4.2 (float number + delimiter… + float number). Then I need to use math formula to process the (practical result - lower bound and upper bound) / (lower bound and upper bound), later may try with the standard deviation.
As there are more than 30 tolerance parameters, is there any easy method (like loops or flow variable) to process each Tolerance one by one,?
I try to use String manipulation, Cell splitter, Column splitter and Loops (Column list loop, Group loop, Chunk loop) but I still can not process.
Is there any easy method (like loops or flow variable) to process each Math formula?
The structure of data sample screenshot is as the attachment.
One column and a group of columns are both acceptable.
The columns are systematized, each tolerance column is next to the parameter column.
@Kenyx I think it would help if you could provide us with a sample file and an explanation what you want to have as a result. You can identify blocks in an excel file hand handle them (eg. by giving the blocks IDs) as databases - like in this example:
But you will have to invest some thoughts in the structure and what parts of you data would be dynamic.
But maybe your solution is much simple you might just have to assign rules to a data table.
Yes, as I was continue the previous work of previous intern, he analyzed dataset’s original parameter result by a metanode which consists 20 rule engine, each engine processes parameter with each tolerance by ‘within’ and ‘out of range’. Currently, I want to optimize the workflow and analyze more focusing on the effect of the outlier value of the preceding parameter on the subsequent parameter.
I will try your recommendation workflow today. Thanks.
I am getting a closer picture on your challenge. @mlauber71 is giving you some tips for gathering and cleaning up your data.
I’m sharing a workflow for prepared for a different post on past October. I think that it can be useful for your challenge as it covers some of the bullets that you mention:
Iterate over groups of data
Individual math or statistical analysis (bounds and binning) by groups.
Thank you @gonhaddock
Definitely iteration and group process will be best solution of large amount of parameter, but currently I haven’t totally understand flow variable, different kinds of loops usage and application yet. I will try out during weekend.
Agree, I had the same idea at first too, change the wide format into long format. But the flow variable, loops and iteration trapped me, I have no idea about those configs and formation set up.
Otherwise, there will be one rule for one parameter, that will be same huge workload.