:â(
And how the names of columns will changes for each iteration with loop ( Row0,Row1,Row3,âŠ) even Iâm not using in this node the flow variable ?!! @ipazin , Please help : (
Br.
Have I configurate the flow variable again in this Node ?!! How to do this please !
may you help be in this ? may be is the last question ! I donât want to give up because itâs so urgent for me to resolve this !
This error means you have different column types in different iterationsâŠ
Go iteration by iteration and see what is result from first iteration and what from second one.
Donât think so it is due to second Rule engine.
Ivan
Thank you very much for your help.and iâm very grateful for you !
In fact ,In the beginning I had 996233 rows , with transpose node the loop have to treat then 996233 column to get if there a repeated value. And that taked a lot of time (more than 2 days).
Is there any Solution for make that more fast, because I have to work a lot with this workflow ! and present the result in a report data.
Thank you again.
Bests Regards
I wish you a very good day !
Some of these operations do last a bit longer. Not sure there is a faster way but more then two days is a bit long! Can you share a workflow for me to check? How many column do you have?
For execution time you have Timer Info node which will give you execution for each node so you will see node execution time so you can optimize.
I got it. Still donât understand why it takes so much time. I tried with 1000 rows and 30 columns and it was around 2 minutes so not sure. Which part takes the longest? Use Timer Info node
Couple of observations:
In Missing Value node you donât do anything - you should remove row with missing value
In GroupBy node uncheck missing
Second Rule engine node seems that you logic is opposite of what you have written before. If you result column is 0 then you write TRUE and actually if result is zero that means you do not have any repeated values.
Loop end - you should uncheck Add iteration column - you do not need it
1 - then you can do it on default tab for every string data type remove row if missing. Otherwise you have to use flow variable which is ok as well if you wish.
not sure about separating and then concatenating to be faster. You can try it.
Here is a blog about optimazing knime workflow. Maybe you can find something.
Do do a little math: you need to go into loop 99623 times. Lets say 1 loop execution (1 column) takes 1 second to execute. That means the whole loop takes 99623 seconds to execute. Divide it by 60 and it is 1660 minutes. Divide it by 60 again and you get that execution lasts more then 27 hours
Here is a link to another forum thread about speeding up your workflow: