Python Script Node ( Getting in all the Data in Data Frame) || Fill Value must be in Categories

oshin · October 14, 2020, 2:18pm

Hello,
So basically I am trying to feed in 2 tables to Python node (2=>1) , internally I am trying to do some processing which requires my entire table to be fed, I am using some aggregated values on certain columns required dynamically, hence the need for entire data frame.
Out of the 2 tables that I feed one table ( table on which my processing algorithm will work)
has 1516 rows.
However when I try to print it’s size inside Python Node ( am getting 1000).
Now this is disturbing my entire logic internally.
I tried changing the chunk size via configurations but did not help me. I am surely required to read the entire data frame for this and many other further use cases. How can I do this?
Secondly I tried running my piece of logic on Jupyter Notebook it ran successfully as expected, however when I am trying to run it via Knime Node I am getting the error “Fill Values must be in Categories”.
Help Appreciated!

MarcelW · October 14, 2020, 3:29pm

Hi @oshin,

Regarding your first problem: only when in the script editor/configuration dialog of the node, not all of the rows of the input tables are loaded into Python. This is done for performance/interactivity reasons and can be changed via the Row limit (dialog) option on the Options tab of the configuration dialog.
When actually executing the node, all of the input will be considered regardless of that option.

Marcel

oshin · October 15, 2020, 5:33am

@MarcelW Thanks that helped!
My second Problem “Fill Values must be in Categories”. is still not resolved . Any help would be appreciated! Thanks:-)

mlauber71 · October 15, 2020, 8:17am

@oshin maybe this thread can help you

oshin · October 16, 2020, 1:22pm

Hi @mlauber71 Thanks this link helped.

system · October 23, 2020, 1:22pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.