I've got an R Source (Table) node where I'm reading in a dataset from a csv and doing some basic preliminary cleaning. I'm using fread from the data.table package to read the 0.137gb file and it reports that there are 403739 rows and 49 columns.
The manipulation removes a row, so when I run nrow on what I assign to knime.out, I get the expected 403738. So far so good.
When I run the node, the output is 353758 rows and 49 columns! Where did the other rows go?!
I thought it might be something to do with rows entirely of missing values being automatically removed, so I tried creating a column in the dataset which would have a value (data$truth <- TRUE). It made no difference.
Any ideas?