Changing typecast of unknown column types

There are instances when columns can take on an unknown typecast denoted by a ? Icon next to the column name. This often happens after transposes or unpivoting. I am unsure why this happens, if a column contains numbers as well as text surely it should be denoted as string typecast.

handling these unknown column types in subsequent nodes is tricky. I would have hoped that there would be nodes to easily convert these, but that doesn't seem the case.

is it possible to have a node like string to number node, which changes typecast, but obviously have ones called unknown typecast to string, and unknown typecast to double.

this would significantly improve column handling.

thanks,

simon.

1 Like

Hi Simon, 

I think this is doable. Have you tried using the column rename node to cast the mystery columns as a string, then String to Number to convert them to doubles? 

Cheers,

Aaron

Yes, I have tried that.

Firstly, this is painful when column names change names, as you have to redo it all over again in the rename node.

The advantage of the proposal, is that such a node would only display the "Unknown" column types in the include box in the node, and you can use the force inclusion/exclusion protocols to select what columns would change type. This way it can be set up in an automated fashion.


Additionally, the Rename node never seems to change the typecast properly. If you change a column to string for instance. If you then look at its column type description, it is not "String" but "Non-native", and this also causes problems in some nodes. Ideally, it should change it precisely to the typecast desired, not to this "non-native" type.

Thanks,

Simon.

1 Like

Hi Simon, 

I see what you mean now.  With dynamic column name, this is not very easy.  I have gone ahead and logged this request in our tracking system. We will let you know when we have a better solution for you. 

Best,

Aaron

 

1 Like

Thanks for logging this.

would be great if something more dynamic and automated way could be introduced.

thanks,

simon.

Hello Simon, hello alll,

 

How do you currently deal with this situation? I have a Transpose node that will correctly cast the columns to doubles 90% of the time. It's the 10% that I cannot find a way to deal with.

 

Cheers,

Fred

1 Like

Hello all,

 

Attached, you will find screenshots of the workflow I use and the configuration of the two Java Snippet (simple) nodes.

castunknowntodouble_n01.png is Node 15 and castunknowntodouble_n02.png is Node 18.

The IF Switch node is controlled by the IfSwitchSelector flow variable.

 

HTH,

Fred

Hey Guys!

Anything new about the "unknown to String/double/integer" node? It would be really great, if someone could do this... :)

2 Likes

You can now use "Column Auto Type Cast"

3 Likes

I just solved the problem by avoiding that the type is changed to UNKNOWN during transpose by creating a column i.e. with STRINGS before the transpose action. The type will be automatically taken from it during transpose. I simply remove the new row again after transposing.

cheers,
Claudia

2 Likes

Hi @CGS,

welcome to KNIME Forum!

Not sure I get it. Before transposing you added string type column and after transposing you got all strings or?

Br,
Ivan

1 Like

Hi @ipazin,

yes, in my case I added a column with strings (I used the RowID node to transfer the header strings to the newly inserted column). However, I guess numbers would also work. Just that Knime can automatically assign a type to the transposed columns. The type will stay even when the table is empty after the additional row is deleted again.
Still a workaround, I know, but for me the Auto Type Cast node mentioned above did not work since I have a changing number of columns.

Best,
C

1 Like

Hi richards99,

I also met the same challenge recently and find this post.
However, I want to share another method to solve.

Yes, it will happen most after unpivoting the columns, if the columns are more than two different type ( e.g. string for column A and data for column B), after the unpivoting, the new column will show a question mark and can not manipulate for string or data operation afterwards. Especially in a loop.

So my way is add one String to Number node before Unpivoting Node. In the configuration node “Enforce Exclusion” to keep the columns that not for unpivoting. So for the rest column that we want to do unpivoting, if they are not typo of number, will automatically appear in the “Include” Tab, then convert to number.
Especially in a loop, we are not sure which column is text or number, so we can use this way to convert into number.

Hope that can help!

Regard,
Kevin C

3 Likes

@kevinchen Thanks so much. Today I encountered with this issue of unknown type columns too. In my case, the issue arises when Concatenating string rows with that of number rows. I followed your solution, but in my case I did it the other way around, which was to convert the Numbers to String. The result was that all my columns are now in String type after Concatenating.

The only minor irritation is that all integer values now have been appended with decimals of point-zero (.0) for unknown reasons. Anyways, cheers!

Update: I was working on another problem which leads me to this node I’ve never encountered before.

It is the Column Auto Type Cast Node.

It’s able to convert all the Unknown Type columns without us having to alter the upstream tasks. Hope this helps to whoever’s having issues in the future.

3 Likes

I encounter similar issues with unknown data type and the Column Auto Type Cast resolved the issue.

I don’t understand why the column rename can’t change the data type of unknown fields.

2 Likes