Knime 4.4.1 does work with my 4.3.4 workflow

Hi,

This morning I upgraded from KNIME 4.3.4 to 4.4.1. I have a workflow that I have used for about six months that creates a table with polygons which I can view with the “View Geometries with Map” node. It also outputs to a CSV file which I use to create maps in QGIS.

For some reason, running exactly the same flow, the map node pops a “no geometry found in column the_geom” error and when I load the CSV in QGIS there is no geometry.

Any suggestions where I start to try and resolve this problem?

tC/.

1 Like

Hi @TigerCole , I’m sorry to hear that.

Can you explain how you updated your Knime? Was it a fresh install of 4.4.1?

1 Like

Hi @Bruno29a

It was. I uninstalled 4.3.4 and then installed 4.4.1 into a new folder KNIME4 so there should be no issue with fragments from an earlier install.

tC/.

2 Likes

Hi @TigerCole , yes, that is the correct way.

I’m not sure what could be the issue. Normally if a workflow has issues because of versions, Knime would notify you about it before opening the workflow, and will even suggest to download/install any new nodes for you and replace them for you.

Hopefully someone from the Knime team can jump in.

1 Like

I have never had an upgrade issue before. To confirm the issue I am going to export the flow and run in on another computer that has 4.3.4 installed and see what happens.

Will post an update on what happens.

tC/.

2 Likes

@TigerCole What kind of data is being imported in the first place? Sometimes when there are CSV imports it might be that a new installation gets somewhat ‘confused’ about the columns, names or types and that might affect nodes further down the road.

So maybe you start by checking the data import and see if this still does work.

Then it could help if you could provide us with an example where the node does fail so it would be possible to inspect that.

2 Likes

Hi, @TigerCole, can you please share with us which nodes from which extension you are using?
Most community extensions are compatible with 4.4.1 but not all of them.

best,
Gabriel

3 Likes

Hi @mlauber71 and @gab1one

I am importing a CSV file that is the output from a preceding workflow.

As some feedback ,

I set up a VM with a clean install of Knime 4.3.4 and ran the same flow with the same CSV file input and the output worked exactly as expected.

I want to do 2 further tests before I call foul.

  1. Set up a new VM with a clean install of Knime 4.4.1 and do the test with the same flow and the same CSV file and confirm the result. This will ensure that there is not some issue with the install on my computer that has had a number of Knime upgrades in the past 2-3 years.
  2. Rebuild the same flow from scratch on a clean 4.4.1 build. It may be that the flow built in 4.3.4 has some issue somewhere and clean 4.1.4 nodes resolve the issue.

If both of the above tests fail I will export the flow with anonymised data and hopefully, with great community and Knime Team support, figure out what is happening and what the solution might be.

Thank you for all the support.

tC/.

3 Likes

@TigerCole CSV typically is an insufficient way to store information (preserving data types), especially if the data is from another KNIME (?) workflow. KNIME .table, Parquet or ORC might be much more stable as well as local H2 or SQLite database. If you absolutely must have a TEXT file ARFF could be another choice preserving structure and data types.

3 Likes

Hi @mlauber71

Thanks for the suggestion on alternative formats to save data for forward flows. The SQLlite idea sounds good because I am using it for another application. I will give it a try.

tC/.

1 Like

@TigerCole SQLite offers a lot of benefits and it does integrate with KNIME. And also with R and Python BTW.

The driver used by KNIME is not the latest one but you can change that easily:

2 Likes

Indeed, CSV is an insufficient format, in a similar way than JSON. On the other hand, CSV is widespread and non-proprietary. Using SDMX or a proprietary format is not always an option, and let us not even mention the unpredictable Excel format.

If type safety is a concern for CSV, one way to deal with it is to insert a “dummy row” just below the header row (so basically row number 1), in which, for each column, there is a value of either abc or 123 to push that column into either string or numeric format. For date columns, using ISO date formatted strings is probably the best choice. After importing, simply filtering the said row will accomplish the trick.

Congratulations btw for having included more options for enforcing type safety in the readers of the recent KNIME version.

1 Like

Hi @gab1one

I found the problem… I have a java snippet in the Knime flow that takes the values in a Latitude array and Longitude array and creates a WKT polygon.

The values in the arrays are double with “.” as the decimal. For some reason in 4.4.1 when node java snippet creates the WKT polygon, which is a string, the “.” in the double is changed to a “,” is incorrect.

I don’t know why it does this in Knime 4.4.1, this does not happen Knime 4.3.4. I have not changed any of the code in the snippet.

Any suggestions why this might be happening?

tC/.

language and regional settings if you are on Windows. You have set “,” as decimal separator. It’s possible in 4.3 it just used “.” by default and in 4.4 it was “fixed” to use what you have as OS setting.

2 Likes

Hi @kienerj

I wish it was that simple, it was the first thing that I checked. The default decimal for my computer s definitely “.” … checked, confirmed, and tested.

tC/.

Hello @TigerCole,

Maybe due to change in Java version with KNIME v4.4.0? (see here). Don’t know your Java code and if you can address it there (or is it related to OS settings) but can’t you replace comma with dot using String Manipulation node for example?

And please, don’t open multiple topics for same issue.

Br,
Ivan

1 Like

Hi @TigerCole , not sure why it is happening, but may be you can force it to be “.” instead of “,”. There are a few ways to do this, I think it could even be done in Java. If you can show what you are doing in your java snippet, someone for sure can help.

Alternatively, you can always manipulate the results as @ipazin suggested via String Manipulation.

1 Like

Hi @ipazin

My apologies about the new topic. I assumed a new issue and a new topic.

It does not look like a change in the Java version is an issue.

I probably can change the “,” with a String Manipulation node but it is an extra step in a flow that is processing 12 million rows that I didn’t need in 4.3.4 and I would like to figure out why it is happening. Adding the extra node is like a bandaid on a bleeder.

I don’t think that the Java code in the snippet node is complicated.

// Your custom imports:
import java.util.List;
import java.util.ArrayList;
import java.io.StringWriter;
import java.util.stream.Collectors;
import java.text.NumberFormat;
import java.text.DecimalFormat;
// system variables
public class JSnippet extends AbstractJSnippet {
  // Fields for input columns
  /** Input column: "latArr" */
  public Double[] c_latArr;
  /** Input column: "lngArr" */
  public Double[] c_lngArr;

  // Fields for output columns
  /** Output column: "the_geom" */
  public String out_WKT;

// Your custom variables:


// expression start
    public void snippet() throws TypeException, ColumnException, Abort {
// Enter your code here:

//Target: MULTIPOINT ((LON1 LAT1), (LON2 LAT2), (LON3 LAT3), (LON4 LAT4), (LON5 LAT5), (LON6 LAT6))
NumberFormat fmtr = new DecimalFormat("###.###############");      
List<String> points = new ArrayList<>();
for (int idx=0;idx<c_latArr.length;idx++){
	double lat = c_latArr[idx];
	double lng = c_lngArr[idx];

	StringWriter segment = new StringWriter();

//	segment.append("(");
	segment.append(fmtr.format(lng));
	segment.append(" ");
	segment.append(fmtr.format(lat));
//	segment.append(")");

	points.add(segment.toString());
}

//add the 1st lati and longi  to the tail to close out the multipoint spec.
{
	double lat = c_latArr[0];
	double lng = c_lngArr[0];

	StringWriter segment = new StringWriter();
//	segment.append("(");
	segment.append(fmtr.format(lng));
	segment.append(" ");
	segment.append(fmtr.format(lat));
//	segment.append(")");

	points.add(segment.toString());
}

out_WKT = "POLYGON (("+points.stream().collect(Collectors.joining(", "))+"))";

// expression end

Is there anything above that may be the cause. or a change that may resolve the problem?

If an export of the node will help, I can send it with a data sample.

Thanks.

tC/.

1 Like

Hi @TigerCole , since I don’t have the proper data and column type, I just used a single record such as this:
image

And of course, since I’m not dealing with list/arrays, my code is very simple, but I’m still using the format that you used:

NumberFormat fmtr = new DecimalFormat("###.###############");      

double lat = Double.valueOf(c_column1);
double lng = Double.valueOf(c_column2);

out_WKT1 = fmtr.format(lng);
out_WKT2 = fmtr.format(lat);

out_WKT = "POLYGON (("+out_WKT1.toString()+", "+out_WKT2.toString()+"))";

I decimals stay as “.”:

Running on Knime 4.4.1.

1 Like

Hi @bruno29a … I almost want to say that this is specific to my computer but I have replicated it on other computers as well.

What happens if you change Column1 and Column 2 to double?
Would you mind trying that?

tC/.