I am trying to read a large XML of customer data. The issue is that there are several columns within the XML around 12. The KNIME is reading these as individual columns and just putting them next to each othere, where as i need them to be read as one customer.
The problem here is that if one piece of data is blank, instead of leaving it blank, it pushes the data up, so now the data is out of sync, for example where the data should look like this:
I have slight trouble visualizing what your actual situation is. You mention having a XML, but your current output and expected output does not really reflect that. Also how is your Xpath is configured? This also regarding the output being in separate columns as you mention.
Theoretically, you should be able to play around with the empty cell option in the Xpath node to control this.
So i have a very large XML file that i am reading in. I have attached a sample size of the structure of the XML. In this you can see that in some cases there is missing values, not blank values, but the headers themselves are not there. In this case the value will move up.
So for the first customer, they don’t have a tax field, in my KNIME i would expect this field to be left blank, but instead it moves the tax field from the customer below it up one space. This is causing my data to be out of sync.
I have tried to use the “return missing cell on empty string” and i set it to “multi row” but it doesn’t resolve my issue, because technically the cell isn’t blank, there just isn’t one there at all for it to read.
Next, add another Xpath node to query the customer specific fields. Something which you probably already have in similar form. Note: the null checkbox is not required.