I am struggling with adding new rows as follows, the first table is what I have at present, and the second table is what I want to get, so is anybody know how to achieve this goal? Thank you very much in advance!
My main aim is to get texts from URL and analyse these texts. I have successfully built the workflow and it works well with xpath, however, I am struggling with the UTF-8 encoding when using htmlparser since the Chinese characters in the website page (url: http://101.227.16.139/ire/2016/11/1/IADT_CPI_AD_46.html) have been parsed into gibberish code as following:
I have checked with the URL technical team and they confirmed that the URL encode is UTF-8, however, I still got the above mess with Chinese. How can I solve this issue with Knime? Is there some node to change these xml to normal display in Chinese? THank you very much if you can help to to solve this!
I wrote the output to a csv file, and then read it in again with a file reader. the file reader allows you to specify the character encoding, and it brings back the Chinese characters.
I've also inlcuded a java snippet solution to the initial question, using lists in java and ungroup to exapnd it out.