How to convert HTML characters into plain text

David_G · October 4, 2023, 2:48pm

I have some data fields that have HTML encoding.

Example 1:
Isol. from Dryopteris abbreviata
Example 2:
C23H28O8

How do I convert these into the following text strings?

Example 1 (converted):
Isol. from Dryopteris abbreviata
where the ‘Dryopteris abbreviata’ is in italics

Example 2 (converted):
C₂₃H₂₈O₈
where the numbers are in subscript

I have tried using a Java Snippet node with org.jsoup.parser.Parser additional bundles import and out_MOLF = Parser.unescapeEntities(c_MOLF, false) as my code.

I could configure the node and get it to run. However, the text has not been converted at all.

I hope somebody can help me.

Daniel_Weikert · October 4, 2023, 4:34pm

You might want to provide a sample upload to get better support
br

Alice_Krebs · November 10, 2023, 8:02am

Hi @David_G

It’s unfortunately not able to make the “normal” grey output table display the HTML-formatted text

But you can use the JS-based Table View node and that will display the HTML formatting without further ado

Mind that the output here is in the interactive view

That’s the only ‘solution’/workaround that I could think of, but maybe there are really Java-savvy folks know more
Sorry I don’t have an better answer!

Alice

Mick · November 22, 2023, 9:56am

If plain text is wanted, there is also the Markup Tag Filter, which will remove such formatting tags from the string.

David_G · November 23, 2023, 9:50am

Hi @Alice_Krebs

Thank you for your answer.

So as I understand it I cannot expose the HTML-formatted text to an end-user in a webpage or Excel file or csv file.

It would be good if such nodes could be developed that would allow this.

Thanks for your help!

David

Alice_Krebs · November 24, 2023, 2:06pm

Hi @David_G

No, if you display the Interactive view of the Table View node as a webpage using our commercial product, the text will be formatted without further ado.

But you are right with regards to outputting csv and Excel.
Thanks for the feedback, will forward that to the devs

Best,
Alice

system · February 22, 2024, 2:06pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.