Remove HTML Tags

Hi,

I have a data set with some HTML tags like

<h3 class="MsoNormal"> 
<span lang="EN-US" style="font-family: 'Calibri',sans-serif; mso-ascii-theme-font: minor-latin; mso-hansi-theme-font: minor-latin;">
<p>
<p style="margin-top: .1pt; margin-right: 0cm; margin-bottom: .1pt; margin-left: 0cm; background: white;"> 

I tried to remove them with " regexReplace($Col1$,“<[^(><.)]+>” , “”) ", but it doesn’t remove them completely. Does someone has a way to remove them completely?

Thanks in regards
Kara

Hey @Karakias,

you could try the Markup Tag Filter node. It can be used to remove tags from several markup languages like HTML and XML.

I hope this helps.

Best,
Julian

5 Likes

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.