Can I recover paragraph breaks when converting documents to strings?

The Word Parser produces documents that contain paragraph breaks. These show up as dotted lines in the Document Viewer node. However, when I convert these documents to strings, these paragraph breaks are lost, and the words immediately before and after them are joined together.

It would be very useful if the paragraph breaks converted to line breaks in the resulting strings, as there are certain operations that I prefer to perform on plain strings rather than documents. Is there any way to achieve this?

I note that I’m asking essentially the same question as this one from 2014 (I’m even working with Factiva outputs). But I was hoping that this particular functionality might have evolved since then.

Cheers.

Hi @sugna,

this functionality hasn’t been implemented yet. Formatting, such as sections and paragraphs will get lost after conversion to strings.

Cheers, Kilian

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.