extract common words from differents documents

Dear Community,

I want to know how can I extract only the common words for differents documents.

With bag of words I extract all the words for alls documents = OK

ex :

word 1 text 1
word 2 text 1
word 3 text 1
word 4 text 1
word 1 text 2
word 2 text 2

 

Now I want to filter and keep only common words for all urls : 

word 1 text 1 text 2
word 2 text 1 text 2
     

How can I do ?

Thanks

David

 

 

Hi David,

you can group by the terms and use sum as aggregation to compute the frequencies over all documents. To filter the most frequent ones use the Row Filter node.

Cheers, Kilian