Example Workflow "Document Classification"

Hi Canan,

on a bow you can perform a group by to group by terms and count the docs. Grouping on terms directly is also possible but this will take the assigned tags into account as well. Maybe you have equal words but different tags resulting in different groups. This is why we converted terms to strings to get rid of the tags. However, of course you can also group directly on terms.

Using the lemmatizer directly after POS tagging makes sense since it relies on tagged documents.

Cheers, Kilian