Apache Tika integration

This workflow shows how to parse files of various formats as well as their attachments, if exist, using Tika parser nodes and detect the languages of the content using Tika language detector. Based on the detected langauge a filtering is applied to keep only English texts which are finally POS tagged.


This is a companion discussion topic for the original entry at https://kni.me/w/OBd_qx1wJeM2jKbH