Hi,
How can I display the single syllables generated by the Hyphenator Node as terms?
Which nodes are essential?
I tried the following procedures:
1) Hyphenator following nodes: Column Filter => BoW creator
Filtering the term column in order to load the column "Document" (which is generated as a "Bag of words" by inserting a blank into the field "separator" inside the Hyphenator node) into the Bow Creator. Unfortunately the Bow Creator doesn't separate the generated syllables to terms though the input is a Document Column. Maybe the BoW Creator recognizes the blank as character. When I load documents into the BoW Creator by using the Parser Nodes starting the workflow the BoW creator works correctly.
2) Hyphenator following nodes: Term to String => Column Filter => Strings to Document => Column Filter => BoW creator
Converting the hyphenated terms into strings and, accordingly, converting these strings into documents to avoid errors alerted by executing the BoW Creator. (I didn't find any nodes for converting terms into documents directly)
In so doing I succeeded (by inserting a blank in the field "Separator" inside the Hyphenator node before) and the syllables are listed separately as terms. But the classified document category of each of the documents/terms seems to be deleted and all terms get the document class "undefined" (Adopting the "Orig. Document" doesn't work because only one document column is allowed using the BoW Creator)
Previously I thought that eventually the Keyword extractor would automatically recognize the generated syllables as single keywords. Now I don't think so because the keywords only appear hyphenated and the original term/keyword doesn't change: e.g. terms (i.e. syllables of terms) considered in a tree classification model will not be recognized if I want to classify new documents including the same syllables later.
I would like to ask you to tell me how to handle this situation, especially which nodes are important to continue processing the generated syllables to terms.
Many thanks!
Regards,
Werner