hi all
urgently please help me
I have a problem in classifying terms
term such as "online learning" to be treated as a whole not "online", "learning"
the classification result for the testing data produces only one class for all the documents which is wrong
these terms construct the document
I tried 2 different ways but no accurate results are given
i want the vector model to show the terms and thier occurence in the document and the classifier gives accurate class
attached the dataset and the workflow
help me please
Hi singing bird,
please, take a look at the following example for document classification:
knime://EXAMPLES/_Old%20Examples%20(2015%20and%20before)/009_TextMining/009001_DocumentClassification
Hope this is helpful,
Best,
Vincenzo
Hi,
there is no dedicated node to detect and tag multi words e.g. text mining or online learning. However you could use the NGram node to create 2 grams and count their frequencies. Then you can filter frequent 2 grams and use them as input for the dictionary or wildcard tagger node. These nodes search documents for terms in input dictionary, tag them and group them if they are multi words.
Cheers, Kilian