KNIME Forum Analysis. Topic Classification in Posts

This workflow performs a supervised topic classification on the forum posts. The training set consists of the description files of the KNIME nodes. Topic classes are the nodes top categories in the Node Repository (IO, Data Manipulation, etc ...) from KNIME versions prio to 3.0. Model is built on this training set and applied to forum posts. Top three topics with highest probability are chosen for the post topic class. A Tree Ensemble is used as classification model.


This is a companion discussion topic for the original entry at https://kni.me/w/TmrL3A-U0VedTlyp