Sentiment Analysis (Classification) of Documents with NGram Features

This workflow shows how to import text from a csv file, convert it to documents, preprocess the documents and transform them into numerical document vectors consisting of single word and 2-gram features. Finally two predictive models are trained on the vectors to predict the sentiment class of the documents. The two models are then compared via a ROC curve.


This is a companion discussion topic for the original entry at https://kni.me/w/fh2Dm55bz7763MiU