I am looking for a workflow with emoticon analysis to classify text in negative / positive category based on emoticons in combination with other learners / predictors. May be you have already encountered such a case or have an example workflow. It would be great if you can point me to an example or a resource on this matter. Thank you very much!
so far I have not done emoticon analysis and have no example workflow, sorry for that. There are two blog articles about sentiment analsis but they don't cover emoticons:
the Textprocessing extension does not provides dictionaries. If you have labeled data you can create feature vectors from 1 and 2 (and n) grams and build predictive model based on these features vectors. If you want to use dictionaries, you need to bring your own.
We don’t have a workflow that I know of for dealing with emoticons in this context, but it’s a good and useful idea.
Apparently (thanks @julian.bunzel!) the StanfordPTB tokenizer can parse emoticons, so in theory they could be labeled using the Dictionary Tagger - but I confess I have not tried this myself yet…
I use the Python library in KNIME for sentiment analysis with this tool which will analyze emoticons plus it understands phrases like LOL etc. You can easily tweak it under the hood as well.
You can tinker with how it’s evaluating for sentiment whether whole text or a composite based on sentence tokenizaton using NLTK for Python. They include examples to follow and I’m happy to share how I have my Python nodes setup for this.