Extraction and Tag Cloud Visualization of Named Entities from New York Times News Feeds

The workflow starts with a URL to a NY Times rss news feed. The news feed is downloaded and parsed and transformed in DocumentCells. Names of persons, organizations and locations are then recognized and the corresponding tags are assigned, in order to apply a coloring based on a tag type later on. After transformation into a bag of words, and filtering of all non-persons, -organizations, or –locations colors are assigned and the terms are visualized via a Tag Cloud.


This is a companion discussion topic for the original entry at https://kni.me/w/HpLEbu51i-t4uXU1