I have a text and I want to extract the term frequency. Aftrer applying a stemmer node and other preprocessing (stop word filter, punctuation erasure, etc.), I used the BoW node and the term frequencies node on the follwoing text:
- Reviews of Apple's new iPhone 8 and 8 Plus have been laudatory. However, the reviewers can't seem to get their minds off the jewel of the Apple universe, the iPhone X. Both the iPhone 8 and 8 Plus are "awesome" and better than last year's models -- but iPhone shoppers who want to be part of the future will save their money and buy an iPhone X later in the year, suggested reviewer David Pierce.
The output regarding the absolute term frequency is strange. The stemmed term review has a an absolute term frequency of 6 and the stemmed term iPhone has a frequency of 10.
I only count review 3 times and iPhone 5 times.
Someone has an idea?
Attached, you find the output table of the term frequency node.
Thank you for help