I’m doing a long tail keyword analysis, so I have my list of 10.000 queries.
Now, before using the bag of words and then the frequency calculation I need to consider 3 words as a term: e.g.
go for joe = one keyword/term
joe = 1 keyword if is not preceded by “go for”
I have to consider also that “go for joe” is written in different ways:
go 4 joe
go for joe
- How can I extract my 3 words keyword as 1?
- How to differentiate the cases in which there is only “joe”
Can someone help me?
Thanks in advance