Text preprocessing for N char node

When I do text mining on filtering all long terms in documents, I only find N char node for filtering all terms which less than N chars in documents, but I want to find the node for filtering all terms which more than N chars in documents:joy:
Any alternative way to do what I want? (I also tried to use regular expression, but I do not know how to write it since I could not find the solution in google)

Hi DerekJin,

You could try using the regex filter node using the expression “.{N,}”, where N is the number of characters you want to filter. For instance, if N = 4, you will remove all terms with at least 4 chars or more.

Hope this helps!



This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.