Hi there,
I am mining text for scientific literature review. I arrived in a Regex expression, but I would need to know if Regex is able to provide a bit more. Any advice is very appreciated.
I need to find definitons/concepts of terms/themes. The follow expression is providing a linear half solution:
word1(?:\s+\w+){0,N}\s+word2
definition(?:\s+\w+){0,2}\s+institution
a) However, I research two languages (English/Portuguese) at same time. So, an expression which I tried but not worked was:
concept|conceito|definition|definição(?:\s+\w+){0,2}\s+institution|instituição
b) Besides, it would be the paramount if there is a non-linear expression. I mean, the order doesn't mind. Something like: using just one expression with a proximity of 2 words the output would be "concept of instituion", "institution as concept", and so on.
Maybe I am asking for the impossible, but any workaround or closer manner to handle the mentioned searches are enough.
Many thanks in advance,
Cadu