I’m having some trouble lemmatizing some words using Stanford Lemmatizer.
Just in case, I’ve also tested using Snowball Stemmer but obtaining the same result.
As you can see in the image below, I can’t lemmatize the words “faster” and “clearer”. I expected “quick” and “clear” results respectively, but this didn’t happen.
On the other hand the procces is capable to convert “cleared” to “clear” correctly.
The right column is the original term, while the next two columns are the stemmed and lemmatized, respectively.
Do you know if it is a bug in the extension or what I am doing wrong?
Hi,
I can confirm the behaviour. I also tested it with the command line tool mentioned here and that yields the correct results. So either we are using an older version of the lemmatizer or there really is a bug. However, the latter seems unlikely, as other words are lemmatized correctly. I will create a bug ticket for this.
Kind regards,
Alexander
Hi Mauricio,
Unfortunately, even the newer versions of that extension do not integrate a newer version of the Stanford Lemmatizer library. It is on us to make that update and I created a ticket for our developers. But maybe Scott’s advice helps!
Kind regards,
Alex