Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

"Opposite" of Porter Stemmer algorithm?

I'm looking for some way of performing the opposite of a Porter Stemmer algorithm, i.e. the string "search" would return an array "searches, searched, searching etc.."

Does something like this exist already (pref in php)?

Thank you for your help!

like image 680
Fred Avatar asked Mar 05 '11 22:03

Fred


People also ask

What is the difference between Porter Stemmer and snowball Stemmer?

Snowball Stemmer is also known as the Porter2 stemming algorithm because it is a better version of the Porter Stemmer. It is more aggressive than Porter Stemmer.

What are the types of stemming algorithms?

stemming algorithms can be classified in three groups: truncating methods, statistical methods, and mixed methods. Each of these groups has a typical way of finding the stems of the word variants.

Which stemming algorithm is best?

Snowball stemmer: This algorithm is also known as the Porter2 stemming algorithm. It is almost universally accepted as better than the Porter stemmer, even being acknowledged as such by the individual who created the Porter stemmer.

What is Lancaster Stemmer?

Lancaster Stemmer is the most aggressive stemming algorithm. It has an edge over other stemming techniques because it offers us the functionality to add our own custom rules in this algorithm when we implement this using the NLTK package. This sometimes results in abrupt results.


1 Answers

I'm not sure if this is the direction that you're looking to go but I would grab a file of dictionary words (Mac and nix machines usually have one at /usr/share/dict/words) and load it into an easily searched storage mechanism.

Next, I'd compile a list of suffixes and program in the rules applied to suffixes.

In regard to the actual question asked I have not found anything that exists like this already.

like image 121
Benjamin Powers Avatar answered Oct 08 '22 10:10

Benjamin Powers