Description: In English, a word often another word variants, such as: happy => happiness happy here called happiness stem (stem). Information retrieval system, we often do things Term normalization process, extract the stem (stemming), that is the end of the word transform the form of removal of English words. The most widely used, moderate complexity, stemming algorithms based on suffix stripped Porter Stemming Algorithm, also known as the Porter stemmer Porter Stemmer. For details, please refer to the official website. More popular retrieval system include the word in Lucene, Whoosh done filter is used Porter stemming algorithm.
To Search:
File list (Check if you may need any files):
Stemmer.java