News Feed
Jobs Feed
Sections




News Archive
feed this:

Sameer Borate's Blog:
Porter Stemming algorithm for search
April 29, 2009 @ 07:57:06

In a recent post to his blog Sameer looks at implementing a Stemming algorithm to search an array of words. It uses this library (as written by Richard Heyes).

A stemming algorithm lets you reduce each English input word to its basic root or stem (e.g. 'walking' to 'walk') so that variations on a word ('walks', 'walked', 'walking') are considered equivalent when searching. This stems can than be used in a search query rather than the original words, which generally (but not always) results in more relevant search results.

His code example uses the library to search for two different types of strings - a single word and a phrase (with stop words removed). The Stem() method is called on the word and the results are looped through to remove all matching the values in the stop words array.

0 comments voice your opinion now!
stop word search stem root query library richardheyes



Community Events











Don't see your event here?
Let us know!


unittest testing introduction language release phpunit interview code example framework conference functional series tool zendframework2 community development podcast application opinion

All content copyright, 2013 PHPDeveloper.org :: info@phpdeveloper.org - Powered by the Solar PHP Framework