News Feed
Sections




News Archive
Looking for more information on how to do PHP the right way? Check out PHP: The Right Way

Ian Barber's Blog:
Alternative Term Weighting
November 09, 2009 @ 12:24:36

In this new post from Ian Barber he takes a look at something that can come in very handy when you need something a bit more complex than the standard search results - term weighting.

The term weighting and ranking function is at the core of any information retrieval system. The vector space model with the cosine similarity is maybe the best known and most widely used, but there are plenty of alternatives. We're looking at two here, the BM25 function based around a probabilistic model, and a function based around language modeling.

He's put together a few examples on some basic weighting practices - simple string evaluation based on word occurrence, using the Okapi/BM25 method and language modeling with a little bit of probability and scoring involved.

0 comments voice your opinion now!
alternative term weighting search


blog comments powered by Disqus

Similar Posts

DevShed: Searching and Replacing Nodes with SimpleXML in PHP 5 (Part 3)

Rob Allen's Blog: Luke - Lucene Index Toolbox

System Architect: Integrate PHP application with Solr search engine

SitePoint PHP Blog: Using Solarium with SOLR for Search Solarium and GUI

ProDevTips.com: Trackbacks with Google Blog Search and Zend Feed


Community Events





Don't see your event here?
Let us know!


framework bugfix interview laravel symfony voicesoftheelephpant language list release deployment community threedevsandamaybe conference series code tips api podcast zendserver introduction

All content copyright, 2014 PHPDeveloper.org :: info@phpdeveloper.org - Powered by the Solar PHP Framework