News Feed
Sections




News Archive
Looking for more information on how to do PHP the right way? Check out PHP: The Right Way

Kapustabrothers.com:
Indexing PDF Documents with Zend_Search_Lucene
January 23, 2008 @ 07:58:00

As mentioned on the Zend Developer Zone, there's a new post on kapustabrothers.com about a method for indexing all of those PDF files your site uses with the help of the Zend Framework's Zend_Search_Lucene component.

along with many others have been trying and asking how to index and search PDF files. Once Zend released its Framework, which is a port of Java Lucene to PHP, I decided to jump on board and find a way to index and search PDF files.

He uses the XPDF software to parse out the PDF files and the ZF component to do the actual indexing and searching. XPDF extracts key information from the PDF and puts it out to a new file where Zend_Search_Lucene can get to it. Example code is included to show the automatic creation of these details and how to add them to the component's index.

0 comments voice your opinion now!
zendframework zendsearchlucene pdf index document tutorial


blog comments powered by Disqus

Similar Posts

Matthew Weir O'Phinney's Blog: svn:externals (and the Zend Framework)

DeveloperTutorials.com: Developing State-enabled Applications With PHP

Sameer Borate's Blog: Unpacking binary data in PHP

Padraic Brady's Blog: New Zend_Feed_Writer Component And Zend_Feed_Reader Enhancements (ZF 1.10)

Developer Tutorials: Introduction to PHP Programming


Community Events





Don't see your event here?
Let us know!


community api application threedevsandamaybe interview laravel library configure series developer release language list install testing introduction unittest podcast code wordpress

All content copyright, 2014 PHPDeveloper.org :: info@phpdeveloper.org - Powered by the Solar PHP Framework