News Feed
Sections




News Archive
Looking for more information on how to do PHP the right way? Check out PHP: The Right Way

Raphael Stolt's Blog:
Scraping websites with Zend_Dom_Query
October 17, 2008 @ 14:31:34

Raphael Stolt has a new blog post today with a tutorial showing how to take the Zend_Dom_Query component out of the Zend Framework and use it to scrape content from another web site.

Today I stumbled upon an interesting and reportable scenario were I had to extract information of the weekly published Drum and Bass charts provided by BBC 1Xtra. As this information currently isn't available in any consumer friendly format like for example a RSS feed, I had to go that scraping route but didn't want to hustle with a regex approach. Since version 1.6.0 the Zend_Dom_Query component has been added to the framework mainly to support functional testing of MVC applications, but it also can be used for rolling custom website scrapers in a snap. Woot, perfect match!

He includes the code for his Bbc_DnbCharts_Scraper class he's created to show how the data is pulled in (via curl) and pushed into an object to be parsed.

1 comment voice your opinion now!
scraping website zendframework zenddomquery component tutorial


blog comments powered by Disqus

Similar Posts

Amazon Web Services PHP Blog: Provision an Amazon EC2 Instance with PHP

Matthew Weier O'Phinney's Blog: ZF2 Forms in Beta5

SitePoint PHP Blog: The Joy of Regular Expressions [4]

Godaddyhostingreview Blog: How to move Magento from Production to Live Server

Evan Coury's Blog: Using Zend\Dbs TableGateway & HydratingResultSet to return rows as custom enties


Community Events





Don't see your event here?
Let us know!


opinion bugfix library interview podcast voicesoftheelephpant tips introduction framework api package series language install laravel release list symfony community deployment

All content copyright, 2014 PHPDeveloper.org :: info@phpdeveloper.org - Powered by the Solar PHP Framework