News Feed
Sections




News Archive
Looking for more information on how to do PHP the right way? Check out PHP: The Right Way

Christian Schaefer's Blog:
Using PHP Web Scraper Goutte in a Console Task in a Silex project
October 10, 2011 @ 08:26:24

In a recent post to his blog Christian Schaefer shows how to use the Goutte tool (a web scraper) to pull information from one site and use it in another Silex-powered one. His tutorial uses a custom service provider for the integration.

Since I discovered the free Facebook App hosting by heroku I keep wanting to make something useful out of it. So I thought about a small service app. Without going into details yet about its nature there was one immediate problem to be solved. How to get hold of the data? So I thought to scrape it off some website. I know this isn't very nice but unfortunately there is no feed I can use.. And how to best scrape a website? Use Goutte!

All you'll need is two things - the goutte.phar and Silex phar files. The code for the service provider is a simple registration of namespaces. With that integrated, it's as simple as making a client object and calling it with a URL.

0 comments voice your opinion now!
silex goutte webscraping tutorial serviceprovider phar


blog comments powered by Disqus

Similar Posts

John Lim's Blog: Parallel Processing in PHP

Wan Qi Chen: Background jobs with php and resque (Series)

Stefan Mischook's Blog: Classes and Objects in PHP

David Adams: Zero to Jenkins - PHP Continuous Integration

Engine Yard Blog: Deploying PHP Applications on Engine Yard: A How-To


Community Events

Don't see your event here?
Let us know!


opinion yii2 laravel example php7 community list api project language part2 introduction symfony series framework application composer interview podcast testing

All content copyright, 2015 PHPDeveloper.org :: info@phpdeveloper.org - Powered by the Solar PHP Framework