News Feed
Sections




News Archive
Looking for more information on how to do PHP the right way? Check out PHP: The Right Way

Script-Tutorials.com:
How to parse web pages using XPath
April 21, 2011 @ 09:49:59

On the Script-Tutorials.com site there's a new article showing you how to use XPath to parse web pages, complete with screenshots of the page and the code to make it happen.

Today I will tell you how you can make parsers of remote HTML pages (in PHP). In this article I will show you how to perform xpath queries to Web pages. XPath - a query language to elements of xml or xhtml document. To obtain the necessary data, we just need to create the necessary query. For the work, we also need: browser Mozilla Firefox, firebug and firepath plugins. For our experiment, I suggest this webpage Google Sci/Tech News. Of course you can choose any other web page too.

They provide two demos and a downloadable package with everything you need. The script pulls in the page as a DOM document (which works as long as it's correctly formatted XML) and spits back out the matches from a few different XPath expressions. There's all sorts of sites out there that can help you with examples of other XPath expressions and syntax.

0 comments voice your opinion now!
xpath tutorial parse html website


blog comments powered by Disqus

Similar Posts

Alexander Netkachev's Blog: Installing Propel object persistence layer for Web application

Kevin Waterson's Blog: Introduction to PHP and MySQL How they work together.

Robert Basic's Blog: Benchmarking pages behind a login with ab

Marcelo Gornstein's Blog: Writing PHP applications with Doctrine2 as ORM and Ding as DI container

Script-Tutorials.com: How to Use APC Caching with PHP


Community Events

Don't see your event here?
Let us know!


language laravel framework extension laravel5 series release wordpress interview development introduction conference library api podcast voicesoftheelephpant opinion psr7 community unittest

All content copyright, 2015 PHPDeveloper.org :: info@phpdeveloper.org - Powered by the Solar PHP Framework