PHPDeveloper: PHP News, Views and Community

Subscribe

@phpdeveloper.org

News Archive

Community News: Latest PECL Releases (05.06.2025)

Community News: Latest PECL Releases (04.29.2025)

Community News: Latest PECL Releases (04.22.2025)

Community News: Latest PECL Releases (04.15.2025)

Community News: Latest PEAR Releases (04.14.2025)

Community News: Latest PECL Releases (04.08.2025)

Community News: Latest PEAR Releases (04.07.2025)

Community News: Latest PECL Releases (04.01.2025)

Community News: Latest PEAR Releases (03.31.2025)

Community News: Latest PECL Releases (03.25.2025)

Looking for more information on how to do PHP the right way? Check out PHP: The Right Way

SitePoint PHP Blog:
Mangling XML as Text with PHP DOM

byChris Cornutt Jul 24, 2008 @ 14:35:16

In trying to convert over several HTML pages to the DITA XML format, James Edwards came up against a problem involving recursion:

But a problem I came across several times was the sheer complexity of recursive element conversion â€” <code> becomes <jsvalue> (or one of a dozen similar elements), <a> becomes <xref> â€¦ and that's all simple enough; but each of these elements might contain the other, or further child elements like <em>, and as we walk through the DOM so the incidence of potential recursion increases, until it gets to the point where my brain explodes.

His solution involves working with both regular expressions and document fragments. He loads the node he wants to work with, its parsed to prepare it and is passed off to do the "text-based mangling" to update it. The result is them pushed back into an XML object (fragment) and this is pushed back into the main document with a replaceChild call.