Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to get values inside <![CDATA[values]] > using php DOM?

Tags:

php

xml

How can i get values inside <![CDATA[values]] > using php DOM. This is few code from my xml.

     <Destinations>          <Destination>             <![CDATA[Aghia Paraskevi, Skiatos, Greece]]>             <CountryCode>GR</CountryCode>         </Destination>          <Destination>             <![CDATA[Amettla, Spain]]>             <CountryCode>ES</CountryCode>         </Destination>          <Destination>             <![CDATA[Amoliani, Greece]]>             <CountryCode>GR</CountryCode>         </Destination>          <Destination>             <![CDATA[Boblingen,  Germany]]>             <CountryCode>DE</CountryCode>         </Destination>    </Destinations> 
like image 229
Hearaman Avatar asked Jul 13 '11 05:07

Hearaman


People also ask

How do I get data from CDATA?

Use an XML parser. Show activity on this post. you should use simple_xml and xpath if you need to extract a complex set of data.

What does <![ CDATA in XML mean?

A CDATA section is used to mark a section of an XML document, so that the XML parser interprets it only as character data, and not as markup. It comes handy when one XML data need to be embedded within another XML document.

HOW include CDATA in XML?

CDATA sections can appear inside element content and allow < and & character literals to appear. A CDATA section begins with the character sequence <! [CDATA[ and ends with the character sequence ]]>. Between the two character sequences, an XML processor ignores all markup characters such as <, >, and &.

Which sequence is not allowed within a CDATA section?

The only sequence which is not allowed within a CDATA section is the closing sequence of a CDATA section itself, ]]> . Note: CDATA sections should not be used within HTML they are considered as comments and not displayed.


2 Answers

Working with PHP DOM is fairly straightforward, and is very similar to Javascript's DOM.

Here are the important classes:

  • DOMNode — The base class for anything that can be traversed inside an XML/HTML document, including text nodes, comment nodes, and CDATA nodes
  • DOMElement — The base class for tags.
  • DOMDocument — The base class for documents. Contains the methods to load/save XML, as well as normal DOM document methods (see below).

There are a few staple methods and properties:

  • DOMDocument->load() — After creating a new DOMDocument, use this method on that object to load from a file.
  • DOMDocument->getElementsByTagName() — this method returns a node list of all elements in the document with the given tag name. Then you can iterate (foreach) on this list.
  • DOMNode->childNodes — A node list of all children of a node. (Remember, a CDATA section is a node!)
  • DOMNode->nodeType — Get the type of a node. CDATA nodes have type XML_CDATA_SECTION_NODE, which is a constant with the value 4.
  • DOMNode->textContent — get the text content of any node.

Note: Your CDATA sections are malformed. I don't know why there is an extra ]] in the first one, or an unclosed CDATA section at the end of the line, but I think it should simply be:

<![CDATA[Aghia Paraskevi, Skiatos, Greece]]> 

Putting this all together we:

  1. Create a new document object and load the XML
  2. Get all Destination elements by tag name and iterate over the list
  3. Iterate over all child nodes of each Destination element
  4. Check if the node type is XML_CDATA_SECTION_NODE
  5. If it is, echo the textContent of that node.

Code:

$doc = new DOMDocument(); $doc->load('test.xml'); $destinations = $doc->getElementsByTagName("Destination"); foreach ($destinations as $destination) {     foreach($destination->childNodes as $child) {         if ($child->nodeType == XML_CDATA_SECTION_NODE) {             echo $child->textContent . "<br/>";         }     } } 

Result:

Aghia Paraskevi, Skiatos, Greece
Amettla, Spain
Amoliani, Greece
Boblingen, Germany

like image 117
Nicole Avatar answered Sep 21 '22 09:09

Nicole


Use this:

$parseFile = simplexml_load_file($myXML,'SimpleXMLElement', LIBXML_NOCDATA) 

and next :

foreach ($parseFile->yourNode as $node ){ etc... } 
like image 34
Grzegorz Brzęczyszczykiewicz Avatar answered Sep 22 '22 09:09

Grzegorz Brzęczyszczykiewicz