I'm trying to parse CDATA tpyes in XML. The code runs fine and it will print Links: in the console (about 50 times, because that's how many links I have) but the links won't appear...it's just a blank console space. What could I be missing?`` <pre class="prettyprint"><code>package Parse; import java.io.File; import javax.xml.parsers.DocumentBuilder; import javax.xml.parsers.DocumentBuilderFactory; import org.w3c.dom.CharacterData; import org.w3c.dom.Document; import org.w3c.dom.Element; import org.w3c.dom.Node; import org.w3c.dom.NodeList; public class XMLParse { public static void main(String[] args) throws Exception { File file = new File("c:test/returnfeed.xml"); DocumentBuilder builder = DocumentBuilderFactory.newInstance().newDocumentBuilder(); Document doc = builder.parse(file); NodeList nodes = doc.getElementsByTagName("video"); for (int i = 0; i < nodes.getLength(); i++) { Element element = (Element) nodes.item(i); NodeList title = element.getElementsByTagName("videoURL"); Element line = (Element) title.item(0); System.out.println("Links: " + getCharacterDataFromElement(line)); } } public static String getCharacterDataFromElement(Element e) { Node child = e.getFirstChild(); if (child instanceof CharacterData) { CharacterData cd = (CharacterData) child; return cd.getData(); } return ""; } } </code></pre> Result: <pre class="prettyprint"><code>Links: Links: Links: Links: Links: Links: Links: </code></pre> Sample XML: (Not full document) <pre class="prettyprint"><code><?xml version="1.0" ?> <response xmlns:uma="http://websiteremoved.com/" version="1.0"> <timestamp> <![CDATA[ July 18, 2012 5:52:33 PM PDT ]]> </timestamp> <resultsOffset> <![CDATA[ 0 ]]> </resultsOffset> <status> <![CDATA[ success ]]> </status> <resultsLimit> <![CDATA[ 207 ]]> </resultsLimit> <resultsCount> <![CDATA[ 207 ]]> </resultsCount> <videoCollection> <name> <![CDATA[ Video API ]]> </name> <count> <![CDATA[ 207 ]]> </count> <description> <![CDATA[ ]]> </description> <videos> <video> <id> <![CDATA[ 8177840 ]]> </id> <headline> <![CDATA[ Test1 ]]> </headline> <shortHeadline> <![CDATA[ Test2 ]]> </shortHeadline> <description> <![CDATA[ Test3 ]]> </description> <shortDescription> <![CDATA[ Test4 ]]> </shortDescription> <posterImage> <![CDATA[ http://a.com.com/media/motion/2012/0718/los_120718_los_bucher_on_howard.jpg ]]> </posterImage> <videoURL> <![CDATA[ http://com/removed/2012/0718/los_120718_los_bucher_on_howard.mp4 ]]> </videoURL> </video> </videos> </videoCollection> </response> </code></pre>

Instead of checking the first child, it would be prudent whether the node has other children as well. In your case (and I guess if you had debugged that node, you would've known), the node passed to the method <code>getCharacterDataFromElement</code> had multiple children. I updated the code and this one might give you the pointers to the right direction: <pre class="prettyprint"><code>public static String getCharacterDataFromElement(Element e) { NodeList list = e.getChildNodes(); String data; for(int index = 0; index < list.getLength(); index++){ if(list.item(index) instanceof CharacterData){ CharacterData child = (CharacterData) list.item(index); data = child.getData(); if(data != null && data.trim().length() > 0) return child.getData(); } } return ""; } </code></pre>

I would consider using getTextContent() <pre class="prettyprint"><code>String string = cdataNode.getTextContent(); </code></pre>

Reading CDATA XML in Java

Tags:

java

parsing

xml

I'm trying to parse CDATA tpyes in XML. The code runs fine and it will print Links: in the console (about 50 times, because that's how many links I have) but the links won't appear...it's just a blank console space. What could I be missing?``

package Parse;

import java.io.File;

import javax.xml.parsers.DocumentBuilder;
import javax.xml.parsers.DocumentBuilderFactory;
import org.w3c.dom.CharacterData;
import org.w3c.dom.Document;
import org.w3c.dom.Element;
import org.w3c.dom.Node;
import org.w3c.dom.NodeList;

public class XMLParse {
  public static void main(String[] args) throws Exception {
    File file = new File("c:test/returnfeed.xml");
    DocumentBuilder builder = DocumentBuilderFactory.newInstance().newDocumentBuilder();
    Document doc = builder.parse(file);

    NodeList nodes = doc.getElementsByTagName("video");
    for (int i = 0; i < nodes.getLength(); i++) {
      Element element = (Element) nodes.item(i);
      NodeList title = element.getElementsByTagName("videoURL");
      Element line = (Element) title.item(0);
      System.out.println("Links: " + getCharacterDataFromElement(line));
    }
  }
  public static String getCharacterDataFromElement(Element e) {
    Node child = e.getFirstChild();
    if (child instanceof CharacterData) {
      CharacterData cd = (CharacterData) child;
      return cd.getData();
    }
    return "";
  }
}

Result:

Links: 

Links: 

Links: 

Links: 

Links: 

Links: 

Links:

Sample XML: (Not full document)

<?xml version="1.0" ?> 
<response xmlns:uma="http://websiteremoved.com/" version="1.0">

    <timestamp>
        <![CDATA[  July 18, 2012 5:52:33 PM PDT 
          ]]> 
    </timestamp>
    <resultsOffset>
        <![CDATA[  0 
          ]]> 
    </resultsOffset>
    <status>
        <![CDATA[  success 
        ]]> 
    </status>
    <resultsLimit>
        <![CDATA[  207 
        ]]> 
    </resultsLimit>
    <resultsCount>
        <![CDATA[  207 
        ]]> 
    </resultsCount>
    <videoCollection>
        <name>
            <![CDATA[  Video API 
            ]]> 
        </name>
        <count>
            <![CDATA[  207 
            ]]> 
        </count>
        <description>
            <![CDATA[  
            ]]> 
        </description>
        <videos>
            <video>
                <id>
                    <![CDATA[  8177840 
                    ]]> 
                </id>
                <headline>
                    <![CDATA[  Test1
                    ]]> 
                </headline>
                <shortHeadline>
                    <![CDATA[  Test2
                    ]]> 
                </shortHeadline>
                <description>
                    <![CDATA[ Test3

                    ]]> 
                </description>
                <shortDescription>
                    <![CDATA[ Test4

                    ]]> 
                </shortDescription>
                <posterImage>
                    <![CDATA[ http://a.com.com/media/motion/2012/0718/los_120718_los_bucher_on_howard.jpg

                    ]]> 
                </posterImage>
                <videoURL>
                    <![CDATA[ http://com/removed/2012/0718/los_120718_los_bucher_on_howard.mp4

                    ]]> 
                </videoURL>
            </video>
        </videos>
    </videoCollection>
</response>

472

asked Jul 19 '12 03:07

Matt

2 Answers

Instead of checking the first child, it would be prudent whether the node has other children as well. In your case (and I guess if you had debugged that node, you would've known), the node passed to the method getCharacterDataFromElement had multiple children. I updated the code and this one might give you the pointers to the right direction:

public static String getCharacterDataFromElement(Element e) {

    NodeList list = e.getChildNodes();
    String data;

    for(int index = 0; index < list.getLength(); index++){
        if(list.item(index) instanceof CharacterData){
            CharacterData child = (CharacterData) list.item(index);
            data = child.getData();

            if(data != null && data.trim().length() > 0)
                return child.getData();
        }
    }
    return "";
}

answered Oct 05 '22 21:10

Sujay

I would consider using getTextContent()

String string = cdataNode.getTextContent();

answered Oct 05 '22 20:10

armagedescu

Related questions
                            
                                What is the right action to take upon closing windows in java/swing?
                            
                                codility absolute distinct count from an array
                            
                                Connecting to access database from linux
                            
                                Java casting order
                            
                                How to set JAVA_HOME or CATALINA_HOME if I have more than 1 version used for Projects?
                            
                                Why is ConsoleAppender throwing "no output stream or file set for the appender named [null]"?
                            
                                got java.security.InvalidAlgorithmParameterException: the trustAnchors parameter must be non-empty when using cas [duplicate]
                            
                                Parsing xml with DOM, DOCTYPE gets erased
                            
                                No enclosing instance of type MySuperClass<B> is available due to some intermediate constructor
                            
                                JSch sftp upload/download progress
                            
                                What is object publishing and why do we need it?
                            
                                cannot make a static reference to the non-static field
                            
                                Best Image Scaling Library
                            
                                Call C++ library from Java in Android
                            
                                Performance Testing vs Profiling
                            
                                Can I call .class on a generic type in Java?
                            
                                Java: InputStream too slow to read huge files
                            
                                Java synchronization between different JVMs
                            
                                Why is Java's String memory usage said to be high?
                            
                                passing a String array as argument

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With