Getting XML Node text value with Java DOM

Tags:

I can't fetch text value with Node.getNodeValue(), Node.getFirstChild().getNodeValue() or with Node.getTextContent().

My XML is like

<add job="351">     <tag>foobar</tag>     <tag>foobar2</tag> </add>

And I'm trying to get tag value (non-text element fetching works fine). My Java code sounds like

Document doc = db.parse(new File(args[0])); Node n = doc.getFirstChild(); NodeList nl = n.getChildNodes();    Node an,an2;  for (int i=0; i < nl.getLength(); i++) {     an = nl.item(i);      if(an.getNodeType()==Node.ELEMENT_NODE) {         NodeList nl2 = an.getChildNodes();          for(int i2=0; i2<nl2.getLength(); i2++) {             an2 = nl2.item(i2);              // DEBUG PRINTS             System.out.println(an2.getNodeName() + ": type (" + an2.getNodeType() + "):");              if(an2.hasChildNodes())                 System.out.println(an2.getFirstChild().getTextContent());              if(an2.hasChildNodes())                 System.out.println(an2.getFirstChild().getNodeValue());              System.out.println(an2.getTextContent());             System.out.println(an2.getNodeValue());         }     } }

It prints out

tag type (1):  tag1 tag1 tag1 null #text type (3): _blank line_ _blank line_ ...

Thanks for the help.

546

asked Apr 21 '09 14:04

Emilio

2 Answers

I'd print out the result of an2.getNodeName() as well for debugging purposes. My guess is that your tree crawling code isn't crawling to the nodes that you think it is. That suspicion is enhanced by the lack of checking for node names in your code.

Other than that, the javadoc for Node defines "getNodeValue()" to return null for Nodes of type Element. Therefore, you really should be using getTextContent(). I'm not sure why that wouldn't give you the text that you want.

Perhaps iterate the children of your tag node and see what types are there?

Tried this code and it works for me:

String xml = "<add job=\"351\">\n" +              "    <tag>foobar</tag>\n" +              "    <tag>foobar2</tag>\n" +              "</add>"; DocumentBuilderFactory dbf = DocumentBuilderFactory.newInstance(); DocumentBuilder db = dbf.newDocumentBuilder(); ByteArrayInputStream bis = new ByteArrayInputStream(xml.getBytes()); Document doc = db.parse(bis); Node n = doc.getFirstChild(); NodeList nl = n.getChildNodes(); Node an,an2;  for (int i=0; i < nl.getLength(); i++) {     an = nl.item(i);     if(an.getNodeType()==Node.ELEMENT_NODE) {         NodeList nl2 = an.getChildNodes();          for(int i2=0; i2<nl2.getLength(); i2++) {             an2 = nl2.item(i2);             // DEBUG PRINTS             System.out.println(an2.getNodeName() + ": type (" + an2.getNodeType() + "):");             if(an2.hasChildNodes()) System.out.println(an2.getFirstChild().getTextContent());             if(an2.hasChildNodes()) System.out.println(an2.getFirstChild().getNodeValue());             System.out.println(an2.getTextContent());             System.out.println(an2.getNodeValue());         }     } }

Output was:

#text: type (3): foobar foobar #text: type (3): foobar2 foobar2

answered Sep 28 '22 23:09

jsight

If your XML goes quite deep, you might want to consider using XPath, which comes with your JRE, so you can access the contents far more easily using:

String text = xp.evaluate("//add[@job='351']/tag[position()=1]/text()",      document.getDocumentElement());

Full example:

import static org.junit.Assert.assertEquals; import java.io.StringReader;     import javax.xml.parsers.DocumentBuilder; import javax.xml.parsers.DocumentBuilderFactory; import javax.xml.xpath.XPath; import javax.xml.xpath.XPathFactory;     import org.junit.Before; import org.junit.Test; import org.w3c.dom.Document; import org.xml.sax.InputSource;  public class XPathTest {      private Document document;      @Before     public void setup() throws Exception {         String xml = "<add job=\"351\"><tag>foobar</tag><tag>foobar2</tag></add>";         DocumentBuilderFactory dbf = DocumentBuilderFactory.newInstance();         DocumentBuilder db = dbf.newDocumentBuilder();         document = db.parse(new InputSource(new StringReader(xml)));     }      @Test     public void testXPath() throws Exception {         XPathFactory xpf = XPathFactory.newInstance();         XPath xp = xpf.newXPath();         String text = xp.evaluate("//add[@job='351']/tag[position()=1]/text()",                 document.getDocumentElement());         assertEquals("foobar", text);     } }

answered Sep 29 '22 00:09

toolkit

Related questions
                            
                                How to add Log4J2 appenders at runtime programmatically?
                            
                                Maven -DskipTests ignored
                            
                                Java 8 – Create Instant from LocalDateTime with TimeZone [duplicate]
                            
                                How to convert Integer[] to int[] array in Java?
                            
                                Passing a JavaScript object using addJavascriptInterface() on Android
                            
                                What are Java's primitive types? [duplicate]
                            
                                Is there a way to check if two Collections contain the same elements, independent of order?
                            
                                Can you get basic GC stats in Java?
                            
                                What is a best practice of writing hash function in java?
                            
                                Assert keyword in Java
                            
                                Why are interface method invocations slower than concrete invocations?
                            
                                How to select an item from a dropdown list using Selenium WebDriver with java?
                            
                                How Does The Bitwise & (AND) Work In Java?
                            
                                How to create a Multidimensional ArrayList in Java?
                            
                                Apache Commons CLI - option type and default value
                            
                                Is there a way to use maven property in Java class during compilation
                            
                                Apostrophe (') in XPath query
                            
                                Java code/library for generating slugs (for use in pretty URLs)
                            
                                What is the difference between Boolean.TRUE and true in Java?
                            
                                How can I clear the Scanner buffer in Java?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With