How do I load an org.w3c.dom.Document from XML in a string?

Tags:

I have a complete XML document in a string and would like a Document object. Google turns up all sorts of garbage. What is the simplest solution? (In Java 1.5)

Solution Thanks to Matt McMinn, I have settled on this implementation. It has the right level of input flexibility and exception granularity for me. (It's good to know if the error came from malformed XML - SAXException - or just bad IO - IOException.)

public static org.w3c.dom.Document loadXMLFrom(String xml)     throws org.xml.sax.SAXException, java.io.IOException {     return loadXMLFrom(new java.io.ByteArrayInputStream(xml.getBytes())); }  public static org.w3c.dom.Document loadXMLFrom(java.io.InputStream is)      throws org.xml.sax.SAXException, java.io.IOException {     javax.xml.parsers.DocumentBuilderFactory factory =         javax.xml.parsers.DocumentBuilderFactory.newInstance();     factory.setNamespaceAware(true);     javax.xml.parsers.DocumentBuilder builder = null;     try {         builder = factory.newDocumentBuilder();     }     catch (javax.xml.parsers.ParserConfigurationException ex) {     }       org.w3c.dom.Document doc = builder.parse(is);     is.close();     return doc; }

273

asked Aug 28 '08 20:08

Frank Krueger

1 Answers

Whoa there!

There's a potentially serious problem with this code, because it ignores the character encoding specified in the String (which is UTF-8 by default). When you call String.getBytes() the platform default encoding is used to encode Unicode characters to bytes. So, the parser may think it's getting UTF-8 data when in fact it's getting EBCDIC or something… not pretty!

Instead, use the parse method that takes an InputSource, which can be constructed with a Reader, like this:

import java.io.StringReader; import org.xml.sax.InputSource; …         return builder.parse(new InputSource(new StringReader(xml)));

It may not seem like a big deal, but ignorance of character encoding issues leads to insidious code rot akin to y2k.

answered Sep 19 '22 18:09

erickson

Related questions
                            
                                Xmpp Vs Websocket [closed]
                            
                                Servlet returns "HTTP Status 404 The requested resource (/servlet) is not available"
                            
                                How can you search Google Programmatically Java API [closed]
                            
                                Calling JMX MBean method from a shell script
                            
                                Singleton design pattern vs Singleton beans in Spring container
                            
                                enum.values() - is an order of returned enums deterministic
                            
                                Why does findFirst() throw a NullPointerException if the first element it finds is null?
                            
                                Use of Initializers vs Constructors in Java
                            
                                Read-only list or unmodifiable list in .NET 4.0
                            
                                How do I compile and run a program in Java on my Mac?
                            
                                In a simple to understand explanation, what is Runnable in Java? [closed]
                            
                                HTTPURLConnection Doesn't Follow Redirect from HTTP to HTTPS
                            
                                Using multiple property files (via PropertyPlaceholderConfigurer) in multiple projects/modules
                            
                                Array Length in Java
                            
                                Jackson JSON custom serialization for certain fields
                            
                                Google Guava isNullOrEmpty for collections
                            
                                How to create JSON Object using String?
                            
                                How does the FetchMode work in Spring Data JPA
                            
                                Java equivalent to JavaScript's encodeURIComponent that produces identical output?
                            
                                Exact difference between CharSequence and String in java [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I load an org.w3c.dom.Document from XML in a string?

Tags:

java

xml

w3c

document

Frank Krueger

People also ask

1 Answers

erickson

Recent Activity

Donate For Us