Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Strip whitespace and newlines from XML in Java

Using Java, I would like to take a document in the following format:

<tag1>
 <tag2>
    <![CDATA[  Some data ]]>
 </tag2>
</tag1>

and convert it to:

<tag1><tag2><![CDATA[  Some data ]]></tag2></tag1>

I tried the following, but it isn't giving me the result I am expecting:

DocumentBuilderFactory dbfac = DocumentBuilderFactory.newInstance();
dbfac.setIgnoringElementContentWhitespace(true);
DocumentBuilder docBuilder = dbfac.newDocumentBuilder();
Document doc = docBuilder.parse(new FileInputStream("/tmp/test.xml"));

Writer out = new StringWriter();
Transformer tf = TransformerFactory.newInstance().newTransformer();
tf.setOutputProperty(OutputKeys.INDENT, "no");
tf.transform(new DOMSource(doc), new StreamResult(out));
System.out.println(out.toString());
like image 408
Jannis Ioannou Avatar asked Jul 28 '12 18:07

Jannis Ioannou


People also ask

How do I remove white space between XML tags?

For this example, if you don't want to strip out the whitespaces for all XML elements, you can instead use <xsl:strip-space element="Book,Chapter, Section"> instead.

How do you add a break in XML?

use <br/> ; or.


1 Answers

Working solution following instructions in the question's comments by @Luiggi Mendoza.

public static String trim(String input) {
    BufferedReader reader = new BufferedReader(new StringReader(input));
    StringBuffer result = new StringBuffer();
    try {
        String line;
        while ( (line = reader.readLine() ) != null)
            result.append(line.trim());
        return result.toString();
    } catch (IOException e) {
        throw new RuntimeException(e);
    }
}
like image 180
Wolfgang Avatar answered Oct 07 '22 06:10

Wolfgang