How to transform huge xml files in java?

Tags:

As the title says it, I have a huge xml file (GBs)

<root>  
<keep>  
   <stuff>  ...  </stuff>  
   <morestuff> ... </morestuff>  
</keep>  
<discard>  
   <stuff>  ...  </stuff>  
   <morestuff> ... </morestuff>
</discard>  
</root>

and I'd like to transform it into a much smaller one which retains only a few of the elements.
My parser should do the following:
1. Parse through the file until a relevant element starts.
2. Copy the whole relevant element (with children) to the output file. go to 1.

step 1 is easy with SAX and impossible for DOM-parsers.
step 2 is annoying with SAX, but easy with the DOM-Parser or XSLT.

so what? - is there a neat way to combine SAX and DOM-Parser to do the task?

783

asked May 05 '10 13:05

user306708

1 Answers

StAX would seem to be one obvious solution: it's a pull parser rather than either the "push" of SAX or the "buffer the whole thing" approach of DOM. Can't say I've used it though. A "StAX tutorial" search may come in handy :)

163

answered Oct 22 '22 12:10

Jon Skeet

Related questions
                            
                                Default hashCode() implementation for Java Objects
                            
                                cannot create user in the keycloak. Getting 403 status
                            
                                How to remove deprecation warning on timeout and polling in Selenium Java Client v3.11.0
                            
                                How to rename Kafka topic
                            
                                How to avoid nested forEach calls?
                            
                                JUnit5: How to assert several properties of an object with a single assert call?
                            
                                Java equals(): to reflect or not to reflect
                            
                                Is it good practice to replace Class with Class<? extends Object> to avoid warnings?
                            
                                Trouble playing wav in Java
                            
                                How do I close a port in a case of program termination?
                            
                                What's the most minimal Java web MVC framework? [closed]
                            
                                Advanced GUI Possible in Java?
                            
                                Getting code statistics from big projects
                            
                                java synchronized block for more than 1 objects?
                            
                                Run background process in different thread in Java
                            
                                How to read an XML file with Java?
                            
                                Relative paths in Flying Saucer XHTML?
                            
                                Create an incrementing timer in seconds in 00:00 format?
                            
                                Java coding style
                            
                                Why should pop() take an argument?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to transform huge xml files in java?

Tags:

java

parsing

xml

user306708

People also ask

1 Answers

Jon Skeet

Recent Activity

Donate For Us