XML Diff and Merge

1 Answers

In my last job, we had a similar problem: We had to detect changes, insertions, and deletions of specific items between two XML files. The files weren't arbitrary XML; they had to adhere to our XSD.

Our solution was to implement a kind of merge sort: Parse the files (using a SAX parser, not a DOM parser, to permit arbitrarily large files), and store the parsed data in separate HashMaps. Then, we compared the contents of the two maps using a merge-sort type of algorithm.

Naturally, the larger the files got, the more memory pressure we experienced, so I ultimately wrote a FileHashMap class that pushed the HashMap's value space to random access files. While theoretically slower, this solution allowed our comparisons to work with very large files, without thrashing or OutOfMemoryError conditions. (A version of that FileHashMap class is available in this library: http://www.clapper.org/software/java/util/)

I have no idea whether what I just described is even remotely close to what you need, but I thought I'd share it, just in case.

Good luck.

178

answered Sep 20 '22 20:09

Brian Clapper

Related questions
                            
                                Kafka Avro Consumer with Decoder issues
                            
                                How to gracefuly shutdown a Spring Boot application by start-stop-daemon [duplicate]
                            
                                how to format string to show two decimal places [duplicate]
                            
                                Spring Data JPA - Specifications join
                            
                                Finding the shortest path nodes with breadth first search
                            
                                Replace last instance of comma with 'and' in a string
                            
                                How to get index of findFirst() in java 8?
                            
                                How to upcast object contained in Java 8 Optional?
                            
                                How to create a nested Map using Collectors.groupingBy?
                            
                                How are beans named by default when created with annotation?
                            
                                Kafka Streams - Send on different topics depending on Streams Data
                            
                                NoSuchElementException occurs when Iterating through Java ArrayList concurrently
                            
                                How to sync TabLayout with Recyclerview?
                            
                                couldn't destroy threadgroup org.codehaus.mojo.exec.ExecJavaMojo$IsolatedThreadGroup[name=SitemapCheck.SitemapAction,maxpri=10]
                            
                                Maven Checkstyle Plugin doesn't fail during build even though `failsOnError` is set to `true`
                            
                                Robot.mouseMove does not work at all in Mac OS X
                            
                                Reverse Sort a stream
                            
                                Can we get a method name using java.util.function?
                            
                                What is a data structure kind of like a hash table, but infrequently-used keys are deleted?
                            
                                Extract words out of a text file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

XML Diff and Merge

Tags:

java

merge

diff

xml

xslt

user53552

People also ask

1 Answers

Brian Clapper

Recent Activity

Donate For Us