Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Java Pdf Diff library

Tags:

java

diff

pdf

Does anybody know of a open source Java library that will do robust diffing of the text parts of pdf files?

Ideally I would like something that would produce a diff in the form of a patch.

like image 531
mac Avatar asked May 20 '09 10:05

mac


1 Answers

Extract the pdf text with http://incubator.apache.org/pdfbox/ and create a diff with http://code.google.com/p/google-diff-match-patch.

like image 59
trunkc Avatar answered Sep 21 '22 05:09

trunkc