Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Parse Pdf File and write content in word file using java

Tags:

java

ms-word

pdf

how to Parse a PDF file and write the content in word file using Java?

like image 952
kedar kamthe Avatar asked Feb 05 '09 07:02

kedar kamthe


2 Answers

For parsing a PDF file in Java, you can use Apache PDFBox: http://incubator.apache.org/pdfbox/

For reading/writing Word (or other Office) file formats in Java, try POI: http://poi.apache.org/

Both are free.

like image 130
breakingobstacles Avatar answered Oct 05 '22 19:10

breakingobstacles


Try the iText java library:

iText is an ideal library for developers looking to enhance web- and other applications with dynamic PDF document generation and/or manipulation.

It can be used for your parsing step.

As for generating word documents - the OpenOffice Java API might be able to generate Word compatible docs (no personal experience with this API).

like image 27
gimel Avatar answered Oct 05 '22 21:10

gimel