Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Convert PDF to HTML file Java API

Tags:

java

html

pdf

I want to convert a pdf file to html file using java application. The PDF file contains some images , text etc. Doesn anybody know a good java API? (please don't suggest Aspose). I tried Apache PDFBox but not satisfied

like image 638
user3505725 Avatar asked Apr 07 '14 07:04

user3505725


People also ask

Can you convert Java to HTML?

Find and select the JAVA files on your computer and click Open to bring them into Doxillion to convert them to the HTML file format. You can also drag and drop your JAVA files directly into the program to convert them as well.


1 Answers

CSSBox Pdf2Dom is a Java library that allows (among other things) converting PDF to HTML. The distribution contains even a PDFToHTML command line tool based on this library so you can check if the results correspond to your needs. However, converting PDF to HTML is always tricky as noted above. The results depend on the complexity and the structure of the particular PDF file so different tools may be suitable for different PDF files.

like image 53
radkovo Avatar answered Nov 14 '22 01:11

radkovo