Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How can I convert PDF to HTML?

What good libraries are there, in any common language, for converting PDF to HTML?

like image 468
user178644 Avatar asked Oct 28 '09 17:10

user178644


People also ask

How do I convert a PDF to HTML in Chrome?

Steps to convert a PDF to Chrome HTML. Use your file explorer to navigate to the desired PDF document. Right-click on the file and choose Open With and then Google Chrome. Your PDF document will open in a new Chrome browser window.


2 Answers

PDFBox at apache has an html extraction capability. http://pdfbox.apache.org/

like image 86
John Thorhauer Avatar answered Sep 29 '22 16:09

John Thorhauer


If you are working on a Windows box, I think Amyuni has a library for this as well. Their PDF Document Convertor is accessible as a DLL, can be used widely among the languages supported by Visual Studio, and can convert to RTF, TML, EXCEL, JPEG, and TIFF.

like image 22
William Daniel Avatar answered Sep 29 '22 16:09

William Daniel