Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Convert pdf, doc, ppt to html5 [closed]

I've googled (without any luck) for open source software that can convert doc, ppt, and pdf to HTML5. (Exactly what Scribd does) Are there open source equivalents to the type of conversion Scribd does?

If anyone knows of a paid service, that would also work. Scribd has an API, but that's for use with the flash viewer. Also, I would like to host my own content as I need further control over converted html document.

like image 849
KevMo Avatar asked Jul 07 '10 23:07

KevMo


People also ask

Can you convert PDF to a HTML5?

The quickest way to convert your PDF is to open it in Acrobat. Go to the File menu, navigate down to Export To, and select HTML Web Page. Your PDF will automatically convert and open in your default web browser. For more nuanced options, begin with your PDF open in Acrobat and click on the Export PDF tool on the right.

Can you convert PowerPoint to HTML5?

How to convert a PPT to a HTML5 file? Choose the PPT file that you want to convert. Select HTML5 as the the format you want to convert your PPT file to. Click "Convert" to convert your PPT file.


2 Answers

You're unlikely to find a single offering that does all this, especially in the open source world. It's more likely that you'll end up relying on a mishmash of things, and may even need to chain some converters in order to get to HTML. (Eg PDF -> ps -> HTML)

OpenOffice supports conversion to HTML, and can be called from the command line.

http://pdftohtml.sourceforge.net/ looks reasonably good at converting pdf to html.

For Doc that is Word ML or OpenXML format it's conceivable that you could use XSLT transforms since both input and output formats are XML. I've seen some stylesheets floating around the net that do this, but YMMV.

Incidentally, why is there a specific requirement for open source? MS Powerpoint already supports save-as-HTML for example.

like image 58
imoatama Avatar answered Oct 07 '22 23:10

imoatama


Open Office will convert pdf to html but you'll take a hit to design quality.

I suggest either: Crocodoc as a paid service (It provides different flavours for different platforms such as Python,Ruby,Java,PHP Developers are allowed to work on their APIs.) or waiting for an official Adobe tool (it's in the works).

like image 39
Mark Essel Avatar answered Oct 08 '22 00:10

Mark Essel