Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

PowerPoint file structure

I'm trying to build a JavaScript parser for .ppt files. PPTX is no big deal since it' an "open" format, but I'm really lost regarding the file structure of a .ppt file and can't find any useful information.

Given this, has anyone ever tried this, or can at least point me to where I can see the 'spec' for the .ppt, so I can build the parser?

Best Regards, Celso Santos

like image 306
Zed_Blade Avatar asked Jul 27 '12 11:07

Zed_Blade


People also ask

What is the format of PowerPoint files?

The default file format in PowerPoint version 2007 or newer is . pptx. A presentation that you can open on a PC in PowerPoint 2007 and newer versions, or that you can open on a Mac in PowerPoint 2008 and newer versions.

What is the 5 5 5 rule in PowerPoint?

To keep your audience from feeling overwhelmed, you should keep the text on each slide short and to the point. Some experts suggest using the 5/5/5 rule: no more than five words per line of text, five lines of text per slide, or five text-heavy slides in a row.

What are the 5 parts of PowerPoint?

These elements include the Office button, Quick Access toolbar, Title bar, Tabs, scroll bars and a Status bar.


2 Answers

.ppt is a binary file format. You can read the 1997-2007 spec here

Not to discourage you from trying, but you should note that this may wind up being a daunting/almost impossible task for 1 developer to implement since the entire spec represents thousands of programming hours over 10 years.

Joel Spolsky has a good article on dealing with these file formats.

Just for completion sake, here is the spec for the pptx file format.

like image 124
Brandon Boone Avatar answered Sep 24 '22 10:09

Brandon Boone


Your dreams have been answered...

Someone built a pure JavaScript PPT binary file parser. Check it out on Git https://github.com/SheetJS/js-ppt

like image 26
Nick Steele Avatar answered Sep 20 '22 10:09

Nick Steele