Is it possible to uncompress PDF by using Adobe Acrobat or Acrobat Distiller?

Tags:

Most PDF files found on the Web have compressed and unreadable data streams. Is it possible to uncompress the internal content of a PDF file using Acrobat or Acrobat Distiller, allowing us to read the source code by a text editor?

P.S. This question is inspired by this answer which explains how it can be done with GhostScript.

725

asked Sep 15 '13 13:09

Alexey Popkov

3 Answers

qpdf and pdftk have already been mentioned. To show the commands:

$ qpdf --qdf --object-streams=disable orig.pdf uncompressed-orig.pdf
$ pdftk orig.pdf output uncompressed-orig.pdf uncompress

mutool however hasn't been mentioned yet:

$ mutool clean -d -a orig.pdf uncompressed-orig.pdf

mutool is a command line tool which ships alongside the lightweight MuPDF PDF + document viewer.

I do not think you can achieve the uncompressing of PDF objects' streams with Acrobat or Distiller (unless you have additional payware plugins available).

112

answered Oct 17 '22 21:10

Kurt Pfeifle

Use cpdf:

cpdf -decompress in.pdf -o out.pdf

and then the graphic operators for each page can be read in a text editor. You'll need a copy of the standard as a reference, though.

Disclosure: I am the author of cpdf.

answered Oct 17 '22 20:10

johnwhitington

This is easy with qpdf and pdftk.

With Adobe Acrobat you can get at the internal structure after profiling a PDF (preflight with some profile (e.g. detect PDF syntax errors), then Options->Internal PDF structure) - but there's no way to get something editable with a text editor.

answered Oct 17 '22 20:10

Martin Schröder

Related questions
                            
                                Converting HTML to PDF using iText
                            
                                Keep table in one piece MigraDoc / PDFsharp
                            
                                Python silent print PDF to specific printer
                            
                                Animated slides conversion to static PDF
                            
                                Advanced PDF parser for Java
                            
                                iTextSharp - Is it possible to set a different font color for the same cell and row?
                            
                                How to extract the title of a PDF document from within a script for renaming?
                            
                                Converting MS Word Documents to PDF in ASP.NET [closed]
                            
                                Opening files in browser instead of downloading
                            
                                'PDFsharp cannot handle this PDF feature introduced with Acrobat 6' error while opening PDF file
                            
                                Wicked_PDF templates is missing
                            
                                Edit Metadata of PDF File with C# [closed]
                            
                                How to show page number (N of N) using xslt in PDF Report
                            
                                Fill pdf form with javascript (client-side only)
                            
                                Can prawn generate PDFs with links?
                            
                                How to detect if a file is PDF or TIFF?
                            
                                PyPDF 2 Decrypt Not Working
                            
                                What intent would open a pdf from a url? [duplicate]
                            
                                How to read PDF form data using iTextSharp?
                            
                                Creating a pdf file in android programmatically and writing in it

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With