How do I extract attachments from a pdf file?

1 Answers

iTextSharp is also quite capable of extracting attachments... Though you might have to use the low level objects to do so.

There are two ways to embed files in a PDF:

In a File Annotation
At the document level "EmbeddedFiles".

Once you have a file specification dictionary from either source, the file itself will be a stream within the dictionary labeled "EF" (embedded file).

So to list all the files at the document level, one would write code (in Java) as such:

Map<String, byte[]> files = new HashMap<String,byte[]>();

PdfReader reader = new PdfReader(pdfPath);
PdfDictionary root = reader.getCatalog();
PdfDictionary names = root.getAsDict(PdfName.NAMES); // may be null
PdfDictionary embeddedFilesDict = names.getAsDict(PdfName.EMBEDDEDFILES); //may be null
PdfArray embeddedFiles = embeddedFilesDict.getAsArray(PdfName.NAMES); // may be null

int len = embeddedFiles.size();
for (int i = 0; i < len; i += 2) {
  PdfString name = embeddedFiles.getAsString(i); // should always be present
  PdfDictionary fileSpec = embeddedFiles.getAsDict(i+1); // ditto

  PdfDictionary streams = fileSpec.getAsDict(PdfName.EF);
  PRStream stream = null;

  if (streams.contains(PdfName.UF))
    stream = (PRStream)streams.getAsStream(PdfName.UF);
  else
    stream = (PRStream)streams.getAsStream(PdfName.F); // Default stream for backwards compatibility

  if (stream != null) {
    files.put( name.toUnicodeString(), PdfReader.getStreamBytes((PRStream)stream));
  }
}

151

answered Oct 10 '22 16:10

Mark Storer

Related questions
                            
                                Web Request/Upload Failing at Very End
                            
                                Constructing a Generic object (not default constructor)
                            
                                Is it possible to make the WinForms Tab Control be able to do tab reordering like IE or Firefox?
                            
                                Why can I use a lambda expression in place of a callback delegate?
                            
                                How to rename excel sheet name dynamically in C#
                            
                                c# compare the data in two object models
                            
                                Unit testing and nhibernate?
                            
                                How to map a string to a date in automapper?
                            
                                .NET SupportedRuntime in App.config
                            
                                c#: keep ref parameter from constructor in class
                            
                                debug a project with references in Visual studio
                            
                                How to allow inserting on DataGridView?
                            
                                Web Controls within UserControl null?
                            
                                Simplest way to post to a facebook fan page's wall with C#!
                            
                                How to set full control to a directory
                            
                                Return result between Windows Forms in C#
                            
                                Change background color of ListView row programmatically (wpf)
                            
                                Work with an Amazon S3 response stream after response has been disposed
                            
                                Activation error occured while trying to get instance of type ICacheManager, key "Cache Manager"
                            
                                Application shown as threat by AVG

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I extract attachments from a pdf file?

Tags:

c#

.net

pdf

gyurisc

People also ask

1 Answers

Mark Storer

Recent Activity

Donate For Us