Is it possible to determine programatically whether a PDF is "tagged" (for accessibility)? I'm using PHP, and would like (if possible) to simply read a PDF file and return true if tagged, false if not.
I've looked at FPDF and TCPDF, but it isn't clear to me whether either can extract this information.
A tagged PDF is a PDF document that contains tags. Tags provide a logical structure that governs how the content of the PDF is presented through assistive technology. A properly tagged PDF, therefore, becomes accessible to everyone.
Finding Tagged Content from the Document Panel Select text, an image, or a table from the document panel 3. Select the Options menu at the top of the Tags pane and then select Find Tag from Selection.
There is no sure proof way to determine if a generic PDF file is modified. If you go to the document properties of a PDF file (control or command d), if the proper metadata is available, it will list the creation date and time and modified date and time.
In the official ISO PDF-1.7 specification (in the copy available for free from the Adobe website), I read on page 574:
"A Tagged PDF document shall also contain a mark information dictionary (see Table 321) with a value of true for the Marked entry."
To me that means...
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With