Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

PDF Parser API in Java [closed]

I want to convert the pdf data into our own file specifications. So pls help me out to choose the correct API for PDF parsing using java or .net. The parsing should extract each and every component(element) from the PDF pages.

like image 930
Naimur Avatar asked Jul 13 '10 08:07

Naimur


1 Answers

There's a library called IText that does what you want. It's sort of the #1 product out there and is free as in beer.

I've worked with IText before, extracting content from PDFs, and while it's not super-duper automatic, it allows you to get at everything.

Recommended, in other words.

like image 129
Carl Smotricz Avatar answered Sep 30 '22 16:09

Carl Smotricz