I want to convert the pdf data into our own file specifications. So pls help me out to choose the correct API for PDF parsing using java or .net. The parsing should extract each and every component(element) from the PDF pages.
There's a library called IText that does what you want. It's sort of the #1 product out there and is free as in beer.
I've worked with IText before, extracting content from PDFs, and while it's not super-duper automatic, it allows you to get at everything.
Recommended, in other words.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With