Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Extracting text from a PDF file [closed]

Tags:

java

c#

pdf

I need to extract the text from a PDF file. This text will likely be in a table format, and it is going to be used for automatic transfer of data between an external party and our systems.

Can anyone suggest a command line tool (eg pdf to txt) or a library that would be good for this?

Language options:

  • C# (preferred)
  • Java (if I must)

I found some ideas here, but i think the guy was talking more about a one-off situation, i'm talking more like a daily import:

https://stackoverflow.com/questions/488089/extracting-tables-from-pdf-files

like image 784
Chris Avatar asked Aug 14 '09 04:08

Chris


1 Answers

try this

http://www.codeproject.com/KB/cs/PDFToText.aspx

Bye

like image 131
RRUZ Avatar answered Sep 20 '22 16:09

RRUZ