I was looking around for an OCR library - optimally it would be open-source - that I could use on some Arabic pdfs. Googling it didn't result in anything useful. I was wondering if anyone knows a related OCR library or even one that works on related languages (Farsi and Urdu could be relevant) that Arabic support could be added to.
Any general suggestions on how to approach this will be appreciated.
Starting with Version 3.01 of Tessaract-ocr it now supports Arabic
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With