I am using Google OCR API and I am reading both images and PDF files, I am able to read and process images file, however, for PDF files, as per Google OCR API documentation, they have mentioned that we need to store our document into Google Cloud service. Having said that, due to data confidentiality, I can't store my data into Google Cloud and want to upload my PDF from my local system in order to read text from PDF file. Is it possible to upload PDF from local disk and then process it instead of uploading file into Google Cloud?

As you said, it's not possible to do that locally. I filed a Feature Request [1] on your behalf for you to follow updates there. Anyway, I have a possible workaround that might satisfy your data confidentiality awareness. It consist in using the Cloud Storage Client libraries [2] to both upload and delete those files: <ol> <li>You have the PDF file locally and no buckets containing it.</li> <li>Upload it to a bucket [3] </li> <li>Use that bucket+file URI to read it through Cloud Vision API and store the result in a bucket</li> <li>Download the result file into your local machine [4] </li> <li>Delete both the PDF file and the result file from the bucket(s) [5] </li> </ol> This should work as long as you don't mind having those files in buckets for a brief period of time.

google-cloud-vision how to read pdf file

Tags:

google-cloud-vision

I am using Google OCR API and I am reading both images and PDF files, I am able to read and process images file, however, for PDF files, as per Google OCR API documentation, they have mentioned that we need to store our document into Google Cloud service.

Having said that, due to data confidentiality, I can't store my data into Google Cloud and want to upload my PDF from my local system in order to read text from PDF file. Is it possible to upload PDF from local disk and then process it instead of uploading file into Google Cloud?

212

asked Aug 24 '18 01:08

ZeeKhan

1 Answers

As you said, it's not possible to do that locally. I filed a Feature Request [1] on your behalf for you to follow updates there.

Anyway, I have a possible workaround that might satisfy your data confidentiality awareness. It consist in using the Cloud Storage Client libraries [2] to both upload and delete those files:

You have the PDF file locally and no buckets containing it.
Upload it to a bucket [3]
Use that bucket+file URI to read it through Cloud Vision API and store the result in a bucket
Download the result file into your local machine [4]
Delete both the PDF file and the result file from the bucket(s) [5]

This should work as long as you don't mind having those files in buckets for a brief period of time.

135

answered Sep 22 '22 15:09

Iñigo

Related questions
                            
                                Google Cloud Vision - Which region does Google upload the images to?
                            
                                Format OCR text annotation from Cloud Vision API in Python
                            
                                How to pass an api key to the Google Cloud Vision NodeJS API
                            
                                How do I call the Google Vision API with an image stored in Google Cloud Storage?
                            
                                Does Google Cloud Vision API support face recognition or face identification?
                            
                                Google Cloud Vision API 'Request Admission Denied'
                            
                                Can't import google.cloud.vision
                            
                                How to enable Google Vision API to access Google Cloud Storage Bucket within same project
                            
                                Google Cloud Vision API "PERMISSION_DENIED"
                            
                                Does google-cloud-vision stores uploaded images ? what is privacy policy for that?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With