OCR confidence score from Google Vision API

Tags:

I am using Google Vision OCR for extracting text from images in python.
Using the following code snippet.
However, the confidence score always shows 0.0 which is definitely incorrect.

How to extract the OCR confidence score for individual char or word from the Google response?

Click to copy

 content = cv2.imencode('.jpg', cv2.imread(file_name))[1].tostring()
 img = types.Image(content=content)
 response1 = client.text_detection(image=img, image_context={"language_hints": ["en"]})
 response_annotations = response1.text_annotations
 for x in response1.text_annotations:
      print(x)
      print(f'confidence:{x.confidence}')

Ex: output for an iteration

Click to copy

description: "Date:"
bounding_poly {
  vertices {
    x: 127
    y: 11
  }
  vertices {
    x: 181
    y: 10
  }
  vertices {
    x: 181
    y: 29
  }
  vertices {
    x: 127
    y: 30
  }
}

confidence:0.0

238

asked Jul 01 '20 17:07

letsBeePolite

2 Answers

I managed to reproduce your issue. I used the following function and obtained confidence 0.0 for all items.

Click to copy

from google.cloud import vision

def detect_text_uri(uri):
    client = vision.ImageAnnotatorClient()
    image = vision.types.Image()
    image.source.image_uri = uri

    response = client.text_detection(image=image)
    texts = response.text_annotations
    print('Texts:')

    for text in texts:
        print('\n"{}"'.format(text.description))

        vertices = (['({},{})'.format(vertex.x, vertex.y)
                    for vertex in text.bounding_poly.vertices])

        print('bounds: {}'.format(','.join(vertices)))
        print("confidence: {}".format(text.confidence))

    if response.error.message:
        raise Exception(
            '{}\nFor more info on error messages, check: '
            'https://cloud.google.com/apis/design/errors'.format(
                response.error.message))

However, when using the same image with the "Try the API" option in the documentation I obtained a result with confidences non 0. This happened also when detecting text from a local image.

One should expect confidences to have the same value using both methods. I've opened an issue tracker, check it here.

157

answered Oct 18 '22 03:10

aemon4

Working code that retrieves the right confidence values of GOCR response.

(using document_text_detection() instead of text_detection())

Click to copy

def detect_document(path):
    """Detects document features in an image."""
    from google.cloud import vision
    import io
    client = vision.ImageAnnotatorClient()

    # [START vision_python_migration_document_text_detection]
    with io.open(path, 'rb') as image_file:
        content = image_file.read()

    image = vision.types.Image(content=content)

    response = client.document_text_detection(image=image)

    for page in response.full_text_annotation.pages:
        for block in page.blocks:
            print('\nBlock confidence: {}\n'.format(block.confidence))

            for paragraph in block.paragraphs:
                print('Paragraph confidence: {}'.format(
                    paragraph.confidence))

                for word in paragraph.words:
                    word_text = ''.join([
                        symbol.text for symbol in word.symbols
                    ])
                    print('Word text: {} (confidence: {})'.format(
                        word_text, word.confidence))

                    for symbol in word.symbols:
                        print('\tSymbol: {} (confidence: {})'.format(
                            symbol.text, symbol.confidence))

    if response.error.message:
        raise Exception(
            '{}\nFor more info on error messages, check: '
            'https://cloud.google.com/apis/design/errors'.format(
                response.error.message))
    # [END vision_python_migration_document_text_detection]
# [END vision_fulltext_detection]

# add your own path
path = "gocr_vision.png"
detect_document(path)

answered Oct 18 '22 03:10

letsBeePolite

Related questions
                            
                                How to substract a day to a date field in Datastudio
                            
                                Get user's birthday/gender using Google Sign-In in Flutter
                            
                                Configuring multiple scheme in iOS causes mismatch in flavors
                            
                                Does iOS has In App updates like feature as of Android?
                            
                                How to use airflow DataFlowPythonOperator for beam pipeline?
                            
                                How can I run uncompiled Spark Scala/spark-shell code as a Dataproc job?
                            
                                Reuse Credential for Sign in with Apple in Firebase iOS
                            
                                Firebase Test Lab Coverage with Orchestrator Permission Denied
                            
                                Cannot find module 'firebase/app' while deploying Angular Universal app
                            
                                Communication between Pods in Kubernetes. Service object or Cluster Networking?
                            
                                Ionic Capacitor firebase push notification, error:Default FirebaseApp is not initialized in this process
                            
                                Where can I find firebase-debug.log to understand why emulators did not cleanly shut down?
                            
                                How can you see the stack trace of an uncaught exception in Google Cloud Functions?
                            
                                What machine instance to use for running GPU workloads in Google Cloud Platform [closed]
                            
                                "network: session_affinity:true " property of app.yaml file is not reflecting in google app engine
                            
                                Flutter: Using path provider when app is in background
                            
                                Google Data Studio displays "null" – how to set the field value?
                            
                                how to discard initial data in a Firebase DB
                            
                                Auto increment a value in firebase with javascript
                            
                                How to send Firebase Cloud Messaging from a node server?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

OCR confidence score from Google Vision API

Tags:

image-processing

computer-vision

ocr

google-vision

google-cloud-vision

letsBeePolite

People also ask

2 Answers

aemon4

letsBeePolite

Recent Activity

Donate For Us