<h3>Background</h3> I need to detect and decode a relatively small QR code (110x110 pixels) in a large image (2500x2000) on a Raspberry Pi. The QR code can be at any location in the frame, but the orientation is expected to be normal, i.e. top-up. We are using high quality industrial cameras and lenses, so images are generally good quality and in focus. Currently, I am able to detect and decode the image reliably with <code>pyzbar</code> when I crop the image around the QR code using a window of aprox 600x500. If I attempt to decode the full image, the symbol is not detected/decoded. <h3>What I Have Tried</h3> I have written a loop that slides a crop window over the image, and attempts to decode each cropped frame separately. I move the window by 50% each iteration to ensure I don't miss any symbols at the edge of the window. I have also tried using OpenCV for detection/decoding but the performance was no better than with <code>pyzbar</code> <h3>Problems With My Solution</h3> Problems which affect my current project: The sliding window approach is difficult to tune, inefficient and slow b/c: <ol> <li>it causes the entire area to be analyzed nearly 4 times; a side effect of shifting the window by 50%,</li> <li>the most reliable window sizes tend to be small and require many iterations,</li> <li>the symbol size may vary due to being closer/further from the camera.</li> </ol> Problems that may affect other projects where I would use this approach: <ol> <li>The sliding window may catch a symbol more than once, making it difficult to determine if the symbol was present more than once.</li> </ol> <h3>The Question</h3> How can I find the approximate location of the QR code(s) so I can crop the image accordingly? I am interested in any solutions to improve the detection/decoding performance, but prefer ones that (a) use machine learning techniques (I'm a ML newbie but willing to learn), (b) use OpenCV image pre-processing or (c) make improvements to my basic cropping algorithm. <h3>Sample Image</h3> Here is one of the sample images that I'm using for testing. It's purposely poor lighting quality to approximate the worst case scenario, however the individual codes still detect and decode correctly when cropped. <img src="https://i.stack.imgur.com/09QoQ.jpg" alt="QR Code Test Image 001">

I think I have found a simple yet reliable way in which the corners of the QR code can be detected. However, my approach assumes there is some contrast (the more the better) between the QR and its surrounding area. Also, we have to keep in mind that neither <code>pyzbar</code> nor <code>opencv.QRCodeDetector</code> are 100% reliable. So, here is my approach: <ol> <li> Resize image. After some experimentation I have come to the conclusion that <code>pyzbar</code> is not completely scale invariant. Although I don't have references that can back this claim, I still use small to medium images for barcode detection as a rule of thumb. You can skip this step as it might seem completely arbitrary.</li> </ol> <pre class="prettyprint lang-py prettyprint-override"><code>image = cv2.imread("image.jpg") scale = 0.3 width = int(image.shape[1] * scale) height = int(image.shape[0] * scale) image = cv2.resize(image, (width, height)) </code></pre> <ol start="2"> <li> Thresholding. We can take advantage on the fact that barcodes are generally black on white surfaces. The more contrast the better.</li> </ol> <pre class="prettyprint lang-py prettyprint-override"><code>gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY) _, thresh = cv2.threshold(gray, 120, 255, cv2.THRESH_BINARY_INV + cv2.THRESH_OTSU) </code></pre> <img src="https://i.stack.imgur.com/0yjUm.png" alt="image after masking"> 3. Dilation + contours. This step is a little bit trickier and I do apologize if my english is not completely clear here. We can see from the previous image that there are black spaces in between the white inside the QR code. If we were to just find the contours, then opencv will assume these spaces are separate entities and not part of a whole. If we want to transform the QR code and make it seem as just a white square, we have to do a bit of morphological operations. Namely, we have to dilate the image. <pre class="prettyprint lang-py prettyprint-override"><code># The bigger the kernel, the more the white region increases. # If the resizing step was ignored, then the kernel will have to be bigger # than the one given here. kernel = np.ones((3, 3), np.uint8) thresh = cv2.dilate(thresh, kernel, iterations=1) contours, _ = cv2.findContours(thresh, cv2.RETR_LIST, cv2.CHAIN_APPROX_SIMPLE) </code></pre> <img src="https://i.stack.imgur.com/CZUxh.png" alt="threshold after dilation"> 4. Filtering and getting bounding boxes. Most of the found contours are too small to contain a barcode, so we have to filter them in order to make our search space smaller. After filtering out the weak candidates, we can fetch the bounding boxes of the strong ones. EDIT: In this case we are filtering by area (small area = weak candidate), but we can also filter by the extent of the detection. Basically what the extent measures is the rectangularity of an object, and we can use that information since we know a QR code is a square. I chose the extent to be greater than pi / 4, since that is the extent of a perfect circle, meaning we are also filtering out circular objects. <pre class="prettyprint lang-py prettyprint-override"><code>bboxes = [] for cnt in contours: area = cv2.contourArea(cnt) xmin, ymin, width, height = cv2.boundingRect(cnt) extent = area / (width * height) # filter non-rectangular objects and small objects if (extent > np.pi / 4) and (area > 100): bboxes.append((xmin, ymin, xmin + width, ymin + height)) </code></pre> <img src="https://i.stack.imgur.com/5oHTE.jpg" alt="Search space"> 5. Detect barcodes. We have reduced our search space to just the actual QR codes! Now we can finally use <code>pyzbar</code> without worrying too much about it taking too long to do barcode detection. <pre class="prettyprint lang-py prettyprint-override"><code>qrs = [] info = set() for xmin, ymin, xmax, ymax in bboxes: roi = image[ymin:ymax, xmin:xmax] detections = pyzbar.decode(roi, symbols=[pyzbar.ZBarSymbol.QRCODE]) for barcode in detections: info.add(barcode.data) # bounding box coordinates x, y, w, h = barcode.rect qrs.append((xmin + x, ymin + y, xmin + x + w, ymin + y + height)) </code></pre> Unfortunately, <code>pyzbar</code> was only able to decode the information of the largest QR code (b'3280406-001'), even though both barcodes were in the search space. With regard to knowing how many times was a particular code detected, you can use a <code>Counter</code> object from the <code>collections</code> standard module. If you don't mind having that information, then you can just use a set as I did here. Hope this could be of help :).

How to locate QR code in large image to improve decoding performance?

Background

I need to detect and decode a relatively small QR code (110x110 pixels) in a large image (2500x2000) on a Raspberry Pi. The QR code can be at any location in the frame, but the orientation is expected to be normal, i.e. top-up. We are using high quality industrial cameras and lenses, so images are generally good quality and in focus.

Currently, I am able to detect and decode the image reliably with pyzbar when I crop the image around the QR code using a window of aprox 600x500. If I attempt to decode the full image, the symbol is not detected/decoded.

What I Have Tried

I have written a loop that slides a crop window over the image, and attempts to decode each cropped frame separately. I move the window by 50% each iteration to ensure I don't miss any symbols at the edge of the window.

I have also tried using OpenCV for detection/decoding but the performance was no better than with pyzbar

Problems With My Solution

Problems which affect my current project:

The sliding window approach is difficult to tune, inefficient and slow b/c:

it causes the entire area to be analyzed nearly 4 times; a side effect of shifting the window by 50%,
the most reliable window sizes tend to be small and require many iterations,
the symbol size may vary due to being closer/further from the camera.

Problems that may affect other projects where I would use this approach:

The sliding window may catch a symbol more than once, making it difficult to determine if the symbol was present more than once.

The Question

How can I find the approximate location of the QR code(s) so I can crop the image accordingly?

I am interested in any solutions to improve the detection/decoding performance, but prefer ones that (a) use machine learning techniques (I'm a ML newbie but willing to learn), (b) use OpenCV image pre-processing or (c) make improvements to my basic cropping algorithm.

Sample Image

Here is one of the sample images that I'm using for testing. It's purposely poor lighting quality to approximate the worst case scenario, however the individual codes still detect and decode correctly when cropped.

QR Code Test Image 001

700

asked Jul 31 '20 16:07

Jens Ehrich

1 Answers

I think I have found a simple yet reliable way in which the corners of the QR code can be detected. However, my approach assumes there is some contrast (the more the better) between the QR and its surrounding area. Also, we have to keep in mind that neither pyzbar nor opencv.QRCodeDetector are 100% reliable.

So, here is my approach:

Resize image. After some experimentation I have come to the conclusion that pyzbar is not completely scale invariant. Although I don't have references that can back this claim, I still use small to medium images for barcode detection as a rule of thumb. You can skip this step as it might seem completely arbitrary.

image = cv2.imread("image.jpg")
scale = 0.3
width = int(image.shape[1] * scale)
height = int(image.shape[0] * scale)
image = cv2.resize(image, (width, height))

Thresholding. We can take advantage on the fact that barcodes are generally black on white surfaces. The more contrast the better.

gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
_, thresh = cv2.threshold(gray, 120, 255, cv2.THRESH_BINARY_INV + cv2.THRESH_OTSU)

image after masking 3. Dilation + contours. This step is a little bit trickier and I do apologize if my english is not completely clear here. We can see from the previous image that there are black spaces in between the white inside the QR code. If we were to just find the contours, then opencv will assume these spaces are separate entities and not part of a whole. If we want to transform the QR code and make it seem as just a white square, we have to do a bit of morphological operations. Namely, we have to dilate the image.

# The bigger the kernel, the more the white region increases.
# If the resizing step was ignored, then the kernel will have to be bigger
# than the one given here.
kernel = np.ones((3, 3), np.uint8)
thresh = cv2.dilate(thresh, kernel, iterations=1)
contours, _ = cv2.findContours(thresh, cv2.RETR_LIST, cv2.CHAIN_APPROX_SIMPLE)

threshold after dilation 4. Filtering and getting bounding boxes. Most of the found contours are too small to contain a barcode, so we have to filter them in order to make our search space smaller. After filtering out the weak candidates, we can fetch the bounding boxes of the strong ones.

EDIT: In this case we are filtering by area (small area = weak candidate), but we can also filter by the extent of the detection. Basically what the extent measures is the rectangularity of an object, and we can use that information since we know a QR code is a square. I chose the extent to be greater than pi / 4, since that is the extent of a perfect circle, meaning we are also filtering out circular objects.

bboxes = []
for cnt in contours:
  area = cv2.contourArea(cnt)
  xmin, ymin, width, height = cv2.boundingRect(cnt)
  extent = area / (width * height)
  
  # filter non-rectangular objects and small objects
  if (extent > np.pi / 4) and (area > 100):
    bboxes.append((xmin, ymin, xmin + width, ymin + height))

5. Detect barcodes. We have reduced our search space to just the actual QR codes! Now we can finally use pyzbar without worrying too much about it taking too long to do barcode detection.

qrs = []
info = set()
for xmin, ymin, xmax, ymax in bboxes:
  roi = image[ymin:ymax, xmin:xmax]
  detections = pyzbar.decode(roi, symbols=[pyzbar.ZBarSymbol.QRCODE])
  for barcode in detections:
     info.add(barcode.data)
     # bounding box coordinates
     x, y, w, h = barcode.rect
     qrs.append((xmin + x, ymin + y, xmin + x + w, ymin + y + height))

Unfortunately, pyzbar was only able to decode the information of the largest QR code (b'3280406-001'), even though both barcodes were in the search space. With regard to knowing how many times was a particular code detected, you can use a Counter object from the collections standard module. If you don't mind having that information, then you can just use a set as I did here.

Hope this could be of help :).

154

answered Oct 04 '22 00:10

Sebastian Liendo

Related questions
                            
                                Check if elements occur together in all lists?
                            
                                How to create a square dataframe/matrix given 3 columns - Python
                            
                                multiprocessing.Pipe is even slower than multiprocessing.Queue?
                            
                                How to implement SMOTE in cross validation and GridSearchCV
                            
                                python: perform gdalwarp in memory with gdal bindings
                            
                                Do I need to add my project directory to the system path in every script to import a function from another directory?
                            
                                python how to run process in detached mode
                            
                                How to run a coroutine and wait it result from a sync func when the loop is running?
                            
                                Best way to add dictionary to dataframe
                            
                                If we combine one trainable parameters with a non-trainable parameter, is the original trainable param trainable?
                            
                                How to import the tensorflow lite interpreter in Python?
                            
                                Wrong spectrogram when using scipy.signal.spectrogram
                            
                                How to generate random numbers with predefined probability distribution?
                            
                                OpenPyXL: Is it possible to create a dropdown menu in an excel sheet?
                            
                                Howto put object to s3 with Content-MD5
                            
                                How to debug a 3rd-party Python package in VS Code
                            
                                Flask reloader crashes with "no module named Scripts\flask" on Windows
                            
                                What is the difference between using flask run vs python app.py vs python -m flask run? [duplicate]
                            
                                A field with precision 10, scale 2 must round to an absolute value less than 10^8
                            
                                how python interpreter treats the position of the function definition having default parameter

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to locate QR code in large image to improve decoding performance?

Tags:

python

image-processing

computer-vision

qr-code