Is there a way to accomplish something similar to what the iTunes and App Store Apps do when you redeem a Gift Card using the device camera, recognizing a short string of characters in real time on top of the live camera feed? <img src="https://i.stack.imgur.com/Ubj4l.png" alt="iTunes App Redeem Gift Card UI"> I know that in iOS 7 there is now the <code>AVMetadataMachineReadableCodeObject</code> class which, AFAIK, only represents barcodes. I'm more interested in detecting and reading the contents of a short string. Is this possible using publicly available API methods, or some other third party SDK that you might know of? There is also a video of the process in action: https://www.youtube.com/watch?v=c7swRRLlYEo Best,

I'm working on a project that does something similar to the Apple app store redeem with camera as you mentioned. A great starting place on processing live video is a project I found on GitHub. This is using the AVFoundation framework and you implement the AVCaptureVideoDataOutputSampleBufferDelegate methods. Once you have the image stream (video), you can use OpenCV to process the video. You need to determine the area in the image you want to OCR before you run it through Tesseract. You have to play with the filtering, but the broad steps you take with OpenCV are: <ul> <li>Convert the images to B&W using cv::cvtColor(inputMat, outputMat, CV_RGBA2GRAY);</li> <li>Threshold the images to eliminate unnecessary elements. You specify the threshold value to eliminate, and then set everything else to black (or white).</li> <li>Determine the lines that form the boundary of the box (or whatever you are processing). You can either create a "bounding box" if you have eliminated everything but the desired area, or use the HoughLines algorithm (or the probabilistic version, HoughLinesP). Using this, you can determine line intersection to find corners, and use the corners to warp the desired area to straighten it into a proper rectangle (if this step is necessary in your application) prior to OCR.</li> <li>Process the portion of the image with Tesseract OCR library to get the resulting text. It is possible to create training files for letters in OpenCV so you can read the text without Tesseract. This could be faster but also could be a lot more work. In the App Store case, they are doing something similar to display the text that was read overlaid on top of the original image. This adds to the cool factor, so it just depends on what you need.</li> </ul> Some other hints: <ul> <li>I used the book "Instant OpenCV" to get started quickly with this. It was pretty helpful.</li> <li>Download OpenCV for iOS from OpenCV.org/downloads.html</li> <li>I have found adaptive thresholding to be very useful, you can read all about it by searching for "OpenCV adaptiveThreshold". Also, if you have an image with very little in between light and dark elements, you can use Otsu's Binarization. This automatically determines the threshold values based on the histogram of the grayscale image.</li> </ul>

iOS: Real Time OCR on top of live camera feed (similar to iTunes Redeem Gift Card)

Tags:

ios

ocr

Is there a way to accomplish something similar to what the iTunes and App Store Apps do when you redeem a Gift Card using the device camera, recognizing a short string of characters in real time on top of the live camera feed?

iTunes App Redeem Gift Card UI

I know that in iOS 7 there is now the AVMetadataMachineReadableCodeObject class which, AFAIK, only represents barcodes. I'm more interested in detecting and reading the contents of a short string. Is this possible using publicly available API methods, or some other third party SDK that you might know of?

There is also a video of the process in action:

https://www.youtube.com/watch?v=c7swRRLlYEo

Best,

861

asked Sep 30 '13 18:09

boliva

1 Answers

I'm working on a project that does something similar to the Apple app store redeem with camera as you mentioned.

A great starting place on processing live video is a project I found on GitHub. This is using the AVFoundation framework and you implement the AVCaptureVideoDataOutputSampleBufferDelegate methods.

Once you have the image stream (video), you can use OpenCV to process the video. You need to determine the area in the image you want to OCR before you run it through Tesseract. You have to play with the filtering, but the broad steps you take with OpenCV are:

Convert the images to B&W using cv::cvtColor(inputMat, outputMat, CV_RGBA2GRAY);
Threshold the images to eliminate unnecessary elements. You specify the threshold value to eliminate, and then set everything else to black (or white).
Determine the lines that form the boundary of the box (or whatever you are processing). You can either create a "bounding box" if you have eliminated everything but the desired area, or use the HoughLines algorithm (or the probabilistic version, HoughLinesP). Using this, you can determine line intersection to find corners, and use the corners to warp the desired area to straighten it into a proper rectangle (if this step is necessary in your application) prior to OCR.
Process the portion of the image with Tesseract OCR library to get the resulting text. It is possible to create training files for letters in OpenCV so you can read the text without Tesseract. This could be faster but also could be a lot more work. In the App Store case, they are doing something similar to display the text that was read overlaid on top of the original image. This adds to the cool factor, so it just depends on what you need.

Some other hints:

I used the book "Instant OpenCV" to get started quickly with this. It was pretty helpful.
Download OpenCV for iOS from OpenCV.org/downloads.html
I have found adaptive thresholding to be very useful, you can read all about it by searching for "OpenCV adaptiveThreshold". Also, if you have an image with very little in between light and dark elements, you can use Otsu's Binarization. This automatically determines the threshold values based on the histogram of the grayscale image.

164

answered Sep 19 '22 19:09

Donovan

Related questions
                            
                                Detecting heart rate using the camera
                            
                                Clojure iOS Development
                            
                                Alternate Icon in iOS 10.3
                            
                                UIScrollview setContentOffset with non linear animation?
                            
                                Deploy .ipa file in App store generated by Phonegap?
                            
                                Multiple apps from one code base - multiple projects or targets in Xcode?
                            
                                Detect Tap on CalloutBubble in MKAnnotationView
                            
                                Refreshing iOS app receipt: How to determine if user will need to sign in for app store?
                            
                                Swift 2.0: Could not cast value MyApp.MyCustomClass to MyAppTests.MyCustomClass when using Set
                            
                                The app ID cannot be registered to your development team
                            
                                iOS 7 UIWebView not rendering
                            
                                UIKit Dynamics on a UITableView
                            
                                How to change the bottom edge color on the iPhone X programmatically?
                            
                                Face Recognition on the iPhone
                            
                                Dynamic UICollectionView header size based on UILabel
                            
                                Significant differences between Cookies and JWT for native mobile apps
                            
                                No valid iOS code signing keys found in keychain
                            
                                Unable to download Apple Developer keys after initial creation
                            
                                Python or Ruby Interpreter on iOS [closed]
                            
                                What's the difference between path and URL in iOS?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With