Is there a way (using something like OpenCV) to detect text skew and correct it by rotating the image? Pretty much like this? <img src="https://i.stack.imgur.com/uQslJ.png" alt="enter image description here"> <img src="https://i.stack.imgur.com/rTa8X.png" alt="enter image description here"> Rotating an image seems easy enough if you know the angle, but for the images I'm processing, I wont...it will need to be detected somehow.

Based on your above comment, here is the code based on the tutorial here, working fine for the above image, Source <img src="https://i.stack.imgur.com/hhIsE.png" alt="enter image description here"> Rotated <img src="https://i.stack.imgur.com/BZccx.jpg" alt="enter image description here"> <pre class="prettyprint"><code> Mat src=imread("text.png",0); Mat thr,dst; threshold(src,thr,200,255,THRESH_BINARY_INV); imshow("thr",thr); std::vector<cv::Point> points; cv::Mat_<uchar>::iterator it = thr.begin<uchar>(); cv::Mat_<uchar>::iterator end = thr.end<uchar>(); for (; it != end; ++it) if (*it) points.push_back(it.pos()); cv::RotatedRect box = cv::minAreaRect(cv::Mat(points)); cv::Mat rot_mat = cv::getRotationMatrix2D(box.center, box.angle, 1); //cv::Mat rotated(src.size(),src.type(),Scalar(255,255,255)); Mat rotated; cv::warpAffine(src, rotated, rot_mat, src.size(), cv::INTER_CUBIC); imshow("rotated",rotated); </code></pre> <blockquote> </blockquote> Edit: Also see the answer here , might be helpful.

Detect and fix text skew by rotating image

3 Answers

I would provide javacv for your reference.

package com.test13;

import org.opencv.core.*;
import org.opencv.imgproc.Imgproc;
import org.opencv.imgcodecs.Imgcodecs;

public class EdgeDetection {

    static{ System.loadLibrary(Core.NATIVE_LIBRARY_NAME); }

    public static void main( String[] args ) throws Exception{      
        Mat src = Imgcodecs.imread("src//data//inclined_text.jpg");
        Mat src_gray = new Mat();
        Imgproc.cvtColor(src, src_gray, Imgproc.COLOR_BGR2GRAY);
        Imgcodecs.imwrite("src//data//inclined_text_src_gray.jpg", src_gray);

        Mat output = new Mat();
        Core.bitwise_not(src_gray, output);
        Imgcodecs.imwrite("src//data//inclined_text_output.jpg", output);

        Mat points = Mat.zeros(output.size(),output.type());  
        Core.findNonZero(output, points);   

        MatOfPoint mpoints = new MatOfPoint(points);    
        MatOfPoint2f points2f = new MatOfPoint2f(mpoints.toArray());
        RotatedRect box = Imgproc.minAreaRect(points2f);

        Mat src_squares = src.clone();
        Mat rot_mat = Imgproc.getRotationMatrix2D(box.center, box.angle, 1);
        Mat rotated = new Mat(); 
        Imgproc.warpAffine(src_squares, rotated, rot_mat, src_squares.size(), Imgproc.INTER_CUBIC);
        Imgcodecs.imwrite("src//data//inclined_text_squares_rotated.jpg",rotated);    
    }
}

177

answered Oct 19 '22 20:10

Anton KONG

Based on your above comment, here is the code based on the tutorial here, working fine for the above image,

Source

enter image description here

Rotated

enter image description here

 Mat src=imread("text.png",0);
 Mat thr,dst;
 threshold(src,thr,200,255,THRESH_BINARY_INV);
 imshow("thr",thr);

  std::vector<cv::Point> points;
  cv::Mat_<uchar>::iterator it = thr.begin<uchar>();
  cv::Mat_<uchar>::iterator end = thr.end<uchar>();
  for (; it != end; ++it)
    if (*it)
      points.push_back(it.pos());

  cv::RotatedRect box = cv::minAreaRect(cv::Mat(points));
  cv::Mat rot_mat = cv::getRotationMatrix2D(box.center, box.angle, 1);

  //cv::Mat rotated(src.size(),src.type(),Scalar(255,255,255));
  Mat rotated;
  cv::warpAffine(src, rotated, rot_mat, src.size(), cv::INTER_CUBIC);
 imshow("rotated",rotated);

Edit:

Also see the answer here , might be helpful.

answered Oct 19 '22 21:10

Haris

Here's an implementation of the Projection Profile Method algorithm for skew angle estimation. Various angle points are projected into an accumulator array where the skew angle can be defined as the angle of projection within a search interval that maximizes alignment. The idea is to rotate the image at various angles and generate a histogram of pixels for each iteration. To determine the skew angle, we compare the maximum difference between peaks and using this skew angle, rotate the image to correct the skew.

Input

enter image description here

Result

enter image description here

Skew angle: -5

import cv2
import numpy as np
from scipy.ndimage import interpolation as inter

def correct_skew(image, delta=1, limit=5):
    def determine_score(arr, angle):
        data = inter.rotate(arr, angle, reshape=False, order=0)
        histogram = np.sum(data, axis=1, dtype=float)
        score = np.sum((histogram[1:] - histogram[:-1]) ** 2, dtype=float)
        return histogram, score

    gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
    thresh = cv2.threshold(gray, 0, 255, cv2.THRESH_BINARY_INV + cv2.THRESH_OTSU)[1] 

    scores = []
    angles = np.arange(-limit, limit + delta, delta)
    for angle in angles:
        histogram, score = determine_score(thresh, angle)
        scores.append(score)

    best_angle = angles[scores.index(max(scores))]

    (h, w) = image.shape[:2]
    center = (w // 2, h // 2)
    M = cv2.getRotationMatrix2D(center, best_angle, 1.0)
    corrected = cv2.warpAffine(image, M, (w, h), flags=cv2.INTER_CUBIC, \
            borderMode=cv2.BORDER_REPLICATE)

    return best_angle, corrected

if __name__ == '__main__':
    image = cv2.imread('1.png')
    angle, corrected = correct_skew(image)
    print('Skew angle:', angle)
    cv2.imshow('corrected', corrected)
    cv2.waitKey()

Note: You may have to adjust the delta or limit values depending on the image. The delta value controls iteration step, it will iterate up until the limit which controls the maximum angle. This method is straightforward by iteratively checking each angle + delta and currently only works to correct skew in the range of +/- 5 degrees. If you need to correct at a larger angle, adjust the limit value.

answered Oct 19 '22 20:10

nathancy

Related questions
                            
                                django store image in database
                            
                                Is it safe to use base64 encoded images for web, Advantages and Disadvantages?
                            
                                In a WPF ListBox with more than 1000 Image Items the Zoom Images become slow
                            
                                How can I get pyplot images to show on a console app?
                            
                                Image Absolute source path with jQuery
                            
                                rawvideo and rgb32 values passed to FFmpeg
                            
                                How to display raw data as an image (Visual Studio c#)
                            
                                Convert System.Windows.Media.ImageSource to System.Drawing.Bitmap
                            
                                File format limits in pixel size for png images?
                            
                                displaying an image in cocoa
                            
                                old images show after replacing with new ones in ios app
                            
                                Image erosion and dilation with Scipy
                            
                                Dynamically change an image in a Crystal Report at runtime
                            
                                Editing Photoshop PSD text layers programmatically
                            
                                Lazy loading images [duplicate]
                            
                                what's the difference between spatial and temporal characterization in terms of image processing?
                            
                                R determine image width and height in pixels
                            
                                Custom thumbnails for file types with Paperclip
                            
                                Preventing secure/insecure errors by using protocol relative URLs for image source
                            
                                How to have images in line with text in css

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Detect and fix text skew by rotating image

Tags:

image

image-processing

opencv

computer-vision

Sam Jarman

People also ask

3 Answers

Anton KONG

Haris

nathancy

Recent Activity

Donate For Us