I have the following image which has text and a lot of white space underneath the text. I would like to crop the white space such that it looks like the second image. <img src="https://i.stack.imgur.com/pUq4x.png" alt="enter image description here"> Cropped Image <img src="https://i.stack.imgur.com/iGdb6.png" alt="enter image description here"> Here is what I've done <pre class="prettyprint"><code>>>> img = cv2.imread("pg13_gau.jpg.png") >>> gray = cv2.cvtColor(img,cv2.COLOR_BGR2GRAY) >>> edged = cv2.Canny(gray, 30,300) >>> (img,cnts, _) = cv2.findContours(edged.copy(), cv2.RETR_TREE, cv2.CHAIN_APPROX_SIMPLE) >>> cnts = sorted(cnts, key = cv2.contourArea, reverse = True)[:10] </code></pre>

As many have alluded in the comments, the best way is to invert the image so the black text becomes white, find all the non-zero points in the image then determine what the minimum spanning bounding box would be. You can use this bounding box to finally crop your image. Finding the contours is very expensive and it isn't needed here - especially since your text is axis-aligned. You can use a combination of <code>cv2.findNonZero</code> and <code>cv2.boundingRect</code> to do what you need. Therefore, something like this would work: <pre class="prettyprint"><code>import numpy as np import cv2 img = cv2.imread('ws.png') # Read in the image and convert to grayscale gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY) gray = 255*(gray < 128).astype(np.uint8) # To invert the text to white coords = cv2.findNonZero(gray) # Find all non-zero points (text) x, y, w, h = cv2.boundingRect(coords) # Find minimum spanning bounding box rect = img[y:y+h, x:x+w] # Crop the image - note we do this on the original image cv2.imshow("Cropped", rect) # Show it cv2.waitKey(0) cv2.destroyAllWindows() cv2.imwrite("rect.png", rect) # Save the image </code></pre> The code above exactly lays out what I talked about in the beginning. We read in the image, but we also convert to grayscale as your image is in colour for some reason. The tricky part is the third line of code where I threshold below the intensity of 128 so that the dark text becomes white. This however produces a binary image, so I convert to <code>uint8</code>, then scale by 255. This essentially inverts the text. Next, given this image we find all of the non-zero coordinates with <code>cv2.findNonZero</code> and we finally put this into <code>cv2.boundingRect</code> which will give you the top-left corner of the bounding box as well as the width and height. We can finally use this to crop the image. Note we do this on the original image and not the inverted one. We use simply NumPy array indexing to do the cropping for us. Finally, we show the image to show that it works and we save it to disk. <hr> I now get this image: <img src="https://i.stack.imgur.com/MP3kk.png" alt="enter image description here"> <hr> For the second image, a good thing to do is to remove some of the right border and bottom border. We can do that by cropping the image down to that first. Next, this image contains some very small noisy pixels. I would recommend doing a morphological opening with a very small kernel, then redo the logic we talked about above. Therefore: <pre class="prettyprint"><code>import numpy as np import cv2 img = cv2.imread('pg13_gau_preview.png') # Read in the image and convert to grayscale img = img[:-20,:-20] # Perform pre-cropping gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY) gray = 255*(gray < 128).astype(np.uint8) # To invert the text to white gray = cv2.morphologyEx(gray, cv2.MORPH_OPEN, np.ones((2, 2), dtype=np.uint8)) # Perform noise filtering coords = cv2.findNonZero(gray) # Find all non-zero points (text) x, y, w, h = cv2.boundingRect(coords) # Find minimum spanning bounding box rect = img[y:y+h, x:x+w] # Crop the image - note we do this on the original image cv2.imshow("Cropped", rect) # Show it cv2.waitKey(0) cv2.destroyAllWindows() cv2.imwrite("rect.png", rect) # Save the image </code></pre> <h3>Note: Output image removed due to privacy</h3>

Opencv reads the image as a numpy array and it's much simpler to use numpy directly (<code>scikit-image</code> does the same). One possible way of doing it is to read the image as grayscale or convert to it and do the row-wise and column-wise operations as shown in the code snippet below. This will remove the columns and rows when all pixels are of <code>pixel_value</code> (white in this case). <pre class="prettyprint"><code>def crop_image(filename, pixel_value=255): gray = cv2.imread(filename, cv2.IMREAD_GRAYSCALE) crop_rows = gray[~np.all(gray == pixel_value, axis=1), :] cropped_image = crop_rows[:, ~np.all(crop_rows == pixel_value, axis=0)] return cropped_image </code></pre> and the output: <img src="https://i.stack.imgur.com/NcICm.png" alt="enter image description here">

How to remove whitespace from an image in OpenCV?

Tags:

python

image-processing

opencv

opencv3.0

I have the following image which has text and a lot of white space underneath the text. I would like to crop the white space such that it looks like the second image.

enter image description here

Cropped Image

enter image description here

Here is what I've done

Click to copy

>>> img = cv2.imread("pg13_gau.jpg.png")
>>> gray = cv2.cvtColor(img,cv2.COLOR_BGR2GRAY)
>>> edged = cv2.Canny(gray, 30,300)
>>> (img,cnts, _) = cv2.findContours(edged.copy(), cv2.RETR_TREE, cv2.CHAIN_APPROX_SIMPLE)
>>> cnts = sorted(cnts, key = cv2.contourArea, reverse = True)[:10]

925

asked Apr 18 '18 19:04

Anthony

2 Answers

As many have alluded in the comments, the best way is to invert the image so the black text becomes white, find all the non-zero points in the image then determine what the minimum spanning bounding box would be. You can use this bounding box to finally crop your image. Finding the contours is very expensive and it isn't needed here - especially since your text is axis-aligned. You can use a combination of cv2.findNonZero and cv2.boundingRect to do what you need.

Therefore, something like this would work:

Click to copy

import numpy as np
import cv2

img = cv2.imread('ws.png') # Read in the image and convert to grayscale
gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
gray = 255*(gray < 128).astype(np.uint8) # To invert the text to white
coords = cv2.findNonZero(gray) # Find all non-zero points (text)
x, y, w, h = cv2.boundingRect(coords) # Find minimum spanning bounding box
rect = img[y:y+h, x:x+w] # Crop the image - note we do this on the original image
cv2.imshow("Cropped", rect) # Show it
cv2.waitKey(0)
cv2.destroyAllWindows()
cv2.imwrite("rect.png", rect) # Save the image

The code above exactly lays out what I talked about in the beginning. We read in the image, but we also convert to grayscale as your image is in colour for some reason. The tricky part is the third line of code where I threshold below the intensity of 128 so that the dark text becomes white. This however produces a binary image, so I convert to uint8, then scale by 255. This essentially inverts the text.

Next, given this image we find all of the non-zero coordinates with cv2.findNonZero and we finally put this into cv2.boundingRect which will give you the top-left corner of the bounding box as well as the width and height. We can finally use this to crop the image. Note we do this on the original image and not the inverted one. We use simply NumPy array indexing to do the cropping for us.

Finally, we show the image to show that it works and we save it to disk.

I now get this image:

enter image description here

For the second image, a good thing to do is to remove some of the right border and bottom border. We can do that by cropping the image down to that first. Next, this image contains some very small noisy pixels. I would recommend doing a morphological opening with a very small kernel, then redo the logic we talked about above.

Therefore:

Click to copy

import numpy as np
import cv2

img = cv2.imread('pg13_gau_preview.png') # Read in the image and convert to grayscale
img = img[:-20,:-20] # Perform pre-cropping
gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
gray = 255*(gray < 128).astype(np.uint8) # To invert the text to white
gray = cv2.morphologyEx(gray, cv2.MORPH_OPEN, np.ones((2, 2), dtype=np.uint8)) # Perform noise filtering
coords = cv2.findNonZero(gray) # Find all non-zero points (text)
x, y, w, h = cv2.boundingRect(coords) # Find minimum spanning bounding box
rect = img[y:y+h, x:x+w] # Crop the image - note we do this on the original image
cv2.imshow("Cropped", rect) # Show it
cv2.waitKey(0)
cv2.destroyAllWindows()
cv2.imwrite("rect.png", rect) # Save the image

Note: Output image removed due to privacy

155

answered Oct 18 '22 06:10

rayryeng

Opencv reads the image as a numpy array and it's much simpler to use numpy directly (scikit-image does the same). One possible way of doing it is to read the image as grayscale or convert to it and do the row-wise and column-wise operations as shown in the code snippet below. This will remove the columns and rows when all pixels are of pixel_value (white in this case).

Click to copy

def crop_image(filename, pixel_value=255):
    gray = cv2.imread(filename, cv2.IMREAD_GRAYSCALE)
    crop_rows = gray[~np.all(gray == pixel_value, axis=1), :]
    cropped_image = crop_rows[:, ~np.all(crop_rows == pixel_value, axis=0)]
    return cropped_image

and the output:

enter image description here

answered Oct 18 '22 05:10

mobiuscreek

Related questions
                            
                                Access Google Cloud service account credentials on Container OS inside Docker Container
                            
                                What does "/" mean in R when writing a regression formula in lm()
                            
                                what's the infix priority of type annotation (::)
                            
                                Java 9 replace Class.newInstance
                            
                                Proptypes for custom react hooks
                            
                                How to check whether Alert Dialog is open in flutter
                            
                                Solution has projects that are located outside the solution folder
                            
                                Any gotchas replacing global const char[] with constexpr string_view?
                            
                                How to create and run a development build of an application using create-react-app configuration
                            
                                What is the best way to localize a WPF application, sans LocBAML?
                            
                                What makes Ometa special? [closed]
                            
                                PHP APIs for Hotmail, Gmail and Yahoo? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to remove whitespace from an image in OpenCV?

Tags:

python

image-processing

opencv

opencv3.0

Anthony

People also ask

2 Answers

Note: Output image removed due to privacy

rayryeng

mobiuscreek

Recent Activity

Donate For Us