I tried 5 different implementations of the Sobel operator in Python, one of which I implemented myself, and the results are radically different. My question is similar to this one, but there are still differences I don't understand with the other implementations. Is there any agreed on definition of the Sobel operator, and is it always synonymous to "image gradient"? Even the definition of the Sobel kernel is different from source to source, according to Wikipedia it is <code>[[1, 0, -1],[2, 0, -2],[1, 0, -1]]</code>, but according to other sources it is <code>[[-1, 0, 1],[-2, 0, 2],[-1, 0, 1]]</code>. Here is my code where I tried the different techniques: <pre class="prettyprint"><code>from scipy import ndimage import numpy as np import cv2 as cv from scipy import ndimage from PIL import Image, ImageFilter img = np.random.randint(0, 255, [10, 10]).astype(np.uint8) def sobel_x(img) : return ndimage.convolve(img, np.array([[-1, 0, 1],[-2, 0, 2],[-1, 0, 1]])) my_sobel = sobel_x(img) _, numpy_sobel = np.gradient(img) opencv_sobel = cv.Sobel(img, cv.CV_8UC1, 1, 0) ndimage_sobel = ndimage.sobel(img, axis=0, mode="constant") pil_sobel = np.array(Image.fromarray(img).filter(ImageFilter.Kernel((3, 3), (-1, 0, 1, -2, 0, 2, -1, 0, 1), 1, 0))) print(my_sobel) print(numpy_sobel) print(opencv_sobel) print(ndimage_sobel) print(pil_sobel) </code></pre> <img src="https://i.stack.imgur.com/K3JL9.png" alt="enter image description here">

according to wikipedia it's [[1, 0, -1],[2, 0, -2],[1, 0, 1]] but according to other sources it's [[-1, 0, 1],[-2, 0, 2],[-1, 0, 1]] Both are used for detecting vertical edges. Difference here is how these kernels mark "left" and "right" edges. For simplicity sake lets consider 1D example, and let array be <code>[0, 0, 255, 255, 255]</code> then if we calculate using padding then <ul> <li>kernel <code>[2, 0, -2]</code> gives <code>[0, -510, -510, 0, 0]</code> </li> <li>kernel <code>[-2, 0, 2]</code> gives <code>[0, 510, 510, 0, 0]</code> </li> </ul> As you can see abrupt increase in value was marked with negative values by first kernel and positive values by second. Note that is is relevant only if you need to discriminate left vs right edges, when you want just to find vertical edges, you might use any of these 2 aboves and then get absolute value.

What is the Sobel operator?

Tags:

python

image-processing

opencv

filtering

derivative

I tried 5 different implementations of the Sobel operator in Python, one of which I implemented myself, and the results are radically different.

My question is similar to this one, but there are still differences I don't understand with the other implementations.

Is there any agreed on definition of the Sobel operator, and is it always synonymous to "image gradient"?

Even the definition of the Sobel kernel is different from source to source, according to Wikipedia it is [[1, 0, -1],[2, 0, -2],[1, 0, -1]], but according to other sources it is [[-1, 0, 1],[-2, 0, 2],[-1, 0, 1]].

Here is my code where I tried the different techniques:

from scipy import ndimage
import numpy as np
import cv2 as cv
from scipy import ndimage
from PIL import Image, ImageFilter

img = np.random.randint(0, 255, [10, 10]).astype(np.uint8)

def sobel_x(img) :
    return ndimage.convolve(img, np.array([[-1, 0, 1],[-2, 0, 2],[-1, 0, 1]]))

my_sobel = sobel_x(img)
_, numpy_sobel = np.gradient(img)
opencv_sobel = cv.Sobel(img, cv.CV_8UC1, 1, 0)
ndimage_sobel = ndimage.sobel(img, axis=0, mode="constant")
pil_sobel = np.array(Image.fromarray(img).filter(ImageFilter.Kernel((3, 3), (-1, 0, 1, -2, 0, 2, -1, 0, 1), 1, 0)))

print(my_sobel)
print(numpy_sobel)
print(opencv_sobel)
print(ndimage_sobel)
print(pil_sobel)

enter image description here

402

asked Jun 15 '21 13:06

nexide

2 Answers

The Sobel operator estimates the derivative.

The correct definition of the Sobel operator to estimate the horizontal derivative is:

  | 1  0 -1 |
  | 2  0 -2 | / 8
  | 1  0 -1 |

The division by 8 is important to get the right magnitude. People often leave it out because they don't care about the actual derivative, they care about comparing the gradient in different places of the same image. Multiplying everything by 8 makes no difference there, and so leaving out the /8 keeps things simple.

You will see the kernel defined with the inverse signs some places. These are cases where the kernel is applied by correlation instead of convolution (which differ by a mirroring of the kernel), such as the case of OpenCV. These can also be cases where people copy stuff without understanding them, resulting in a gradient with the wrong sign.

But then again, the Sobel operator is mostly applied to obtain the gradient magnitude (the square root of the sum of the squares of the horizontal and vertical derivatives). In this case, reversing the signs doesn't matter any more.

Note that np.gradient(img) is comparable to convolving with [1,0,-1]/2. This is another way to estimate the derivative. Sobel adds a regularization (==smoothing) in the perpendicular direction.

You will get a better understanding of each implementation if you use a more meaningful test image. Try for example a black image with a white square in the middle. You will be able to compare the strength of the estimated gradients, their direction (I assume some libraries use a different definition of x and y axes), and you will be able to see the effect of the regularization.

160

answered Oct 21 '22 03:10

Cris Luengo

according to wikipedia it's [[1, 0, -1],[2, 0, -2],[1, 0, 1]] but according to other sources it's [[-1, 0, 1],[-2, 0, 2],[-1, 0, 1]]

Both are used for detecting vertical edges. Difference here is how these kernels mark "left" and "right" edges.

For simplicity sake lets consider 1D example, and let array be

[0, 0, 255, 255, 255]

then if we calculate using padding then

kernel [2, 0, -2] gives [0, -510, -510, 0, 0]
kernel [-2, 0, 2] gives [0, 510, 510, 0, 0]

As you can see abrupt increase in value was marked with negative values by first kernel and positive values by second. Note that is is relevant only if you need to discriminate left vs right edges, when you want just to find vertical edges, you might use any of these 2 aboves and then get absolute value.

answered Oct 21 '22 01:10

Daweo

Related questions
                            
                                Same random numbers in C++ as computed by Python3 numpy.random.rand
                            
                                Writing data from a Python List and a Dictionary to CSV
                            
                                How to implement Grad-CAM on a trained network
                            
                                Poetry could not find a pyproject.toml file in C:\
                            
                                How to serialise and deserialise complex POCO data structures in Python to/from JSON
                            
                                The wikipedia api seems to almost always get the word in question wrong
                            
                                Automatically simplify redundant arithmetic relations
                            
                                lask.cli.NoAppException: While importing "app", an ImportError was raised:
                            
                                Color percentage in image for Python using OpenCV
                            
                                Getting 403 when using Selenium to automate checkout process
                            
                                ImportError: Spatial indexes require either `rtree` or `pygeos` in geopanda but rtree is installed
                            
                                Pandas sort_value() issue. Wrong sorting integer when applied key parameter
                            
                                Scraping data from a dynamic web table
                            
                                str.encode() giving unexpected results
                            
                                How to fill the values in the list and convert it into the dataframe?
                            
                                Making a ML model scikit-learn compatible
                            
                                InvalidArgumentError: required broadcastable shapes at loc(unknown)
                            
                                Forward fill only certain value
                            
                                How to get the target by adding using python
                            
                                VS Code portable on Linux is still using for packages local user folder instead of the enviroment folder, and because of that imports fail

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With