I found an implementation of the Hough transform in MATLAB at Rosetta Code, but I'm having trouble understanding it. Also I would like to modify it to show the original image and the reconstructed lines (de-Houghing). Any help in understanding it and de-Houghing is appreciated. Thanks <ol> <li> Why is the image flipped? <code>theImage = flipud(theImage);</code> </li> <li>I can't wrap my head around the norm function. What is its purpose, and can it be avoided?</li> </ol> EDIT: norm is just a synonym for euclidean distance: sqrt(width^2 + height^2) <code>rhoLimit = norm([width height]);</code> <ol> <li> Can someone provide an explanation of how/why rho, theta, and houghSpace is calculated? <pre class="prettyprint"><code>rho = (-rhoLimit:1:rhoLimit); theta = (0:thetaSampleFrequency:pi); numThetas = numel(theta); houghSpace = zeros(numel(rho),numThetas); </code></pre> </li> <li>How would I de-Hough the Hough space to recreate the lines?</li> </ol> Calling the function using a 10x10 image of a diagonal line created using the identity (eye) function <pre class="prettyprint"><code>theImage = eye(10) thetaSampleFrequency = 0.1 [rho,theta,houghSpace] = houghTransform(theImage,thetaSampleFrequency) </code></pre> The actual function <pre class="prettyprint"><code>function [rho,theta,houghSpace] = houghTransform(theImage,thetaSampleFrequency) %Define the hough space theImage = flipud(theImage); [width,height] = size(theImage); rhoLimit = norm([width height]); rho = (-rhoLimit:1:rhoLimit); theta = (0:thetaSampleFrequency:pi); numThetas = numel(theta); houghSpace = zeros(numel(rho),numThetas); %Find the "edge" pixels [xIndicies,yIndicies] = find(theImage); %Preallocate space for the accumulator array numEdgePixels = numel(xIndicies); accumulator = zeros(numEdgePixels,numThetas); %Preallocate cosine and sine calculations to increase speed. In %addition to precallculating sine and cosine we are also multiplying %them by the proper pixel weights such that the rows will be indexed by %the pixel number and the columns will be indexed by the thetas. %Example: cosine(3,:) is 2*cosine(0 to pi) % cosine(:,1) is (0 to width of image)*cosine(0) cosine = (0:width-1)'*cos(theta); %Matrix Outerproduct sine = (0:height-1)'*sin(theta); %Matrix Outerproduct accumulator((1:numEdgePixels),:) = cosine(xIndicies,:) + sine(yIndicies,:); %Scan over the thetas and bin the rhos for i = (1:numThetas) houghSpace(:,i) = hist(accumulator(:,i),rho); end pcolor(theta,rho,houghSpace); shading flat; title('Hough Transform'); xlabel('Theta (radians)'); ylabel('Rho (pixels)'); colormap('gray'); end </code></pre>

The Hough Transform is a "voting" approach where each image point casts a vote on the existence of a certain line (not a line segment) in an image. The voting is carried out in the parameter space for a line: the polar coordinate representation of normal vectors. We discretize the parameter space and allow each image point to suggest parameters which would be compatible with a line through the point. Each of your questions can be addressed in terms of how the parameter space is treated in code. Wikipedia has a good article with worked examples that might clarify things (if you are having any conceptual troubles). For your specific questions: <ol> <li>The image is flipped so the origin is the bottom right corner. As far as I can tell this step is not technically necessary. It does change the outcome somewhat due to discretization issues. The other implementations on Rosetta Code do not flip the image.</li> <li> <code>rhoLimit</code> holds the maximum radius of an image point in polar coordinates (recall the norm of a vector is its magnitude).</li> <li> <code>rho</code> and <code>theta</code> are discretizations of the polar coordinate plane according to a sampling rate. <code>houghSpace</code> creates a matrix with an element for each possible combination of the discrete rho/theta values.</li> <li>The Hough Transform does not specify the lengths of putative lines; the peaks in the voting space just specify the polar coordinates of the normal vector of the line. You can "de-Hough" by selecting the peaks and drawing the corresponding lines, or perhaps by drawing every possible line and using the number of votes as a grayscale weight. It is not possible to re-create the original image from the Hough Transform, just the lines identified by the transform (and your thresholding scheme on the votes).</li> </ol> Following the example from the question produces the following graph. The placement of grid lines and the datatips cursor can be a bit misleading (though the variable values in the 'tip are correct). Since this is an image of the parameter space and not the image space the sampling rate we chose is determining the number of bins in each variable. At this sampling rate, the image points are compatible with more than one possible line; in other words our lines have subpixel resolution, in the sense that they cannot be drawn without overlap in a 10x10 image. Once we have chosen a peak, such as that corresponding to the line with normal <code>(rho,theta) = (6.858,0.9)</code>, we can draw that line in an image however we choose. Automated peak picking, that is thresholding to find the highly up-voted lines, is its own problem - you could ask a another question about the topic in DSP or about a particular algorithm here. For example methods see the code and documentation of MATLAB's <code>houghpeaks</code> and <code>houghlines</code> functions. <img src="https://i.stack.imgur.com/Bnude.png" alt="enter image description here">

Hough transform in MATLAB without using hough function

Tags:

matlab

hough-transform

I found an implementation of the Hough transform in MATLAB at Rosetta Code, but I'm having trouble understanding it. Also I would like to modify it to show the original image and the reconstructed lines (de-Houghing).

Any help in understanding it and de-Houghing is appreciated. Thanks

Why is the image flipped?

theImage = flipud(theImage);
I can't wrap my head around the norm function. What is its purpose, and can it be avoided?

EDIT: norm is just a synonym for euclidean distance: sqrt(width^2 + height^2)

rhoLimit = norm([width height]);

Can someone provide an explanation of how/why rho, theta, and houghSpace is calculated?

rho = (-rhoLimit:1:rhoLimit);          
theta = (0:thetaSampleFrequency:pi);

numThetas = numel(theta);
houghSpace = zeros(numel(rho),numThetas);

How would I de-Hough the Hough space to recreate the lines?

Calling the function using a 10x10 image of a diagonal line created using the identity (eye) function

theImage = eye(10)
thetaSampleFrequency = 0.1
[rho,theta,houghSpace] = houghTransform(theImage,thetaSampleFrequency)

The actual function

function [rho,theta,houghSpace] = houghTransform(theImage,thetaSampleFrequency)

    %Define the hough space
    theImage = flipud(theImage);
    [width,height] = size(theImage);

    rhoLimit = norm([width height]);
    rho = (-rhoLimit:1:rhoLimit);          
    theta = (0:thetaSampleFrequency:pi);

    numThetas = numel(theta);
    houghSpace = zeros(numel(rho),numThetas);

    %Find the "edge" pixels
    [xIndicies,yIndicies] = find(theImage);

    %Preallocate space for the accumulator array
    numEdgePixels = numel(xIndicies);
    accumulator = zeros(numEdgePixels,numThetas);

    %Preallocate cosine and sine calculations to increase speed. In
    %addition to precallculating sine and cosine we are also multiplying
    %them by the proper pixel weights such that the rows will be indexed by 
    %the pixel number and the columns will be indexed by the thetas.
    %Example: cosine(3,:) is 2*cosine(0 to pi)
    %         cosine(:,1) is (0 to width of image)*cosine(0)
    cosine = (0:width-1)'*cos(theta); %Matrix Outerproduct  
    sine = (0:height-1)'*sin(theta); %Matrix Outerproduct

    accumulator((1:numEdgePixels),:) = cosine(xIndicies,:) + sine(yIndicies,:);

    %Scan over the thetas and bin the rhos 
    for i = (1:numThetas)
        houghSpace(:,i) = hist(accumulator(:,i),rho);
    end

    pcolor(theta,rho,houghSpace);
    shading flat;
    title('Hough Transform');
    xlabel('Theta (radians)');
    ylabel('Rho (pixels)');
    colormap('gray');

end

639

asked Mar 28 '12 21:03

waspinator

1 Answers

The Hough Transform is a "voting" approach where each image point casts a vote on the existence of a certain line (not a line segment) in an image. The voting is carried out in the parameter space for a line: the polar coordinate representation of normal vectors.

We discretize the parameter space and allow each image point to suggest parameters which would be compatible with a line through the point. Each of your questions can be addressed in terms of how the parameter space is treated in code. Wikipedia has a good article with worked examples that might clarify things (if you are having any conceptual troubles).

For your specific questions:

The image is flipped so the origin is the bottom right corner. As far as I can tell this step is not technically necessary. It does change the outcome somewhat due to discretization issues. The other implementations on Rosetta Code do not flip the image.
rhoLimit holds the maximum radius of an image point in polar coordinates (recall the norm of a vector is its magnitude).
rho and theta are discretizations of the polar coordinate plane according to a sampling rate. houghSpace creates a matrix with an element for each possible combination of the discrete rho/theta values.
The Hough Transform does not specify the lengths of putative lines; the peaks in the voting space just specify the polar coordinates of the normal vector of the line. You can "de-Hough" by selecting the peaks and drawing the corresponding lines, or perhaps by drawing every possible line and using the number of votes as a grayscale weight. It is not possible to re-create the original image from the Hough Transform, just the lines identified by the transform (and your thresholding scheme on the votes).

Following the example from the question produces the following graph. The placement of grid lines and the datatips cursor can be a bit misleading (though the variable values in the 'tip are correct). Since this is an image of the parameter space and not the image space the sampling rate we chose is determining the number of bins in each variable. At this sampling rate, the image points are compatible with more than one possible line; in other words our lines have subpixel resolution, in the sense that they cannot be drawn without overlap in a 10x10 image.

Once we have chosen a peak, such as that corresponding to the line with normal (rho,theta) = (6.858,0.9), we can draw that line in an image however we choose. Automated peak picking, that is thresholding to find the highly up-voted lines, is its own problem - you could ask a another question about the topic in DSP or about a particular algorithm here.

For example methods see the code and documentation of MATLAB's houghpeaks and houghlines functions.

enter image description here

119

answered Sep 23 '22 16:09

reve_etrange

Related questions
                            
                                How do you retrieve the selected text in MATLAB?
                            
                                Given Polygon and Fix Points, Find the Triangle Meshes
                            
                                Python: Equivalent to Matlab's svds(A, k) for large arrays?
                            
                                MATLAB - list all methods supplied by subclass only?
                            
                                Recall Matlab History with Multiline-Command
                            
                                MATLAB GUI Look and Feel
                            
                                Java LinkedList to matlab array
                            
                                What is this MATLAB statement for: [M N ~] = size(imge);?
                            
                                Homography to Projective transform
                            
                                Converting Matlab to C++
                            
                                find an inverse log transformation of an image in matlab
                            
                                MATLAB: How to get an array of all items of an enumeration?
                            
                                Calculate bessel function in MATLAB using Jm+1=2mj(m) -j(m-1) formula
                            
                                How to implement a left matrix division on C++ using gsl
                            
                                Matlab sparse tensor
                            
                                Loading Matlab sparse matrix saved with -v7.3 (HDF5) into Python and operating on it
                            
                                Matlab and .NET 4.0 data communication
                            
                                Accuracy of LibSVM decreases
                            
                                Color contour different than pcolor
                            
                                What is the default MATLAB Color Order?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With