Dilemma about image cropping algorithm - is it possible?

Tags:

I am building a web application using .NET 3.5 (ASP.NET, SQL Server, C#, WCF, WF, etc) and I have run into a major design dilemma. This is a uni project btw, but it is 100% up to me what I develop.

I need to design a system whereby I can take an image and automatically crop a certain object within it, without user input. So for example, cut out the car in a picture of a road. I've given this a lot of thought, and I can't see any feasible method. I guess this thread is to discuss the issues and feasibility of achieving this goal. Eventually, I would get the dimensions of a car (or whatever it may be), and then pass this into a 3d modelling app (custom) as parameters, to render a 3d model. This last step is a lot more feasible. It's the cropping issue which is an issue. I have thought of all sorts of ideas, like getting the colour of the car and then the outline around that colour. So if the car (example) is yellow, when there is a yellow pixel in the image, trace around it. But this would fail if there are two yellow cars in a photo.

Ideally, I would like the system to be completely automated. But I guess I can't have everything my way. Also, my skills are in what I mentioned above (.NET 3.5, SQL Server, AJAX, web design) as opposed to C++ but I would be open to any solution just to see the feasibility.

I also found this patent: US Patent 7034848 - System and method for automatically cropping graphical images

Thanks

631

asked Oct 18 '08 12:10

GSS

2 Answers

This is one of the problems that needed to be solved to finish the DARPA Grand Challenge. Google video has a great presentation by the project lead from the winning team, where he talks about how they went about their solution, and how some of the other teams approached it. The relevant portion starts around 19:30 of the video, but it's a great talk, and the whole thing is worth a watch. Hopefully it gives you a good starting point for solving your problem.

103

answered Sep 20 '22 23:09

Bill the Lizard

What you are talking about is an open research problem, or even several research problems. One way to tackle this, is by image segmentation. If you can safely assume that there is one object of interest in the image, you can try a figure-ground segmentation algorithm. There are many such algorithms, and none of them are perfect. They usually output a segmentation mask: a binary image where the figure is white and the background is black. You would then find the bounding box of the figure, and use it to crop. The thing to remember is that none of the existing segmentation algorithm will give you what you want 100% of the time.

Alternatively, if you know ahead of time what specific type of object you need to crop (car, person, motorcycle), then you can try an object detection algorithm. Once again, there are many, and none of them are perfect either. On the other hand, some of them may work better than segmentation if your object of interest is on very cluttered background.

To summarize, if you wish to pursue this, you would have to read a fair number of computer vision papers, and try a fair number of different algorithms. You will also increase your chances of success if you constrain your problem domain as much as possible: for example restrict yourself to a small number of object categories, assume there is only one object of interest in an image, or restrict yourself to a certain type of scenes (nature, sea, etc.). Also keep in mind, that even the accuracy of state-of-the-art approaches to solving this type of problems has a lot of room for improvement.

And by the way, the choice of language or platform for this project is by far the least difficult part.

answered Sep 21 '22 23:09

Dima

Related questions
                            
                                Approximate a curve with a limited number of line segments and arcs of circles
                            
                                Most efficient way to select point with the most surrounding points
                            
                                Minimum Bounding Box or Convex Hull for any shape quadrilateral?
                            
                                Given an elevator with max weight and n people with x_i weights, find out minimum number of rides needed
                            
                                How do you determine the average-case complexity of this algorithm?
                            
                                How to detect in real time a "knee/elbow" (maximal curvature) in a curve
                            
                                Understanding regex string matching using Dynamic Programming
                            
                                Total water capacity in a 2D array, which represents a topographical map
                            
                                Algorithm for finding the lowest price to get through an array
                            
                                Triangle enclosing the biggest number of points
                            
                                How to memoize recursive path of length n search
                            
                                Generating All Combinations of List n Levels Deep in Java
                            
                                Algorithm for allowing concurrent walk of a graph
                            
                                searching a binary search tree for parents efficiently
                            
                                Fast generation of random derangements
                            
                                Sub-quadratic algorithm for fitting a curve with two lines
                            
                                Debug Exact Cover Pentominoes, Wikipedia example incomplete? OR... I'm misunderstanding something (includes code)
                            
                                how to calculate the minimum unfairness sum of a list
                            
                                Given an array of indicies to splice, generate the order in which each element was removed
                            
                                Programming a probability of twins reunion

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Dilemma about image cropping algorithm - is it possible?

Tags:

algorithm

image-processing

computer-vision

GSS

People also ask

2 Answers

Bill the Lizard

Dima

Recent Activity

Donate For Us