Image recognition and 3d rendering

Tags:

How hard would it be to take an image of an object (in this case of a predefined object), and develop an algorithm to cut just that object out of a photo with a background of varying complexity.

Further to this, a photo's object (say a house, car, dog - but always of one type) would need to be transformed into a 3d render. I know there are 3d rendering engines available (at a cost, free, or with some clause), but for this to work the object (subject) would need to be measured in all sorts of ways - e.g. if this is a person, we need to measure height, the curvature of the shoulder, radius of the face, length of each finger, etc.

What would the feasibility of solving this problem be? Anyone know any good links specialing in this research area? I've seen open source solutions to this problem which leaves me with the question of the ease of measuring the object while tracing around it to crop it out.

Thanks

Essentially I want to take a 2d image (typical image:which is easier than a complex photo containing multiple objects, etc.)

But effectively I want to turn that into a 3d image, so wouldn't what I want to do involve building a 3d rendering/modelling engine?

Furthermore, that link I have provided goes into 3ds max, with a few properties set, and a render is made.

391

asked Jan 09 '09 21:01

GurdeepS

2 Answers

It sounds like you want to do several things, all in the domain of computer vision.

Object Recognition (i.e. find the predefined object)
3D Reconstruction (make the 3d model from the image)
Image Segmentation (cut out just the object you are worried about from the background)

I've ranked them in order of easiest to hardest (according to my limited understanding). All together I would say it is a very complicated problem. I would look at the following Wikipedia links for more information:

Computer Vision Overview (Wikipedia)

The Eight Point Algorithm (for 3d reconstruction)

Image Segmentation

answered Oct 10 '22 00:10

Carlos Rendon

You're right this is an extremely hard set of problems, particularly that of inferring 3D information from a 2D image. Only a very limited understanding exists of how our visual system extrapolates 3D information from 2D images, one such approach is known as "Shape from Shading" and the linked google search shows how much (and consequently how little) we know.

Rob

answered Oct 10 '22 00:10

RobS

Related questions
                            
                                How to I make my AI algorithm play 9 board tic-tac-toe?
                            
                                How to get the element which are diagonal to a certain index in an array which represents a rectangle
                            
                                Finding minimum total length of line segments to connect 2N points
                            
                                Minimum cost path from (0,0) to (N,N) on 2D grid
                            
                                Fast intersection of HashSet<int> and List<int>
                            
                                Efficient way to filter groups that do not contain all types of elements
                            
                                std::accumulate using the view std::ranges::views::values
                            
                                Need help understanding this line in an FFT algorithm
                            
                                How to build a tree array into which / out of which items can be spliced, which only allows arrays of 1, 2, 4, 8, 16, or 32 items?
                            
                                How to compare two vectors for equality?
                            
                                Constant time for multiplication in Galois Field GF(4)
                            
                                Why doesn't STL's implementation of next_permutation use the binary search?
                            
                                performing floating point addition algorithmically
                            
                                Writing a Domain Specific Language for selecting rows from a table
                            
                                Finding clusters of mass in a matrix/bitmap
                            
                                Minimum number of flips to get adjacent 1's in a matrix
                            
                                Efficiently get sorted sums of a sorted list
                            
                                From an interview: Removing rows and columns in an n×n matrix to maximize the sum of remaining values
                            
                                Detecting if angle is more than 180 degrees
                            
                                How to find all partitions of a set

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Image recognition and 3d rendering

Tags:

algorithm

image-recognition

3d-rendering

GurdeepS

People also ask

2 Answers

Carlos Rendon

RobS

Recent Activity

Donate For Us