I have a set of vectors. For a vector in that set I like to find the sub set that is closeest to this vector. What algorithm can do this.

This class of algorithms is called Nearest Neighbor or K Nearest Neighbor. The cosine similarity as excepeiont says will work if direction of vector is important. If the vector represents a position in a space, then any metric for representing a distance in the space will work. For example the Euclidean distance: take the square root of the sum of squares difference in each dimension. This will give you a distance for each vector, then sort your set of vectors ascending on this distance. This process will be O(N) in time. If this is too slow for you, you might want to look at some common K Nearest Neighbour algorithms.

Algorithms for finding closest vector

3 Answers

This class of algorithms is called Nearest Neighbor or K Nearest Neighbor.

The cosine similarity as excepeiont says will work if direction of vector is important. If the vector represents a position in a space, then any metric for representing a distance in the space will work.

For example the Euclidean distance: take the square root of the sum of squares difference in each dimension. This will give you a distance for each vector, then sort your set of vectors ascending on this distance.

This process will be O(N) in time. If this is too slow for you, you might want to look at some common K Nearest Neighbour algorithms.

answered Sep 24 '22 02:09

Nick Fortescue

use the cosinus similarity (http://en.wikipedia.org/wiki/Cosine_similarity) among the vectors and then sort them.

answered Sep 26 '22 02:09

excepeiont32

If your problem relates to large amount of data:

I published a related algorithm on ddj.com, that finds the nearest line to a given point:

Accelerated Search For the Nearest Line

You would have to modify this algorithm by i.e. by converting the given vector to a number of points. This will reduce the number of possible matches drastically. The exact match has then to be checked for each possible match by

Find the cutting point of both vectors or
Get distance from vector start and end point to the possible match, as described in the article

answered Sep 24 '22 02:09

RED SOFT ADAIR

Related questions
                            
                                Accurate vectorizable implementation of acosf()
                            
                                find a smaller group of friends from the circle?
                            
                                Find the K closest points to the origin (0, 0)
                            
                                Algorithm for calculating trigonometry, logarithms or something like that. ONLY addition-subtraction
                            
                                Does Dijkstra's algorithm work with negative edges if there is no "processed" check?
                            
                                Implementing Chinese Remainder Theorem in JavaScript
                            
                                Find out how similar a set is compared to all other sets in a collection of sets
                            
                                Recommended face detection tools/SDK/etc [closed]
                            
                                Algorithm to determine the "usual" cash payment amounts for a given price
                            
                                Is there a proper algorithm to solve edge-removing problem?
                            
                                What does it mean to 'hash cons'?
                            
                                What's the most efficient method of continually deleting files older than X hours on Windows?
                            
                                A Ranking algorithm
                            
                                compact data structure like set
                            
                                What is the logic behind Fourier division algorithm?
                            
                                Selection Coloring Algorithm
                            
                                Non-Random Weighted Distribution
                            
                                Perceptual Image Downsampling
                            
                                How do you efficiently debug reference count problems in shared memory?
                            
                                Is there any algorithm for determining 3d position in such case? (images below)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Algorithms for finding closest vector

Tags:

algorithm

vector

cluster-analysis

joel

People also ask

3 Answers

Nick Fortescue

excepeiont32

RED SOFT ADAIR

Recent Activity

Donate For Us