Choose the closest k points from given n points

Q: How do you find the closest point to a set of points?

The closest pair is the minimum of the closest pairs within each half and the closest pair between the two halves. To split the point set in two, we find the x-median of the points and use that as a pivot. Finding the closest pair of points in each half is subproblem that is solved recursively.

Q: Which data structure would you use to query the K nearest points of a set on a 2D plane?

A KD Tree does the job. It's a geometric data structure that can find nearest neighbors efficiently by cutting the search space similarly to a binary search.

Tags:

You are given a set U of n points on the plane and you can compute distance between any pair of points in constant time. Choose a subset of U called C such that C has exactly k points in it and the distance between the farthest 2 points in C is as small as possible for given k. 1 < k <= n

What's the fastest way to do this besides the obvious n-choose-k solution?

949

asked Mar 30 '11 21:03

pathikrit

2 Answers

A solution is shown in Finding k points with minimum diameter and related problems - Aggarwal, 1991. The algorithm described therein has running time: O(k^2.5 n log k + n log n)

For those who have no access to the paper: the problem is called k-diameter and definied as

Find a set of k points with minimum diameter. The diameter of a set is the maximum distance between any points of the set.

I cannot really give an overview over the presented algorithm, but it includes computing the (3k - 3)th order Voronoi diagram of the points, and then solve the problem for each of the O(kn) Voronoi sets (by computing maximum independent sets in some bipartite graphs)... I guess that I am trying to say is, that it is highly non-trivial, and far beyond both an interview and this site :-)

156

answered Oct 03 '22 18:10

dcn

Since this is an interview question, here is my shot at a solution. (As dcn points out below, this is not guaranteed to return the optimal solution, though it should still be a decent heuristic. Good catch, dcn!)

Create a set S_p with a single point P.
Compute the distance between every point in S_p and every point outside of it, then add the point with the smallest max distance to S_p.
Repeat 2. until S_p has k points.
Repeat 1-3 using each point once as the initial P. Take the S_p which has the smallest max distance.

There are O(k) points in S_p, and O(n) points outside of it, so finding the point with the smallest max distance is O(nk). We repeat this k times, then repeat the whole procedure n times, for an overall complexity of O(n²k²).

We can improve on this by caching the max distance between any point in S_p and each point outside of S_p. If maxDistanceFromPointInS[pointOutsideS] is, say, an O(1) hash-table containing the current max distance between every point pointOutsideS and some point inside S_p, then every time we add a new point newPoint, we set maxDistanceFromPointInS[p] = Max(maxDistanceFromPointInS[p], distance(newPoint, p)) for all points p outside of S_p. Then finding the smallest max distance is O(n), adding a point to S_p is O(n). Repeating this k times gives us O((n+n)k) = O(nk). Finally, we repeat the whole procedure n times, for an overall complexity of O(n²k).

We could improve finding the smallest max distance to O(1) using a heap, but that would not change the overall complexity.

By the way, it took an hour to write this all up - there's no way I could have done this in an interview.

answered Oct 03 '22 20:10

BlueRaja - Danny Pflughoeft

Related questions
                            
                                OpenId support for Yii
                            
                                Is this a good approach for temporarily changing the current thread's culture?
                            
                                visualize the GnuPG web of trust
                            
                                @class for typedef enum?
                            
                                How to validate a nested model object based on the state of the parent object?
                            
                                Resize CATextLayer to fit text on iOS
                            
                                What does requestValidationMode="2.0" actually do?
                            
                                How can I use Nhibernate to retrieve data when the "WHERE IN()" have thousands of values? (too many parameters in the sql)
                            
                                What is the default generator for CMake in Windows?
                            
                                Avoid memory leaks on Android
                            
                                Which scala compiler plugins are available?
                            
                                Adding iPad support to an iPhone project: universal vs two separate apps? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With