K Nearest-Neighbor Algorithm [closed]

Tags:

Using the KNN-algorithm, say k=5. Now I try to classify an unknown object by getting its 5 nearest neighbours. What to do, if after determining the 4 nearest neighbors, the next 2 (or more) nearest objects have the same distance? Which object of these 2 or more should be chosen as the 5th nearest neighbor?

764

asked Feb 03 '11 18:02

Gwaihir

1 Answers

Which object of these 2 or more should be chosen as the 5th nearest neighbor?

It really depends on how you want to implement it.

Most algorithms will do one of three things:

Include all equal distance points, so for this estimation, they'll use 6 points, not 5.
Use the "first" found point of the two equal distant.
Pick a random (usually with a consistent seed, so results are reproducable) point from the 2 points found.

That being said, most algorithms based on radial searching have an inherent assumption of stationarity, in which case, it really shouldn't matter which of the options above you choose. In general, any of them should, theoretically, provide reasonable defaults (especially since they're the furthest points in the approximation, and should have the lowest effective weightings).

130

answered Sep 20 '22 18:09

Reed Copsey

Related questions
                            
                                Keras - Validation Loss and Accuracy stuck at 0
                            
                                Convolutional neural network Conv1d input shape
                            
                                How to train Word2vec on very large datasets?
                            
                                How tf.gradients work in TensorFlow
                            
                                Scikit-learn : Input contains NaN, infinity or a value too large for dtype ('float64')
                            
                                Where is it best to use svm with linear kernel?
                            
                                PyTorch - How to get learning rate during training?
                            
                                How is the complexity of PCA O(min(p^3,n^3))?
                            
                                What is a loss function in simple words?
                            
                                What is a `"Python"` layer in caffe?
                            
                                Tensorflow: Attempting to use uninitialized value beta1_power
                            
                                Save and load model optimizer state
                            
                                How training and test data is split - Keras on Tensorflow
                            
                                List of all classification algorithms
                            
                                Algorithm for Hand writing recognition
                            
                                keras: what is the difference between model.predict and model.predict_proba
                            
                                Fast (< n^2) clustering algorithm
                            
                                How to get SVMs to play nicely with missing data in scikit-learn?
                            
                                What are some good machine learning programming exercises? [closed]
                            
                                How to use scikit-learn PCA for features reduction and know which features are discarded

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

K Nearest-Neighbor Algorithm [closed]

Tags:

machine-learning

classification

knn

Gwaihir

People also ask

1 Answers

Reed Copsey

Recent Activity

Donate For Us