Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Nearest Neighbour - Locality Sensitive Hashing Disadvantage

Locality sensitive hashing seems like a great technique for KNNs without any disadvantages. However, what would be a disadvantage of locality sensitive hashing if someone is using it in industry for practical applications? Under what situations will the LSH fail or do somewhat badly? Or does it take long time to code/tune?

like image 407
jonty rhodes Avatar asked Sep 15 '25 20:09

jonty rhodes


1 Answers

This is a rather broad question, but since you are new here, I will attempt to answer.

LSH is not as perfect as you describe, of course, search for papers about it please. Maybe that question can help: How to understand Locality Sensitive Hashing?

There are many LSH libraries that provides automatic parameter configuration, but not for the most important one, R, used in solving a randomized version of R-near neighbor. This is a major drawback, since the user has to manually identify R at every input. That in my opinion, is a very important aspect you have to take into account, when it comes to practical applications.

About the performance, it all depends on your input! For, example in the kd-GeRaF project of mine, I had tested LSH thoroughly and I had seen that it may have some important issues when it comes to accuracy and search speed. The scope of the datasets where in a high dimensional space, where ANNS was performed.

like image 117
gsamaras Avatar answered Sep 19 '25 14:09

gsamaras



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!