We have a database filled with OCRed data and manually typed data.
When doing searches with CONTAINS command, not all of the results are appearing. So a search for "monkey man" doesn't return the records that have m0nkey man, momkey man in the data.
Is there a way to allow for these issues in the data?
I've had a cursory glance at Lucene.NET and Soundex but can't see these being of much use.
Thanks for any ideas
Full-Text Search in SQL Server and Azure SQL Database lets users and applications run full-text queries against character-based data in SQL Server tables.
If you are going to install a new SQL Server instance, then you don't need to do much except click the checkbox near "Full-Text and Semantic Extractions for Search" option on the feature selection window in the SQL Server setup. This feature will be installed along with other components on your server.
I believe you are looking for something called Fuzzy matching.
Similar post:
SQL Fuzzy Matching
Maybe useful:
http://web.archive.org/web/20100209050309/http://anastasiosyal.com/archive/2009/01/11/18.aspx
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With