Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How do I do a free text search in Azure Table Storage?

I have a solution with a Azure table storage with up to a few tusands "rows" per customer (partition key).

How do I best do a lightning fast free text search?

Because of the nature of the data I'm not able to do a hole word search (eg. a search for "zur" should match "Azure").

like image 658
Thomas Jespersen Avatar asked Oct 07 '10 18:10

Thomas Jespersen


People also ask

Where are source text documents stored in azure?

In most scenarios, the source text documents are loaded into object storage such as Azure Storage or Azure Data Lake Store, and then indexed using an external search service. In this scenario, source text documents are physically distinct from a resulting search index that's hosted on a search service.

How to search for text on images and documents with Azure Search?

To search for text on images and documents with Azure Search, here are the simple steps that we will follow: 1. Create a Storage Account The first step for our cognitive search engine is to create a storage account in Azure, in which we will store the files we want to analyze. We create our storage account by entering the requested parameters:

Is it possible to search for data in Azure storage tables?

To answer your question, you can search using the Azure Storage API, but you've got to be very intentional about what fields you want to search when structuring your Azure Storage Tables initially. The only "indexes" you have to work with are the partition and row keys.

What is full text search in Azure SQL?

Full-Text Search can scale from mobile or personal applications with relatively few and simple queries, up to complex mission-critical applications with high query volume over huge amounts of textual data. Full-Text Search is available in Premium, Standard and Basic service tiers in Azure SQL Database V12.


2 Answers

Just spotted this which may help you: Azure Library for Lucene

like image 115
Ross Scott Avatar answered Nov 15 '22 07:11

Ross Scott


We are using the following in production for our sites: We run hosted solr (based on lucene) instances on http://websolr.com and cache the results using the new azure distributed cache feature that is currently in beta. That gives us a worst case 200 ms latency for an initial search request between the Amazon datacenter where websolr.com runs and the Azure Datacenter and an average 6 - 10 ms for all cached searches. We also record common search text fragments and try to keep them fresh in the cache.

like image 35
Martin Wawrusch Avatar answered Nov 15 '22 07:11

Martin Wawrusch