What is the best full text search open source project (.NET preferred)?

Tags:

I've developed an index and search application with Lucene library. but this library has some limitation in custom ranking in my context, aside from its performance, i need scalability and access to all kinds of word frequencies and etc. is there any powerful open source full text library available?

665

asked Nov 08 '10 13:11

Ehsan

1 Answers

http://www.sphinxsearch.com

http://www.sphinxconnector.net/

Key Sphinx features are:

high indexing and searching performance;
advanced indexing and querying tools (flexible and feature-rich text tokenizer, querying language, several different ranking modes, etc);
advanced result set post-processing (SELECT with expressions, WHERE, ORDER BY, GROUP BY etc over text search results);
proven scalability up to billions of documents, terabytes of data, and thousands of queries per second;
easy integration with SQL and XML data sources, and SphinxAPI, SphinxQL, or SphinxSE search interfaces;
easy scaling with distributed searches.

To expand a bit, Sphinx:

has high indexing speed (upto 10-15 MB/sec per core on an internal benchmark);
has high search speed (upto 150-250 queries/sec per core against 1,000,000 documents, 1.2 GB of data on an internal benchmark);
has high scalability (biggest known cluster indexes over 3,000,000,000 documents, and busiest one peaks over 50,000,000 queries/day);
provides good relevance ranking through combination of phrase proximity ranking and statistical (BM25) ranking;
provides distributed searching capabilities;
provides document excerpts (snippets) generation;
provides searching from within application with SphinxAPI or SphinxQL interfaces, and from within MySQL with pluggable SphinxSE storage engine;
supports boolean, phrase, word proximity and other types of queries;
supports multiple full-text fields per document (upto 32 by default);
supports multiple additional attributes per document (ie. groups, timestamps, etc);
supports stopwords;
supports morphological word forms dictionaries;
supports tokenizing exceptions;
supports both single-byte encodings and UTF-8;
supports stemming (stemmers for English, Russian and Czech are built-in; and stemmers for French, Spanish, Portuguese, Italian, Romanian, German, Dutch, Swedish, Norwegian, Danish, Finnish, Hungarian, are available by building third party libstemmer library);
supports MySQL natively (all types of tables, including MyISAM, InnoDB, NDB, Archive, etc are supported);
supports PostgreSQL natively;
supports ODBC compliant databases (MS SQL, Oracle, etc) natively;
...has 50+ other features not listed here, refer to API and configuration manual!

126

answered Sep 26 '22 08:09

Vladislav Rastrusny

Related questions
                            
                                Pause/prevent animation for a checkbox control
                            
                                Is Microsoft Sync Framework alive?
                            
                                HttpClient PostAsync does not return
                            
                                UWP VisualTreeHelper.GetParent() returns null
                            
                                Get context connection and use it as a connection in other place
                            
                                Can not stop async TCP sever Windows service
                            
                                Why does Python.NET use the base method instead of the method from a derived class?
                            
                                Elasticsearch.net - Range Query
                            
                                How to leverage generics to populate derive class models to avoid code duplication?
                            
                                IMemoryCache, refresh cache before eviction
                            
                                How to get HttpWebRequest.AllowAutoRedirect to set the cookies when doing a GET/POST on the redrected page?
                            
                                How do I restore a file from the recycle bin using C#?
                            
                                Can I get a path for a Memory Mapped File? (.NET 4.0)
                            
                                LinqToSQL Error : Operation is not valid due to the current state of the object
                            
                                Making a reusable predicate for EntitySet<T>, IQueryable<T> and IEnumerable<T>
                            
                                What is the C# equivalent of MsgWaitForMultipleObjects?
                            
                                C# 4.0: casting dynamic to static
                            
                                How to detect if Windows is directing traffic over LAN or over WiFi in C#
                            
                                How to use distinct with group by in Linq to SQL
                            
                                How to programmatically get the current audio level?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the best full text search open source project (.NET preferred)?

Tags:

c#

full-text-search

lucene

Ehsan

People also ask

1 Answers

Vladislav Rastrusny

Recent Activity

Donate For Us