Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Search Engine - Lucene or Solr

We need to integrate a search engine in our Product Catalog management software. the catalog is expected to have more than 4-5 mn. records with relational data spread over several tables. Our dev platform is Asp.Net 3.5 and we have done some pre-liminary work on Lucene, found it to be good. However, we just came to know of Solr and was looking for some practical tips to compare Lucene & Solr from implementation, timeline, regular maintenance, performance, features perspective. Any guidance or pointers would be really helpful. Thanks.

like image 345
Vikram Avatar asked Sep 09 '09 17:09

Vikram


People also ask

What is the difference between Solr and Lucene?

Lucene is a full-text search engine library, whereas Solr is a full-text search engine web application built on Lucene. One way to think about Lucene and Solr is as a car and its engine. The engine is Lucene; the car is Solr. A wide array of companies (Ford, Salesforce, etc.)

Is Solr a search engine?

What is Solr? Solr is a leading open source search engine from the Apache Software Foundation's Lucene project. Thanks to its flexibility, scalability, and cost-effectiveness, Solr is widely used by large and small enterprises.

Who uses Solr as search engine?

MTV uses Solr to power search on a number of its websites. NASA is using Solr as the Enterprise Search component in its NEBULA cloud computing platform. Netflix uses Solr for their site search feature. Panasonic Europe uses Solr to power the search and faceted navigation on it's sites for 30 countries.


1 Answers

Lucene:

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search

Solr:

Solr is an open source enterprise search server based on the Lucene Java search library, with XML/HTTP and JSON APIs, hit highlighting, faceted search, caching, replication, a web administration interface and ...

Essentially, Lucene is embedded in Solr and is purely a full-text search library, with the purpose of being embedded into projects giving them full-text search capabilities. Solr has much more features and administration capabilities, allowing to search structured data without needing to write any custom code, load data from CSV files, tolerant parsing of user input, faceted searching, highlighting matched text in results, and retrieving search results in a variety of formats (XML, JSON, ...) . Check Solr features page and see if any feature is relevant for your project.

like image 117
dcruz Avatar answered Sep 29 '22 23:09

dcruz