What is the difference between Lucene and Elasticsearch

Elasticsearch index Vs Lucene index.

The Elasticsearch index is a chunk of documents just like databases consist of tables in relational world.
In order to achieve scaling we spread the Elasticsearch Indices into multiple physical nodes / servers.

For that, we break the Elasticsearch Indices into smaller units which are called shards.

Question: How it is related to Lucene index?
If we want to search for a specific term (for example: "Cake" or "Cookie") we'll have to go over each shard and look for it (lets put aside how shards are being located and replicated on each node).

This operation will take a lot of time - so we need to use an efficient data structure for this search - this is where Lucene's index comes into play.

Each Elasticsearch shard is based on the Lucene index structure and stores statistics about terms in order to make term-based search more efficient.

(!) This is quiet confusing because of the word "index" and the fact that an Elasticsearch shard is a portion of Elasticsearch index BUT is based on a data structure of Lucene index .

Bonus - Lucene's index as a inverted index

As can be seen in the example below , Lucene's index stores the original document’s content plus additional information, such as term dictionary and term frequencies, which increase searching efficiency:

Term           Document                 Frequency
Cake           doc_id_1, doc_id_8       4 (2 in doc_id_1, 2 in doc_id_8)
Cookie         doc_id_1, doc_id_6       3 (2 in doc_id_1, 1 in doc_id_6)
Spaghetti      doc_id_12                1 (1 in doc_id_12)

Lucene's index falls into the family of indexes known as an inverted index. This is because it can list, for a term, the documents that contain it.
This is the inverse of the natural relationship, in which documents list terms.

(Reminder) How did we reached from a Shard to a term?

(1) Shard is a directory of files which contains documents.
(2) A document is a sequence of fields.
(3) A field is a named sequence of terms.

Related questions
                            
                                Proper access policy for Amazon Elastic Search Cluster
                            
                                Elastic Search: how to see the indexed data
                            
                                how to move elasticsearch data from one server to another
                            
                                How do I enable remote access/request in Elasticsearch 2.0?
                            
                                no [query] registered for [filtered]
                            
                                No handler for type [string] declared on field [name]
                            
                                How to change Elasticsearch max memory size
                            
                                Elasticsearch vs Cassandra vs Elasticsearch with Cassandra
                            
                                Elasticsearch: Failed to connect to localhost port 9200 - Connection refused
                            
                                How do I escape characters in GitHub code search? [duplicate]
                            
                                No mapping found for field in order to sort on in ElasticSearch
                            
                                Elasticsearch: Difference between "Term", "Match Phrase", and "Query String"
                            
                                how to rename an index in a cluster?
                            
                                Elasticsearch: Max virtual memory areas vm.max_map_count [65530] is too low, increase to at least [262144]
                            
                                What are some use cases for using Elasticsearch versus standard sql queries? [closed]
                            
                                How to search for a part of a word with ElasticSearch
                            
                                ElasticSearch - Return Unique Values
                            
                                Content-Type header [application/x-www-form-urlencoded] is not supported on Elasticsearch
                            
                                How to use Elasticsearch with MongoDB?
                            
                                Elastic search, multiple indexes vs one index and types for different data sets?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the difference between Lucene and Elasticsearch

Tags:

lucene

elasticsearch

People also ask

Elasticsearch index Vs Lucene index.

Bonus - Lucene's index as a inverted index

(Reminder) How did we reached from a Shard to a term?

Recent Activity

Donate For Us