Forward Index vs Inverted index Why?

Tags:

I was reading about inverted index (used by the text search engines like Solr, Elastic Search etc) and as I understand (if we take "Person" as an example):

The attribute to Person relationship is inverted:

John -> PersonId(1), PersonId(2), PersonId(3)
London -> PersonId(1), PersonId(2), PersonId(5)

I can now search the person records for 'John who lives in London'

Doesn't this solve all the problems? Why do we have the forward (or regular database index) at all? Or in other words, in what cases the regular indexing is useful? Please explain. Thanks.

843

asked Aug 01 '15 11:08

user1189332

2 Answers

The point that you're missing is that there is no real technical distinction between a forward index and an inverted index. "Forward" and "inverted" in this case are just descriptive terms to distinguish between:

A list of words contained in a document.
A list of documents containing a word.

The concept of an inverted index only makes sense if the concept of a regular (forward) index already exists. In the context of a search engine, a forward index would be the term vector; a list of terms contained within a particular document. The inverted index would be a list of documents containing a given term.

When you understand that the terms "forward" and "inverted" are really just relative terms used to describe the nature of the index you're talking about - and that really an index is just an index - your question doesn't really make sense any more.

answered Sep 28 '22 08:09

Ant P

Here's an explanation of inverted index, from Elasticsearch:

Elasticsearch uses a structure called an inverted index, which is designed to allow very fast full-text searches. An inverted index consists of a list of all the unique words that appear in any document, and for each word, a list of the documents in which it appears. https://www.elastic.co/guide/en/elasticsearch/guide/current/inverted-index.html

Inverted indexing is for fast full text search. Regular indexing is less efficient, because the engine looks through all entries for a term, but very fast with indexing!

You can say this:

Forward index: fast indexing, less efficient query's
Inverted index: fast query, slower indexing

But, it's always context related. If you compare it with MySQL: myisam has fast read, innodb has fast insert/update and slower read.

Read more here: https://www.found.no/foundation/indexing-for-beginners-part3/

answered Sep 28 '22 07:09

schellingerht

Related questions
                            
                                What Solr client lib for Python can you recommend and why? [closed]
                            
                                Solr faceting: Inconsistent JSON formatting
                            
                                Do multiple Solr shards on a single machine improve performance?
                            
                                Solr Index appears to be valid - but returns no results
                            
                                Is it possible to do Solr faceting combining multiple fields, like distinct on multiple columns in RMDB?
                            
                                Elasticsearch UI [closed]
                            
                                Cassandra or SOLR? What gives better performance to frond end read queries?
                            
                                Running Solr on Azure
                            
                                Document search on partial words
                            
                                Changing the default operator from OR to AND in Solr (Magento Enterprise)
                            
                                SOLR df and qf explanation
                            
                                Solr Fuzzy Search for similar words
                            
                                Solr always use more than 90% of physical memory
                            
                                Delete/remove Solr configuration from ZooKeeper using zkcli?
                            
                                LockObtainFailedException updating Lucene search index using solr
                            
                                solr - java heap space out of memory
                            
                                Can you use POST to run a query in Solr (/select)
                            
                                solr main query vs fq
                            
                                apache solr as a service hosting [closed]
                            
                                How can I Schedule data imports in Solr

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Forward Index vs Inverted index Why?

Tags:

solr

lucene

elasticsearch

inverted-index

forward-indexing

user1189332

People also ask

2 Answers

Ant P

schellingerht

Recent Activity

Donate For Us