What is the difference between a secondary index and an inverted index in Cassandra?

Tags:

When I read about these two, I thought both of them are explaining the same approach, I googled but found nothing. Is the difference in implementation? Cassandra does the secondary index itself but inverted index has to be implemented by myself?

Which is faster in searching, by the way?

302

asked Oct 08 '13 13:10

fereshteh

1 Answers

The main difference is that secondary indexes in Cassandra are not distributed in the same way a manual inverted index would be. With the inbuilt secondary indexes, each node indexes the data it stores locally (using the LocalPartitioner). With manual indexing, the indexes are distributed independently of the nodes that store the values.

This means that, for the inbuilt indexes, each query must go to each node, whereas if you did inverted indexing manually you would just go to one node (plus replicas) to query the value you were looking up. One advantage of having the index stored locally is that indexes can be updated atomically with the data. (Although, since Cassandra 1.2, the atomic batches could be used for this instead although they are a bit slower.)

This is why Cassandra indexes are not recommended for really high cardinality data. If you are doing a lookup on each node but there are only one or two results, it is inefficient and a manual inverted index will be better. If your lookup returns many results, then you will need to lookup on each node anyway so the inbuilt indexes work well.

A further advantage of using Cassandra's inbuilt indexing is that the indexes are updated lazily, so you don't need to do a read on every update. (See CASSANDRA-2897.) This can be a significant speed improvement for indexed tables with high write throughput.

122

answered Sep 19 '22 23:09

Richard

Related questions
                            
                                Clean URLs for search query?
                            
                                bash script to find pattern in text file and return entire line
                            
                                String algorithm suggestion to find all the common prefixes of a list of strings
                            
                                jQuery Mobile data-filter, in case of no result
                            
                                how to call onSearchRequested when pressing magnifying glass
                            
                                how to do vi search and replace within a range in sublime text
                            
                                Amazon like search with Solr
                            
                                Java data structure that has efficient add, delete, and random
                            
                                Python: search through list of tuples
                            
                                Show UISearchController when tableView swipe down
                            
                                Elasticsearch return all documents of a given type
                            
                                Exponential Search vs Binary Search
                            
                                In-memory search index for application takes up too much memory - any suggestions?
                            
                                Live xpath search in your browser
                            
                                speed string search in PHP
                            
                                I want Search specific value in all columns of all tables in oracle 11g
                            
                                Fibonacci Search
                            
                                Data structure / algorithm for query: filter by A, sort by B, return N results
                            
                                How to get distance in Solr 4 geospatial search?
                            
                                Efficient way to store and search coordinates in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the difference between a secondary index and an inverted index in Cassandra?

Tags:

indexing

search

cassandra

inverted-index

fereshteh

People also ask

1 Answers

Richard

Recent Activity

Donate For Us