Solr loads entire index into memory

Tags:

I am using solr for data similar to name:age:sex:balance:nextbalance:interest

I have 30 M records totaling to 4G on disk. I am retrieving by age:23 which is only 50 records. I have indexed="true" in the schema xml. Solr seems to load the entire index on disk into memory (4G). Isnt it supposed to retrieve only the 40 odd records into memory ?

795

asked Mar 14 '12 14:03

Hari

3 Answers

Maybe this is document cache. You need to specify the size of it. Can you please check the following in solrconfig.xml?

<!-- documentCache caches Lucene Document objects (the stored fields for each document).
  -->
<documentCache
  class="solr.LRUCache"
  size="16384"
  initialSize="16384"/>

126

answered Oct 01 '22 00:10

stzoannos

I think it depends on how you configure the cache (what it does and doesn't keep in memory). Loading the entire index into memory can give you huge performance boosts in terms of the time needed to retrieve results, regardless of the query.

Details on configuring cache, and details on performance factors:

https://cwiki.apache.org/confluence/display/SOLR/SolrPerformanceFactors

answered Oct 01 '22 00:10

jefflunt

Fields that are stored but not indexed, are saved on disk but not in RAM. However, 100% of the records are indeed indexed in RAM and those indexes contain all of the indexed fields. But inverted indexes are rather efficient for that.

However, when you do queries then SOLR does retrieve the entire set of stored (but not indexed) field contents into RAM for the records which match. This is usually considered to be desirable caching behavior because it means that search results can be transmitted sooner which reduces the overall query turnaround time. As usual with SOLR, you can configure caching behavior in many ways to match your RAM budget and database needs. Have a look at the possibilities in solrconfig.xml.

Note that this is a complex area and you probably will find it difficult to fully understand caching if Google is your main info source. This is an area where it is better to learn from one of the books on SOLR.

answered Oct 01 '22 01:10

Michael Dillon

Related questions
                            
                                Java - Is it common practice to use a hashtable (eg HashMap) to map objects to themselves?
                            
                                How to simplify a class with lot's of copy-pasted error handling code?
                            
                                Updating contents of a jsp page without refreshing
                            
                                Efficient data structure with two keys
                            
                                Sort List<List<String>> by list value
                            
                                Ivy Custom Resolvers for Git or TFS
                            
                                Java array, add item into next empty index
                            
                                How to intercept super class constructor argument?
                            
                                Enabling scroll bars when JTextArea exceeds certain amount of lines
                            
                                How to check Gradle version number in build file?
                            
                                Eclipse + GAE: Properties changes and therefor problems with deploy
                            
                                JSlider not updating?
                            
                                Are ZipFile InputStreams thread safe?
                            
                                missing feature in lucene 4.0 snapshot
                            
                                Hibernate : @SecondaryTable doesn't work
                            
                                In GWT is there a way to create a KeyPressEvent for the entire view instead of a single input element?
                            
                                Initializing an array of custom type
                            
                                Serialization - What is the advantage of using ObjectStreamField [] serialPersistentFields?
                            
                                Wicket Label not updated / remains invisible
                            
                                Is it bad to have public variables in a non-static class?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Solr loads entire index into memory

Tags:

java

indexing

solr