Folks, I am trying reduce my memory usage in my elasticsearch deployment (Single node cluster). I can see 3GB JVM heap space being used. To optimize I first need to understand the bottleneck. I have limited understanding of how is JVM usage is split. Field data looks to consume 1.5GB and filter cache & query cache combined consume less than 0.5GB, that adds upto 2GB at the max. Can someone help me understand where does elasticsearch eats up rest of 1GB? <img src="https://i.stack.imgur.com/tuwiL.png" alt="Marvel screenshot">

I can't tell for your exact setup, but in order to know what's going on in your heap, you can use the jvisualvm tool (bundled with the jdk) together with marvel or the bigdesk plugin (my preference) and the <code>_cat</code> APIs to analyze what's going on. As you've rightly noticed, the heap hosts three main caches, namely: <ul> <li>the fielddata cache: unbounded by default, but can be controlled with <code>indices.fielddata.cache.size</code> (in your case it seems to be around 50% of the heap, probably due to the fielddata circuit breaker)</li> <li>the node query/filter cache: 10% of the heap</li> <li>the shard request cache: 1% of the heap but disabled by default</li> </ul> There is nice mindmap available here (Kudos to Igor Kupczyński) that summarizes the roles of caches. That leaves more or less ~30% of the heap (1GB in your case) for all other object instances that ES needs to create in order to function properly (see more about this later). Here is how I proceeded on my local env. First, I started my node fresh (with <code>Xmx1g</code>) and waited for green status. Then I started jvisualvm and hooked it onto my elasticsearch process. I took a heap dump from the Sampler tab so I can compare it later on with another dump. My heap looks like this initially (only 1/3 of max heap allocated so far): <img src="https://i.stack.imgur.com/YLYAp.png" alt="enter image description here"> I also checked that my field data and filter caches were empty: <img src="https://i.stack.imgur.com/izJuB.png" alt="enter image description here"> Just to make sure, I also ran <code>/_cat/fielddata</code> and as you can see there's no heap used by field data yet since the node just started. <pre class="prettyprint"><code>$ curl 'localhost:9200/_cat/fielddata?bytes=b&v' id host ip node total TMVa3S2oTUWOElsBrgFhuw iMac.local 192.168.1.100 Tumbler 0 </code></pre> This is the initial situation. Now, we need to warm this all up a bit, so I started my back- and front-end apps to put some pressure on the local ES node. After a while, my heap looks like this, so its size has more or less increased by 300 MB (139MB -> 452MB, not much but I ran this experiment on a small dataset) <img src="https://i.stack.imgur.com/hh1RX.png" alt="enter image description here"> My caches have also grown a bit to a few megabytes: <img src="https://i.stack.imgur.com/h0XzX.png" alt="enter image description here"> <pre class="prettyprint"><code>$ curl 'localhost:9200/_cat/fielddata?bytes=b&v' id host ip node total TMVa3S2oTUWOElsBrgFhuw iMac.local 192.168.1.100 Tumbler 9066424 </code></pre> At this point I took another heap dump to gain insights into how the heap had evolved, I computed the retained size of the objects and I compared it with the first dump I took just after starting the node. The comparison looks like this: Among the objects that increased in retained size, he usual suspects are maps, of course, and any cache-related entities. But we can also find the following classes: <ul> <li> <code>NIOFSDirectory</code> that are used to read Lucene segment files on the filesystem</li> <li>A lot of interned strings in the form of char arrays or byte arrays</li> <li>Doc values related classes</li> <li>Bit sets</li> <li>etc</li> </ul> <img src="https://i.stack.imgur.com/p0pT1.png" alt="enter image description here"> As you can see, the heap hosts the three main caches, but it is also the place where reside all other Java objects that the Elasticsearch process needs and that are not necessarily cache-related. So if you want to control your heap usage, you obviously have no control over the internal objects that ES needs to function properly, but you can definitely influence the sizing of your caches. If you follow the links in the first bullet list, you'll get a precise idea of what settings you can tune. Also tuning caches might not be the only option, maybe you need to rewrite some of your queries to be more memory-friendly or change your analyzers or some fields types in your mapping, etc. Hard to tell in your case, without more information, but this should give you some leads. Go ahead and launch jvisualvm the same way I did here and learn how your heap is growing while your app (searching+indexing) is hitting ES and you should quickly gain some insights into what's going on in there.

Understanding elasticsearch jvm heap usage

1 Answers

I can't tell for your exact setup, but in order to know what's going on in your heap, you can use the jvisualvm tool (bundled with the jdk) together with marvel or the bigdesk plugin (my preference) and the _cat APIs to analyze what's going on.

As you've rightly noticed, the heap hosts three main caches, namely:

the fielddata cache: unbounded by default, but can be controlled with indices.fielddata.cache.size (in your case it seems to be around 50% of the heap, probably due to the fielddata circuit breaker)
the node query/filter cache: 10% of the heap
the shard request cache: 1% of the heap but disabled by default

There is nice mindmap available here (Kudos to Igor Kupczyński) that summarizes the roles of caches. That leaves more or less ~30% of the heap (1GB in your case) for all other object instances that ES needs to create in order to function properly (see more about this later).

Here is how I proceeded on my local env. First, I started my node fresh (with Xmx1g) and waited for green status. Then I started jvisualvm and hooked it onto my elasticsearch process. I took a heap dump from the Sampler tab so I can compare it later on with another dump. My heap looks like this initially (only 1/3 of max heap allocated so far):

enter image description here

I also checked that my field data and filter caches were empty:

enter image description here

Just to make sure, I also ran /_cat/fielddata and as you can see there's no heap used by field data yet since the node just started.

$ curl 'localhost:9200/_cat/fielddata?bytes=b&v'
id                     host       ip            node    total 
TMVa3S2oTUWOElsBrgFhuw iMac.local 192.168.1.100 Tumbler     0

This is the initial situation. Now, we need to warm this all up a bit, so I started my back- and front-end apps to put some pressure on the local ES node.

After a while, my heap looks like this, so its size has more or less increased by 300 MB (139MB -> 452MB, not much but I ran this experiment on a small dataset)

enter image description here

My caches have also grown a bit to a few megabytes:

enter image description here

$ curl 'localhost:9200/_cat/fielddata?bytes=b&v'
id                     host       ip            node      total 
TMVa3S2oTUWOElsBrgFhuw iMac.local 192.168.1.100 Tumbler 9066424

At this point I took another heap dump to gain insights into how the heap had evolved, I computed the retained size of the objects and I compared it with the first dump I took just after starting the node. The comparison looks like this:

Among the objects that increased in retained size, he usual suspects are maps, of course, and any cache-related entities. But we can also find the following classes:

NIOFSDirectory that are used to read Lucene segment files on the filesystem
A lot of interned strings in the form of char arrays or byte arrays
Doc values related classes
Bit sets
etc

enter image description here

As you can see, the heap hosts the three main caches, but it is also the place where reside all other Java objects that the Elasticsearch process needs and that are not necessarily cache-related.

So if you want to control your heap usage, you obviously have no control over the internal objects that ES needs to function properly, but you can definitely influence the sizing of your caches. If you follow the links in the first bullet list, you'll get a precise idea of what settings you can tune.

Also tuning caches might not be the only option, maybe you need to rewrite some of your queries to be more memory-friendly or change your analyzers or some fields types in your mapping, etc. Hard to tell in your case, without more information, but this should give you some leads.

Go ahead and launch jvisualvm the same way I did here and learn how your heap is growing while your app (searching+indexing) is hitting ES and you should quickly gain some insights into what's going on in there.

160

answered Sep 19 '22 20:09

Val

Related questions
                            
                                Reindexing Elastic search via Bulk API, scan and scroll
                            
                                get buckets count in elasticsearch aggregations
                            
                                Data transfer from SQL Server to ElasticSearch Node
                            
                                Can't start ElasticSearch on Mac
                            
                                What is the proper way to unit test Service with NestJS/Elastic
                            
                                How can elasticsearch objects be boosted based on date or score
                            
                                ElasticSearch terms aggregation by entire field
                            
                                What does disable_coord parameter for boolean queries mean?
                            
                                Check if elasticsearch index is open or closed
                            
                                How to parse json in logstash /grok from a text file line?
                            
                                Customize the information in an alert received by elastalert plugin for elasticsearch
                            
                                Range query in ElasticSearch (GET without body)
                            
                                ElasticSearch entered "read only" mode, node cannot be altered
                            
                                Sorting by multiple params in pyes and elasticsearch
                            
                                How to filter last 5 minutes, date histogram using Elastic search?
                            
                                ElasticSearch - sort search results by relevance and custom field (Date)
                            
                                elasticsearch mapping tokenizer keyword to avoid splitting tokens and enable use of wildcard
                            
                                Elasticsearch query must not match text from field
                            
                                Fuzziness settings in ElasticSearch
                            
                                How to filter or query list of index names in Elasticsearch?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Understanding elasticsearch jvm heap usage

Tags:

elasticsearch

Nullpoet

People also ask

1 Answers

Val

Recent Activity

Donate For Us