If I execute the Python interpreter it needs roughly 111 MByte: <pre class="prettyprint"><code>>>> import psutil >>> psutil.Process().memory_info() pmem(rss=19451904, vms=111677440, shared=6905856, text=4096, lib=0, data=12062720, dirty=0) </code></pre> After importing django it uses 641 MByte <pre class="prettyprint"><code>>>> import django >>> django.setup() >>> psutil.Process().memory_info() pmem(rss=188219392, vms=641904640, shared=27406336, text=4096, lib=0, data=284606464, dirty=0) </code></pre> And the WSGI process (which has already executed some http requests) 919 MByte: <pre class="prettyprint"><code>>>> psutil.Process(13843).memory_info() pmem(rss=228777984, vms=919306240, shared=16076800, text=610304, lib=0, data=485842944, dirty=0) </code></pre> I think that's too much. What can I do to investigate this in more detail? What occupies the memory? Background: From time to time memory on the server is running low and the oom-killer terminates processes.

You're looking at the wrong attribute: <ul> <li> <code>rss</code> is the Resident Set Size, which is the actual physical memory the process is using</li> <li> <code>vms</code> is the Virtual Memory Size which is the virtual memory that process is using</li> </ul> Kernels allow a process to get a different view of the memory where the process thinks like it is the only program running in the system, that's why the virtual address space is for. While in reality kernel uses memory management to synchronize memory usage between processes. Also note that, the shared libraries between processes play a part in memory consumption as well. Regarding your OOM incident, see which process is being killed and see what's the process was doing. For example, Linux uses <code>/proc/PID/oom_score</code> to keep track of each processes OOM score to find which process to kill in OOM situations -- higher value indicates a higher probability of selection. Linux sets this value based on different heuristics e.g. number of children, how long it's running, CPU usage, niceness and so on. And you can tweak this for the process by writing to <code>/proc/PID/oom_score_adj</code>. But don't influence the OOM score, try to debug the actual problem in the process. A memory profiler like <code>valgrind</code> might be helpful in this regard.

What uses the memory of my python process? (RSS vs VMS)

Tags:

python

django

memory-profiling

If I execute the Python interpreter it needs roughly 111 MByte:

>>> import psutil
>>> psutil.Process().memory_info()
pmem(rss=19451904, vms=111677440, shared=6905856, text=4096, lib=0, data=12062720, dirty=0)

After importing django it uses 641 MByte

>>> import django
>>> django.setup()
>>> psutil.Process().memory_info()
pmem(rss=188219392, vms=641904640, shared=27406336, text=4096, lib=0, data=284606464, dirty=0)

And the WSGI process (which has already executed some http requests) 919 MByte:

>>> psutil.Process(13843).memory_info()
pmem(rss=228777984, vms=919306240, shared=16076800, text=610304, lib=0, data=485842944, dirty=0)

I think that's too much.

What can I do to investigate this in more detail? What occupies the memory?

Background: From time to time memory on the server is running low and the oom-killer terminates processes.

416

asked Dec 03 '19 15:12

guettli

1 Answers

You're looking at the wrong attribute:

rss is the Resident Set Size, which is the actual physical memory the process is using
vms is the Virtual Memory Size which is the virtual memory that process is using

Kernels allow a process to get a different view of the memory where the process thinks like it is the only program running in the system, that's why the virtual address space is for. While in reality kernel uses memory management to synchronize memory usage between processes. Also note that, the shared libraries between processes play a part in memory consumption as well.

Regarding your OOM incident, see which process is being killed and see what's the process was doing. For example, Linux uses /proc/PID/oom_score to keep track of each processes OOM score to find which process to kill in OOM situations -- higher value indicates a higher probability of selection. Linux sets this value based on different heuristics e.g. number of children, how long it's running, CPU usage, niceness and so on. And you can tweak this for the process by writing to /proc/PID/oom_score_adj.

But don't influence the OOM score, try to debug the actual problem in the process. A memory profiler like valgrind might be helpful in this regard.

101

answered Nov 04 '22 06:11

heemayl

Related questions
                            
                                Replacing more than one substring value with pandas str.replace
                            
                                Documentation says to use a confidence parameter, but it throws an error
                            
                                Counting Consecutive Duplicates For By Group
                            
                                How to add additional field to django oscar product field in dashboard
                            
                                Occasional 'temporary failure in name resolution' while connecting to AWS Aurora cluster
                            
                                Stuck understanding ResNet's Identity block and Convolutional blocks
                            
                                How to Naturally Sort Pathlib objects in Python?
                            
                                Passing current_user from Flask-Login to Plotly Dash app
                            
                                Panel + Param: FileInput widget and @param.depends interaction
                            
                                Python Connect to AWS Aurora Serverless MySQL Using SQLAlchemy
                            
                                How to set proxy AUTHENTICATION username:password using Python/Selenium
                            
                                How do you get Visual Studio Code to use different Python interpreter?
                            
                                Difference between nn.MaxPool2d vs.nn.functional.max_pool2d?
                            
                                How to style/format point markers in Plotly 3D scatterplot?
                            
                                Passing `training=true` when using Tensorflow 2's Keras Functional API
                            
                                Can python load definitions from a C header file?
                            
                                Count number of cells in the image
                            
                                No module named 'pyarrow._orc'
                            
                                How to separate files using dask groupby on a column
                            
                                How to access arrays?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With