I'd like to utilise Prometheus to monitor occurences of system OOM killer on Debian / Ubuntu. The particular case is that sometimes Redis is killed because of OOM and the already existing low-memory-available alert is not triggered because it happens too fast. But I'd like to make the solution as smart and universal as possible and also not to spend a lot of time on it, so let us not focus on Redis itself. The ideas I have so far:
I'd like to ask for your suggestions and opinions. Thanks!
The node_vmstat_oom_kill metric from the node exporter will tell you this.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With