I use flask and gunicorn for deploy on production scale machine learning models, but the memory using gunicorn with 4 workers is very huge (almost 4x) without gunicorn. Can gunicorn use less memory?
Adjust your worker to (2*CPU)+1. This is the suggested maximum concurrent requests when using workers and threads.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With