Mysql connection pooling question: is it worth it?

Tags:

I recall hearing that the connection process in mysql was designed to be very fast compared to other RDBMSes, and that therefore using a library that provides connection pooling (SQLAlchemy) won't actually help you that much if you enable the connection pool.

Does anyone have any experience with this?

I'm leery of enabling it because of the possibility that if some code does something stateful to a db connection and (perhaps mistakenly) doesn't clean up after itself, that state which would normally get cleaned up upon closing the connection will instead get propagated to subsequent code that gets a recycled connection.

833

asked Jan 01 '09 19:01

ʞɔıu

2 Answers

There's no need to worry about residual state on a connection when using SQLA's connection pool, unless your application is changing connectionwide options like transaction isolation levels (which generally is not the case). SQLA's connection pool issues a connection.rollback() on the connection when its checked back in, so that any transactional state or locks are cleared.

It is possible that MySQL's connection time is pretty fast, especially if you're connecting over unix sockets on the same machine. If you do use a connection pool, you also want to ensure that connections are recycled after some period of time as MySQL's client library will shut down connections that are idle for more than 8 hours automatically (in SQLAlchemy this is the pool_recycle option).

You can quickly do some benching of connection pool vs. non with a SQLA application by changing the pool implementation from the default of QueuePool to NullPool, which is a pool implementation that doesn't actually pool anything - it connects and disconnects for real when the proxied connection is acquired and later closed.

191

answered Sep 20 '22 10:09

zzzeek

Even if the connection part of MySQL itself is pretty slick, presumably there's still a network connection involved (whether that's loopback or physical). If you're making a lot of requests, that could get significantly expensive. It will depend (as is so often the case) on exactly what your application does, of course - if you're doing a lot of work per connection, then that will dominate and you won't gain a lot.

When in doubt, benchmark - but I would by-and-large trust that a connection pooling library (at least, a reputable one) should work properly and reset things appropriately.

answered Sep 19 '22 10:09

Jon Skeet

Related questions
                            
                                Parse pandas (multi)index to datetime
                            
                                Visual Studio - "The environment IronPython|2.7-32 appears to be incorrectly configured or missing"
                            
                                Blob.generate_signed_url() failing to AttributeError
                            
                                Python Unit Test : How to unit test the module which contains database operations?
                            
                                Use keras layer in tensorflow code
                            
                                Python async/await downloading a list of urls
                            
                                How to fix issues with E402?
                            
                                platform.linux_distribution() deprecated - what are the alternatives?
                            
                                Deleting elements of a list based on a condition
                            
                                What does the asterisk in the output of `reveal_type` mean?
                            
                                Are nested format specifications legal?
                            
                                How to schedule a task in asyncio so it runs at a certain date?
                            
                                Zero occurrences/frequency using value_counts() in PANDAS
                            
                                Seaborn: Avoid plotting missing values (line plot)
                            
                                Pandas - group by column and transform the data to numpy array
                            
                                Converting a Python function with a callback to an asyncio awaitable
                            
                                pip3 setup.py install_requires PEP 508 git URL for private repo
                            
                                How can I customize python syntax highlighting in VS code?
                            
                                Is it possible to call Black as an API?
                            
                                Python's requests triggers Cloudflare's security while urllib does not

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Mysql connection pooling question: is it worth it?

Tags:

python

mysql

sqlalchemy

connection-pooling

ʞɔıu

People also ask

2 Answers

zzzeek

Jon Skeet

Recent Activity

Donate For Us