How do cursors work in Python's DB-API?

Tags:

I have been using python with RDBMS' (MySQL and PostgreSQL), and I have noticed that I really do not understand how to use a cursor.

Usually, one have his script connect to the DB via a client DB-API (like psycopg2 or MySQLdb):

connection = psycopg2.connect(host='otherhost', etc)

And then one creates a cursor:

cursor = connection.cursor()

And then one can issue queries and commands:

cursor.execute("SELECT * FROM etc")

Now where is the result of the query, I wonder? is it on the server? or a little on my client and a little on my server? And then, if we need to access some results, we fetch 'em:

rows = cursor.fetchone()

rows = cursor.fetchmany()

Now lets say, I do not retrieve all the rows, and decide to execute another query, what will happen to the previous results? Is their an overhead.

Also, should I create a cursor for every form of command and continuously reuse it for those same commands somehow; I head psycopg2 can somehow optimize commands that are executed many times but with different values, how and is it worth it?

Thx

410

asked Jan 17 '09 23:01

Nicholas Leonard

2 Answers

ya, i know it's months old :P

DB-API's cursor appears to be closely modeled after SQL cursors. AFA resource(rows) management is concerned, DB-API does not specify whether the client must retrieve all the rows or DECLARE an actual SQL cursor. As long as the fetchXXX interfaces do what they're supposed to, DB-API is happy.

AFA psycopg2 cursors are concerned(as you may well know), "unnamed DB-API cursors" will fetch the entire result set--AFAIK buffered in memory by libpq. "named DB-API cursors"(a psycopg2 concept that may not be portable), will request the rows on demand(fetchXXX methods).

As cited by "unbeknown", executemany can be used to optimize multiple runs of the same command. However, it doesn't accommodate for the need of prepared statements; when repeat executions of a statement with different parameter sets is not directly sequential, executemany() will perform just as well as execute(). DB-API does "provide" driver authors with the ability to cache executed statements, but its implementation(what's the scope/lifetime of the statement?) is undefined, so it's impossible to set expectations across DB-API implementations.

If you are loading lots of data into PostgreSQL, I would strongly recommend trying to find a way to use COPY.

122

answered Sep 19 '22 13:09

jwp

Assuming you're using PostgreSQL, the cursors probably are just implemented using the database's native cursor API. You may want to look at the source code for pg8000, a pure Python PostgreSQL DB-API module, to see how it handles cursors. You might also like to look at the PostgreSQL documentation for cursors.

answered Sep 18 '22 13:09

kquinn

Related questions
                            
                                Sqlite and Python -- return a dictionary using fetchone()?
                            
                                Cannot install pyaudio, gcc error
                            
                                Installing MySQL Python on Mac OS X
                            
                                Flipping zeroes and ones in one-dimensional NumPy array
                            
                                Cycle a list from alternating sides
                            
                                Transpose a matrix in Python [duplicate]
                            
                                How to get two random records with Django
                            
                                How do you determine if an IP address is private, in Python?
                            
                                Does pyvenv replace virtualenv in python3.3 +? [duplicate]
                            
                                Endpoints API - protorpc validation error
                            
                                Interact with celery ongoing task
                            
                                Non-blocking multiprocessing.connection.Listener?
                            
                                Building a ctypes-"based" C library with distutils
                            
                                Clean way of structuring ctypes class
                            
                                Python real time image classification problems with Neural Networks
                            
                                Fatal Python error and `BufferedWriter`
                            
                                How to get PyTest working in Visual Studio
                            
                                Is there a version of TensorFlow not compiled for AVX instructions?
                            
                                Passing a C++ object to Python
                            
                                Pylint: Disable specific warnings for specific folder

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do cursors work in Python's DB-API?

Tags:

performance

python

psycopg2

rdbms

cursors

Nicholas Leonard

People also ask

2 Answers

jwp

kquinn

Recent Activity

Donate For Us