python script hangs when calling cursor.fetchall() with large data set

Tags:

mysql-connector

I have a query that returns over 125K rows.

The goal is to write a script the iterates through the rows, and for each, populate a second table with data processed from the result of the query.

To develop the script, I created a duplicate database with a small subset of the data (4126 rows)

On the small database, the following code works:

import os
import sys
import random

import mysql.connector

cnx = mysql.connector.connect(user='dbuser', password='thePassword',
                          host='127.0.0.1',
                          database='db')
cnx_out = mysql.connector.connect(user='dbuser', password='thePassword',
                          host='127.0.0.1',
                          database='db')

ins_curs = cnx_out.cursor()

curs = cnx.cursor(dictionary=True)
#curs = cnx.cursor(dictionary=True,buffered=True) #fail

with open('sql\\getRawData.sql') as fh:
    sql = fh.read()

curs.execute(sql, params=None, multi=False)
result = curs.fetchall()  #<=== script stops at this point
print len(result) #<=== this line never executes

print curs.column_names

curs.close()
cnx.close()
cnx_out.close()
sys.exit()

The line curs.execute(sql, params=None, multi=False) succeeds on both the large and small databases. If I use curs.fetchone() in a loop, I can read all records.

If I alter the line:

curs = cnx.cursor(dictionary=True)

to read:

curs = cnx.cursor(dictionary=True,buffered=True)

The script hangs at curs.execute(sql, params=None, multi=False).

I can find no documentation on any limits to fetchall(), nor can I find any way to increase the buffer size, and no way to tell how large a buffer I even need.

There are no exceptions raised.

How can I resolve this?

570

asked Apr 10 '15 01:04

1 Answers

I was having this same issue, first on a query that returned ~70k rows and then on one that only returned around 2k rows (and for me RAM was also not the limiting factor). I switched from using mysql.connector (i.e. the mysql-connector-python package) to MySQLdb (i.e. the mysql-python package) and then was able to fetchall() on large queries with no problem. Both packages seem to follow the python DB API, so for me MySQLdb was a drop-in replacement for mysql.connector, with no code changes necessary beyond the line that sets up the connection. YMMV if you're leveraging something specific about mysql.connector.

Pragmatically speaking, if you don't have a specific reason to be using mysql.connector the solution to this is just to switch to a package that works better!

100

answered Sep 30 '22 03:09

mactyr

Related questions
                            
                                How can I convert from scatter size to data coordinates in matplotlib?
                            
                                utf-8 character in user path prevents module from being imported
                            
                                Python : Setting cookie into another website
                            
                                Get Binary Representation of PIL Image Without Saving
                            
                                How to Cross Validate Properly
                            
                                Feature space reduction for tag prediction
                            
                                SymPy cannot solve polynomial equation of 4th order
                            
                                how to save pygame camera as video output
                            
                                Buffer size for reading UDP packets in Python
                            
                                Write numpy array to wave file in buffers using wave (not scipy.io.wavfile) module
                            
                                Combining Tkinter and win32ui makes Python crash on exit
                            
                                How can python do imports after I clear sys.path - Import precedence
                            
                                Django Rest Framework and Stripe, best practice?
                            
                                Python Script Output Does Not Appear in CMD
                            
                                zeromq: unicode not allowed, use send_unicode
                            
                                Send data from Java to Python and then return the response [closed]
                            
                                How to unit test Django rest framework requests?
                            
                                How to use dronekit with SITL vagrant VM from dev Wiki
                            
                                DNS timeout on 'requests' python library
                            
                                how to partially apply arbitrary argument of a function?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

python script hangs when calling cursor.fetchall() with large data set

Tags:

python

mysql-connector

Vorpal Swordsman

People also ask

1 Answers

mactyr

Recent Activity

Donate For Us