pandas.read_sql processing speed

Tags:

I need for further processing the result set of a MySQL query as a dataframe. The SQL table contains about 2 million rows and 12 columns (Data size = 180 MiB). I'm running OS X 10.9 with 8 GB memory. Is it normal that pandas.read_sql takes more than 20 secs to return the dataframe? How to implement a chunk size option like in pandas.read_csv?

Edit: Python 2.7.6, pandas 0.13.1

610

asked Apr 04 '14 23:04

Yann

1 Answers

Pandas documentation shows that read_sql()/read_sql_query() takes about 10 times the time to read a file compare to read_hdf() and 3 times the time of read_csv().

The read_sql() has now a chunk-size argument ( see the documentation)

130

answered Sep 22 '22 14:09

Adrien Pacifico

Related questions
                            
                                Django REST Framework PATCH fails on required fields
                            
                                Pylint error W0232: class has no __init__ method
                            
                                Pandas: Use iterrows on Dataframe subset
                            
                                How to call a function whenever a key is pressed in python
                            
                                Hidden Markov in PyMC3
                            
                                Splitting sentences with nltk while preserving quotes
                            
                                In Pandas How to sort one level of a multi-index based on the values of a column, while maintaining the grouping of the other level
                            
                                How does the select() function in the select module of Python exactly work?
                            
                                Why can `__subclasshook__` be monkeypatched onto the metaclass but `__instancecheck__` cannot?
                            
                                64 bit python fills up memory until computer freezes with no memerror
                            
                                How to color parts of links in dendrograms using scipy in python?
                            
                                Mouse Events of WxPython TaskBarIcon on Mac OSX are not triggering
                            
                                PyInstaller: “ImportError: No module named htmlentitydefs”
                            
                                Python autocomplete user input [closed]
                            
                                Is it possible to sort in python 3 using buffer-like (pointer-based) string comparisons?
                            
                                Django query for many-to-many subset containment
                            
                                Should I use HttpResponseRedirect here?
                            
                                Compiling SciPy to Android - Has it been done, any help on how to compile the FORTRAN code to Android Arm
                            
                                Python: Metaclass properties override class attributes, sometimes?
                            
                                PyCharm is missing project type drop down

pandas.read_sql processing speed

Tags:

python

pandas

Yann

People also ask

1 Answers

Adrien Pacifico

Recent Activity

Donate For Us