export very large sql file into csv with Python or R

Tags:

I have a large sql file (20 GB) that I would like to convert into csv. I plan to load the file into Stata for analysis. I have enough ram to load the entire file (my computer has 32GB in RAM)

Problem is: the solutions I found online with Python so far (sqlite3) seem to require more RAM than my current system has to:

read the SQL
write the csv

Here is the code

import sqlite3
import pandas as pd

con=sqlite3.connect('mydata.sql')
query='select * from mydata'
data=pd.read_sql(query,con)
data.to_csv('export.csv')
con.close()

The sql file contains about 15 variables that can be timestamps, strings or numerical values. Nothing really fancy.

I think one possible solution could be to read the sql and write the csv file one line at a time. However, I have no idea how to do that (either in R or in Python)

Any help really appreciated!

370

asked Nov 01 '15 20:11

ℕʘʘḆḽḘ

1 Answers

You can read the SQL database in batches and write them to file instead of reading the whole database at once. Credit to How to add pandas data to an existing csv file? for how to add to an existing CSV file.

import sqlite3
import pandas as pd

# Open the file
f = open('output.csv', 'w')
# Create a connection and get a cursor
connection = sqlite3.connect('mydata.sql')
cursor = connection.cursor()
# Execute the query
cursor.execute('select * from mydata')
# Get data in batches
while True:
    # Read the data
    df = pd.DataFrame(cursor.fetchmany(1000))
    # We are done if there are no data
    if len(df) == 0:
        break
    # Let's write to the file
    else:
        df.to_csv(f, header=False)

# Clean up
f.close()
cursor.close()
connection.close()

116

answered Sep 29 '22 12:09

Till Hoffmann

Related questions
                            
                                How to fix forward slash issue in path on windows in python?
                            
                                Bug in Python Regex? (re.sub with re.MULTILINE)
                            
                                Run an external command and get the amount of CPU it consumed
                            
                                Tracking the number of recursive calls without using global variables in Python
                            
                                How to split a word into letters in Python
                            
                                Tweepy (Twitter API) Not Returning all Search Results
                            
                                How to check if a class member exists without getting exception
                            
                                What's the best way to check if class instance variable is set in Python?
                            
                                name 'settings' is not defined
                            
                                Split datetime64 column into a date and time column in pandas dataframe
                            
                                python matplotlib plot sparse matrix pattern
                            
                                Python: slices of enumerate
                            
                                get difference between 3 lists
                            
                                set `ulimit -c` from outside shell
                            
                                Round a value to nearest number divisible by 2, 4, 8 and 16?
                            
                                How can i query for objects in current year , current month in django
                            
                                TypeError: Reduce() of empty sequence with no initial value
                            
                                General bars and stars
                            
                                Object pandas has no attribute name Series
                            
                                Scatterplot with different size, marker, and color from pandas dataframe

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

export very large sql file into csv with Python or R

Tags:

python

sql

r

export

csv

ℕʘʘḆḽḘ

People also ask

1 Answers

Till Hoffmann

Recent Activity

Donate For Us