How to write Pandas dataframe to sqlite with Index

Tags:

I have a list of stockmarket data pulled from Yahoo in a pandas DataFrame (see format below). The date is serving as the index in the DataFrame. I want to write the data (including the index) out to a SQLite database.

             AAPL     GE Date 2009-01-02  89.95  14.76 2009-01-05  93.75  14.38 2009-01-06  92.20  14.58 2009-01-07  90.21  13.93 2009-01-08  91.88  13.95

Based on my reading of the write_frame code for Pandas, it does not currently support writing the index. I've attempted to use to_records instead, but ran into the issue with Numpy 1.6.2 and datetimes. Now I'm trying to write tuples using .itertuples, but SQLite throws an error that the data type isn't supported (see code and result below). I'm relatively new to Python, Pandas and Numpy, so it is entirely possible I'm missing something obvious. I think I'm running into a problem trying to write a datetime to SQLite, but I think I might be overcomplicating this.

I think I may be able to fix the issue by upgrading to Numpy 1.7 or the development version of Pandas, which has a fix posted on GitHub. I'd prefer to develop using release versions of software - I'm new to this and I don't want stability issues confusing matters further.

Is there a way to accomplish this using Python 2.7.2, Pandas 0.10.0, and Numpy 1.6.2? Perhaps cleaning the datetimes somehow? I'm in a bit over my head, any help would be appreciated.

Code:

import numpy as np import pandas as pd from pandas import DataFrame, Series import sqlite3 as db  # download data from yahoo all_data = {}  for ticker in ['AAPL', 'GE']:     all_data[ticker] = pd.io.data.get_data_yahoo(ticker, '1/1/2009','12/31/2012')  # create a data frame price = DataFrame({tic: data['Adj Close'] for tic, data in all_data.iteritems()})  # get output ready for database export output = price.itertuples() data = tuple(output)  # connect to a test DB with one three-column table titled "Demo" con = db.connect('c:/Python27/test.db') wildcards = ','.join(['?'] * 3) insert_sql = 'INSERT INTO Demo VALUES (%s)' % wildcards con.executemany(insert_sql, data)

Result:

--------------------------------------------------------------------------- InterfaceError                            Traceback (most recent call last) <ipython-input-15-680cc9889c56> in <module>() ----> 1 con.executemany(insert_sql, data)  InterfaceError: Error binding parameter 0 - probably unsupported type.

883

asked Jan 21 '13 02:01

jmatthewhouse

1 Answers

In recent pandas the index will be saved in the database (you used to have to reset_index first).

Following the docs (setting a SQLite connection in memory):

import sqlite3 # Create your connection. cnx = sqlite3.connect(':memory:')

Note: You can also pass a SQLAlchemy engine here (see end of answer).

We can save price2 to cnx:

price2.to_sql(name='price2', con=cnx)

We can retrieve via read_sql:

p2 = pd.read_sql('select * from price2', cnx)

However, when stored (and retrieved) dates are unicode rather than Timestamp. To convert back to what we started with we can use pd.to_datetime:

p2.Date = pd.to_datetime(p2.Date) p = p2.set_index('Date')

We get back the same DataFrame as prices:

In [11]: p2 Out[11]:  <class 'pandas.core.frame.DataFrame'> DatetimeIndex: 1006 entries, 2009-01-02 00:00:00 to 2012-12-31 00:00:00 Data columns: AAPL    1006  non-null values GE      1006  non-null values dtypes: float64(2)

You can also use a SQLAlchemy engine:

from sqlalchemy import create_engine e = create_engine('sqlite://')  # pass your db url  price2.to_sql(name='price2', con=cnx)

This allows you to use read_sql_table (which can only be used with SQLAlchemy):

pd.read_sql_table(table_name='price2', con=e) #         Date   AAPL     GE # 0 2009-01-02  89.95  14.76 # 1 2009-01-05  93.75  14.38 # 2 2009-01-06  92.20  14.58 # 3 2009-01-07  90.21  13.93 # 4 2009-01-08  91.88  13.95

answered Sep 24 '22 04:09

Andy Hayden

Related questions
                            
                                Dynamically add/create subplots in matplotlib
                            
                                Setting a value in a nested Python dictionary given a list of indices and value
                            
                                How can I get argparse in Python 2.6?
                            
                                raise LinAlgError("SVD did not converge") LinAlgError: SVD did not converge in matplotlib pca determination
                            
                                How to unpack a Series of tuples in Pandas?
                            
                                assertRaises in python unit-test not catching the exception [duplicate]
                            
                                NameError: name 'true' is not defined [closed]
                            
                                How take a random row from a PySpark DataFrame?
                            
                                Python scatter plot. Size and style of the marker
                            
                                Django tests - patch object in all tests
                            
                                PostgreSQL - how to run VACUUM from code outside transaction block?
                            
                                Why does Python raise TypeError rather than SyntaxError?
                            
                                Slicing a dictionary by keys that start with a certain string
                            
                                SQLAlchemy - How to make "django choices" using SQLAlchemy?
                            
                                Print file age in seconds using Python
                            
                                Opening sqlite3 database from python in read-only mode
                            
                                Python Regex Engine - "look-behind requires fixed-width pattern" Error
                            
                                HSV to RGB Color Conversion
                            
                                Python lightweight database wrapper for SQLite
                            
                                How to add an image in Tkinter?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to write Pandas dataframe to sqlite with Index

Tags:

python

sqlite

pandas

jmatthewhouse

People also ask

1 Answers

Andy Hayden

Recent Activity

Donate For Us