I am using this example to upload a csv file into a sqlite database: this is my code: <pre class="prettyprint"><code>from numpy import genfromtxt from time import time from datetime import datetime from sqlalchemy import Column, Integer, Float, Date, String, VARCHAR from sqlalchemy.ext.declarative import declarative_base from sqlalchemy import create_engine from sqlalchemy.orm import sessionmaker def Load_Data(file_name): data = genfromtxt(file_name, delimiter=',')# skiprows=1, converters={0: lambda s: str(s)}) return data.tolist() Base = declarative_base() class cdb1(Base): #Tell SQLAlchemy what the table name is and if there's any table-specific arguments it should know about __tablename__ = 'cdb1' __table_args__ = {'sqlite_autoincrement': True} #tell SQLAlchemy the name of column and its attributes: id = Column(Integer, primary_key=True, nullable=False) name = Column(VARCHAR(40)) shack = Column(VARCHAR) db = Column(Integer) payments = Column(Integer) status = Column(VARCHAR) if __name__ == "__main__": t = time() print 'creating database' #Create the database engine = create_engine('sqlite:///cdb.db') Base.metadata.create_all(engine) #Create the session session = sessionmaker() session.configure(bind=engine) s = session() try: file_name = 'client_db.csv' data = Load_Data(file_name) for i in data: record = cdb1(**{ 'name' : i[0], 'shack' : i[1], 'db' : i[2], 'payments' : i[3], 'status' : i[4] }) s.add(record) #Add all the records s.commit() #Attempt to commit all the records except: s.rollback() #Rollback the changes on error print 'error in reading' finally: s.close() #Close the connection print "Time elapsed: " + str(time() - t) + " s." #0.091s </code></pre> and this is the first few rows of the csv file: <pre class="prettyprint"><code>Name,Shack,DB,Payments,Status Loyiso Dwala,I156,13542,37,LightsOnly ON Attwell Fayo,I157,13077,32,LightsON David Mbhele,G25,13155,33,LightsON </code></pre> The DB is created ok, but only some of the data is captured into the attributes: the 'payments' and 'db' column are populated correctly, but everything else comes out as NULL. UPDATED CORRECT CODE (using pandas dataframe): <pre class="prettyprint"><code>from numpy import genfromtxt from time import time from datetime import datetime from sqlalchemy import Column, Integer, Float, Date, String, VARCHAR from sqlalchemy.ext.declarative import declarative_base from sqlalchemy import create_engine from sqlalchemy.orm import sessionmaker import csv import pandas as pd #def Load_Data(file_name): #data = csv.reader(file_name, delimiter=',')# skiprows=1, converters={0: lambda s: str(s)}) #return data.tolist() Base = declarative_base() class cdb1(Base): #Tell SQLAlchemy what the table name is and if there's any table-specific arguments it should know about __tablename__ = 'cdb1' __table_args__ = {'sqlite_autoincrement': True} #tell SQLAlchemy the name of column and its attributes: id = Column(Integer, primary_key=True, nullable=False) Name = Column(VARCHAR(40)) Shack = Column(VARCHAR) DB = Column(Integer) Payments = Column(Integer) Status = Column(VARCHAR) engine = create_engine('sqlite:///cdb.db') Base.metadata.create_all(engine) file_name = 'client_db.csv' df = pd.read_csv(file_name) df.to_sql(con=engine, index_label='id', name=cdb1.__tablename__, if_exists='replace') </code></pre>

Are you familiar with Pandas Dataframe? Really simple to use (and debug) pandas.read_csv(file_name) <pre class="prettyprint"><code>In [5]: pandas.read_csv('/tmp/csvt.csv') Out[5]: Name Shack DB Payments Status 0 Loyiso Dwala I156 13542 37 LightsOnly ON 1 Attwell Fayo I157 13077 32 LightsON 2 David Mbhele G25 13155 33 LightsON </code></pre> For inserting the DataFrames data into a table, you can simply use pandas.DataFrame.to_sql So your main code will end up looking something like this: <pre class="prettyprint"><code>engine = create_engine('sqlite:///cdb.db') Base.metadata.create_all(engine) file_name = 'client_db.csv' df = pandas.read_csv(file_name) df.to_sql(con=engine, index_label='id', name=cdb1.__tablename__, if_exists='replace') </code></pre> You should read further in the documentation link I added, and set the function Parameters as suits your purpose (specially look at - if_exists, index, index_label, dtype)

Import CSV to database using sqlalchemy

Tags:

python

sqlite

sqlalchemy

I am using this example to upload a csv file into a sqlite database:

this is my code:

from numpy import genfromtxt
from time import time
from datetime import datetime
from sqlalchemy import Column, Integer, Float, Date, String, VARCHAR
from sqlalchemy.ext.declarative import declarative_base
from sqlalchemy import create_engine
from sqlalchemy.orm import sessionmaker

def Load_Data(file_name):
    data = genfromtxt(file_name, delimiter=',')# skiprows=1, converters={0: lambda s: str(s)})
    return data.tolist()

Base = declarative_base()

class cdb1(Base):
    #Tell SQLAlchemy what the table name is and if there's any table-specific arguments it should know about
    __tablename__ = 'cdb1'
    __table_args__ = {'sqlite_autoincrement': True}
    #tell SQLAlchemy the name of column and its attributes:
    id = Column(Integer, primary_key=True, nullable=False) 
    name = Column(VARCHAR(40))
    shack = Column(VARCHAR)
    db = Column(Integer)
    payments = Column(Integer)
    status = Column(VARCHAR)


if __name__ == "__main__":
    t = time()
    print 'creating database'

    #Create the database
    engine = create_engine('sqlite:///cdb.db')
    Base.metadata.create_all(engine)

    #Create the session
    session = sessionmaker()
    session.configure(bind=engine)
    s = session()

    try:
        file_name = 'client_db.csv'
        data = Load_Data(file_name)

        for i in data:
            record = cdb1(**{
                'name' : i[0],
                'shack' : i[1],
                'db' : i[2],
                'payments' : i[3],
                'status' : i[4]
            })
            s.add(record) #Add all the records

        s.commit() #Attempt to commit all the records
    except:
        s.rollback() #Rollback the changes on error
        print 'error in reading'
    finally:
        s.close() #Close the connection
    print "Time elapsed: " + str(time() - t) + " s." #0.091s

and this is the first few rows of the csv file:

Name,Shack,DB,Payments,Status
Loyiso Dwala,I156,13542,37,LightsOnly ON
Attwell Fayo,I157,13077,32,LightsON
David Mbhele,G25,13155,33,LightsON

The DB is created ok, but only some of the data is captured into the attributes: the 'payments' and 'db' column are populated correctly, but everything else comes out as NULL.

UPDATED CORRECT CODE (using pandas dataframe):

from numpy import genfromtxt
from time import time
from datetime import datetime
from sqlalchemy import Column, Integer, Float, Date, String, VARCHAR
from sqlalchemy.ext.declarative import declarative_base
from sqlalchemy import create_engine
from sqlalchemy.orm import sessionmaker
import csv
import pandas as pd


#def Load_Data(file_name):
    #data = csv.reader(file_name, delimiter=',')# skiprows=1, converters={0: lambda s: str(s)})
    #return data.tolist()

Base = declarative_base()

class cdb1(Base):
    #Tell SQLAlchemy what the table name is and if there's any table-specific arguments it should know about
    __tablename__ = 'cdb1'
    __table_args__ = {'sqlite_autoincrement': True}
    #tell SQLAlchemy the name of column and its attributes:
    id = Column(Integer, primary_key=True, nullable=False) 
    Name = Column(VARCHAR(40))
    Shack = Column(VARCHAR)
    DB = Column(Integer)
    Payments = Column(Integer)
    Status = Column(VARCHAR)

engine = create_engine('sqlite:///cdb.db')
Base.metadata.create_all(engine)
file_name = 'client_db.csv'
df = pd.read_csv(file_name)
df.to_sql(con=engine, index_label='id', name=cdb1.__tablename__, if_exists='replace')

524

asked Apr 17 '17 14:04

warrenfitzhenry

1 Answers

Are you familiar with Pandas Dataframe?

Really simple to use (and debug)

pandas.read_csv(file_name)

In [5]: pandas.read_csv('/tmp/csvt.csv')
Out[5]: 
           Name Shack     DB  Payments         Status
0  Loyiso Dwala  I156  13542        37  LightsOnly ON
1  Attwell Fayo  I157  13077        32       LightsON
2  David Mbhele   G25  13155        33       LightsON

For inserting the DataFrames data into a table, you can simply use pandas.DataFrame.to_sql

So your main code will end up looking something like this:

engine = create_engine('sqlite:///cdb.db')
Base.metadata.create_all(engine)

file_name = 'client_db.csv'
df = pandas.read_csv(file_name)
df.to_sql(con=engine, index_label='id', name=cdb1.__tablename__, if_exists='replace')

You should read further in the documentation link I added, and set the function Parameters as suits your purpose (specially look at - if_exists, index, index_label, dtype)

175

answered Oct 22 '22 02:10

Brailo

Related questions
                            
                                Can not infer schema for type: <type 'str'>
                            
                                Finding elements with selenium using Starts with and ends functions in xpath
                            
                                python add value in dictionary in lambda expression
                            
                                Alternating row color using xlsxwriter in Python 3
                            
                                zip the values from a dictionary [duplicate]
                            
                                Python :unit test throws <Response streamed [200 OK]> instead of actual output
                            
                                django.db.migrations.exceptions.CircularDependencyError
                            
                                Split output of a layer in keras
                            
                                Convert API to Pandas DataFrame
                            
                                Why don't f-strings change when variables they reference change?
                            
                                Outer product of each column of a 2D array to form a 3D array - NumPy
                            
                                What do the functions tf.squeeze and tf.nn.rnn do?
                            
                                Environment specific pip.conf under anaconda
                            
                                Hiding and showing a widget in Kivy
                            
                                How do I have a "press enter to continue" feature in python? [duplicate]
                            
                                sqlalchemy print results instead of objects
                            
                                pip install mod_wsgi, How to Set MOD_WSGI_APACHE_ROOTDIR environment?
                            
                                ImportError: No module named googleapiclient.discovery
                            
                                How does paging work in the list_blobs function in Google Cloud Storage Python Client Library
                            
                                Is LASSO regression implemented in Statsmodels?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With