basic pyodbc bulk insert

Tags:

In a python script, I need to run a query on one datasource and insert each row from that query into a table on a different datasource. I'd normally do this with a single insert/select statement with a tsql linked server join but I don't have a linked server connection to this particular datasource.

I'm having trouble finding a simple pyodbc example of this. Here's how I'd do it but I'm guessing executing an insert statement inside a loop is pretty slow.

result = ds1Cursor.execute(selectSql)

for row in result:
    insertSql = "insert into TableName (Col1, Col2, Col3) values (?, ?, ?)"
    ds2Cursor.execute(insertSql, row[0], row[1], row[2])
    ds2Cursor.commit()

Is there a better bulk way to insert records with pyodbc? Or is this a relatively efficient way to do this anyways. I'm using SqlServer 2012, and the latest pyodbc and python versions.

804

asked May 03 '16 15:05

Zip184

1 Answers

Here's a function that can do the bulk insert into SQL Server database.

import pyodbc
import contextlib

def bulk_insert(table_name, file_path):
    string = "BULK INSERT {} FROM '{}' (WITH FORMAT = 'CSV');"
    with contextlib.closing(pyodbc.connect("MYCONN")) as conn:
        with contextlib.closing(conn.cursor()) as cursor:
            cursor.execute(string.format(table_name, file_path))
        conn.commit()

This definitely works.

UPDATE: I've noticed at the comments, as well as coding regularly, that pyodbc is better supported than pypyodbc.

NEW UPDATE: remove conn.close() since the with statement handles that automatically.

answered Sep 20 '22 17:09

Naufal

Related questions
                            
                                Slick confused about numThreads and best practice for good performance
                            
                                set Jackson ObjectMapper class not to use scientific notation for double
                            
                                Integer division in Java [duplicate]
                            
                                String includes many substrings in ScalaTest Matchers
                            
                                Only sources that implement IAsyncEnumerable can be used for Entity Framework asynchronous operations
                            
                                How to use -parameters javac option in intellij?
                            
                                How to convert JSON object to an Typescript array?
                            
                                Why cast to a pointer then dereference?
                            
                                pg_dump: [archiver (db)] query failed: ERROR: permission denied for relation abouts
                            
                                Printing an array in Powershell
                            
                                What is the difference between .stream() and Stream.of?
                            
                                Vim plugin youcompleteme error message

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With