Multi-row UPSERT (INSERT or UPDATE) from Python

Tags:

I am currently executing the simply query below with python using pyodbc to insert data in SQL server table:

import pyodbc

table_name = 'my_table'
insert_values = [(1,2,3),(2,2,4),(3,4,5)]

cnxn = pyodbc.connect(...)
cursor = cnxn.cursor()
cursor.execute(
    ' '.join([
        'insert into',
        table_name,
        'values',
        ','.join(
            [str(i) for i in insert_values]
        )
    ])
)
cursor.commit()

This should work as long as there are no duplicate keys (let's assume the first column contains the key). However for data with duplicate keys (data already existing in the table) it will raise an error. How can I, in one go, insert multiple rows in a SQL server table using pyodbc such that data with duplicate keys simply gets updated.

Note: There are solutions proposed for single rows of data, however, I would like to insert multiple rows at once (avoid loops)!

784

asked May 02 '18 13:05

Mosy

1 Answers

This can be done using MERGE. Let's say you have a key column ID, and two columns col_a and col_b (you need to specify column names in update statements), then the statement would look like this:

MERGE INTO MyTable as Target
USING (SELECT * FROM 
       (VALUES (1, 2, 3), (2, 2, 4), (3, 4, 5)) 
       AS s (ID, col_a, col_b)
      ) AS Source
ON Target.ID=Source.ID
WHEN NOT MATCHED THEN
INSERT (ID, col_a, col_b) VALUES (Source.ID, Source.col_a, Source.col_b)
WHEN MATCHED THEN
UPDATE SET col_a=Source.col_a, col_b=Source.col_b;

You can give it a try on rextester.com/IONFW62765.

Basically, I'm creating a Source table "on-the-fly" using the list of values, which you want to upsert. When you then merge the Source table with the Target, you can test the MATCHED condition (Target.ID=Source.ID) on each row (whereas you would be limited to a single row when just using a simple IF <exists> INSERT (...) ELSE UPDATE (...) condition).

In python with pyodbc, it should probably look like this:

import pyodbc

insert_values = [(1, 2, 3), (2, 2, 4), (3, 4, 5)]
table_name = 'my_table'
key_col = 'ID'
col_a = 'col_a'
col_b = 'col_b'

cnxn = pyodbc.connect(...)
cursor = cnxn.cursor()
cursor.execute(('MERGE INTO {table_name} as Target '
                'USING (SELECT * FROM '
                '(VALUES {vals}) '
                'AS s ({k}, {a}, {b}) '
                ') AS Source '
                'ON Target.ID=Source.ID '
                'WHEN NOT MATCHED THEN '
                'INSERT ({k}, {a}, {b}) VALUES (Source.{k}, Source.{a}, Source.{b}) '
                'WHEN MATCHED THEN '
                'UPDATE SET {k}=Source.{a}, col_b=Source.{b};'
                .format(table_name=table_name,
                        vals=','.join([str(i) for i in insert_values]),
                        k=key_col,
                        a=col_a,
                        b=col_b)))
cursor.commit()

You can read up more on MERGE in the SQL Server docs.

157

answered Oct 13 '22 23:10

ksbg

Related questions
                            
                                Python: how to remove a value from a dict if it exactly matches the key?
                            
                                Pandas groupby results on the same plot
                            
                                python gRPC requiring a context argument
                            
                                Remove 'seconds' and 'minutes' from a Pandas dataframe column
                            
                                Sorting categorical labels in seaborn chart
                            
                                Pandas DataFrame Bar Plot - Plot Bars Different Colors From Specific Colormap
                            
                                Validating upload file type in Django
                            
                                Python convert string to float error with negative numbers
                            
                                Comparing NumPy arange and custom range function for producing ranges with decimal increments
                            
                                Keras- Embedding layer
                            
                                How to chain multiple re.sub() commands in Python [duplicate]
                            
                                ImportError: cannot import name 'BlobService' when using Azure Backend
                            
                                Multiprocessing managers and custom classes
                            
                                matplotlib generic colormap from tab10
                            
                                Is there an efficient way of creating a list of tuples using a range?
                            
                                tabula-py ImportError: cannot import name 'read_pdf'
                            
                                Change column type from string to date in Pyspark
                            
                                How to use `cv2.imshow` correctly for the float image returned by `cv2.distanceTransform`?
                            
                                Django 2.0 ModelForm dateField not displaying as a widget
                            
                                CNN with keras, accuracy not improving

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Multi-row UPSERT (INSERT or UPDATE) from Python

Tags:

python

sql-server

pyodbc

Mosy

People also ask

1 Answers

ksbg

Recent Activity

Donate For Us