I am hitting some performance bottlenecks with my C# client inserting bulk data into a SQL Server 2005 database and I'm looking for ways in which to speed up the process. I am already using the SqlClient.SqlBulkCopy (which is based on TDS) to speed up the data transfer across the wire which helped a lot, but I'm still looking for more. I have a simple table that looks like this: <pre class="prettyprint"><code> CREATE TABLE [BulkData]( [ContainerId] [int] NOT NULL, [BinId] [smallint] NOT NULL, [Sequence] [smallint] NOT NULL, [ItemId] [int] NOT NULL, [Left] [smallint] NOT NULL, [Top] [smallint] NOT NULL, [Right] [smallint] NOT NULL, [Bottom] [smallint] NOT NULL, CONSTRAINT [PKBulkData] PRIMARY KEY CLUSTERED ( [ContainerIdId] ASC, [BinId] ASC, [Sequence] ASC )) </code></pre> I'm inserting data in chunks that average about 300 rows where ContainerId and BinId are constant in each chunk and the Sequence value is 0-n and the values are pre-sorted based on the primary key. The %Disk time performance counter spends a lot of time at 100% so it is clear that disk IO is the main issue but the speeds I'm getting are several orders of magnitude below a raw file copy. Does it help any if I: <ol> <li>Drop the Primary key while I am doing the inserting and recreate it later</li> <li>Do inserts into a temporary table with the same schema and periodically transfer them into the main table to keep the size of the table where insertions are happening small</li> <li>Anything else?</li> </ol> -- Based on the responses I have gotten, let me clarify a little bit: Portman: I'm using a clustered index because when the data is all imported I will need to access data sequentially in that order. I don't particularly need the index to be there while importing the data. Is there any advantage to having a nonclustered PK index while doing the inserts as opposed to dropping the constraint entirely for import? Chopeen: The data is being generated remotely on many other machines (my SQL server can only handle about 10 currently, but I would love to be able to add more). It's not practical to run the entire process on the local machine because it would then have to process 50 times as much input data to generate the output. Jason: I am not doing any concurrent queries against the table during the import process, I will try dropping the primary key and see if that helps.

Here's how you can disable/enable indexes in SQL Server: <pre class="prettyprint"><code>--Disable Index ALTER INDEX [IX_Users_UserID] SalesDB.Users DISABLE GO --Enable Index ALTER INDEX [IX_Users_UserID] SalesDB.Users REBUILD</code></pre> Here are some resources to help you find a solution: Some bulk loading speed comparisons Use SqlBulkCopy to Quickly Load Data from your Client to SQL Server Optimizing Bulk Copy Performance Definitely look into NOCHECK and TABLOCK options: Table Hints (Transact-SQL) INSERT (Transact-SQL)

What's the fastest way to bulk insert a lot of data in SQL Server (C# client)

Tags:

c#

sql

sql-server

sql-server-2005

I am hitting some performance bottlenecks with my C# client inserting bulk data into a SQL Server 2005 database and I'm looking for ways in which to speed up the process.

I am already using the SqlClient.SqlBulkCopy (which is based on TDS) to speed up the data transfer across the wire which helped a lot, but I'm still looking for more.

I have a simple table that looks like this:

 CREATE TABLE [BulkData](  [ContainerId] [int] NOT NULL,  [BinId] [smallint] NOT NULL,  [Sequence] [smallint] NOT NULL,  [ItemId] [int] NOT NULL,  [Left] [smallint] NOT NULL,  [Top] [smallint] NOT NULL,  [Right] [smallint] NOT NULL,  [Bottom] [smallint] NOT NULL,  CONSTRAINT [PKBulkData] PRIMARY KEY CLUSTERED   (   [ContainerIdId] ASC,   [BinId] ASC,   [Sequence] ASC ))

I'm inserting data in chunks that average about 300 rows where ContainerId and BinId are constant in each chunk and the Sequence value is 0-n and the values are pre-sorted based on the primary key.

The %Disk time performance counter spends a lot of time at 100% so it is clear that disk IO is the main issue but the speeds I'm getting are several orders of magnitude below a raw file copy.

Does it help any if I:

Drop the Primary key while I am doing the inserting and recreate it later
Do inserts into a temporary table with the same schema and periodically transfer them into the main table to keep the size of the table where insertions are happening small
Anything else?

-- Based on the responses I have gotten, let me clarify a little bit:

Portman: I'm using a clustered index because when the data is all imported I will need to access data sequentially in that order. I don't particularly need the index to be there while importing the data. Is there any advantage to having a nonclustered PK index while doing the inserts as opposed to dropping the constraint entirely for import?

Chopeen: The data is being generated remotely on many other machines (my SQL server can only handle about 10 currently, but I would love to be able to add more). It's not practical to run the entire process on the local machine because it would then have to process 50 times as much input data to generate the output.

Jason: I am not doing any concurrent queries against the table during the import process, I will try dropping the primary key and see if that helps.

979

asked Aug 23 '08 12:08

Andrew

1 Answers

Here's how you can disable/enable indexes in SQL Server:

--Disable Index ALTER INDEX [IX_Users_UserID] SalesDB.Users DISABLE GO --Enable Index ALTER INDEX [IX_Users_UserID] SalesDB.Users REBUILD

Here are some resources to help you find a solution:

Some bulk loading speed comparisons

Use SqlBulkCopy to Quickly Load Data from your Client to SQL Server

Optimizing Bulk Copy Performance

Definitely look into NOCHECK and TABLOCK options:

Table Hints (Transact-SQL)

INSERT (Transact-SQL)

194

answered Oct 02 '22 21:10

JohnB

Related questions
                            
                                How to specify an Order or Sort using the C# driver for MongoDB?
                            
                                C# - Insert a variable number of spaces into a string? (Formatting an output file)
                            
                                A 'Binding' can only be set on a DependencyProperty of a DependencyObject
                            
                                glob pattern matching in .NET
                            
                                Practical usage of virtual functions in c#
                            
                                What is the purpose of the extra braces in Switch case?
                            
                                Memory Leak in C#
                            
                                DataGridView.Clear()
                            
                                Is double Multiplication Broken in .NET? [duplicate]
                            
                                How to store/retrieve RSA public/private key
                            
                                Using SignalR with ElastiCache fails
                            
                                Deploy a .NET Windows Service with Amazon Elastic Beanstalk with no Web Application
                            
                                Named arguments and generic type inference in C# 4.0
                            
                                Why does trying to understand delegates feel like trying to understand the nature of the universe?
                            
                                What is the meaning/reason for the generated entries in web.config>configuration>runtime>assemblyBinding?
                            
                                The best approach to create new window in WPF using MVVM
                            
                                INotifyPropertyChanged and Auto-Properties
                            
                                Graph database for .NET [closed]
                            
                                Pain-free local development while also referencing NuGet packages
                            
                                TCPClient vs Socket in C#

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With