Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Difference between INSERT and COPY

As per the documentation,

Loading large number of rows using COPY is always faster than using INSERT, even if PREPARE is used and multiple insertions are batched into a single transaction.

Why COPY is faster than INSERT (multiple insertion are batched into single transaction) ?

like image 980
user3756488 Avatar asked Aug 17 '15 05:08

user3756488


Video Answer


1 Answers

Quite a number of reasons, actually, but the main ones are:

  • Typically, client applications wait for confirmation of one INSERT's success before sending the next. So there's a round-trip delay for each INSERT, scheduling delays, etc. (PgJDBC supports pipelineing INSERTs in batches, but I'm not aware of any other clients that do).

  • Each INSERT has to go through the whole executor. Use of a prepared statement bypasses the need to run the parser, rewriter and planner, but there's still executor state to set up and tear down for each row. COPY does some setup once, and has an extremely low overhead for each row, especially where no triggers are involved.

The first point is the most significant. It's all about network round-trips and rescheduling delays.

like image 180
Craig Ringer Avatar answered Nov 06 '22 06:11

Craig Ringer