I have created a table in my database with name 'con' which has two columns with the name 'date' and 'kgs'. I am trying to extract data from this 'hi.rpt' file copied on this location 'H:Sir\data\reporting\hi.rpt' and want to store values in the table 'con' in my database. I have tried this code in pgadmin When I run: <pre class="prettyprint"><code>COPY con (date,kgs) FROM 'H:Sir\data\reporting\hi.rpt' WITH DELIMITER ',' CSV HEADER date AS 'Datum/Uhrzeit' kgs AS 'Summe' </code></pre> I get the error: <pre class="prettyprint"> ERROR: syntax error at or near "date" LINE 5: date AS 'Datum/Uhrzeit' ^ ********** Error ********** ERROR: syntax error at or near "date" SQL state: 42601 Character: 113 </pre> "hi.rpt" file from which i am reading the data look like this: <pre class="prettyprint"> Datum/Uhrzeit,Sta.,Bez.,Unit,TBId,Batch,OrderNr,Mat1,Total1,Mat2,Total2,Mat3,Total3,Mat4,Total4,Mat5,Total5,Mat6,Total6,Summe 41521.512369(04.09.13 12:17:48),TB01,TB01,005,300,9553,,2,27010.47,0,0.00,0,0.00,3,1749.19,0,0.00,0,0.00,28759.66 41521.547592(04.09.13 13:08:31),TB01,TB01,005,300,9570,,2,27057.32,0,0.00,0,0.00,3,1753.34,0,0.00,0,0.00,28810.66 </pre> Is it possible to extract only two data values from 20 different type of data that i have in this 'hi.rpt' file or not? or is there only a mistake in the syntax that i have written? What is the correct way to write it?

I don't know where you got that syntax, but <code>COPY</code> doesn't take a list of column aliases like that. See the help: <pre class="prettyprint"><code>COPY table_name [ ( column_name [, ...] ) ] FROM { 'filename' | PROGRAM 'command' | STDIN } [ [ WITH ] ( option [, ...] ) ] </code></pre> (<code>AS</code> isn't one of the listed options; to see the full output run <code>\d copy</code> in psql, or look at the manual for the <code>copy</code> command online). There is no mapping facility in <code>COPY</code> that lets you read only some columns of the input CSV. It'd be really useful, but nobody's had the time/interest/funding to implement it yet. It's really only one of many data transform/filtering tasks people want anyway. PostgreSQL expects the column-list given in <code>COPY</code> to be in the same order, left-to-right, as what's in the CSV file, and have the same number of entries as the CSV file has columns. So if you write: <pre class="prettyprint"><code>COPY con (date,kgs) </code></pre> then PostgreSQL will expect an input CSV with exactly two columns. It'll use the first csv column for the <code>"date"</code> table column and the second csv column for the <code>"kgs"</code> table column. It doesn't care what the CSV headers are, they're ignored if you specify <code>WITH (FORMAT CSV, HEADER ON)</code>, or treated as normal data rows if you don't specify <code>HEADER</code>. PostgreSQL 9.4 adds <code>FROM PROGRAM</code> to <code>COPY</code>, so you could run a shell command to read the file and filter it. A simple Python or Perl script would do the job. If it's a small file, just open a copy in the spreadsheet of your choice as a csv file, delete the unwanted columns, and save it, so only the <code>date</code> and <code>kgs</code> columns remain. Alternately, <code>COPY</code> to a staging table that has all the same columns as the <code>CSV</code>, then do an <code>INSERT INTO ... SELECT</code> to transfer just the wanted data into the real target table.

COPY only some columns from an input CSV?

Tags:

postgresql

I have created a table in my database with name 'con' which has two columns with the name 'date' and 'kgs'. I am trying to extract data from this 'hi.rpt' file copied on this location 'H:Sir\data\reporting\hi.rpt' and want to store values in the table 'con' in my database.

I have tried this code in pgadmin

When I run:

COPY con (date,kgs) 
FROM 'H:Sir\data\reporting\hi.rpt'
WITH DELIMITER ','
CSV HEADER 
    date AS 'Datum/Uhrzeit'
    kgs  AS 'Summe'

I get the error:

ERROR:  syntax error at or near "date"
LINE 5:    date AS 'Datum/Uhrzeit' 
           ^
********** Error **********
ERROR: syntax error at or near "date"
SQL state: 42601
Character: 113

"hi.rpt" file from which i am reading the data look like this:

Datum/Uhrzeit,Sta.,Bez.,Unit,TBId,Batch,OrderNr,Mat1,Total1,Mat2,Total2,Mat3,Total3,Mat4,Total4,Mat5,Total5,Mat6,Total6,Summe
41521.512369(04.09.13 12:17:48),TB01,TB01,005,300,9553,,2,27010.47,0,0.00,0,0.00,3,1749.19,0,0.00,0,0.00,28759.66
41521.547592(04.09.13 13:08:31),TB01,TB01,005,300,9570,,2,27057.32,0,0.00,0,0.00,3,1753.34,0,0.00,0,0.00,28810.66

Is it possible to extract only two data values from 20 different type of data that i have in this 'hi.rpt' file or not?

or is there only a mistake in the syntax that i have written? What is the correct way to write it?

294

asked Jun 30 '14 05:06

user3732694

1 Answers

I don't know where you got that syntax, but COPY doesn't take a list of column aliases like that. See the help:

COPY table_name [ ( column_name [, ...] ) ]
    FROM { 'filename' | PROGRAM 'command' | STDIN }
    [ [ WITH ] ( option [, ...] ) ]

(AS isn't one of the listed options; to see the full output run \d copy in psql, or look at the manual for the copy command online).

There is no mapping facility in COPY that lets you read only some columns of the input CSV. It'd be really useful, but nobody's had the time/interest/funding to implement it yet. It's really only one of many data transform/filtering tasks people want anyway.

PostgreSQL expects the column-list given in COPY to be in the same order, left-to-right, as what's in the CSV file, and have the same number of entries as the CSV file has columns. So if you write:

COPY con (date,kgs)

then PostgreSQL will expect an input CSV with exactly two columns. It'll use the first csv column for the "date" table column and the second csv column for the "kgs" table column. It doesn't care what the CSV headers are, they're ignored if you specify WITH (FORMAT CSV, HEADER ON), or treated as normal data rows if you don't specify HEADER.

PostgreSQL 9.4 adds FROM PROGRAM to COPY, so you could run a shell command to read the file and filter it. A simple Python or Perl script would do the job.

If it's a small file, just open a copy in the spreadsheet of your choice as a csv file, delete the unwanted columns, and save it, so only the date and kgs columns remain.

Alternately, COPY to a staging table that has all the same columns as the CSV, then do an INSERT INTO ... SELECT to transfer just the wanted data into the real target table.

102

answered Sep 22 '22 02:09

Craig Ringer

Related questions
                            
                                Convert recursive function to view
                            
                                Postgres 9.1 - Numbering groups of rows
                            
                                ALTER query very slow on tiny table in PostgreSQL
                            
                                Query furthest children in Adjacency List
                            
                                Incrementing a sequence in PostgreSQL based on a foreign key
                            
                                Reloading postgreSQL without breaking current connection?
                            
                                Autocomplete getting data from a huge table
                            
                                How to pass BigInteger from java to Postgres?
                            
                                Very slow lexicographic ordering in PostgreSQL?
                            
                                PostgreSQL development workflow
                            
                                psycopg2.ProgrammingError: syntax error at or near "\"
                            
                                postgres - estimate index size for timestamp column
                            
                                Indexed ORDER BY with LIMIT 1
                            
                                Get ID before saving to database
                            
                                Reusing pure Python functions between PL/Python functions
                            
                                Rails joins with limit on association
                            
                                Postgres Query JSON Array that contains something
                            
                                Shouldn't this PostgreSQL function return zero rows?
                            
                                Understanding "LOG: execute S_1: BEGIN " in PostgreSQL
                            
                                Are constraints executed before or after customized trigger?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With