I'm trying to unload a table data from postgres database into amazon s3. I'm aware that redshift has a option of unload into s3 - Since redshift is a postgres database, I tried using the same command in my postgres database but was unsuccesful. Can someone help me with unloading table data from postgres into s3 periodically ?

Redshift is based on a PostgreSQL clone but there's not 1-1 feature correspondence. If you want to load data from a PostgreSQL DB to Redshift, through S3, you should: <ol> <li>Unload your data from PostgreSQL to a CSV file. To do that use the copy command of psql. See also this Question here.</li> <li>Copy the CSV file on S3. There are different ways to do that but check the documentation here </li> <li>Use the COPY command to load the data from S3 to Redshift</li> </ol>

Unload data from postgres to s3

2 Answers

Redshift is based on a PostgreSQL clone but there's not 1-1 feature correspondence. If you want to load data from a PostgreSQL DB to Redshift, through S3, you should:

Unload your data from PostgreSQL to a CSV file. To do that use the copy command of psql. See also this Question here.
Copy the CSV file on S3. There are different ways to do that but check the documentation here
Use the COPY command to load the data from S3 to Redshift

142

answered Oct 23 '22 09:10

cpard

On Redshift you can create a table to receive the data:

CREATE TABLE redshift_schema.redshift_table (...);

Then create a foreign data wrapper, server and a virtual phantom of the table in PostgreSQL RDS:

CREATE EXTENSION redshift_fdw;

----optional
--CREATE FOREIGN DATA WRAPPER redshift_fdw
--HANDLER postgres_fdw_handler
--VALIDATOR postgres_fdw_validator
--OPTIONS ();

CREATE SERVER redshift_server_mydb
FOREIGN DATA WRAPPER redshift_fdw
OPTIONS (dbname 'mydb', port '5439', connect_timeout '200000', host 'myhost.redshift.amazonaws.com');

CREATE USER MAPPING FOR mypguser
SERVER redshift_server_mydb
OPTIONS (user 'myrsuser', password 'mypassword');

IMPORT FOREIGN SCHEMA redshift_schema 
LIMIT TO (redshift_table) 
FROM SERVER redshift_server_mydb
INTO postgresql_schema;

Now in PostgreSQL you can (in a function if you like) load (select, insert, update, delete) the Redshift table from the PostgreSQL table (without using dblink):

INSERT INTO postgresql_schema.redshift_table
SELECT *
FROM postgresql_schema.postgresql_table;

Now when you look at the Redshift table all the data is there and you can UNLOAD the table to S3 as required.

answered Oct 23 '22 08:10

S Wright

Related questions
                            
                                How do you enable the logging of all queries to postgreSQL AWS RDS instance?
                            
                                slick 3 auto-generated - default value (timestamp) column, how to define a Rep[Date] function
                            
                                Why can't NULL be converted to JSON's null in postgreSQL?
                            
                                Accessing to PostgreSQL array via ScalikeJDBC
                            
                                how to install Odoo 9 on ubuntu?
                            
                                "ERROR: cached plan must not change result type" when mixing DDL with SELECT via JDBC
                            
                                How to create table to store json object data in PostgreSQL database?
                            
                                Limit a text column in an ActiveRecord migration 5.0 using the PostgreSQL adapter
                            
                                Update each row with incremental value Postgres
                            
                                How do I create a docker-compose version 2 to have a persistent postgres db using volumes?
                            
                                How use `unaccent` with full text search in django 1.10?
                            
                                Connect to PostgreSQL database using EntityFramework 6 (C#)
                            
                                Return jsonb_array_elements result as comma-separated list
                            
                                Parameters and NULL
                            
                                PostgreSQL - Auto Cast for types?
                            
                                Index on JSON field with dynamic keys
                            
                                SQL aggregate function alias
                            
                                Undefined table: 7 ERROR: relation "expenses" does not exist
                            
                                Array of Enum in Postgres with SQLAlchemy
                            
                                Is it possible to add your own strings to a Django SearchVectorField?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Unload data from postgres to s3

Tags:

postgresql

amazon-web-services

amazon-s3

amazon-redshift

Firstname

People also ask

2 Answers

cpard

S Wright

Recent Activity

Donate For Us