Is there a sequence number generation function in redshift ? Or a function that takes combination of values and gives out a numerical hash key ?

There is no concept of sequences (as seen in Oracle) at the moment. You have a few options: <ul> <li>Number tables</li> <li>RANK() or ROW_NUMBER() window functions over the whole set. Note that this can have some negative performance implications if you have a multi-node cluster.</li> <li>Columns defined as IDENTITY(seed, step). Note that IDENTITY sequence may be 'sparse' (e.g. have gaps in the sequence).</li> </ul>

sequence number generation function in AWS redshift

3 Answers

Here is another way to generate 1 million numbers

with seq_0_9 as (
select 0 as num
union all select 1 as num
union all select 2 as num
union all select 3 as num
union all select 4 as num
union all select 5 as num
union all select 6 as num
union all select 7 as num
union all select 8 as num
union all select 9 as num
), seq_0_999 as (
select a.num + b.num * 10 + c.num * 100 as num
from seq_0_9 a, seq_0_9 b, seq_0_9 c
)
select a.num + b.num * 1000 as num
from seq_0_999 a, seq_0_999 b
order by num

118

answered Nov 15 '22 10:11

Charles Lee

There is no concept of sequences (as seen in Oracle) at the moment.

You have a few options:

Number tables
RANK() or ROW_NUMBER() window functions over the whole set. Note that this can have some negative performance implications if you have a multi-node cluster.
Columns defined as IDENTITY(seed, step). Note that IDENTITY sequence may be 'sparse' (e.g. have gaps in the sequence).

answered Nov 15 '22 09:11

Joe Harris

I am new to Redshift, and I found this article looking for a common sequence, that is not supported on Amazon database. I found this solution I will report with a complete example using ROW_NUMBER.

I have schemas sta and dim. In sta I have staging tables, while in dim I have dimension tables I want to populate with ids. I have a source of information that has fields trk_key, name containing for instance some publishers.

CREATE TABLE sta.publisher (
        trk_key VARCHAR(20),
        name VARCHAR(225)
);
CREATE TABLE dim.publisher (
        id SMALLINT,
        trk_key VARCHAR(20),
        name VARCHAR(255),
        PRIMARY KEY (id)
);

First I truncate sta.publisher table and load there a csv file. Then I launch the following query

-- This query is idempotent:
-- it will insert a publisher found in sta.publisher table only if
-- it is not already in dim.publisher table.
INSERT INTO dim.publisher
SELECT
        -- Generate id using max id found in dim.publisher.
        -- Start with id=1 if dim.publisher is empty.
        (
                SELECT NVL(MAX(id), 0)
                FROM dim.publisher
        ) + ROW_NUMBER() OVER() AS id,
        trk_key,
        name
FROM sta.publisher
        -- Only insert record if trk_key is not found in dim.publisher table.
        WHERE trk_key NOT IN (
                SELECT trk_key
                FROM dim.publisher
        )

answered Nov 15 '22 09:11

Gianluca Casati

Related questions
                            
                                How to delete aws iot things and policies?
                            
                                Can't use custom Request Headers on AWS API Gateway with CORS
                            
                                Delete image from Amazon S3 Storage
                            
                                aws "Cannot create enum from " + regionName + " value!"
                            
                                How to use LISTAGG in AWS Athena?
                            
                                AWS EBS block size
                            
                                Python in AWS Lambda: "module 'requests' has no attribute 'get'"
                            
                                How to use put and get data from elasticache redis of AWS with golang
                            
                                Changing the auto-hibernate settings on a AWS Cloud9 EC2 Instance
                            
                                InvalidClientTokenID error when running Terraform Plan/Apply
                            
                                How to modify the multiple object's ACL in S3 bucket?
                            
                                Deploy NodeJS with Elastic Beanstalk permission problem
                            
                                ReactJS: Render a private image asset from S3
                            
                                How to change the default output format in AWS CLI?
                            
                                AWS Cognito AdminLinkProviderForUser - User Pool Account and Facebook
                            
                                Amazon Linux 2 OpenVPN client package unavailable?
                            
                                Elastic BeanStalk app deploy post hook not executing my command
                            
                                Error when logging into ECR with Docker login: "Error saving credentials... not implemented"
                            
                                AWS Autoscaling and Elastic load balancing
                            
                                In Amazon S3, what permissions do I need to get HEAD on an object?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

sequence number generation function in AWS redshift

Tags:

amazon-web-services

amazon-redshift

user3279189

People also ask

3 Answers

Charles Lee

Joe Harris

Gianluca Casati

Recent Activity

Donate For Us