Count distinct multiple columns in redshift

Question

I am trying to count rows which have a distinct combination of 2 columns in Amazon redshift. The query I am using is -

select count(distinct col1, col2)
from schemaname.tablename
where some filters

It is throwing me this error -

Amazon Invalid operation: function count(character varying, bigint) does not exist`

I tried casting bigint to char but it didn't work.

Zaynul Abadin Tuhin · Accepted Answer

you can use sub-query and count

select count(*) from (
  select distinct col1, col2 
 from schemaname.tablename
  where some filter
) as t

Mariusz Sakowski · Answer

A little late to the party but anyway: you can also try to concatenate columns using || operator. It might be inefficient so I wouldn't use it in prod code, but for ad-hoc analysis should be fine.

select count(distinct col1 || '_' || col2)
from schemaname.tablename
where some filters

Note separator choice might matter, i.e. both 'foo' || '_' || 'bar_baz' and 'foo_bar' || '_' || 'baz' yield 'foo_bar_baz' and are thus equal. In some cases this might be concern, in some it's so insignificant you can skip separator completely.

Deepak · Answer

You can use

select col1,col2,count(*) from schemaname.tablename
where -- your filter
group by col1,col2

Count distinct multiple columns in redshift

Tags:

sql

amazon-redshift

Janusz01

3 Answers

Zaynul Abadin Tuhin

Mariusz Sakowski

Deepak

Recent Activity

Donate For Us

Count distinct multiple columns in redshift

Tags:

sql

amazon-redshift

Janusz01

3 Answers

Zaynul Abadin Tuhin

Mariusz Sakowski

Deepak

Related questions

Recent Activity

Donate For Us