Calculate percentage of group using GROUP BY

Tags:

google-bigquery

I am doing a GROUP BY and COUNT(*) on a dataset, and I would like to calculate the percentage of each group over the total.

For example, in this query, I would like to know how much the count() for each state represents over the total ( select count() from publicdata:samples.natality ):

Click to copy

SELECT state, count(*)
FROM [publicdata:samples.natality]
GROUP by state

There are several ways to do it in SQL, but I haven't found a way to do it in Bigquery, does anyone know?

Thanks!

209

asked Jun 05 '13 07:06

inaki

1 Answers

Check ratio_to_report, one of the recently announced window functions:

Click to copy

SELECT state, ratio * 100 AS percent FROM (
 SELECT state, count(*) AS total, RATIO_TO_REPORT(total) OVER() AS ratio
 FROM [publicdata:samples.natality]
 GROUP by state
)

state   percent
AL      1.4201828131159113   
AK      0.23521048665998198  
AZ      1.3332896746620975   
AR      0.7709591206172346   
CA      10.008298605982642

158

answered Oct 16 '22 05:10

Felipe Hoffa

Related questions
                            
                                BigQuery select * except nested column
                            
                                BigQuery - remove unused column from schema
                            
                                BigQuery - NULL values
                            
                                BigQuery - Export query results to local file/Google storage
                            
                                Google App Engine: Using Big Query on datastore?
                            
                                Bigquery - json_extract all elements from an array
                            
                                Copy table structure alone in Bigquery
                            
                                Google BigQuery - how to drop table with bq command?
                            
                                Count number of GCP log entries during a specified time
                            
                                Export from Google BigQuery into CloudSQL?
                            
                                BIGQUERY SELECT list expression references column CHANNEL_ID which is neither grouped nor aggregated at [10:13]
                            
                                Default values for columns in Big Query Tables
                            
                                How to extract all the keys in a JSON object with BigQuery
                            
                                Best Practice to migrate data from MySQL to BigQuery
                            
                                Avoid correlated subqueries error in BigQuery
                            
                                How can I change the project in BigQuery
                            
                                BigQuery - how to compare a "date" column (using legacy SQL)?
                            
                                Oops! used a reserved word to name a column
                            
                                How to convert Timestamp to Date Data Type in Google Bigquery
                            
                                What is the difference between NUMERIC and FLOAT in BigQuery?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With