I'm storing UUID v4 values in a PostgreSQL v9.4 table, under column "id". When I create the table, is there any difference in following write or read performance whether I define the "id" column as VARCHAR(36), CHAR(36), or UUID data type? Thanks!

Use <code>uuid</code>. PostgreSQL has the native type for a reason. It stores the uuid internally as a 128-bit binary field. Your other proposed options store it as hexadecimal, which is very inefficient in comparison. Not only that, but: <ul> <li><code>uuid</code> does a simple bytewise sort for ordering. <code>text</code>, <code>char</code> and <code>varchar</code> consider collations and locales, which is nonsensical for a uuid.</li> <li>There is only one canonical respresentation of a <code>uuid</code>. The same is not true for text etc; you have to consider upper vs lower case hex, presence or absence of <code>{...-...}</code>s etc.</li> </ul> There's just no question. Use <code>uuid</code>. The only other type that makes any sense is <code>bytea</code>, which at least can be used to store the 16 bytes of the uuid directly. This is what I'd do if I was using systems that couldn't cope with data types outside the basic set, like a really dumb ORM of some kind.

The index size is maybe the most notable difference: almost 86% more for VARCHAR. From a performance perspective I didn't notice significant differences in PostgreSQL 9.5.

Performance difference between UUID, CHAR, and VARCHAR in PostgreSql table?

3 Answers

Use uuid. PostgreSQL has the native type for a reason.

It stores the uuid internally as a 128-bit binary field. Your other proposed options store it as hexadecimal, which is very inefficient in comparison.

Not only that, but:

uuid does a simple bytewise sort for ordering. text, char and varchar consider collations and locales, which is nonsensical for a uuid.
There is only one canonical respresentation of a uuid. The same is not true for text etc; you have to consider upper vs lower case hex, presence or absence of {...-...}s etc.

There's just no question. Use uuid.

The only other type that makes any sense is bytea, which at least can be used to store the 16 bytes of the uuid directly. This is what I'd do if I was using systems that couldn't cope with data types outside the basic set, like a really dumb ORM of some kind.

122

answered Oct 12 '22 01:10

Craig Ringer

UUID would be the fastest because its 128 bits -> 16 bytes and comparisons are done numerically.

Char(36) and varchar(36) seems to be the same and slow: http://www.depesz.com/2010/03/02/charx-vs-varcharx-vs-varchar-vs-text/.

The server should check EOF to determine the job of reading the value has finished or not for each character.

Also text comparison is slower than numerical comparison. And because UUID consists of 16 bytes, comparing UUID is much faster than comparing two texts of 36 characters.

Use native UUID for performance.

answered Oct 11 '22 23:10

Abdullah Nehir

The index size is maybe the most notable difference: almost 86% more for VARCHAR.

From a performance perspective I didn't notice significant differences in PostgreSQL 9.5.

answered Oct 12 '22 01:10

johnlemon

Related questions
                            
                                SQL (ORACLE): ORDER BY and LIMIT [duplicate]
                            
                                Select column value where other column is max of group
                            
                                How to replace SQL field value
                            
                                What's the equivalent for LISTAGG (Oracle database) in PostgreSQL?
                            
                                DB2: Won't Allow "NULL" column?
                            
                                Convert Comma Separated column value to rows
                            
                                How do I use T-SQL Group By
                            
                                how do I select a column based on condition?
                            
                                SQL AVG returning an int
                            
                                When to use GROUPING SETS, CUBE and ROLLUP
                            
                                How do I create a table alias in MySQL
                            
                                ORA-38104: Columns referenced in the ON Clause cannot be updated
                            
                                How to get the name of a unique constraint in postgresql?
                            
                                Temp Table collation conflict - Error : Cannot resolve the collation conflict between Latin1* and SQL_Latin1*
                            
                                Search for “whole word match” with SQL Server LIKE pattern
                            
                                Querying oracle clob column
                            
                                How to update an SQLite database with a search and replace query?
                            
                                Try_Convert for SQL Server 2008 R2
                            
                                Select from union tsql
                            
                                SWITCH with LIKE inside SELECT query in MySQL

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Performance difference between UUID, CHAR, and VARCHAR in PostgreSql table?

Tags:

sql

sqldatatypes

postgresql

database-performance

Pensierinmusica

People also ask

3 Answers

Craig Ringer

Abdullah Nehir

johnlemon

Recent Activity

Donate For Us