The UUID specification defines 4 predefined namespaces which it describes as "potentially interesting" - meaning among other things, "if other people have generated UUIDs in this namespace you can verify them": <ul> <li> <code>6ba7b810-9dad-11d1-80b4-00c04fd430c8</code> for DNS</li> <li> <code>6ba7b811-9dad-11d1-80b4-00c04fd430c8</code> for URL</li> <li> <code>6ba7b812-9dad-11d1-80b4-00c04fd430c8</code> for ISO OID</li> <li> <code>6ba7b814-9dad-11d1-80b4-00c04fd430c8</code> for X.500 DN</li> </ul> Where did these come from? Specifically; <ul> <li>If I'm generating my own namespace UUID do I need to avoid anything in particular?</li> <li>I'm aware how big the UUID space is, but does this have any implication on collisions?</li> <li>Why have they chosen the 4th octet to increase as a kind of UUID 'version number'?</li> <li>Do my questions imply that I'm missing something fundamental about UUIDs?</li> </ul>

First, to be clear, this whole discussion is limited to version 3 & 5 UUIDs. In my (anecdotal) experience, version 4 (random) UUIDs are most commonly used. 4122's namespaced UUID generation algorithm ambiguously begins: <blockquote> Allocate a UUID to use as a "name space ID" </blockquote> There is no other mention of "name space ID" allocation, and neither I nor python have found any standardized spaces beyond the four listed in RFC 4122. So the answer to your first question, <blockquote> <ul> <li>If I'm generating my own namespace UUID do I need to avoid anything in particular?</li> </ul> </blockquote> You only need to avoid the four standard namespaces. <hr> The next question, <blockquote> <ul> <li>I'm aware how big the UUID space is, but does this have any implication on collisions?</li> </ul> </blockquote> Has two parts: <ol> <li> Will UUIDs within your namespace collide? Verbatim from 4122: <blockquote> The UUIDs generated from two different names in [your] namespace should be different (with very high probability). </blockquote> </li> <li> Will your namespace UUID collide with other namespaces? I couldn't find a direct answer, since there's no standard for "name space ID" allocation, but the argument in section 4.1.1 seems relevant: <blockquote> Interoperability, in any form, with variants other than the one defined here is not guaranteed, and is not likely to be an issue in practice. </blockquote> </li> </ol> <hr> <blockquote> <ul> <li>Why have they chosen the 4th octet to increase as a kind of UUID 'version number'?</li> </ul> </blockquote> This one's a bit of a mystery. Luckily, we have a spec for UUIDs, so we can mine them for some insight. Note that the (0-index) 8th octet starts with <code>8</code> in all cases, so we're dealing with RFC 4122 variant UUIDs. Phew. Now check octet 6 for the version: <code>1</code>, we're dealing with version 1 time-based UUIDs. This answer has a handy algorithm for extracting python datetimes from version 1 UUIDs. Applying the algorithm yields a time in February 4th, 1998. I have yet to find meaning in this date. Incrementing the 3rd octet adds the smallest encodable time interval (100ns) to the date. <hr> <blockquote> <ul> <li>Do my questions imply that I'm missing something fundamental about UUIDs?</li> </ul> </blockquote> Nope. There is very little discussion of UUID namespaces, since random UUIDs are so easy.

Where do UUID namespaces come from?

Tags:

language-agnostic

uuid

standards

The UUID specification defines 4 predefined namespaces which it describes as "potentially interesting" - meaning among other things, "if other people have generated UUIDs in this namespace you can verify them":

6ba7b810-9dad-11d1-80b4-00c04fd430c8 for DNS
6ba7b811-9dad-11d1-80b4-00c04fd430c8 for URL
6ba7b812-9dad-11d1-80b4-00c04fd430c8 for ISO OID
6ba7b814-9dad-11d1-80b4-00c04fd430c8 for X.500 DN

Where did these come from?

Specifically;

If I'm generating my own namespace UUID do I need to avoid anything in particular?
I'm aware how big the UUID space is, but does this have any implication on collisions?
Why have they chosen the 4th octet to increase as a kind of UUID 'version number'?
Do my questions imply that I'm missing something fundamental about UUIDs?

475

asked Oct 11 '11 10:10

Gareth

1 Answers

First, to be clear, this whole discussion is limited to version 3 & 5 UUIDs. In my (anecdotal) experience, version 4 (random) UUIDs are most commonly used.

4122's namespaced UUID generation algorithm ambiguously begins:

Allocate a UUID to use as a "name space ID"

There is no other mention of "name space ID" allocation, and neither I nor python have found any standardized spaces beyond the four listed in RFC 4122.

So the answer to your first question,

If I'm generating my own namespace UUID do I need to avoid anything in particular?

You only need to avoid the four standard namespaces.

The next question,

I'm aware how big the UUID space is, but does this have any implication on collisions?

Has two parts:

Will UUIDs within your namespace collide? Verbatim from 4122:

The UUIDs generated from two different names in [your] namespace should be different (with very high probability).
Will your namespace UUID collide with other namespaces? I couldn't find a direct answer, since there's no standard for "name space ID" allocation, but the argument in section 4.1.1 seems relevant:

Interoperability, in any form, with variants other than the one defined here is not guaranteed, and is not likely to be an issue in practice.

Why have they chosen the 4th octet to increase as a kind of UUID 'version number'?

This one's a bit of a mystery. Luckily, we have a spec for UUIDs, so we can mine them for some insight.

Note that the (0-index) 8th octet starts with 8 in all cases, so we're dealing with RFC 4122 variant UUIDs. Phew.

Now check octet 6 for the version: 1, we're dealing with version 1 time-based UUIDs.

This answer has a handy algorithm for extracting python datetimes from version 1 UUIDs. Applying the algorithm yields a time in February 4th, 1998. I have yet to find meaning in this date. Incrementing the 3rd octet adds the smallest encodable time interval (100ns) to the date.

Do my questions imply that I'm missing something fundamental about UUIDs?

Nope. There is very little discussion of UUID namespaces, since random UUIDs are so easy.

122

answered Sep 29 '22 10:09

hurrymaplelad

Related questions
                            
                                Do all programming languages have boolean short-circuit evaluation?
                            
                                What is the purpose of null?
                            
                                How to check if line segment intersects a rectangle?
                            
                                How can a developer learn about web design? [closed]
                            
                                Good examples, articles, books for understanding dynamic programming [closed]
                            
                                How Could One Implement the K-Means++ Algorithm?
                            
                                How do I display the binary representation of a float or double?
                            
                                Have you ever restricted yourself to using a subset of language features?
                            
                                What are hashtables and hashmaps and their typical use cases?
                            
                                Why do good programmers sometimes silently swallow exceptions? [closed]
                            
                                How long does code last?
                            
                                Code Golf: The wave
                            
                                Why are compilers so stupid?
                            
                                What is a good network graph library for language X?
                            
                                Best practices for internationalizing web applications?
                            
                                Linear Time Voting Algorithm. I don't get it
                            
                                Optimizing Conway's 'Game of Life'
                            
                                Reducing the number of arguments to a constructor
                            
                                Difference between a LinkedList and a Binary Search Tree
                            
                                How to begin with augmented reality? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With