Union/find algorithm without union by rank for disjoint-set forests data structure

Tags:

Here's a breakdown on the union/find algorithm for disjoint set forests on wikipedia:

Barebone disjoint-set forests... (O(n))
- ... with union by rank ... (now improved to O(log(n))
  - ... with path compression (now improved to O(a(n)), effectively O(1))

Implementing union by rank necessitates that each node keeps a rank field for comparison purposes. My question is, is union by rank worth this additional space? What happens if I skip union by rank and just do path compression instead? Is it good enough? What is the amortized complexity now?

A comment is made that implies that union by rank without path compression (amortized O(log(n) complexity) is sufficient for most practical application. This is correct. What I'm asking is the other way around: what if you skip union by rank and ONLY do path compression instead?

In a sense, path compression is an extra step to improve union by rank, and that's why that extra step can be omitted without disastrous consequence. But is union by rank a necessary intermediate step to path compression? Can I skip it and go straight to path compression, or will that be catastrophic?

It was also pointed out that without union by rank, repeated unions could create a linked-list like structure. This means that a single path compression operation could take O(n) in the worst case. This would of course affect future operations, so how this plays out when amortized over many operations is what I'm more interested in.

631

asked Feb 24 '10 02:02

polygenelubricants

1 Answers

I googled for "without union by rank" and the second link that came up was this one:

...We close this section with an analysis of union–find with path compression but without union by rank...

The union-find datastructure with path compression but without union by rank processes m find and n-1 link operations in time O((m+n) log n)

181

answered Sep 29 '22 20:09

jkff

Related questions
                            
                                Self numbers in c++
                            
                                Generate a list of primes up to a certain number
                            
                                How to sort (million/billion/...) integers?
                            
                                Difference between a linear problem and a non-linear problem? Essence of Dot-Product and Kernel trick
                            
                                Design an efficient algorithm to sort 5 distinct keys in fewer than 8 comparisons
                            
                                Best algorithm to check whether a vector is sorted
                            
                                How to reverse a number as an integer and not as a string?
                            
                                How to write a for loop that will pick up a count where it left off?
                            
                                Distributed local clustering coefficient algorithm (MapReduce/Hadoop)
                            
                                DCF77 decoder vs. noisy signal
                            
                                Data structure for dynamically changing n-length sequence with longest subsequence length query
                            
                                How to implement (fast) bigint division?
                            
                                Find the best way to buy p Product from limit x Vendors
                            
                                Graph transformation - vertices into edges and edges into vertices
                            
                                What are some good algorithms for drawing lines between graph nodes? [closed]
                            
                                Print number of 1s in a sequence up to a number, without actually counting 1s [closed]
                            
                                Data structure for O(log N) find and update, considering small L1 cache
                            
                                Data structure to check if a static array does not contain an element of a given range
                            
                                Detecting empty pages in scanned documents
                            
                                How to express integer using symbols + * () and 1 with minimal cost? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Union/find algorithm without union by rank for disjoint-set forests data structure

Tags:

algorithm

time-complexity

data-structures

amortized-analysis

disjoint-sets

polygenelubricants

People also ask

1 Answers

jkff

Recent Activity

Donate For Us