I have two collections (they happen to be arrays, but it doesn't really matter, I think): <code>L</code> and <code>R</code>. They are both sorted and now I want to compare them. I want to end up with two collections: one for each input array containing the items which were not in the other. I could just take the first item from <code>L</code> and then search <code>R</code> and, if there isn't a match, add it to my "unique" collection (<code>Lu</code>). But that's extremely inefficient, and I am expecting to have some very large collections to process in the near future. I though about possibly "playing hopscotch": <ul> <li> Step 1: Take two lists, <code>L</code> and <code>R</code>, and compare the head of each list ( <code>l :: L</code> and <code>r :: R</code>): <ul> <li>Branch 1: if <code>l</code> < <code>r</code>, then add <code>l</code> to <code>Lu</code> and recurse, passing in <code>L</code> and <code>r :: R</code></li> <li>Branch 2: if <code>l</code> > <code>r</code>, then add <code>r</code> to <code>Ru</code> and recurse, passing in <code>l :: L</code> and <code>R</code></li> <li>Branch 3: if <code>l</code> = <code>r</code>, then recurse, passing in <code>L</code> and <code>R</code></li> </ul> </li> <li>Step 2: return <code>Lu</code> and <code>Ru</code></li> </ul> I can write this function, but before I put in the effort I was wondering if a function already exists which can do this for me. It seems like a not-to-uncommon scenario, and I'd always rather use an existing solution to rolling my own. (Also, if there's a more recognizable name for this algorithm, I'd like to know what it's called.)

(I wrote the question above about 2 hours ago. Since then, I found the answer on my own. The following is what I discovered.) In set theory, the "list" of items in L but not in R is known as "the relative complement of R in L", also known as "set-theoretic difference of L and R" (See Wikipedia's Complement (set theory) article) <img src="https://upload.wikimedia.org/wikipedia/commons/thumb/5/5a/Venn0010.svg/250px-Venn0010.svg.png" alt=""> F#, being a mathematical language, has this concept baked right in to it's Core library. First, you need to build your collections as sets: <pre class="prettyprint lang-fsharp prettyprint-override"><code>// example arrays: let arr1 = [| 1; 2; 3 |] let arr2 = [| 2; 3; 4 |] // build the L and R sets let L = set arr1 let R = set arr2 </code></pre> Now you can call the "difference" function and quickly get the relative complement for each array: <pre class="prettyprint lang-fsharp prettyprint-override"><code>let Lu = Set.difference L R |> Set.toArray let Ru = Set.difference R L |> Set.toArray </code></pre> <pre class="prettyprint lang-none prettyprint-override"><code>> val Lu : int [] = [|1|] > val Ru : int [] = [|4|] </code></pre> There's also a shorter syntax. The Set type has overloaded the minus operator. <code>Set.difference</code> just subtracts the second parameter from the first, so you can actually just use the following: <pre class="prettyprint lang-fsharp prettyprint-override"><code>let Lu = L - R |> Set.toArray let Ru = R - L |> Set.toArray </code></pre> <pre class="prettyprint lang-none prettyprint-override"><code>> val Lu : int [] = [|1|] > val Ru : int [] = [|4|] </code></pre>

Comparing two lists for unique items in each

Tags:

f#

list-comparison

I have two collections (they happen to be arrays, but it doesn't really matter, I think): L and R. They are both sorted and now I want to compare them. I want to end up with two collections: one for each input array containing the items which were not in the other.

I could just take the first item from L and then search R and, if there isn't a match, add it to my "unique" collection (Lu). But that's extremely inefficient, and I am expecting to have some very large collections to process in the near future.

I though about possibly "playing hopscotch":

Step 1: Take two lists, L and R, and compare the head of each list ( l :: L and r :: R):
- Branch 1: if l < r, then add l to Lu and recurse, passing in L and r :: R
- Branch 2: if l > r, then add r to Ru and recurse, passing in l :: L and R
- Branch 3: if l = r, then recurse, passing in L and R
Step 2: return Lu and Ru

I can write this function, but before I put in the effort I was wondering if a function already exists which can do this for me. It seems like a not-to-uncommon scenario, and I'd always rather use an existing solution to rolling my own.

(Also, if there's a more recognizable name for this algorithm, I'd like to know what it's called.)

808

asked Jan 27 '14 19:01

JDB

1 Answers

(I wrote the question above about 2 hours ago. Since then, I found the answer on my own. The following is what I discovered.)

In set theory, the "list" of items in L but not in R is known as "the relative complement of R in L", also known as "set-theoretic difference of L and R"

(See Wikipedia's Complement (set theory) article)

F#, being a mathematical language, has this concept baked right in to it's Core library. First, you need to build your collections as sets:

// example arrays:
let arr1 = [| 1; 2; 3 |]
let arr2 = [| 2; 3; 4 |]

// build the L and R sets
let L = set arr1
let R = set arr2

Now you can call the "difference" function and quickly get the relative complement for each array:

let Lu = Set.difference L R |> Set.toArray
let Ru = Set.difference R L |> Set.toArray

> val Lu : int [] = [|1|]
> val Ru : int [] = [|4|]

There's also a shorter syntax. The Set type has overloaded the minus operator. Set.difference just subtracts the second parameter from the first, so you can actually just use the following:

let Lu = L - R |> Set.toArray
let Ru = R - L |> Set.toArray

> val Lu : int [] = [|1|]
> val Ru : int [] = [|4|]

192

answered Sep 24 '22 11:09

JDB

Related questions
                            
                                How to indent F# code in Visual Studio 2008 in #light mode
                            
                                F# quotations object graph
                            
                                Something similar to yield break in F#
                            
                                Is this an F# structured printf bug?
                            
                                Implementing the Haskell-MaybeMonad in F# - how can we get this lazy?
                            
                                How to write a new type that is compatible with Array.sum?
                            
                                F# AsyncWaitOne and AsyncReadToEnd
                            
                                How to use TryScan in F# properly
                            
                                F# compiler directive for 'symbol not defined'
                            
                                Is this a candidate for computational expressions?
                            
                                How do I write an enumeration in F# without explicitly assigning number literals?
                            
                                fst and 3-tuple in fsharp
                            
                                Practical limitations with assemblies not marked as CLS compliant?
                            
                                Performance of List.permute
                            
                                How to define and use % as a prefix operator?
                            
                                FsCheck and NUnit integration
                            
                                Converting OCaml to F#: Is there a simple way to simulate OCaml top-level #trace in F#
                            
                                Using active patterns within discrimated union type declarations
                            
                                F# MSIL obfuscation
                            
                                F#: How to create a Deedle Frame with SQL data source

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Comparing two lists for unique items in each

Tags:

f#

list-comparison

JDB

People also ask

1 Answers

JDB

Recent Activity

Donate For Us