I have a hash that uses array as its key. When I change the array, the hash can no longer get the corresponding key and value: <pre class="prettyprint"><code>1.9.3p194 :016 > a = [1, 2] => [1, 2] 1.9.3p194 :017 > b = { a => 1 } => {[1, 2]=>1} 1.9.3p194 :018 > b[a] => 1 1.9.3p194 :019 > a.delete_at(1) => 2 1.9.3p194 :020 > a => [1] 1.9.3p194 :021 > b => {[1]=>1} 1.9.3p194 :022 > b[a] => nil 1.9.3p194 :023 > b.keys.include? a => true </code></pre> What am I doing wrong? Update: OK. Use a.clone is absolutely one way to deal with this problem. What if I want to change "a" but still use "a" to retrieve the corresponding value (since "a" is still one of the keys) ?

The #rehash method will recalculate the hash, so after the key changes do: <pre class="prettyprint"><code>b.rehash </code></pre>

TL;DR: consider <code>Hash#compare_by_indentity</code> <h3>You need to decide if you want the hash to work by array value or array identity.</h3> By default arrays <code>.hash</code> and <code>.eql?</code> by value, which is why changing the value confuses ruby. Consider this variant of your example: <pre class="prettyprint"><code>pry(main)> a = [1, 2] pry(main)> a1 = [1] pry(main)> a.hash => 4266217476190334055 pry(main)> a1.hash => -2618378812721208248 pry(main)> h = {a => '12', a1 => '1'} => {[1, 2]=>"12", [1]=>"1"} pry(main)> h[a] => "12" pry(main)> a.delete_at(1) pry(main)> a => [1] pry(main)> a == a1 => true pry(main)> a.hash => -2618378812721208248 pry(main)> h[a] => "1" </code></pre> See what happened there? As you discovered, it fails to match on the <code>a</code> key because the <code>.hash</code> value under which it stored it is outdated [BTW, you can't even rely on that! A mutation might result in same hash (rare) or different hash that lands in the same bucket (not so rare).] But instead of failing by returning <code>nil</code>, it matched on the <code>a1</code> key. See, <code>h[a]</code> doesn't care at all about the identity of <code>a</code> vs <code>a1</code> (the traitor!). It compared the current value you supply — <code>[1]</code> with the value of <code>a1</code> being <code>[1]</code> and found a match. That's why using <code>.rehash</code> is just band-aid. It will recompute the <code>.hash</code> values for all keys and move them to the correct buckets, but it's error-prone, and may also cause trouble: <pre class="prettyprint"><code>pry(main)> h.rehash => {[1]=>"1"} pry(main)> h => {[1]=>"1"} </code></pre> Oh oh. The two entries collapsed into one, since they now have the same value (and which wins is hard to predict). <h3>Solutions</h3> One sane approach is embracing lookup by value, which requires the value to never change. <code>.freeze</code> your keys. Or use <code>.clone</code>/<code>.dup</code> when building the hash, and feel free to mutate the original arrays — but accept that <code>h[a]</code> will lookup the current value of <code>a</code> against the values preserved from build time. The other, which you seem to want, is deciding you care about identity — lookup by <code>a</code> should find <code>a</code> whatever its current value, and it shouldn't matter if many keys had or now have the same value. How? <ul> <li><code>Object</code> hashes by identity. (Arrays don't because types that <code>.==</code> by value tend to also override <code>.hash</code> and <code>.eql?</code> to be by value.) So one option is: don't use arrays as keys, use some custom class (which may hold an array inside).</li> <li> But what if you want it to behave directly like a hash of arrays? You could subclass Hash, or Array but it's a lot of work to make everything work consistently. Luckily, Ruby has a builtin way: <code>h.compare_by_identity</code> switches a hash to work by identity (with no way to undo, AFAICT). If you do this before you insert anything, you can even have distinct keys with equal values, with no confusion: <pre class="prettyprint"><code>[39] pry(main)> x = [1] => [1] [40] pry(main)> y = [1] => [1] [41] pry(main)> h = Hash.new.compare_by_identity => {} [42] pry(main)> h[x] = 'x' => "x" [44] pry(main)> h[y] = 'y' => "y" [45] pry(main)> h => {[1]=>"x", [1]=>"y"} [46] pry(main)> x.push(7) => [1, 7] [47] pry(main)> y.push(7) => [1, 7] [48] pry(main)> h => {[1, 7]=>"x", [1, 7]=>"y"} [49] pry(main)> h[x] => "x" [50] pry(main)> h[y] => "y" </code></pre> Beware that such hashes are counter-intuitive if you try to put there e.g. strings, because we're really used to strings hashing by value. </li> </ul>

You should use <code>a.clone</code> as key <pre class="prettyprint"><code>irb --> a = [1, 2] ==> [1, 2] irb --> b = { a.clone => 1 } ==> {[1, 2]=>1} irb --> b[a] ==> 1 irb --> a.delete_at(1) ==> 2 irb --> a ==> [1] irb --> b ==> {[1, 2]=>1} # STILL UNCHANGED irb --> b[a] ==> nil # Trivial, since a has changed irb --> b.keys.include? a ==> false # Trivial, since a has changed </code></pre> Using <code>a.clone</code> will make sure that the key is unchanged even when we change <code>a</code> later on.

Hash use array as key in ruby

Tags:

ruby

hash

I have a hash that uses array as its key. When I change the array, the hash can no longer get the corresponding key and value:

Click to copy

1.9.3p194 :016 > a = [1, 2]
 => [1, 2] 
1.9.3p194 :017 > b = { a => 1 }
 => {[1, 2]=>1} 
1.9.3p194 :018 > b[a]
 => 1 
1.9.3p194 :019 > a.delete_at(1)
 => 2 
1.9.3p194 :020 > a
 => [1] 
1.9.3p194 :021 > b
 => {[1]=>1} 
1.9.3p194 :022 > b[a]
 => nil 
1.9.3p194 :023 > b.keys.include? a
 => true

What am I doing wrong?

Update: OK. Use a.clone is absolutely one way to deal with this problem. What if I want to change "a" but still use "a" to retrieve the corresponding value (since "a" is still one of the keys) ?

399

asked Aug 29 '12 11:08

Eagle

4 Answers

The #rehash method will recalculate the hash, so after the key changes do:

Click to copy

b.rehash

answered Oct 30 '22 23:10

steenslag

TL;DR: consider Hash#compare_by_indentity

You need to decide if you want the hash to work by array value or array identity.

By default arrays .hash and .eql? by value, which is why changing the value confuses ruby. Consider this variant of your example:

Click to copy

pry(main)> a = [1, 2]
pry(main)> a1 = [1]
pry(main)> a.hash
=> 4266217476190334055
pry(main)> a1.hash
=> -2618378812721208248
pry(main)> h = {a => '12', a1 => '1'}
=> {[1, 2]=>"12", [1]=>"1"}
pry(main)> h[a]
=> "12"
pry(main)> a.delete_at(1)
pry(main)> a
=> [1]
pry(main)> a == a1
=> true
pry(main)> a.hash
=> -2618378812721208248
pry(main)> h[a]
=> "1"

See what happened there? As you discovered, it fails to match on the a key because the .hash value under which it stored it is outdated [BTW, you can't even rely on that! A mutation might result in same hash (rare) or different hash that lands in the same bucket (not so rare).]

But instead of failing by returning nil, it matched on the a1 key.
See, h[a] doesn't care at all about the identity of a vs a1 (the traitor!). It compared the current value you supply — [1] with the value of a1 being [1] and found a match.

That's why using .rehash is just band-aid. It will recompute the .hash values for all keys and move them to the correct buckets, but it's error-prone, and may also cause trouble:

Click to copy

pry(main)> h.rehash
=> {[1]=>"1"}
pry(main)> h
=> {[1]=>"1"}

Oh oh. The two entries collapsed into one, since they now have the same value (and which wins is hard to predict).

Solutions

One sane approach is embracing lookup by value, which requires the value to never change. .freeze your keys. Or use .clone/.dup when building the hash, and feel free to mutate the original arrays — but accept that h[a] will lookup the current value of a against the values preserved from build time.

The other, which you seem to want, is deciding you care about identity — lookup by a should find a whatever its current value, and it shouldn't matter if many keys had or now have the same value.
How?

Object hashes by identity. (Arrays don't because types that .== by value tend to also override .hash and .eql? to be by value.) So one option is: don't use arrays as keys, use some custom class (which may hold an array inside).
But what if you want it to behave directly like a hash of arrays? You could subclass Hash, or Array but it's a lot of work to make everything work consistently. Luckily, Ruby has a builtin way: h.compare_by_identity switches a hash to work by identity (with no way to undo, AFAICT). If you do this before you insert anything, you can even have distinct keys with equal values, with no confusion:

Click to copy
```
[39] pry(main)> x = [1]
=> [1]
[40] pry(main)> y = [1]
=> [1]
[41] pry(main)> h = Hash.new.compare_by_identity
=> {}
[42] pry(main)> h[x] = 'x'
=> "x"
[44] pry(main)> h[y] = 'y'
=> "y"
[45] pry(main)> h
=> {[1]=>"x", [1]=>"y"}
[46] pry(main)> x.push(7)
=> [1, 7]
[47] pry(main)> y.push(7)
=> [1, 7]
[48] pry(main)> h
=> {[1, 7]=>"x", [1, 7]=>"y"}
[49] pry(main)> h[x]
=> "x"
[50] pry(main)> h[y]
=> "y"
```
Beware that such hashes are counter-intuitive if you try to put there e.g. strings, because we're really used to strings hashing by value.

answered Oct 31 '22 01:10

Beni Cherniavsky-Paskin

Hashes use their key objects' hash codes (a.hash) to group them. Hash codes often depend on the state of the object; in this case, the hash code of a changes when an element has been removed from the array. Since the key has already been inserted into the hash, a is filed under its original hash code.

This means you can't retrieve the value for a in b, even though it looks alright when you print the hash.

answered Oct 31 '22 00:10

waldrumpus

You should use a.clone as key

Click to copy

irb --> a = [1, 2]
==> [1, 2]

irb --> b = { a.clone => 1 }
==> {[1, 2]=>1}

irb --> b[a]
==> 1

irb --> a.delete_at(1)
==> 2

irb --> a
==> [1]

irb --> b
==> {[1, 2]=>1} # STILL UNCHANGED

irb --> b[a]
==> nil # Trivial, since a has changed

irb --> b.keys.include? a
==> false # Trivial, since a has changed

Using a.clone will make sure that the key is unchanged even when we change a later on.

answered Oct 31 '22 00:10

Kulbir Saini

Related questions
                            
                                alias_method on ActiveRecord::Base results in NameError
                            
                                Why do I get a string encoding issue "\xE2" from ASCII-8BIT to UTF-8?
                            
                                sum(&:x) not working any more
                            
                                Why does default rake task run specs in Rails app?
                            
                                Is it acceptable practice to patch Ruby's base classes, such as Fixnum?
                            
                                Creating a Capistrano task that performs different tasks based on role
                            
                                Mocha Mock Carries To Another Test
                            
                                Problem Signing Out with Devise on my App
                            
                                How can I find a devise user by it's session id?
                            
                                Retrieving a Braintree customer's subscriptions
                            
                                Short way to include CSS and javascript tags with HAML
                            
                                How to run "bundle exec jekyll new ."
                            
                                Best way to add tests to an existing Rails project?
                            
                                rails 3:how to generate models for existing database tables
                            
                                what does "class << self" mean in Rails? [duplicate]
                            
                                sleep until condition is true in ruby
                            
                                What am I doing wrong with DateTime.strptime?
                            
                                Retry after Exception in Groovy
                            
                                Find lowest value in a hash
                            
                                Cucumber "puts" in After hook not outputting anything

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Hash use array as key in ruby

Tags:

ruby

hash

Eagle

People also ask

4 Answers

steenslag

You need to decide if you want the hash to work by array value or array identity.

Solutions

Beni Cherniavsky-Paskin

waldrumpus

Kulbir Saini

Recent Activity

Donate For Us