Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Overriding GetHashCode in VB without checked/unchecked keyword support?

So I'm trying to figure out how to correctly override GetHashCode() in VB for a large number of custom objects. A bit of searching leads me to this wonderful answer.

Except there's one problem: VB lacks both the checked and unchecked keyword in .NET 4.0. As far as I can tell, anyways. So using Jon Skeet's implementation, I tried creating such an override on a rather simple class that has three main members: Name As String, Value As Int32, and [Type] As System.Type. Thus I come up with:

Public Overrides Function GetHashCode() As Int32     Dim hash As Int32 = 17      hash = hash * 23 + _Name.GetHashCode()     hash = hash * 23 + _Value     hash = hash * 23 + _Type.GetHashCode()     Return hash End Function 

Problem: Int32 is too small for even a simple object such as this. The particular instance I tested has "Name" as a simple 5-character string, and that hash alone was close enough to Int32's upper limit, that when it tried to calc the second field of the hash (Value), it overflowed. Because I can't find a VB equivalent for granular checked/unchecked support, I can't work around this.

I also do not want to remove Integer overflow checks across the entire project. This thing is maybe....40% complete (I made that up, TBH), and I have a lot more code to write, so I need these overflow checks in place for quite some time.

What would be the "safe" version of Jon's GetHashCode version for VB and Int32? Or, does .NET 4.0 have checked/unchecked in it somewhere that I'm not finding very easily on MSDN?


EDIT:
Per the linked SO question, one of the unloved answers at the very bottom provided a quasi-solution. I say quasi because it feels like it's....cheating. Beggars can't be choosers, though, right?

Translated from from C# into a more readable VB and aligned to the object described above (Name, Value, Type), we get:

Public Overrides Function GetHashCode() As Int32     Return New With { _         Key .A = _Name, _         Key .B = _Value, _         Key .C = _Type      }.GetHashCode() End Function 

This triggers the compiler apparently to "cheat" by generating an anonymous type, which it then compiles outside of the project namespace, presumably with integer overflow checks disabled, and allows the math to take place and simply wrap around when it overflows. It also seems to involve box opcodes, which I know to be performance hits. No unboxing, though.

But this raises an interesting question. Countless times, I've seen it stated here and elsewhere that both VB and C# generate the same IL code. This is clearly not the case 100% of the time...Like the use of C#'s unchecked keyword simply causes a different opcode to get emitted. So why do I continue to see the assumption that both produce the exact same IL keep getting repeated?  </rhetorical-question>

Anyways, I'd rather find a solution that can be implemented within each object module. Having to create Anonymous Types for every single one of my objects is going to look messy from an ILDASM perspective. I'm not kidding when I say I have a lot of classes implemented in my project.


EDIT2: I did open up a bug on MSFT Connect, and the gist of the outcome from the VB PM was that they'll consider it, but don't hold your breath: https://connect.microsoft.com/VisualStudio/feedback/details/636564/checked-unchecked-keywords-in-visual-basic

A quick look at the changes in .NET 4.5 suggests they've not considered it yet, so maybe .NET 5?

My final implementation, which fits the constraints of GetHashCode, while still being fast and unique enough for VB is below, derived from the "Rotating Hash" example on this page:

'// The only sane way to do hashing in VB.NET because it lacks the '// checked/unchecked keywords that C# has. Public Const HASH_PRIME1 As Int32 = 4 Public Const HASH_PRIME2 As Int32 = 28 Public Const INT32_MASK As Int32 = &HFFFFFFFF  Public Function RotateHash(ByVal hash As Int64, ByVal hashcode As Int32) As Int64     Return ((hash << HASH_PRIME1) Xor (hash >> HASH_PRIME2) Xor hashcode) End Function 

I also think the "Shift-Add-XOR" hash may also apply, but I haven't tested it.

like image 344
Kumba Avatar asked Jan 11 '11 04:01

Kumba


1 Answers

Use Long to avoid the overflow:

Dim hash As Long = 17 '' etc.. Return CInt(hash And &H7fffffffL) 

The And operator ensures no overflow exception is thrown. This however does lose one bit of "precision" in the computed hash code, the result is always positive. VB.NET has no built-in function to avoid it, but you can use a trick:

Imports System.Runtime.InteropServices  Module NoOverflows     Public Function LongToInteger(ByVal value As Long) As Integer         Dim cast As Caster         cast.LongValue = value         Return cast.IntValue     End Function      <StructLayout(LayoutKind.Explicit)> _     Private Structure Caster         <FieldOffset(0)> Public LongValue As Long         <FieldOffset(0)> Public IntValue As Integer     End Structure End Module 

Now you can write:

Dim hash As Long = 17 '' etc.. Return NoOverflows.LongToInteger(hash) 
like image 178
Hans Passant Avatar answered Oct 04 '22 16:10

Hans Passant