I'm profiling some C# code. The method below is one of the most expensive ones. For the purpose of this question, assume that micro-optimization is the right thing to do. Is there an approach to improve performance of this method? Changing the input parameter to <code>p</code> to <code>ulong[]</code> would create a macro inefficiency. <pre class="prettyprint"><code>static ulong Fetch64(byte[] p, int ofs = 0) { unchecked { ulong result = p[0 + ofs] + ((ulong) p[1 + ofs] << 8) + ((ulong) p[2 + ofs] << 16) + ((ulong) p[3 + ofs] << 24) + ((ulong) p[4 + ofs] << 32) + ((ulong) p[5 + ofs] << 40) + ((ulong) p[6 + ofs] << 48) + ((ulong) p[7 + ofs] << 56); return result; } } </code></pre>

Why not use BitConverter? I've got to believe the Microsoft has spent some time tuning that code. Plus it deals with endian issues. Here's how BitConverter turns a byte[] into a long/ulong (ulong converts it as signed and then casts it to unsigned): <pre class="prettyprint"><code>[SecuritySafeCritical] public static unsafe long ToInt64(byte[] value, int startIndex) { if (value == null) { ThrowHelper.ThrowArgumentNullException(ExceptionArgument.value); } if (((ulong) startIndex) >= value.Length) { ThrowHelper.ThrowArgumentOutOfRangeException(ExceptionArgument.startIndex, ExceptionResource.ArgumentOutOfRange_Index); } if (startIndex > (value.Length - 8)) { ThrowHelper.ThrowArgumentException(ExceptionResource.Arg_ArrayPlusOffTooSmall); } fixed (byte* numRef = &(value[startIndex])) { if ((startIndex % 8) == 0) { return *(((long*) numRef)); } if (IsLittleEndian) { int num = ((numRef[0] | (numRef[1] << 8)) | (numRef[2] << 0x10)) | (numRef[3] << 0x18); int num2 = ((numRef[4] | (numRef[5] << 8)) | (numRef[6] << 0x10)) | (numRef[7] << 0x18); return (((long) ((ulong) num)) | (num2 << 0x20)); } int num3 = (((numRef[0] << 0x18) | (numRef[1] << 0x10)) | (numRef[2] << 8)) | numRef[3]; int num4 = (((numRef[4] << 0x18) | (numRef[5] << 0x10)) | (numRef[6] << 8)) | numRef[7]; return (((long) ((ulong) num4)) | (num3 << 0x20)); } } </code></pre> I suspect that doing the conversion one 32-bit word at a time is for 32-bit efficiency. No 64-bit registers on a 32-bit CPU means dealing with a 64-bit ints is a lot more expensive. If you know for sure you're targeting 64-bit hardware, it might be faster to do do the conversion in one fell swoop.

Try to use <code>for</code> instead of unrolling the loop. You may be able to save time on boundary checks. Try BitConverter.ToUInt64 - http://msdn.microsoft.com/en-us/library/system.bitconverter.touint64.aspx if it is what you looking for.

Optimize C# Code Fragment

Tags:

c#

unchecked

micro-optimization

I'm profiling some C# code. The method below is one of the most expensive ones. For the purpose of this question, assume that micro-optimization is the right thing to do. Is there an approach to improve performance of this method?

Changing the input parameter to p to ulong[] would create a macro inefficiency.

static ulong Fetch64(byte[] p, int ofs = 0)
{
    unchecked
    {
        ulong result = p[0 + ofs] + 
            ((ulong) p[1 + ofs] <<  8) + 
            ((ulong) p[2 + ofs] << 16) + 
            ((ulong) p[3 + ofs] << 24) + 
            ((ulong) p[4 + ofs] << 32) + 
            ((ulong) p[5 + ofs] << 40) + 
            ((ulong) p[6 + ofs] << 48) + 
            ((ulong) p[7 + ofs] << 56);
        return result;
    }
}

887

asked Nov 12 '11 01:11

Eric J.

2 Answers

Why not use BitConverter? I've got to believe the Microsoft has spent some time tuning that code. Plus it deals with endian issues.

Here's how BitConverter turns a byte[] into a long/ulong (ulong converts it as signed and then casts it to unsigned):

[SecuritySafeCritical]
public static unsafe long ToInt64(byte[] value, int startIndex)
{
  if (value == null)
  {
    ThrowHelper.ThrowArgumentNullException(ExceptionArgument.value);
  }
  if (((ulong) startIndex) >= value.Length)
  {
    ThrowHelper.ThrowArgumentOutOfRangeException(ExceptionArgument.startIndex, ExceptionResource.ArgumentOutOfRange_Index);
  }
  if (startIndex > (value.Length - 8))
  {
    ThrowHelper.ThrowArgumentException(ExceptionResource.Arg_ArrayPlusOffTooSmall);
  }
  fixed (byte* numRef = &(value[startIndex]))
  {
    if ((startIndex % 8) == 0)
    {
      return *(((long*) numRef));
    }
    if (IsLittleEndian)
    {
      int num  = ((numRef[0] | (numRef[1] << 8)) | (numRef[2] << 0x10)) | (numRef[3] << 0x18);
      int num2 = ((numRef[4] | (numRef[5] << 8)) | (numRef[6] << 0x10)) | (numRef[7] << 0x18);
      return (((long) ((ulong) num)) | (num2 << 0x20));
    }
    int num3 = (((numRef[0] << 0x18) | (numRef[1] << 0x10)) | (numRef[2] << 8)) | numRef[3];
    int num4 = (((numRef[4] << 0x18) | (numRef[5] << 0x10)) | (numRef[6] << 8)) | numRef[7];
    return (((long) ((ulong) num4)) | (num3 << 0x20));
  }
}

I suspect that doing the conversion one 32-bit word at a time is for 32-bit efficiency. No 64-bit registers on a 32-bit CPU means dealing with a 64-bit ints is a lot more expensive.

If you know for sure you're targeting 64-bit hardware, it might be faster to do do the conversion in one fell swoop.

150

answered Nov 03 '22 00:11

Nicholas Carey

Try to use for instead of unrolling the loop. You may be able to save time on boundary checks.

Try BitConverter.ToUInt64 - http://msdn.microsoft.com/en-us/library/system.bitconverter.touint64.aspx if it is what you looking for.

answered Nov 02 '22 22:11

Alexei Levenkov

Related questions
                            
                                How to implement my own locking mechanism
                            
                                Is it possible to 'strip' a .NET DLL?
                            
                                wcf - difference between MaxConcurrentCalls and MaxConcurrentSessions property
                            
                                C# Unit Testing - Should you unit test something in a derived class that is taken care of in a base class?
                            
                                C# 4.0 MemoryCache - how to evict dependent cache entries when changes are made to their dependencies
                            
                                How do I play two sounds one after the other in a Windows Forms application?
                            
                                different between executable location
                            
                                String split using C#
                            
                                how to get key from value in hash table
                            
                                How to make the generated EXE work in all .NET framework versions?
                            
                                LINQ Grouping Data Twice
                            
                                Accessing content file in c# web application
                            
                                "SmtpFailedRecipientException: Mailbox unavailable" when mailbox is available
                            
                                Why does Moq setup/verify matcher fail when It.Is...() is called from anonymous function
                            
                                Mapping select stored procedures in entity framework
                            
                                Does Select followed by Where result in two iterations over the IEnumerable?
                            
                                Conversion of strings containing non printable characters
                            
                                Localize Office add-in based on Office language pack in use rather than Windows' current language
                            
                                RegEx to get href and src from HTML content?
                            
                                C# Cast Object to List of Base Class

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With