I have a collection of objects and I am trying to clone this collection and trying to understand performance implication of different approaches. The object in the collection has about 20 properties all strings, ints, floats (this objects doesn't have any nested objects inside of it). The two approaches are: <ol> <li> Create DeepClone() method: <pre class="prettyprint"><code>public static class ExtensionMethods { public static T DeepClone<T>(this T a) { using (var stream = new MemoryStream()) { var formatter = new BinaryFormatter(); formatter.Serialize(stream, a); stream.Position = 0; return (T)formatter.Deserialize(stream); } } </code></pre> } </li> <li> Manually write "copy" code where i am looping through the collection and "new"ing a new object and then manually setting all of the 20 properties. something like this <pre class="prettyprint"><code> public MyObject Copy(MyObject myObj) { var obj = new MyObject(); obj.Prop1 = myObj.Prop1; obj.Prop2 = myObj.Prop2; return obj; </code></pre> } </li> </ol> I am getting very inconsistent results so I wanted to get peoples feedback on: <ol> <li>Should one be much faster that the other? I would have thought choice two but my tests don't seem to support this so I am trying to figure out if I am doing something wrong.</li> <li>Is there any way to do this even faster?</li> </ol>

In my previous role, we investigated this very issue, as we were caching objects and wanted to clone them before handing them out from the cache. We did some detailed benchmarking and found that property setting was always at least an order of magnitude faster then the <code>BinaryFormatter</code> approach, although obviously required a hand-rolled implementation as opposed to the much simpler <code>BinaryFormatter</code> approach. For deep object graphs, the difference became more pronounced IIRC. In the end, we settled on a three pronged approach to "cloning": <ul> <li>if the type was immutable (which we would signify with a marker interface, <code>IImutable</code>, but you could equally use an attribute or somesuch), we would "clone" by returning the original instance. Since we knew no-one could mutate it, it was safe to keep returning the same instance. Obviously this was the fastest type of "clone", although clearly not really a clone at all.</li> <li>If the type implemented our own <code>IDeepCloneable<T></code> interface (which would be like your second example - but generic) we'd use that. [This generic interface would inherit from a non-generic equivalent <code>IDeepCloneable</code>]</li> <li>Failing that, we'd fall back to your first example, the <code>BinaryFormatter</code>.</li> </ul> I mention the "immutable" approach because depending on what you are doing, sometimes the best way is to redesign the classes you need to clone so they don't need to be cloned at all. If they are essentially read-only once created, this is easy; but even if not, sometimes the builder/immutable-type approach is useful (see <code>Uri</code> vs. <code>UriBuilder</code> in the framework. The former is essentially immutable, while the latter can be used to build and/or mutate instances of the former).

Performance of DeepClone (using binary serialization) versus manually setting properties

Tags:

performance

c#

clone

I have a collection of objects and I am trying to clone this collection and trying to understand performance implication of different approaches.

The object in the collection has about 20 properties all strings, ints, floats (this objects doesn't have any nested objects inside of it). The two approaches are:

Create DeepClone() method:

public static class ExtensionMethods
{
    public static T DeepClone<T>(this T a)
    {
       using (var stream = new MemoryStream())
       {
           var formatter = new BinaryFormatter();
           formatter.Serialize(stream, a);
           stream.Position = 0;
          return (T)formatter.Deserialize(stream);
       }
   }

}

Manually write "copy" code where i am looping through the collection and "new"ing a new object and then manually setting all of the 20 properties. something like this
```
 public MyObject Copy(MyObject myObj)
{
 var obj = new MyObject();
 obj.Prop1 = myObj.Prop1;
 obj.Prop2 = myObj.Prop2;
 return obj;
```
}

I am getting very inconsistent results so I wanted to get peoples feedback on:

Should one be much faster that the other? I would have thought choice two but my tests don't seem to support this so I am trying to figure out if I am doing something wrong.
Is there any way to do this even faster?

408

asked Feb 23 '12 05:02

leora

2 Answers

Well, first of all BinaryFormatter route must definitely be slower, since it uses reflection to get/set properties. The most common method is using the IClonable interface in conjunction with a copy constructor.

class A : ICloneable
{
    private readonly int _member;

    public A(int member)
    {
        _member = member;
    }

    public A(A a)
    {
        _member = a._member;
    }

    public object Clone()
    {
        return new A(this);
    }
}

Of course strictly speaking you only need the copy constructor, which should be the fastest method. If your objects are simple, you should try using in-built MemberwiseClone function.

class A : ICloneable
{
    private readonly int _member;

    public A(int member)
    {
        _member = member;
    }

    public object Clone()
    {
        return MemberwiseClone();
    }
}

Meanwhile, I wrote some test code to see if MemberwiseClone() was severely faster or slower than using a copy constructor. You can find it here. I found that MemberwiseClone is actually much slower than doing a CopyConstructor, at least on small classes. Note that using the BinaryFormatter is insanely slow.

122

answered Oct 29 '22 01:10

Gleno

In my previous role, we investigated this very issue, as we were caching objects and wanted to clone them before handing them out from the cache.

We did some detailed benchmarking and found that property setting was always at least an order of magnitude faster then the BinaryFormatter approach, although obviously required a hand-rolled implementation as opposed to the much simpler BinaryFormatter approach. For deep object graphs, the difference became more pronounced IIRC.

In the end, we settled on a three pronged approach to "cloning":

if the type was immutable (which we would signify with a marker interface, IImutable, but you could equally use an attribute or somesuch), we would "clone" by returning the original instance. Since we knew no-one could mutate it, it was safe to keep returning the same instance. Obviously this was the fastest type of "clone", although clearly not really a clone at all.
If the type implemented our own IDeepCloneable<T> interface (which would be like your second example - but generic) we'd use that. [This generic interface would inherit from a non-generic equivalent IDeepCloneable]
Failing that, we'd fall back to your first example, the BinaryFormatter.

I mention the "immutable" approach because depending on what you are doing, sometimes the best way is to redesign the classes you need to clone so they don't need to be cloned at all. If they are essentially read-only once created, this is easy; but even if not, sometimes the builder/immutable-type approach is useful (see Uri vs. UriBuilder in the framework. The former is essentially immutable, while the latter can be used to build and/or mutate instances of the former).

answered Oct 28 '22 23:10

Rob Levine

Related questions
                            
                                Create 2 FileStream on the same file in the same process
                            
                                Changing default connection string for Membership, Roles, etc
                            
                                Send mouse & keyboard events
                            
                                How do I store the LinkedIn API AccessToken so I don't have to re-enter credentials every time I use the LinkedIn API?
                            
                                Building and running Monodevelop solution in OS X Terminal
                            
                                Windows 8 - BeginAnimation?
                            
                                Using Readline() and ReadKey() Simultaneously
                            
                                location of .snk file and management of it
                            
                                IIS7 stops working after 5 requests
                            
                                What is the Java equivalent for the following C# code?
                            
                                Canonicalize URL to lowercase without breaking file system or culture?
                            
                                How can one free memory used by heavy WPF Controls in a deterministic way?
                            
                                How can I create a dynamic Select on an IEnumerable<T> at runtime?
                            
                                how to secure dll functions from being used outside of my application?
                            
                                DllNotFoundException with DllImport in Mono on Mac: wrong architecture
                            
                                Lazily creating isolated storage
                            
                                Is there any common way to save application settings in more advanced way, than plain .settings file?
                            
                                .NET Remoting, passing objects into methods
                            
                                How many models of Asynchronous development in .NET?
                            
                                Are there hooks in ASP.NET MVC prior to layout execution and post body render?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With