Suppose I have some classes <code>foo < handle</code>, and <code>bar < foo</code>, <code>baz < foo</code>, and maybe <code>qux < foo</code>. There are a couple ways I can store an array of these objects: <ul> <li>As a cell array: <code>A = {foo bar baz qux} % A(1) would be a cell, A{1} gives me a foo object</code></li> <li>Starting with R2011a, I can make <code>foo <</code><code>matlab.mixin.Heterogeneous</code>, and then build an array directy: <code>A = [foo bar baz qux] % A(1) directly gives me a foo object</code></li> </ul> The way I see it, from a maintenance perspective it would be better to use the second method rather than the first, this way it removes ambiguity about how to access <code>A</code>. Namely, when we need to dereference elements of the cell array (cell <code>A(1)</code> vs <code>foo</code> object <code>A{1}</code>, which lives inside <code>A(1)</code>). But is there any kind of memory or performance penalty (or benefit) to using one syntax vs the other?

I did a small experiment (source) on the memory and running time of the cell array, containers.Map and a Heterogeneous array. In my method I preallocated each array with N=65535 elements (the max array size for Map and Heterogeneous array), then began assigning each element a uint32, and measured the time and memory. My Heterogeneous Class was a simple class with a single public property, and a constructor which assigned that property. The containers.Map had uint32 key/value pairs. <pre class="prettyprint"><code>Maps took 9.17917e-01 seconds. Cells took 5.81220e-02 seconds. Heterogeneous array took 4.95336e+00 seconds. **Name** **Size** **Bytes** **Class** map 65535x1 112 containers.Map cellArr 65535x1 7602060 cell hArr 1x65535 262244 SomeHeterogeneousClass </code></pre> Immediately note that the size of the mapArray is not accurate. It is hidden behind the containers.Map class implementation, most likley the 112 bytes reported is the memory assigned to the map itself, excluding the data. I approximate the true size to be at minimum (112+65535*(sizeof(uint32)*2)) = 524392 bytes. This value is almost exactly double the hArr size, which makes me think it is quite accurate, since the map must store twice as much data (for key AND value) as the hArr. The results are straightforward: <ul> <li>Time: cell Array < Map < Heterogeneous Array</li> <li>Memory: Heterogeneous Array < Map < cell Array</li> </ul> I repeated the experiment with N=30 to test for small arrays, the results were similar. God only knows why cells take up so much memory and Heterogeneous arrays are so slow.

What's better for performance, cell arrays of objects or heterogeneous arrays?

Q: What are the advantages of cell array over normal array in Matlab?

Cell arrays contain data in cells that you access by numeric indexing. Common applications of cell arrays include storing separate pieces of text and storing heterogeneous data from spreadsheets. For example, store temperature data for three cities over time in a cell array.

Q: Are Matlab cell arrays slow?

I've found that the performance of cell Arrays, multi-dimensional arrays, and structure arrays are all far slower than the use of multiple independent variables.

Q: What are heterogeneous arrays?

A heterogeneous array is an array of objects that differ in their specific class, but all objects derive from or are instances of a common superclass. The common superclass forms the root of the hierarchy of classes that you can combine into heterogeneous arrays.

Q: What is the difference between a cell array and an array in Matlab?

Array = a single variable (of any data type) that contains multiple content elements. Cell array = a specific type of array in MATLAB; an array of class cell. This is the "everything" container in MATLAB -- it's essentially a meta data type, or a "container" data type. You can put anything inside a cell.

Tags:

performance

oop

matlab

heterogeneous

Suppose I have some classes foo < handle, and bar < foo, baz < foo, and maybe qux < foo. There are a couple ways I can store an array of these objects:

As a cell array: A = {foo bar baz qux} % A(1) would be a cell, A{1} gives me a foo object
Starting with R2011a, I can make foo <matlab.mixin.Heterogeneous, and then build an array directy: A = [foo bar baz qux] % A(1) directly gives me a foo object

The way I see it, from a maintenance perspective it would be better to use the second method rather than the first, this way it removes ambiguity about how to access A. Namely, when we need to dereference elements of the cell array (cell A(1) vs foo object A{1}, which lives inside A(1)).

But is there any kind of memory or performance penalty (or benefit) to using one syntax vs the other?

642

asked Dec 09 '14 17:12

Dang Khoa

1 Answers

I did a small experiment (source) on the memory and running time of the cell array, containers.Map and a Heterogeneous array. In my method I preallocated each array with N=65535 elements (the max array size for Map and Heterogeneous array), then began assigning each element a uint32, and measured the time and memory. My Heterogeneous Class was a simple class with a single public property, and a constructor which assigned that property. The containers.Map had uint32 key/value pairs.

Maps took 9.17917e-01 seconds.
Cells took 5.81220e-02 seconds.
Heterogeneous array took 4.95336e+00 seconds.

**Name**     **Size**         **Bytes**       **Class**   
map          65535x1           112          containers.Map              
cellArr      65535x1           7602060      cell               
hArr         1x65535           262244       SomeHeterogeneousClass

Immediately note that the size of the mapArray is not accurate. It is hidden behind the containers.Map class implementation, most likley the 112 bytes reported is the memory assigned to the map itself, excluding the data. I approximate the true size to be at minimum (112+65535*(sizeof(uint32)*2)) = 524392 bytes. This value is almost exactly double the hArr size, which makes me think it is quite accurate, since the map must store twice as much data (for key AND value) as the hArr.

The results are straightforward:

Time: cell Array < Map < Heterogeneous Array
Memory: Heterogeneous Array < Map < cell Array

I repeated the experiment with N=30 to test for small arrays, the results were similar. God only knows why cells take up so much memory and Heterogeneous arrays are so slow.

answered Sep 24 '22 04:09

Gouda

Related questions
                            
                                SQL Server query takes longer with parameter than with constant string
                            
                                Improve string parse performance
                            
                                Extreme slowdown Cloud vs VPS (Amazon, Jelastic)
                            
                                multiple resource files versus single resource file
                            
                                Why does native browser sort function work slower than quicksort?
                            
                                Looking for a quick way to speed up my code
                            
                                Fastest way to count array values above a threshold in numpy
                            
                                Android layout: running second layout pass
                            
                                overloaded array subscript [] operator slow
                            
                                Constant SQL Server 80% CPU Utilization
                            
                                Best gcc optimization switches for hyperthreading
                            
                                Speeding up wilcox.test in R
                            
                                Python defaultdict(list) de/serialization performance
                            
                                Why is my Entity Framework query with Single slow?
                            
                                What do the statistics (usr, sys, cusr, csys, and CPU) outputted by Perl's prove command mean?
                            
                                Efficiently computing floating-point arithmetic hundreds of thousands of times in Bash
                            
                                HHVM fastcgi + Nginx performance fluctuations
                            
                                Is it efficient to return an array in php?
                            
                                Animating scrollTop in fixed height overflow without jank
                            
                                Python multiprocessing queues slower than pool.map

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With