how to avoid the loop to reduce the computation time of this code (one solution of my last question): I hope to find the column vectors of <code>A(1:3,:)</code> whose corresponding values in <code>M(4,:)</code> are not part of one of the vectors of the cell <code>X</code> (and obviously not equal to one of these vectors). I look for a fast solution if <code>X</code> is very large. <pre class="prettyprint"><code>M = [1007 1007 4044 1007 4044 1007 5002 5002 5002 622 622; 552 552 300 552 300 552 431 431 431 124 124; 2010 2010 1113 2010 1113 2010 1100 1100 1100 88 88; 7 12 25 15 12 30 2 10 55 32 12]; </code></pre> Here I take directly <code>A</code>: <pre class="prettyprint"><code>A = [1007 4044 5002 622; 552 300 431 124; 2010 1113 1100 88]; </code></pre> <code>A</code> contains unique column vectors of <code>M(1:3,:)</code> <pre class="prettyprint"><code>X = {[2 5 68 44],[2 10 55 9 17],[1 55 6 7 8 9],[32 12]}; [~, ~, subs] = unique(M(1:3,:)','rows'); A4 = accumarray(subs(:),M(4,:).',[],@(x) {x}); %// getting a mask of which columns we want idxC(length(A4)) = false; for ii = 1:length(A4) idxC(ii) = ~any(cellfun(@(x) all(ismember(A4{ii},x)), X)); end </code></pre> Displaying the columns we want <pre class="prettyprint"><code>out = A(:,idxC) </code></pre> Results: <pre class="prettyprint"><code>>> out out = 1007 4044 552 300 2010 1113 </code></pre> the column vector <code>[5002;431;1100]</code> was eliminated because <code>[2;10;55]</code> is contained in <code>X{2} = [2 10 55 9 17]</code> the column vector <code>[622;124;88]</code> was eliminated because <code>[32 12] = X{4}</code> Another example: with the same <code>X</code> <pre class="prettyprint"><code> M = [1007 4044 1007 4044 1007 5002 5002 5002 622 622 1007 1007 1007; 552 300 552 300 552 431 431 431 124 124 552 11 11; 2010 1113 2010 1113 2010 1100 1100 1100 88 88 2010 20 20; 12 25 15 12 30 2 10 55 32 12 7 12 7]; X = {[2 5 68 44],[2 10 55 9 17],[1 55 6 7 8 9],[32 12]}; A = [1007 4044 5002 622 1077; 552 300 431 124 11; 2010 1113 1100 88 20]; </code></pre> Results: (with scmg answer) I get if <code>A</code> sorted according to the first row: (correct result) <pre class="prettyprint"><code>out = 1007 1007 4044 11 552 300 20 2010 1113 </code></pre> if I do not sort the matrix <code>A</code>, I get: (false result) <pre class="prettyprint"><code>out = 4044 5002 622 300 431 124 1113 1100 88 </code></pre> the column vector <code>A(:,4) = [622;124;88]</code> should be eliminated because <code>[32 12] = X{4}</code>. the column vector <code>[5002;431;1100]</code> should be eliminated because <code>[2;10;55]</code> is contained in <code>X{2} = [2 10 55 9 17]</code>

The answer of Ben Voigt is great, but the line <code>for A4i = A4{ii}</code> is the one causing issues : the for loop doesn't work this way with column vectors : <pre class="prettyprint"><code>%row vector for i = 1:3 disp('foo'); end foo foo foo %column vector for i = (1:3).' disp('foo'); end foo </code></pre> Just try for <code>A4i = A4{ii}.'</code> instead and it should get your work done! Now, if we look at the output : <pre class="prettyprint"><code>A(:,idxC) = 4044 5002 300 431 1113 1100 </code></pre> As you can see, the final result is not what we expected. As long as <code>unique</code> does a kind of sort, the subs are not numbered by the order of encounter in A, but by order of encounter in C (which is sorted) : <pre class="prettyprint"><code>subs = 2 2 3 2 3 2 4 4 4 1 1 </code></pre> Therefore you should pass by the matrix given by <code>unique</code> rather than A to get your final output Enter <pre class="prettyprint"><code>[C, ~, subs] = unique(M(1:3,:)','rows'); %% rather than [~, ~, subs] = unique(M(1:3,:)','rows'); </code></pre> Then, to get the final output, enter <pre class="prettyprint"><code>>> out = C(idxC,:).' out = 1007 4044 552 300 2010 1113 </code></pre>

In this case, you should not be trying to eliminate loops. The vectorization is actually hurting you badly. In particular (giving a name to your anonymous lambda) <pre class="prettyprint"><code>issubset = @(x) all(ismember(A4{ii},x)) </code></pre> is ridiculously inefficient, because it doesn't short-circuit. Replace that with a loop. Same for <pre class="prettyprint"><code>any(cellfun(issubset, X)) </code></pre> Use an approach similar to this instead: <pre class="prettyprint"><code>idxC = true(size(A4)); NX = numel(X); for ii = 1:length(A4) for jj = 1:NX xj = X{jj}; issubset = true; for A4i=A4{ii} if ~ismember(A4i, xj) issubset = false; break; end; end; if issubset idxC(ii) = false; break; end; end; end; </code></pre> The two <code>break</code> statements, and especially the second one, trigger an early exit that potentially saves you a huge amount of computation.

How to avoid the loop to reduce the computation time of this code?

Tags:

vectorization

matrix

matlab

runtime

how to avoid the loop to reduce the computation time of this code (one solution of my last question):

I hope to find the column vectors of A(1:3,:) whose corresponding values in M(4,:) are not part of one of the vectors of the cell X (and obviously not equal to one of these vectors). I look for a fast solution if X is very large.

M = [1007  1007  4044  1007  4044  1007  5002 5002 5002 622 622;
      552   552   300   552   300   552   431  431  431 124 124; 
     2010  2010  1113  2010  1113  2010  1100 1100 1100  88  88;
        7    12    25    15    12    30     2   10   55  32  12];

Here I take directly A:

A = [1007  4044  5002  622;
      552   300   431  124;
     2010  1113  1100   88];

A contains unique column vectors of M(1:3,:)

X = {[2 5 68 44],[2 10 55 9 17],[1 55 6 7 8 9],[32 12]};

[~, ~, subs] = unique(M(1:3,:)','rows');

A4 = accumarray(subs(:),M(4,:).',[],@(x) {x});

%// getting a mask of which columns we want
idxC(length(A4)) = false;
for ii = 1:length(A4)
    idxC(ii) = ~any(cellfun(@(x) all(ismember(A4{ii},x)), X));
end

Displaying the columns we want

out = A(:,idxC)

Results:

>> out

out =

    1007        4044
     552         300
    2010        1113

the column vector [5002;431;1100] was eliminated because [2;10;55] is contained in X{2} = [2 10 55 9 17]

the column vector [622;124;88] was eliminated because [32 12] = X{4}

Another example: with the same X

    M = [1007  4044  1007  4044  1007  5002 5002 5002 622 622  1007  1007  1007;
          552   300   552   300   552   431  431  431 124 124   552    11    11; 
         2010  1113  2010  1113  2010  1100 1100 1100  88  88  2010    20    20;
           12    25    15    12    30     2   10   55  32  12     7    12     7];

X = {[2 5 68 44],[2 10 55 9 17],[1 55 6 7 8 9],[32 12]};

A = [1007  4044  5002  622  1077;
      552   300   431  124    11;
     2010  1113  1100   88    20];

Results: (with scmg answer)

I get if A sorted according to the first row: (correct result)

out =

         1007        1007        4044
           11         552         300
           20        2010        1113

if I do not sort the matrix A, I get: (false result)

out =

        4044        5002         622
         300         431         124
        1113        1100          88

the column vector A(:,4) = [622;124;88] should be eliminated because [32 12] = X{4}.

the column vector [5002;431;1100] should be eliminated because [2;10;55] is contained in X{2} = [2 10 55 9 17]

245

asked May 22 '15 20:05

bzak

2 Answers

The answer of Ben Voigt is great, but the line for A4i = A4{ii} is the one causing issues : the for loop doesn't work this way with column vectors :

%row vector
for i = 1:3
    disp('foo');
end

    foo
    foo
    foo

%column vector
for i = (1:3).'
    disp('foo');
end

    foo

Just try for A4i = A4{ii}.' instead and it should get your work done!

Now, if we look at the output :

A(:,idxC) =

    4044        5002
     300         431
    1113        1100

As you can see, the final result is not what we expected.

As long as unique does a kind of sort, the subs are not numbered by the order of encounter in A, but by order of encounter in C (which is sorted) :

Therefore you should pass by the matrix given by unique rather than A to get your final output

Enter

[C, ~, subs] = unique(M(1:3,:)','rows'); 
%% rather than [~, ~, subs] = unique(M(1:3,:)','rows');

Then, to get the final output, enter

>> out = C(idxC,:).'
out =

        1007        4044
         552         300
        2010        1113

answered Oct 23 '22 12:10

Ikaros

In this case, you should not be trying to eliminate loops. The vectorization is actually hurting you badly.

In particular (giving a name to your anonymous lambda)

issubset = @(x) all(ismember(A4{ii},x))

is ridiculously inefficient, because it doesn't short-circuit. Replace that with a loop.

Same for

any(cellfun(issubset, X))

Use an approach similar to this instead:

idxC = true(size(A4));
NX = numel(X);
for ii = 1:length(A4)
    for jj = 1:NX
        xj = X{jj};
        issubset = true;
        for A4i=A4{ii}
            if ~ismember(A4i, xj)
                issubset = false;
                break;
            end;
        end;
        if issubset
            idxC(ii) = false;
            break;
        end;
    end;
end;

The two break statements, and especially the second one, trigger an early exit that potentially saves you a huge amount of computation.

answered Oct 23 '22 13:10

Ben Voigt

Related questions
                            
                                What is the most efficient way to implement zig-zag ordering in MATLAB? [duplicate]
                            
                                java.library.path, classpath Netbeans 8.0.2
                            
                                Live Script with animation
                            
                                Rollback compatibility version of libraries on Mac OS X
                            
                                Variables not showing in MATLAB workspace
                            
                                How to do perspective correction in Matlab from known Intrinsic and Extrinsic parameters?
                            
                                Enabling option-key shortcuts in MATLAB for Mac
                            
                                Optimized tabs in MATLAB GUI
                            
                                Matlab --- splice vector into arguments for function call
                            
                                Add legend outside of axes without rescaling in MATLAB
                            
                                Why, if MATLAB is column-major, do some functions output row vectors?
                            
                                Exceptions vs. Errors in Matlab
                            
                                OpenCV vs Matlab : Different Values on pixels with imread
                            
                                Plotting multi-colored line in Matlab
                            
                                Does anyone have experience creating a shared library in MATLAB?
                            
                                Why do I receive a "Out of Windows Resources" warning when I open numerous figure windows in MATLAB [7.0 (R14) and beyond] on a Microsoft Windows PC?
                            
                                Does MATLAB perform tail call optimization?
                            
                                Retrieving element index in spfun, cellfun, arrayfun, etc. in MATLAB
                            
                                vectorize/optimize this code in MATLAB?
                            
                                Which numerical library to use for porting from Matlab to C++? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With