Strcmp for cell arrays of unequal length in MATLAB

Tags:

Is there an easy way to find a smaller cell array of strings within a larger one? I've got two lists, one with unique elements, and one with repeating elements. I want to find whole occurrences of the specific pattern of the smaller array within the larger. I'm aware that strcmp will compare two cell arrays, but only if they're equal in length. My first thought was to step through subsets of the larger array using a loop, but there's got to be a better solution.

For example, in the following:

smallcellarray={'string1',...
                'string2',...
                'string3'};
largecellarray={'string1',...
                'string2',...
                'string3',...
                'string1',...
                'string2',...
                'string1',...
                'string2',...
                'string3'};

index=myfunction(largecellarray,smallcellarray)

would return

index=[1 1 1 0 0 1 1 1]

614

asked Jun 30 '10 19:06

Doresoom

2 Answers

You could actually use the function ISMEMBER to get an index vector for where the cells in largecellarray occur in the smaller array smallcellarray, then use the function STRFIND (which works for both strings and numeric arrays) to find the starting indices of the smaller array within the larger:

>> nSmall = numel(smallcellarray);
>> [~, matchIndex] = ismember(largecellarray,...  %# Find the index of the 
                                smallcellarray);    %#   smallcellarray entry
                                                    %#   that each entry of
                                                    %#   largecellarray matches
>> startIndices = strfind(matchIndex,1:nSmall)  %# Starting indices where the
                                                %#   vector [1 2 3] occurs in
startIndices =                                  %#   matchIndex

     1     6

Then it's a matter of building the vector index from these starting indices. Here's one way you could create this vector:

>> nLarge = numel(largecellarray);
>> endIndices = startIndices+nSmall;  %# Get the indices immediately after
                                      %#   where the vector [1 2 3] ends
>> index = zeros(1,nLarge);           %# Initialize index to zero
>> index(startIndices) = 1;           %# Mark the start index with a 1
>> index(endIndices) = -1;            %# Mark one index after the end with a -1
>> index = cumsum(index(1:nLarge))    %# Take the cumulative sum, removing any
                                      %#   extra entry in index that may occur
index =

     1     1     1     0     0     1     1     1

Another way to create it using the function BSXFUN is given by Amro. Yet another way to create it is:

index = cumsum([startIndices; ones(nSmall-1,numel(startIndices))]);
index = ismember(1:numel(largecellarray),index);

135

answered Sep 18 '22 14:09

gnovice

Here's my version (based on the answers of both @yuk and @gnovice):

g = grp2idx([S L])';
idx = strfind(g(numel(S)+1:end),g(1:numel(S)));
idx = bsxfun(@plus,idx',0:numel(S)-1);

index = zeros(size(L));
index(idx(:)) = 1;

answered Sep 19 '22 14:09

Amro

Related questions
                            
                                Using uigetfile instead of uigetdir to get directories in Matlab
                            
                                How to find intersections in binary image lines?
                            
                                Series of consecutive numbers (different lengths)
                            
                                Matlab/Octave 1-of-K representation
                            
                                Connecting final and initial point in simple x-y plot (Plotting closed curve/polygon)
                            
                                Problems with movie file creation in MATLAB
                            
                                Contour plot coloured by clustering of points matlab
                            
                                globals and parfor
                            
                                How to change image axis labels
                            
                                Load all the images from a directory
                            
                                Smooth color plots in Matlab
                            
                                Distinguish between scripts and functions programmatically
                            
                                How to repeat a vector along a diagonal in Matlab
                            
                                Matlab API reading .mat file from c++, using STL container
                            
                                Create strings from the indices of two vectors in Matlab
                            
                                What is the Matlab equivalent to Python's `not in`?
                            
                                Matlab multiply each row in matrix by different number
                            
                                How to write an anonymous function with a variable number of output arguments?
                            
                                set 'help' for matlab anonymous functions
                            
                                MATLAB date selection popup calendar for gui

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Strcmp for cell arrays of unequal length in MATLAB

Tags:

matlab

strcmp

cell-array

Doresoom

People also ask

2 Answers

gnovice

Amro

Recent Activity

Donate For Us