Matching the occurrence and pattern of characters of String2 in String1

Tags:

I was asked this question in a phone interview for summer internship, and tried to come up with a n*m complexity solution (although it wasn't accurate too) in Java.

I have a function that takes 2 strings, suppose "common" and "cmn". It should return True based on the fact that 'c', 'm', 'n' are occurring in the same order in "common". But if the arguments were "common" and "omn", it would return False because even though they are occurring in the same order, but 'm' is also appearing after 'o' (which fails the pattern match condition)

I have worked over it using Hashmaps, and Ascii arrays, but didn't get a convincing solution yet! From what I have read till now, can it be related to Boyer-Moore, or Levenshtein Distance algorithms?

Hoping for respite at stackoverflow! :)

Edit: Some of the answers talk about reducing the word length, or creating a hashset. But per my understanding, this question cannot be done with hashsets because occurrence/repetition of each character in first string has its own significance. PASS conditions- "con", "cmn", "cm", "cn", "mn", "on", "co". FAIL conditions that may seem otherwise- "com", "omn", "mon", "om". These are FALSE/FAIL because "o" is occurring before as well as after "m". Another example- "google", "ole" would PASS, but "google", "gol" would fail because "o" is also appearing before "g"!

390

asked May 03 '11 02:05

MadTest

3 Answers

I think it's quite simple. Run through the pattern and fore every character get the index of it's last occurence in the string. The index must always increase, otherwise return false. So in pseudocode:

index = -1
foreach c in pattern
    checkindex = string.lastIndexOf(c)
    if checkindex == -1                   //not found
        return false
    if checkindex < index
        return false
    if string.firstIndexOf(c) < index     //characters in the wrong order
        return false
    index = checkindex
return true

Edit: you could further improve the code by passing index as the starting index to the lastIndexOf method. Then you would't have to compare checkindex with index and the algorithm would be faster.

Updated: Fixed a bug in the algorithm. Additional condition added to consider the order of the letters in the pattern.

answered Oct 29 '22 04:10

raymi

An excellent question and couple of hours of research and I think I have found the solution. First of all let me try explaining the question in a different approach.

Requirement:

Lets consider the same example 'common' (mainString) and 'cmn'(subString). First we need to be clear that any characters can repeat within the mainString and also the subString and since its pattern that we are concentrating on, the index of the character play a great role to. So we need to know:

Index of the character (least and highest)

Lets keep this on hold and go ahead and check the patterns a bit more. For the word common, we need to find whether the particular pattern cmn is present or not. The different patters possible with common are :- (Precedence apply )

c -> o
c -> m
c -> n
o -> m
o -> o
o -> n
m -> m
m -> o
m -> n
o -> n

At any moment of time this precedence and comparison must be valid. Since the precedence plays a huge role, we need to have the index of each unique character Instead of storing the different patterns.

Solution

First part of the solution is to create a Hash Table with the following criteria :-

Create a Hash Table with the key as each character of the mainString
Each entry for a unique key in the Hash Table will store two indices i.e lowerIndex and higherIndex
Loop through the mainString and for every new character, update a new entry of lowerIndex into the Hash with the current index of the character in mainString.
If Collision occurs, update the current index with higherIndex entry, do this until the end of String

Second and main part of pattern matching :-

Set Flag as False
Loop through the subString and for every character as the key, retreive the details from the Hash.
Do the same for the very next character.

Just before loop increment, verify two conditions

If highestIndex(current character) > highestIndex(next character) Then
   Pattern Fails, Flag <- False, Terminate Loop
   // This condition is applicable for almost all the cases for pattern matching

Else If lowestIndex(current character) > lowestIndex(next character) Then
   Pattern Fails, Flag <- False, Terminate Loop
   // This case is explicitly for cases in which patterns like 'mon' appear

Display the Flag

N.B : Since I am not so versatile in Java, I did not submit the code. But some one can try implementing my idea

answered Oct 29 '22 03:10

NirmalGeo

I had myself done this question in an inefficient manner, but it does give accurate result! I would appreciate if anyone can make out an an efficient code/algorithm from this!

Create a function "Check" which takes 2 strings as arguments. Check each character of string 2 in string 1. The order of appearance of each character of s2 should be verified as true in S1.

Take character 0 from string p and traverse through the string s to find its index of first occurrence.
Traverse through the filled ascii array to find any value more than the index of first occurrence.
Traverse further to find the last occurrence, and update the ascii array
Take character 1 from string p and traverse through the string s to find the index of first occurence in string s
Traverse through the filled ascii array to find any value more than the index of first occurrence. if found, return False.
Traverse further to find the last occurrence, and update the ascii array

As can be observed, this is a bruteforce method...I guess O(N^3)

public class Interview
{
    public static void main(String[] args)
{
    if (check("google", "oge"))
        System.out.println("yes");
    else System.out.println("sorry!");
}

 public static boolean check (String s, String p) 
{   

     int[] asciiArr =  new int[256];    
     for(int pIndex=0; pIndex<p.length(); pIndex++) //Loop1 inside p
     {
        for(int sIndex=0; sIndex<s.length(); sIndex++) //Loop2 inside s
        {
            if(p.charAt(pIndex) == s.charAt(sIndex))    
            {
                asciiArr[s.charAt(sIndex)] = sIndex; //adding char from s to its Ascii value

                for(int ascIndex=0; ascIndex<256; )     //Loop 3 for Ascii Array
                {
                    if(asciiArr[ascIndex]>sIndex)           //condition to check repetition
                    return false;
                    else ascIndex++;
                }
            }
        }
     }
    return true;
}
}

answered Oct 29 '22 03:10

MadTest

Related questions
                            
                                Tools for rapid layout/interface creations?
                            
                                How can I convert a jar file into apk format?
                            
                                Unknown Error:java.lang.nullPointerException [duplicate]
                            
                                Supporting multiple content types in a Spring-MVC controller
                            
                                Can I programmatically find out in which GC generation an instance lives?
                            
                                Bug in Java Calendar / Date for 2nd October 2010?
                            
                                Java generics: Illegal forward reference
                            
                                JLayeredPane versus Container layering
                            
                                Initialization of Java - Class v/s Interface
                            
                                Read-only association with JPA OneToMany mapping
                            
                                java keystore and password settings
                            
                                boolean recursion
                            
                                How to know the original size (width and height) of a swf file with java?
                            
                                Display unicode characters in android?
                            
                                How to manage the transaction(which includes File IO) when an IOException is thrown from the close file method
                            
                                What is the best implementation for boolean in MySQL using Java to connect to the database?
                            
                                Tool to look for incompatabilities in method signatures / fields
                            
                                JSF 2.0 can't render dialog from primefaces
                            
                                JPA background cache refresh
                            
                                Shared cache between Tomcat web apps

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Matching the occurrence and pattern of characters of String2 in String1

Tags:

java

string

algorithm

MadTest

People also ask

3 Answers

raymi

NirmalGeo

MadTest

Recent Activity

Donate For Us