A friend of mine just had his interview at Google and got rejected because he couldn't give a solution to this question. I have my own interview in a couple of days and can't seem to figure out a way to solve it. Here's the question: <blockquote> You are given a pattern, such as [a b a b]. You are also given a string, example "redblueredblue". I need to write a program that tells whether the string follows the given pattern or not. A few examples: Pattern: [a b b a] String: catdogdogcat returns 1 Pattern: [a b a b] String: redblueredblue returns 1 Pattern: [a b b a] String: redblueredblue returns 0 </blockquote> I thought of a few approaches, like getting the number of unique characters in the pattern and then finding that many unique substrings of the string then comparing with the pattern using a hashmap. However, that turns out to be a problem if the substring of a is a part of b. It'd be really great if any of you could help me out with it. :) UPDATE: Added Info: There can be any number of characters in the pattern (a-z). Two characters won't represent the same substring. Also, a character can't represent an empty string.

The simplest solution that I can think of is to divide the given string into four parts and compare the individual parts. You don't know how long <code>a</code> or <code>b</code> is, but both <code>a</code>s are of the same length as well as <code>b</code>s are. So the number of ways how to divide the given string is not very large. Example: pattern = <code>[a b a b]</code>, given string = <code>redblueredblue</code> (14 characters in total) <ol> <li> <code>|a|</code> (length of <code>a</code>) = 1, then that makes 2 characters for <code>a</code>s and 12 characters is left for <code>b</code>s, i.e. <code>|b|</code> = 6. Divided string = <code>r edblue r edblue</code>. Whoa, this matches right away!</li> <li>(just out of curiosity) <code>|a| = 2, |b| = 5</code> -> divided string = <code>re dblue re dblue</code> -> match</li> </ol> Example 2: pattern = <code>[a b a b]</code>, string = <code>redbluebluered</code> (14 characters in total) <ol> <li> <code>|a| = 1, |b| = 6</code> -> divided string = <code>r edblue b luered</code> -> no match</li> <li> <code>|a| = 2, |b| = 5</code> -> divided string = <code>re dblue bl uered</code> -> no match</li> <li> <code>|a| = 3, |b| = 4</code> -> divided string = <code>red blue blu ered</code> -> no match</li> </ol> The rest is not needed to be checked because if you switched <code>a</code> for <code>b</code> and vice versa, the situation is identical. What is the pattern that has [a b c a b c] ?

Don't you just need to translate the pattern to a regexp using backreferences, i.e. something like this (Python 3 with the "re" module loaded): <pre class="prettyprint"><code>>>> print(re.match('(.+)(.+)\\2\\1', 'catdogdogcat')) <_sre.SRE_Match object; span=(0, 12), match='catdogdogcat'> >>> print(re.match('(.+)(.+)\\1\\2', 'redblueredblue')) <_sre.SRE_Match object; span=(0, 14), match='redblueredblue'> >>> print(re.match('(.+)(.+)\\2\\1', 'redblueredblue')) None </code></pre> The regexp looks pretty trivial to generate. If you need to support more than 9 backrefs, you can use named groups - see the Python regexp docs.

Check if the given string follows the given pattern

Tags:

A friend of mine just had his interview at Google and got rejected because he couldn't give a solution to this question.

I have my own interview in a couple of days and can't seem to figure out a way to solve it.

Here's the question:

You are given a pattern, such as [a b a b]. You are also given a string, example "redblueredblue". I need to write a program that tells whether the string follows the given pattern or not.

A few examples:

Pattern: [a b b a] String: catdogdogcat returns 1

Pattern: [a b a b] String: redblueredblue returns 1

Pattern: [a b b a] String: redblueredblue returns 0

I thought of a few approaches, like getting the number of unique characters in the pattern and then finding that many unique substrings of the string then comparing with the pattern using a hashmap. However, that turns out to be a problem if the substring of a is a part of b.

It'd be really great if any of you could help me out with it. :)

UPDATE:

Added Info: There can be any number of characters in the pattern (a-z). Two characters won't represent the same substring. Also, a character can't represent an empty string.

473

asked Nov 02 '14 18:11

shashankg77

2 Answers

The simplest solution that I can think of is to divide the given string into four parts and compare the individual parts. You don't know how long a or b is, but both as are of the same length as well as bs are. So the number of ways how to divide the given string is not very large.

Example: pattern = [a b a b], given string = redblueredblue (14 characters in total)

|a| (length of a) = 1, then that makes 2 characters for as and 12 characters is left for bs, i.e. |b| = 6. Divided string = r edblue r edblue. Whoa, this matches right away!
(just out of curiosity) |a| = 2, |b| = 5 -> divided string = re dblue re dblue -> match

Example 2: pattern = [a b a b], string = redbluebluered (14 characters in total)

|a| = 1, |b| = 6 -> divided string = r edblue b luered -> no match
|a| = 2, |b| = 5 -> divided string = re dblue bl uered -> no match
|a| = 3, |b| = 4 -> divided string = red blue blu ered -> no match

The rest is not needed to be checked because if you switched a for b and vice versa, the situation is identical.

What is the pattern that has [a b c a b c] ?

183

answered Oct 25 '22 22:10

zegkljan

Don't you just need to translate the pattern to a regexp using backreferences, i.e. something like this (Python 3 with the "re" module loaded):

>>> print(re.match('(.+)(.+)\\2\\1', 'catdogdogcat'))
<_sre.SRE_Match object; span=(0, 12), match='catdogdogcat'>

>>> print(re.match('(.+)(.+)\\1\\2', 'redblueredblue'))
<_sre.SRE_Match object; span=(0, 14), match='redblueredblue'>

>>> print(re.match('(.+)(.+)\\2\\1', 'redblueredblue'))
None

The regexp looks pretty trivial to generate. If you need to support more than 9 backrefs, you can use named groups - see the Python regexp docs.

answered Oct 25 '22 23:10

EricM

Related questions
                            
                                Animate a bezier path drawn in drawRect() Swift
                            
                                Browser performance tests through selenium
                            
                                getting the key index in a Python OrderedDict?
                            
                                urllib HTTPS request: <urlopen error unknown url type: https>
                            
                                ClassNotFoundException: Didn't find class "android.os.PersistableBundle" Otto Android 5.0
                            
                                What do two plus signs in a git diff mean?
                            
                                How to detect if a user clicks browser back button in Angularjs
                            
                                What does regex "\\p{Z}" mean?
                            
                                What is the definition of "feature" in neural network?
                            
                                Local site testing with BrowserStack and self-signed certificates
                            
                                RESTful routing best practice when referencing current_user from route?
                            
                                How can I write a 3 byte unicode character as string literal

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With