What's a regular expression that validates if a string is pandigital (containing all digits from 1 to 9 exactly once)? For example: <pre class="prettyprint"><code>123456789 891364572 </code></pre> But not: <pre class="prettyprint"><code>11234556789 25896471 </code></pre> I know how to do this without regex but I was unable to form a regex for it. Thanks. This is not homework.

Short and sweet, using a negative lookahead: <pre class="prettyprint"><code>/^(?!.*([1-9]).*\1)[1-9]{9}$/ </code></pre> <ul> <li> <code>[1-9]</code> is the character class for nonzero digits - equivalent to <code>[123456789]</code> </li> <li> <code>.*</code> matches any string of any length.</li> <li> <code>.*([1-9]).*\1.*</code> matches any string with that contains at least two occurrences of the same nonzero digit <ul> <li>a nonzero digit is matched and captured by <code>([1-9])</code> </li> <li>a repeat of that nonzero digit is matched by <code>\1</code>, a back-reference to the first captured match.</li> <li>the <code>.*</code> matches the arbitrary padding before, and between the nonzero digit and its repeat.</li> </ul> </li> <li> <code>(?!<pattern>)</code> matches any position where the contained pattern doesn't match. This is a negative lookahead, as it only matches a position in the string, and doesn't consume any of it - just looks ahead to compare it with the contained pattern.</li> <li> <code>[1-9]{9}</code> matches nine nonzeo digits. <ul> <li> <code><pattern>{9}</code> means match the preceding pattern 9 times.</li> </ul> </li> <li> <code>^<pattern>$</code> matches any string that exactly matches the contained pattern (rather than contains a substring that matches the pattern) <ul> <li> <code>^</code> matches the position at the beginning of a string OR the beginning of a line</li> <li> <code>$</code> matches the position at the end of a string OR the end of a line</li> </ul> </li> </ul> So combined, we check to make sure that it's not repeating any digits, then we check that it's only digits. Since it's 9 digits long, and none repeat, all must show up exactly once. That's the pigeonhole principle at work! The syntax for your specific regular expression engine may vary. The above is a PCRE (supported in Perl, Ruby, and a bunch of different other languages). Posix regular expressions have slightly different syntax. Not all engines support negative lookaheads, but most support back-references. Neither are part of the definition of formal theoretic regular expressions, but are very convenient.

Regex is not exactly the best tool for the job here, but here you go: <pre class="prettyprint"><code>^(?=[^1]*1[^1]*$)(?=[^2]*2[^2]*$)(?=[^3]*3[^3]*$)(?=[^4]*4[^4]*$)(?=[^5]*5[^5]*$)(?=[^6]*6[^6]*$)(?=[^7]*7[^7]*$)(?=[^8]*8[^8]*$)(?=[^9]*9[^9]*$)[1-9]+$ </code></pre> <code>(?= )</code> is a look-ahead. It does not actually fit the description of regular expressions, as it does not describe a regular language.

Pandigital Regex?

Tags:

regex

What's a regular expression that validates if a string is pandigital (containing all digits from 1 to 9 exactly once)?

For example:

Click to copy

123456789
891364572

But not:

Click to copy

11234556789
25896471

I know how to do this without regex but I was unable to form a regex for it.

Thanks.

This is not homework.

244

asked Apr 17 '09 01:04

Bai Li

2 Answers

Short and sweet, using a negative lookahead:

Click to copy

/^(?!.*([1-9]).*\1)[1-9]{9}$/

[1-9] is the character class for nonzero digits - equivalent to [123456789]
.* matches any string of any length.
.*([1-9]).*\1.* matches any string with that contains at least two occurrences of the same nonzero digit
- a nonzero digit is matched and captured by ([1-9])
- a repeat of that nonzero digit is matched by \1, a back-reference to the first captured match.
- the .* matches the arbitrary padding before, and between the nonzero digit and its repeat.
(?!<pattern>) matches any position where the contained pattern doesn't match. This is a negative lookahead, as it only matches a position in the string, and doesn't consume any of it - just looks ahead to compare it with the contained pattern.
[1-9]{9} matches nine nonzeo digits.
- <pattern>{9} means match the preceding pattern 9 times.
^<pattern>$ matches any string that exactly matches the contained pattern (rather than contains a substring that matches the pattern)
- ^ matches the position at the beginning of a string OR the beginning of a line
- $ matches the position at the end of a string OR the end of a line

So combined, we check to make sure that it's not repeating any digits, then we check that it's only digits. Since it's 9 digits long, and none repeat, all must show up exactly once. That's the pigeonhole principle at work!

The syntax for your specific regular expression engine may vary. The above is a PCRE (supported in Perl, Ruby, and a bunch of different other languages). Posix regular expressions have slightly different syntax. Not all engines support negative lookaheads, but most support back-references. Neither are part of the definition of formal theoretic regular expressions, but are very convenient.

answered Oct 31 '22 02:10

rampion

Regex is not exactly the best tool for the job here, but here you go:

Click to copy

^(?=[^1]*1[^1]*$)(?=[^2]*2[^2]*$)(?=[^3]*3[^3]*$)(?=[^4]*4[^4]*$)(?=[^5]*5[^5]*$)(?=[^6]*6[^6]*$)(?=[^7]*7[^7]*$)(?=[^8]*8[^8]*$)(?=[^9]*9[^9]*$)[1-9]+$

(?= ) is a look-ahead. It does not actually fit the description of regular expressions, as it does not describe a regular language.

answered Oct 31 '22 00:10

Markus Jarderot

Related questions
                            
                                Insert spaces into a string using Ruby [closed]
                            
                                Invalid group reference in python 2.7+
                            
                                Java remove escaped double-quote
                            
                                regular expression to detect ISO language code
                            
                                Regex whitespace word boundary
                            
                                How to sanitize a String for regexp
                            
                                Finding HTML tags in string
                            
                                Is this the shortest javascript regex to find all uppercase consonants?
                            
                                Removing consecutive duplicates words out of text using Regex and displaying the new text
                            
                                Javascript: Named Capture Groups
                            
                                Replace keys in template string with object properties
                            
                                Why does this RegExp exec cause an infinite loop?
                            
                                C# - Getting file names starting with a specific format in a directory
                            
                                How to use regular expression in the WHERE clause of query in Laravel?
                            
                                regular expression add double quotes around values and keys in javascript
                            
                                Regular expression exclude double character
                            
                                Regular expression is not working with single character
                            
                                Best way to fetch a varying HTML tag
                            
                                How do I tokenize input using Java's Scanner class and regular expressions?
                            
                                Regex to parse define() contents, possible?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandigital Regex?

Tags:

regex

Bai Li

People also ask

2 Answers

rampion

Markus Jarderot

Recent Activity

Donate For Us