My client has requested that passwords on their system must following a specific set of validation rules, and I'm having great difficulty coming up with a "nice" regular expression. The rules I have been given are... <ul> <li>Minimum of 8 character</li> <li>Allow any character</li> <li>Must have at least one instance from three of the four following character types... <ol> <li>Upper case character</li> <li>Lower case character</li> <li>Numeric digit</li> <li>"Special Character"</li> </ol> </li> </ul> When I pressed more, "Special Characters" are literally everything else (including spaces). I can easily check for at least one instance for all four, using the following... <pre class="prettyprint"><code>^(?=.*?[A-Z])(?=.*?[a-z])(?=.*?\d)(?=.*?[^a-zA-Z0-9]).{8,}$ </code></pre> The following works, but it's horrible and messy... <pre class="prettyprint"><code>^((?=.*?[A-Z])(?=.*?[a-z])(?=.*?\d)|(?=.*?[A-Z])(?=.*?[a-z])(?=.*?[^a-zA-Z0-9])|(?=.*?[A-Z])(?=.*?\d)(?=.*?[^a-zA-Z0-9])|(?=.*?[a-z])(?=.*?\d)(?=.*?[^a-zA-Z0-9])).{8,}$ </code></pre> So you don't have to work it out yourself, the above is checking for <code>(1,2,3|1,2,4|1,3,4|2,3,4)</code> which are the 4 possible combinations of the 4 groups (where the number relates to the "types" in the set of rules). Is there a "nicer", cleaner or easier way of doing this? (Please note, this is going to be used in an <code><asp:RegularExpressionValidator></code> control in an ASP.NET website, so therefore needs to be a valid regex for both .NET and javascript.)

It's not much of a better solution, but you can reduce <code>[^a-zA-Z0-9]</code> to <code>[\W_]</code>, since a word character is all letters, digits and the underscore character. I don't think you can avoid the alternation when trying to do this in a single regex. I think you have pretty much have the best solution. One slight optimization is that <code>\d*[a-z]\w_*|\d*[A-Z]\w_*</code> ~> <code>\d*[a-zA-Z]\w_*</code>, so I could remove one of the alternation sets. If you only allowed 3 out of 4 this wouldn't work, but since <code>\d*[A-Z][a-z]\w_*</code> was implicitly allowed it works. <pre class="prettyprint"><code>(?=.{8,})((?=.*\d)(?=.*[a-z])(?=.*[A-Z])|(?=.*\d)(?=.*[a-zA-Z])(?=.*[\W_])|(?=.*[a-z])(?=.*[A-Z])(?=.*[\W_])).* </code></pre> Extended version: <pre class="prettyprint"><code>(?=.{8,})( (?=.*\d)(?=.*[a-z])(?=.*[A-Z])| (?=.*\d)(?=.*[a-zA-Z])(?=.*[\W_])| (?=.*[a-z])(?=.*[A-Z])(?=.*[\W_]) ).* </code></pre> Because of the fourth condition specified by the OP, this regular expression will match even unprintable characters such as new lines. If this is unacceptable then modify the set that contains <code>\W</code> to allow for more specific set of special characters.

Regex to find 3 out of 4 conditions

Tags:

regex

passwords

My client has requested that passwords on their system must following a specific set of validation rules, and I'm having great difficulty coming up with a "nice" regular expression.

The rules I have been given are...

Minimum of 8 character
Allow any character
Must have at least one instance from three of the four following character types...
1. Upper case character
2. Lower case character
3. Numeric digit
4. "Special Character"

When I pressed more, "Special Characters" are literally everything else (including spaces).

I can easily check for at least one instance for all four, using the following...

^(?=.*?[A-Z])(?=.*?[a-z])(?=.*?\d)(?=.*?[^a-zA-Z0-9]).{8,}$

The following works, but it's horrible and messy...

^((?=.*?[A-Z])(?=.*?[a-z])(?=.*?\d)|(?=.*?[A-Z])(?=.*?[a-z])(?=.*?[^a-zA-Z0-9])|(?=.*?[A-Z])(?=.*?\d)(?=.*?[^a-zA-Z0-9])|(?=.*?[a-z])(?=.*?\d)(?=.*?[^a-zA-Z0-9])).{8,}$

So you don't have to work it out yourself, the above is checking for (1,2,3|1,2,4|1,3,4|2,3,4) which are the 4 possible combinations of the 4 groups (where the number relates to the "types" in the set of rules).

Is there a "nicer", cleaner or easier way of doing this?

(Please note, this is going to be used in an <asp:RegularExpressionValidator> control in an ASP.NET website, so therefore needs to be a valid regex for both .NET and javascript.)

763

asked Mar 05 '14 17:03

freefaller

1 Answers

It's not much of a better solution, but you can reduce [^a-zA-Z0-9] to [\W_], since a word character is all letters, digits and the underscore character. I don't think you can avoid the alternation when trying to do this in a single regex. I think you have pretty much have the best solution.

One slight optimization is that \d*[a-z]\w_*|\d*[A-Z]\w_* ~> \d*[a-zA-Z]\w_*, so I could remove one of the alternation sets. If you only allowed 3 out of 4 this wouldn't work, but since \d*[A-Z][a-z]\w_* was implicitly allowed it works.

(?=.{8,})((?=.*\d)(?=.*[a-z])(?=.*[A-Z])|(?=.*\d)(?=.*[a-zA-Z])(?=.*[\W_])|(?=.*[a-z])(?=.*[A-Z])(?=.*[\W_])).*

Extended version:

(?=.{8,})(   (?=.*\d)(?=.*[a-z])(?=.*[A-Z])|   (?=.*\d)(?=.*[a-zA-Z])(?=.*[\W_])|   (?=.*[a-z])(?=.*[A-Z])(?=.*[\W_]) ).*

Because of the fourth condition specified by the OP, this regular expression will match even unprintable characters such as new lines. If this is unacceptable then modify the set that contains \W to allow for more specific set of special characters.

108

answered Sep 19 '22 07:09

Daniel Gimenez

Related questions
                            
                                Regular expression for IP Address Validation
                            
                                HTML simple not blank pattern
                            
                                Regex for validating multiple E-Mail-Addresses
                            
                                Convert string to Pascal Case (aka UpperCamelCase) in Javascript
                            
                                Regular Expression: Numeric range [duplicate]
                            
                                regular expression to validate datetime format (MM/DD/YYYY) [duplicate]
                            
                                Extract number at end of string in C#
                            
                                How to remove spaces before and after a string?
                            
                                PHP: Regex to ignore escaped quotes within quotes
                            
                                Remove all whitespaces in a file- Linux
                            
                                C# Regex Issue "unrecognized escape sequence"
                            
                                get everything between <tag> and </tag> with php [duplicate]
                            
                                Multiline regular expression in C# [duplicate]
                            
                                Regex match digits, comma and semicolon?
                            
                                What are ^.* and .*$ in regular expressions?
                            
                                camelCase to underscore in vi(m)
                            
                                Match non printable/non ascii characters and remove from text
                            
                                Does Perl's `(?PARNO)` discard its own named captures when it's done?
                            
                                How is Guava Splitter.onPattern(..).split() different from String.split(..)?
                            
                                How To Negate Regex [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With