Regular Expression for alphanumeric and underscores

Tags:

regex

People also ask

What is alphanumeric and underscores?

In Computer Science, an Alphanumeric value often means the first character is not a number but is an alphabet or underscore. Thereafter the character can be 0-9 , A-Z , a-z , or underscore ( _ ).

What is the regex for underscore?

The _ (underscore) character in the regular expression means that the zone name must have an underscore immediately following the alphanumeric string matched by the preceding brackets. The . (period) matches any character (a wildcard).

Does \w include underscores?

A domain name may include lowercase and uppercase letters, numbers, period signs and dashes, but no underscores. \w includes all of the above, plus an underscore.

To match a string that contains only those characters (or an empty string), try

"^[a-zA-Z0-9_]*$"

This works for .NET regular expressions, and probably a lot of other languages as well.

Breaking it down:

^ : start of string
[ : beginning of character group
a-z : any lowercase letter
A-Z : any uppercase letter
0-9 : any digit
_ : underscore
] : end of character group
* : zero or more of the given characters
$ : end of string

If you don't want to allow empty strings, use + instead of *.

As others have pointed out, some regex languages have a shorthand form for [a-zA-Z0-9_]. In the .NET regex language, you can turn on ECMAScript behavior and use \w as a shorthand (yielding ^\w*$ or ^\w+$). Note that in other languages, and by default in .NET, \w is somewhat broader, and will match other sorts of Unicode characters as well (thanks to Jan for pointing this out). So if you're really intending to match only those characters, using the explicit (longer) form is probably best.

There's a lot of verbosity in here, and I'm deeply against it, so, my conclusive answer would be:

/^\w+$/

\w is equivalent to [A-Za-z0-9_], which is pretty much what you want. (unless we introduce unicode to the mix)

Using the + quantifier you'll match one or more characters. If you want to accept an empty string too, use * instead.

You want to check that each character matches your requirements, which is why we use:

[A-Za-z0-9_]

And you can even use the shorthand version:

\w

Which is equivalent (in some regex flavors, so make sure you check before you use it). Then to indicate that the entire string must match, you use:

To indicate the string must start with that character, then use

To indicate the string must end with that character. Then use

\w+ or \w*

To indicate "1 or more", or "0 or more". Putting it all together, we have:

^\w*$

Um...question: Does it need to have at least one character or no? Can it be an empty string?

^[A-Za-z0-9_]+$

Will do at least one upper or lower case alphanumeric or underscore. If it can be zero length, then just substitute the + for *

^[A-Za-z0-9_]*$

Edit:

If diacritics need to be included (such as cedilla - ç) then you would need to use the word character which does the same as the above, but includes the diacritic characters:

^\w+$

^\w*$

Related questions
                            
                                How to remove all line breaks from a string
                            
                                Is it worth using Python's re.compile?
                            
                                How to extract numbers from a string in Python?
                            
                                Regex Match all characters between two strings
                            
                                How to do a regular expression replace in MySQL?
                            
                                Split string on whitespace in Python [duplicate]
                            
                                What is a good regular expression to match a URL? [duplicate]
                            
                                How can I exclude one word with grep?
                            
                                Escape string for use in Javascript regex [duplicate]
                            
                                What is the difference between re.search and re.match?
                            
                                jQuery selector regular expressions
                            
                                How to match all occurrences of a regex
                            
                                How do I split a string with multiple separators in JavaScript?
                            
                                What do 'lazy' and 'greedy' mean in the context of regular expressions?
                            
                                Regex to replace multiple spaces with a single space
                            
                                Regex for password must contain at least eight characters, at least one number and both lower and uppercase letters and special characters
                            
                                How to match "anything up until this sequence of characters" in a regular expression?
                            
                                Regular expression to stop at first match
                            
                                How to use Regular Expressions (Regex) in Microsoft Excel both in-cell and loops
                            
                                How do I remove all non alphanumeric characters from a string except dash?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Regular Expression for alphanumeric and underscores

Tags:

regex

People also ask

Related questions

Recent Activity

Donate For Us