I was wondering if someone out there could help me with a regex in C#. I think it's fairly simple but I've been wracking my brain over it and not quite sure why I'm having such a hard time. :) I've found a few examples around but I can't seem to manipulate them to do what I need. I just need to match ANY alphanumeric+dashes subdomain string that is not "www", and just up to the "." Also, ideally, if someone were to type "www.subdomain.domain.com" I would like the www to be ignored if possible. If not, it's not a huge issue. In other words, I would like to match: <ul> <li> (test).domain.com</li> <li> (test2).domain.com</li> <li> (wwwasdf).domain.com</li> <li> (asdfwww).domain.com</li> <li> (w).domain.com</li> <li> (wwwwww).domain.com</li> <li> (asfd-12345-www-bananas).domain.com</li> <li>www.(subdomain).domain.com</li> </ul> And I don't want to match: <ul> <li> (www).domain.com</li> </ul> It seems to me like it should be easy, but I'm having troubles with the "not match" part. For what it's worth, this is for use in the IIS 7 URL Rewrite Module, to rewrite for all non-www subdomains. Thanks!

Is the remainder of the domain name constant, like <code>.domain.com</code>, as in your examples? Try this: <pre class="prettyprint"><code>\b(?!www\.)(\w+(?:-\w+)*)(?=\.domain\.com\b) </code></pre> Explanation: <ul> <li><code>\w+(?:-\w+)*</code> matches a generic domain-name component as you described (but a little more rigorously).</li> <li><code>(?=\.domain\.com\b)</code> makes sure it's the first subdomain (i.e., the last one before the actual domain name).</li> <li><code>\b(?!www\.)</code> makes sure it isn't <code>www.</code> (without the <code>\b</code>, it could skip over the first <code>w</code> and match just the <code>ww.</code>).</li> </ul> In my tests, this regex matches precisely the parts you highlighted in your examples, and does not match the <code>www.</code> in either of the last two examples. <hr> EDIT: Here's another version which matches the whole name, capturing the pieces in different groups: <pre class="prettyprint"><code>^((?:\w+(?:-\w+)*\.)*)((?!www\.)\w+(?:-\w+)*)(\.domain\.com)$ </code></pre> In most cases, group <code>$1</code> will contain an empty string because there's nothing before the subdomain name, but here's how it breaks down <code>www.subdomain.domain.com</code>: <pre class="prettyprint"><code>$1: "www." $2: "subdomain" $3: ".domain.com" </code></pre>

Regex for ANY string except "www"? (subdomain)

2 Answers

Is the remainder of the domain name constant, like .domain.com, as in your examples? Try this:

\b(?!www\.)(\w+(?:-\w+)*)(?=\.domain\.com\b)

Explanation:

\w+(?:-\w+)* matches a generic domain-name component as you described (but a little more rigorously).
(?=\.domain\.com\b) makes sure it's the first subdomain (i.e., the last one before the actual domain name).
\b(?!www\.) makes sure it isn't www. (without the \b, it could skip over the first w and match just the ww.).

In my tests, this regex matches precisely the parts you highlighted in your examples, and does not match the www. in either of the last two examples.

EDIT: Here's another version which matches the whole name, capturing the pieces in different groups:

^((?:\w+(?:-\w+)*\.)*)((?!www\.)\w+(?:-\w+)*)(\.domain\.com)$

In most cases, group $1 will contain an empty string because there's nothing before the subdomain name, but here's how it breaks down www.subdomain.domain.com:

$1: "www."
$2: "subdomain"
$3: ".domain.com"

185

answered Sep 17 '22 23:09

Alan Moore

^www\.

And invert the logic for this bit, so if it matches, then your string does not meet your requirements.

answered Sep 19 '22 23:09

mopsled

Related questions
                            
                                Adding a file to a bucket on Amazon S3 using C#
                            
                                StreamReader and binary data
                            
                                How to cut a sub-part of an image using Emgu CV (or OpenCV)?
                            
                                WinForm ScrollViewer
                            
                                C# Deleting a .ZIP file after unzipping
                            
                                How to start a thread on a specific core?
                            
                                Fans-only content in facebook with asp.net C# sdk
                            
                                How to sort a string array by numeric style?
                            
                                Why can't I access the HttpContext from the Controller initializer?
                            
                                Method GetPrice() cannot be translated into a store expression
                            
                                Select nodes Linq to Xml C#
                            
                                How do I convert IEnumerable<Enum> to Enum in C#?
                            
                                Mousewheel bubbling up in winforms?
                            
                                Verifying datetime fluent nhibernate mappings
                            
                                WPF Dashed Border Control
                            
                                Task.ContinueWith not working how I expected
                            
                                In C# is ID / OK ever to be used?
                            
                                How to pass a subset of a collection to a C# method?
                            
                                Translation of this C# code to VB.NET
                            
                                Create an Index Based Class in c# .Net

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Regex for ANY string except "www"? (subdomain)

Tags:

string

c#

regex

asp.net

rewrite

trnelson

People also ask

2 Answers

Alan Moore

mopsled

Recent Activity

Donate For Us