I'm using the regex <pre class="prettyprint"><code>System.Text.RegularExpressions.Regex.Replace(stringToSplit, "([A-Z])", " $1").Trim() </code></pre> to split strings by capital letter, for example: 'MyNameIsSimon' becomes 'My Name Is Simon' I find this incredibly useful when working with enumerations. What I would like to do is change it slightly so that strings are only split if the next letter is a lowercase letter, for example: 'USAToday' would become 'USA Today' Can this be done? EDIT: Thanks to all for responding. I may not have entirely thought this through, in some cases 'A' and 'I' would need to be ignored but this is not possible (at least not in a meaningful way). In my case though the answers below do what I need. Thanks!

<pre class="prettyprint"> ((?<=[a-z])[A-Z]|[A-Z](?=[a-z])) </pre> or its Unicode-aware cousin <pre class="prettyprint"> ((?<=\p{Ll})\p{Lu}|\p{Lu}(?=\p{Ll})) </pre> when replaced globally with <pre class="prettyprint"><code>" $1" </code></pre> handles <pre class="prettyprint"> TodayILiveInTheUSAWithSimon USAToday IAmSOOOBored </pre> yielding <pre class="prettyprint"> Today I Live In The USA With Simon USA Today I Am SOOO Bored </pre> In a second step you'd have to trim the string.

Regular expression, split string by capital letter but ignore TLA

Tags:

.net

regex

I'm using the regex

System.Text.RegularExpressions.Regex.Replace(stringToSplit, "([A-Z])", " $1").Trim()

to split strings by capital letter, for example:

'MyNameIsSimon' becomes 'My Name Is Simon'

I find this incredibly useful when working with enumerations. What I would like to do is change it slightly so that strings are only split if the next letter is a lowercase letter, for example:

'USAToday' would become 'USA Today'

Can this be done?

EDIT: Thanks to all for responding. I may not have entirely thought this through, in some cases 'A' and 'I' would need to be ignored but this is not possible (at least not in a meaningful way). In my case though the answers below do what I need. Thanks!

284

asked Jul 08 '09 12:07

Simon

1 Answers

 ((?<=[a-z])[A-Z]|[A-Z](?=[a-z]))

or its Unicode-aware cousin

 ((?<=\p{Ll})\p{Lu}|\p{Lu}(?=\p{Ll}))

when replaced globally with

" $1"

handles

 TodayILiveInTheUSAWithSimon USAToday IAmSOOOBored

yielding

  Today I Live In The USA With Simon USA Today I Am SOOO Bored

In a second step you'd have to trim the string.

140

answered Sep 30 '22 09:09

Tomalak

Related questions
                            
                                Why isn't this causing an infinite loop of events?
                            
                                PowerShell And StringBuilder
                            
                                How do i use Activator.CreateInstance with strings?
                            
                                Looping through dictionary object
                            
                                What is the default buffer size for StreamWriter
                            
                                Is there a universal config file for .Net Standard 2.0 Class Library?
                            
                                Best sorting algorithms for C# / .NET in different scenarios
                            
                                Generating random, unique values C#
                            
                                Why does Add-Migration sometimes create duplicate migrations?
                            
                                How to tell if a "ZipArchiveEntry" is directory?
                            
                                Can't decide between Task<IActionResult>, IActionResult and ActionResult<Thing>
                            
                                Localizing enum descriptions attributes
                            
                                F# and "enterprise-level" reporting [closed]
                            
                                Class not registered error when creating Excel workbook in C#
                            
                                What's the difference between .NET CoreCLR, CoreRT, Roslyn and LLILC
                            
                                "Error Creating Window Handle"
                            
                                Writing string at the same position using Console.Write in C# 2.0
                            
                                Running .net based application without .NET Framework
                            
                                Proper way to Mock repository objects for unit tests using Moq and Unity
                            
                                Attaching to a child process automatically in Visual Studio during Debugging

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With