I'm trying to read a log file and extract some machine/setting information using regular expressions. Here is a sample from the log: <pre class="prettyprint"><code>... COMPUTER INFO: Computer Name: TESTCMP02 Windows User Name: testUser99 Time Since Last Reboot: 405 Minutes Processor: (2 processors) Intel(R) Xeon(R) CPU 5160 @ 3.00GHz OS Version: 5.1 .number 2600:Service Pack 2 Memory: RAM: 48% used, 3069.6 MB total, 1567.3 MB free ServerTimeOffSet: -146 Seconds Use Local Time for Log: True INITIAL SETTINGS: Command Line: /SKIPUPDATES Remote Online: True INI File: c:\demoapp\system\DEMOAPP.INI DatabaseName: testdb SQL Server: 10.254.58.1 SQL UserName: SQLUser ODBC Source: TestODBC Dynamic ODBC (not defined): True ... </code></pre> I would like to capture each 'block' of data, using the header as one group, and the data as a second (i.e. "COMPUTER INFO", "Computer Name:.......") and repeat this for each block. The expression if have so far is <pre class="prettyprint"><code>(?s)(\p{Lu}{1,} \p{Lu}{1,}:\r\n)(.*\r\n\r\n) </code></pre> This pulls out the block into the groups like it should, which is great. But I need to have it repeat the capture, which I can't seem to get. I've tried several grouping expressions, including: <pre class="prettyprint"><code>(?s)(?:(\p{Lu}{1,} \p{Lu}{1,}:\r\n)(.*\r\n\r\n))* </code></pre> which would seem to be correct, but I get back lots of NULL result groups with empty group item values. I'm using the .Net RegEx class to apply the expressions, can anyone help me out here?

It's not possible to have repeated groups. The group will contain the last match. You'll need to break this into two problems. First, find each section: <pre class="prettyprint"><code>new Regex(@"(?>^[A-Z\s]+:\s*$)\s*(?:(?!^\S).)*", RegexOptions.Singleline | RegexOptions.Multiline); </code></pre> And then, within each match, use another regex to match each field/value into groups: <pre class="prettyprint"><code>new Regex(@"^\s+(?<name>[^:]*):\s*(?<value>.*)$", RegexOptions.Multiline); </code></pre> <hr> The code to use this would look something like this: <pre class="prettyprint"><code>Regex sectionRegex = new Regex(@"(?>^[A-Z\s]+:\s*$)\s*(?:(?!^\S).)*", RegexOptions.Singleline | RegexOptions.Multiline); Regex nameValueRegex = new Regex(@"^\s+(?<name>[^:]*):\s*(?<value>.*)$", RegexOptions.Multiline); MatchCollection sections = sectionRegex.Matches(logData); foreach (Match section in sections) { MatchCollection nameValues = nameValueRegex.Matches(section.ToString()); foreach (Match nameValue in nameValues) { string name = nameValue.Groups["name"].Value; string value = nameValue.Groups["value"].Value; // OK, do something here. } } </code></pre>

<pre class="prettyprint"><code>((?<header>[^:]+:)(?<content>[^\r\n]+)?\r\n)+ </code></pre> or, if you have empty lines between items: <pre class="prettyprint"><code>(((?<header>[^:]+:)(?<content>[^\r\n]+)?\r\n)|\r\n)+ </code></pre>

Regular Expression - Repeating Groups

Tags:

.net

regex

I'm trying to read a log file and extract some machine/setting information using regular expressions. Here is a sample from the log:

...
COMPUTER INFO:
 Computer Name:                 TESTCMP02
 Windows User Name:             testUser99
 Time Since Last Reboot:        405 Minutes
 Processor:                     (2 processors) Intel(R) Xeon(R) CPU            5160  @ 3.00GHz
 OS Version:                    5.1 .number 2600:Service Pack 2
 Memory:                        RAM: 48% used, 3069.6 MB total, 1567.3 MB free
 ServerTimeOffSet:              -146 Seconds 
 Use Local Time for Log:        True

INITIAL SETTINGS:
 Command Line:                  /SKIPUPDATES
 Remote Online:                 True
 INI File:                      c:\demoapp\system\DEMOAPP.INI
 DatabaseName:                  testdb
 SQL Server:                    10.254.58.1
 SQL UserName:                  SQLUser
 ODBC Source:                   TestODBC
 Dynamic ODBC (not defined):    True
...

I would like to capture each 'block' of data, using the header as one group, and the data as a second (i.e. "COMPUTER INFO", "Computer Name:.......") and repeat this for each block. The expression if have so far is

(?s)(\p{Lu}{1,} \p{Lu}{1,}:\r\n)(.*\r\n\r\n)

This pulls out the block into the groups like it should, which is great. But I need to have it repeat the capture, which I can't seem to get. I've tried several grouping expressions, including:

(?s)(?:(\p{Lu}{1,} \p{Lu}{1,}:\r\n)(.*\r\n\r\n))*

which would seem to be correct, but I get back lots of NULL result groups with empty group item values. I'm using the .Net RegEx class to apply the expressions, can anyone help me out here?

484

asked Nov 06 '09 18:11

Jason

2 Answers

It's not possible to have repeated groups. The group will contain the last match.

You'll need to break this into two problems. First, find each section:

new Regex(@"(?>^[A-Z\s]+:\s*$)\s*(?:(?!^\S).)*", RegexOptions.Singleline | RegexOptions.Multiline);

And then, within each match, use another regex to match each field/value into groups:

new Regex(@"^\s+(?<name>[^:]*):\s*(?<value>.*)$", RegexOptions.Multiline);

The code to use this would look something like this:

Regex sectionRegex = new Regex(@"(?>^[A-Z\s]+:\s*$)\s*(?:(?!^\S).)*", RegexOptions.Singleline | RegexOptions.Multiline);
Regex nameValueRegex = new Regex(@"^\s+(?<name>[^:]*):\s*(?<value>.*)$", RegexOptions.Multiline);
MatchCollection sections = sectionRegex.Matches(logData);
foreach (Match section in sections)
{
    MatchCollection nameValues = nameValueRegex.Matches(section.ToString());
    foreach (Match nameValue in nameValues)
    {
        string name = nameValue.Groups["name"].Value;
        string value = nameValue.Groups["value"].Value;
        // OK, do something here.
    }
}

answered Sep 17 '22 18:09

Jeremy Stein

((?<header>[^:]+:)(?<content>[^\r\n]+)?\r\n)+

or, if you have empty lines between items:

(((?<header>[^:]+:)(?<content>[^\r\n]+)?\r\n)|\r\n)+

answered Sep 18 '22 18:09

Victor Hurdugaci

Related questions
                            
                                Heterogeneous Dictionary, but typed?
                            
                                How to avoid XML injection
                            
                                Is it possible to reuse a .NET WinForms Form object?
                            
                                How can I receive the "scroll box" type scroll events from a DataGridView?
                            
                                Updating a databound ComboBox
                            
                                Remove empty xmlns="" after Xml Serialization
                            
                                Do you think generic properties would be useful in .NET?
                            
                                Returning a nested generic Expression<Func<T, bool>>
                            
                                Single assembly from multiple projects
                            
                                Get the exact time for a remote server
                            
                                How can I force all derived classes to implement an abstract method or property?
                            
                                (.net) CriticalFinalizerObject - What does it really do?
                            
                                Interactive .NET Charting tools? [closed]
                            
                                How to access controls that are inside a TabControl tab?
                            
                                Merge and Update Two Lists in C#
                            
                                Does BinaryFormatter apply any compression?
                            
                                Mercurial workflow question (how to handle Config files)
                            
                                Location Coordinates On Computer Showing X=-32000, Y=-32000
                            
                                Memory usage, SortedList vs List problem
                            
                                How do you use an UpdatePanel properly?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With