I'm trying to use a regex for group matching. I want to extract two strings from one big string. The input string looks something like this: <pre class="prettyprint"><code>tХB:Username!Username@Username.tcc.domain.com Connected tХB:Username!Username@Username.tcc.domain.com WEBMSG #Username :this is a message tХB:Username!Username@Username.tcc.domain.com Status: visible </code></pre> The <code>Username</code> can be anything. Same goes for the end part <code>this is a message</code>. What I want to do is extract the Username that comes after the pound sign <code>#</code>. Not from any other place in the string, since that can vary aswell. I also want to get the message from the string that comes after the semicolon <code>:</code>. I tried that with the following regex. But it never outputs any results. <pre class="prettyprint"><code>regex rgx("WEBMSG #([a-zA-Z0-9]) :(.*?)"); smatch matches; for(size_t i=0; i<matches.size(); ++i) { cout << "MATCH: " << matches[i] << endl; } </code></pre> I'm not getting any matches. What is wrong with my regex?

Your regular expression is incorrect because neither capture group does what you want. The first is looking to match a single character from the set <code>[a-zA-Z0-9]</code> followed by <code><space>:</code>, which works for single character usernames, but nothing else. The second capture group will always be empty because you're looking for zero or more characters, but also specifying the match should not be greedy, which means a zero character match is a valid result. Fixing both of these your <code>regex</code> becomes <pre class="prettyprint"><code>std::regex rgx("WEBMSG #([a-zA-Z0-9]+) :(.*)"); </code></pre> But simply instantiating a <code>regex</code> and a <code>match_results</code> object does not produce matches, you need to apply a <code>regex</code> algorithm. Since you only want to match part of the input string the appropriate algorithm to use in this case is <code>regex_search</code>. <pre class="prettyprint"><code>std::regex_search(s, matches, rgx); </code></pre> Putting it all together <pre class="prettyprint"><code> std::string s{R"( tХB:Username!Username@Username.tcc.domain.com Connected tХB:Username!Username@Username.tcc.domain.com WEBMSG #Username :this is a message tХB:Username!Username@Username.tcc.domain.com Status: visible )"}; std::regex rgx("WEBMSG #([a-zA-Z0-9]+) :(.*)"); std::smatch matches; if(std::regex_search(s, matches, rgx)) { std::cout << "Match found\n"; for (size_t i = 0; i < matches.size(); ++i) { std::cout <</pre> Live demo

Regex grouping matches with C++ 11 regex library

Tags:

I'm trying to use a regex for group matching. I want to extract two strings from one big string.

The input string looks something like this:

tХB:[email protected] Connected tХB:[email protected] WEBMSG #Username :this is a message tХB:[email protected] Status: visible

The Username can be anything. Same goes for the end part this is a message.

What I want to do is extract the Username that comes after the pound sign #. Not from any other place in the string, since that can vary aswell. I also want to get the message from the string that comes after the semicolon :.

I tried that with the following regex. But it never outputs any results.

regex rgx("WEBMSG #([a-zA-Z0-9]) :(.*?)"); smatch matches;  for(size_t i=0; i<matches.size(); ++i) {     cout << "MATCH: " << matches[i] << endl; }

I'm not getting any matches. What is wrong with my regex?

798

asked Mar 28 '15 18:03

Vivendi

1 Answers

Your regular expression is incorrect because neither capture group does what you want. The first is looking to match a single character from the set [a-zA-Z0-9] followed by <space>:, which works for single character usernames, but nothing else. The second capture group will always be empty because you're looking for zero or more characters, but also specifying the match should not be greedy, which means a zero character match is a valid result.

Fixing both of these your regex becomes

std::regex rgx("WEBMSG #([a-zA-Z0-9]+) :(.*)");

But simply instantiating a regex and a match_results object does not produce matches, you need to apply a regex algorithm. Since you only want to match part of the input string the appropriate algorithm to use in this case is regex_search.

std::regex_search(s, matches, rgx);

Putting it all together

    std::string s{R"( tХB:[email protected] Connected tХB:[email protected] WEBMSG #Username :this is a message tХB:[email protected] Status: visible )"};      std::regex rgx("WEBMSG #([a-zA-Z0-9]+) :(.*)");     std::smatch matches;      if(std::regex_search(s, matches, rgx)) {         std::cout << "Match found\n";          for (size_t i = 0; i < matches.size(); ++i) {             std::cout << i << ": '" << matches[i].str() << "'\n";         }     } else {         std::cout << "Match not found\n";     }

Live demo

157

answered Sep 21 '22 17:09

Praetorian

Related questions
                            
                                Close an open h5py data file
                            
                                Warning: Attempt to present <UIAlertController: 0x7facd3946920> on <...> which is already presenting (null)
                            
                                Android sdk installation. Error: Invalid content was found starting with element 'd:skin'. No child element is expected at this point
                            
                                Testing spring security with Postman
                            
                                How to set font size of SKLabelNode to fit in fixed size (Swift)
                            
                                how to get pandas get_dummies to emit N-1 variables to avoid collinearity?
                            
                                Trying to have a grid of card with angular material
                            
                                how to make <svg> 100% width
                            
                                iOS9 App has black bars on top and bottom
                            
                                HTML5 video how to play two videos in one video element
                            
                                How do I fix the directory not found for option -F error [duplicate]
                            
                                UI Testing Xcode 7- can't access element within subview

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With