Can you please tell me the difference between <code>\z</code> and <code>\Z</code> as well as <code>\a</code> and <code>\A</code> in Perl with a simple example ?

<code>\z</code> only matches the very end of the string. <code>\Z</code> also matches the very end of the string, but if the string ends with a newline, then <code>\Z</code> also matches immediately before the newline. So, for example, these five are true: <pre class="prettyprint"><code>'foo' =~ m/foo\z/ 'foo' =~ m/foo\Z/ "foo\n" =~ m/foo\Z/ "foo\n" =~ m/foo\n\z/ "foo\n" =~ m/foo\n\Z/ </code></pre> whereas this one is false: <pre class="prettyprint"><code>"foo\n" =~ m/foo\z/ </code></pre> They both differ from <code>$</code> in that they are not affected by the <code>/m</code> "multiline" flag, which allows <code>$</code> to match at the end of any line. <code>\a</code> denotes the alert (bell) character; it doesn't have any additional special meaning in a regex. <code>\A</code> matches only at the start of a string. Like <code>\z</code> and <code>\Z</code>, and unlike <code>^</code>, it's not affected by the <code>/m</code> "multiline" flag. All of this is documented in <code>perlre</code>, the Perl regular expressions manual page: http://perldoc.perl.org/perlre.html.

<ul> <li> <code>\A</code> matches zero characters at position 0.</li> <li> <code>\z</code> matches zero characters at the end of the string.</li> <li> <code>\Z</code> matches zero characters at the end of the string and at a trailing line feed.</li> </ul> <ul> <li> <code>^</code> without <code>/m</code> is the same as <code>\A</code>.</li> <li> <code>^</code> with <code>/m</code> matches zero characters at position 0 and after a line feed.</li> <li> <code>$</code> without <code>/m</code> is the same as <code>\Z</code>.</li> <li> <code>$</code> with <code>/m</code> matches zero characters at the end of the string and at a line feed.</li> </ul> <ul> <li> <code>\a</code> matches the BEL/BELL character. <ul> <li>It is equivalent to <code>\x07</code> on an ASCII-based machine.</li> <li>It is equivalent to <code>\x2F</code> on an EBCDIC-based machine.</li> </ul> </li> </ul> The following indicates the positions at which the relevant regex patterns will match (<code>␊</code> indicates a line feed): <pre class="prettyprint lang-none prettyprint-override"><code>\A \A is not affected by /m ^ ^ without /m &equiv; \A ^/m ^/m ^/m ^ with /m &equiv; \A|(?<=\n) | | | 0123456789012 | | | v v v abc␊def␊ghi␊ ^ ^ ^^ | | || 0123456789012 | | ||___ | | | | $/m $/m $/m $/m $ with /m &equiv; \z|(?=\n) $ $ $ without /m &equiv; \z|(?=\n\z) \Z \Z \Z is not affected by /m &equiv; \z|(?=\n\z) \z \z is not affected by /m </code></pre> This is documented in perlre.

Difference between \z and \Z and \a and \A in Perl

Video Answer

2 Answers

\z only matches the very end of the string.

\Z also matches the very end of the string, but if the string ends with a newline, then \Z also matches immediately before the newline.

So, for example, these five are true:

Click to copy

'foo' =~ m/foo\z/
'foo' =~ m/foo\Z/
"foo\n" =~ m/foo\Z/
"foo\n" =~ m/foo\n\z/
"foo\n" =~ m/foo\n\Z/

whereas this one is false:

Click to copy

"foo\n" =~ m/foo\z/

They both differ from $ in that they are not affected by the /m "multiline" flag, which allows $ to match at the end of any line.

\a denotes the alert (bell) character; it doesn't have any additional special meaning in a regex.

\A matches only at the start of a string. Like \z and \Z, and unlike ^, it's not affected by the /m "multiline" flag.

All of this is documented in perlre, the Perl regular expressions manual page: http://perldoc.perl.org/perlre.html.

191

answered Oct 13 '22 19:10

ruakh

\A matches zero characters at position 0.
\z matches zero characters at the end of the string.
\Z matches zero characters at the end of the string and at a trailing line feed.

^ without /m is the same as \A.
^ with /m matches zero characters at position 0 and after a line feed.
$ without /m is the same as \Z.
$ with /m matches zero characters at the end of the string and at a line feed.

\a matches the BEL/BELL character.
- It is equivalent to \x07 on an ASCII-based machine.
- It is equivalent to \x2F on an EBCDIC-based machine.

The following indicates the positions at which the relevant regex patterns will match (␊ indicates a line feed):

Click to copy

\A                       \A is not affected by /m
^                        ^ without /m             ≡ \A
^/m ^/m ^/m              ^ with /m                ≡ \A|(?<=\n)
|   |   |
0123456789012
|   |   |
v   v   v
abc␊def␊ghi␊
   ^   ^   ^^
   |   |   ||
0123456789012
   |   |   ||___
   |   |   |    |
   $/m $/m $/m  $/m      $ with /m                ≡ \z|(?=\n)
           $    $        $ without /m             ≡ \z|(?=\n\z)
           \Z   \Z       \Z is not affected by /m ≡ \z|(?=\n\z)
                \z       \z is not affected by /m

This is documented in perlre.

answered Oct 13 '22 17:10

ikegami

Related questions
                            
                                Regex Extract html Body
                            
                                Validate class/method names with regex
                            
                                How can I replace white space in filename of uploaded file
                            
                                Space character in regex is not recognised
                            
                                Error using split() with curly braces "{"
                            
                                how to ignore blank lines and comment lines using awk
                            
                                get id video vimeo with regexp preg_match
                            
                                R regex find last occurrence of delimiter
                            
                                Regex to trim spaces from the end of unknown length strings of words
                            
                                Extracting year from string in python
                            
                                Allowing Only Certain Characters In PHP
                            
                                How to replace a variable within a string with PHP?
                            
                                Percent Symbol in CodeIgniter URI
                            
                                RegEx Pattern that matches positive or negative values (e.g "1.2", "-2.8", "7.8", -22.8")
                            
                                PHP regex digit length only 5 or 9
                            
                                Java how to replace backslash? [duplicate]
                            
                                Match a string containing a comma (eg 1,5)
                            
                                RegEx.IsMatch() vs. String.ToUpper().Contains() performance
                            
                                Remove <p> tags - Regular Expression (Regex)
                            
                                Regex to strip a variable number of periods from username in email address?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Difference between \z and \Z and \a and \A in Perl

Tags:

regex

perl

user3597719

People also ask

Video Answer

2 Answers

ruakh

ikegami

Recent Activity

Donate For Us