Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

C# System.RegEx matches LF when it should not

Tags:

c#

.net

regex

The following returns true

Regex.IsMatch("FooBar\n", "^([A-Z]([a-z][A-Z]?)+)$");

so does

Regex.IsMatch("FooBar\n", "^[A-Z]([a-z][A-Z]?)+$");

The RegEx is in SingleLine mode by default, so $ should not match \n. \n is not an allowed character.

This is to match a single ASCII PascalCaseWord (yes, it will match a trailing Cap)

Doesn't work with any combinations of RegexOptions.Multiline | RegexOptions.Singleline

What am I doing wrong?

like image 925
CodeScrubber Avatar asked Jun 15 '17 19:06

CodeScrubber


People also ask

What C is used for?

C programming language is a machine-independent programming language that is mainly used to create many types of applications and operating systems such as Windows, and other complicated programs such as the Oracle database, Git, Python interpreter, and games and is considered a programming foundation in the process of ...

Is C language easy?

C is a general-purpose language that most programmers learn before moving on to more complex languages. From Unix and Windows to Tic Tac Toe and Photoshop, several of the most commonly used applications today have been built on C. It is easy to learn because: A simple syntax with only 32 keywords.

What is the full name of C?

In the real sense it has no meaning or full form. It was developed by Dennis Ritchie and Ken Thompson at AT&T bell Lab. First, they used to call it as B language then later they made some improvement into it and renamed it as C and its superscript as C++ which was invented by Dr.

Is C programming hard?

C is more difficult to learn than JavaScript, but it's a valuable skill to have because most programming languages are actually implemented in C. This is because C is a “machine-level” language. So learning it will teach you how a computer works and will actually make learning new languages in the future easier.


1 Answers

In .NET regex, the $ anchor (as in PCRE, Python, PCRE, Perl, but not JavaScript) matches the end of line, or the position before the final newline ("\n") character in the string.

See this documentation:

$   The match must occur at the end of the string or line, or before \n at the end of the string or line. For more information, see End of String or Line.

No modifier can redefine this in .NET regex (in PCRE, you can use D PCRE_DOLLAR_ENDONLY modifier).

You must be looking for \z anchor: it matches only at the very end of the string:

\z   The match must occur at the end of the string only. For more information, see End of String Only.

A short test in C#:

Console.WriteLine(Regex.IsMatch("FooBar\n", @"^[A-Z]([a-z][A-Z]?)+$"));  // => True
Console.WriteLine(Regex.IsMatch("FooBar\n", @"^[A-Z]([a-z][A-Z]?)+\z")); // => False
like image 153
Wiktor Stribiżew Avatar answered Sep 30 '22 01:09

Wiktor Stribiżew