restrict 1 word as case sensitive and other as case insensitive in python regex | (pipe)

Tags:

regex

I got the meaning of | (pipe special character) in regex, Python. It matches either 1st or 2nd.

ex : a|b Matches either a or b.

My question: What if I want to match is a with case sensitive and b with case insensitive in above example?

ex:

s = "Welcome to PuNe, Maharashtra"

result1 = re.search("punnee|MaHaRaShTrA",s)
result2 = re.search("pune|maharashtra",s)
result3 = re.search("PuNe|MaHaRaShTrA",s)
result4 = re.search("P|MaHaRaShTrA",s)

I want to search Pune in the way I have written in above statement s i.e PuNe. But I have to search Maharashtra by ignoring case. How can I search 1 word with case sensitive and other with case insensitive? So that, result1, result2, result3, result4 will give not null value.

I tried:

Click to copy

result1 = re.search("pune|MaHaRaShTrA",s1, re.IGNORECASE)

But this ignores the cases for both the words.

How can I restrict 1 word as case sensitive and other as case insensitive?

290

asked Jul 04 '17 09:07

Harsha Biyani

1 Answers

In Python 3.6 and later, you may use the inline modifier groups:

Click to copy

>>> s = "Welcome to PuNe, Maharashtra"
>>> print(re.findall(r"PuNe|(?i:MaHaRaShTrA)",s))
['PuNe', 'Maharashtra']

See the relevant Python re documentation:

(?aiLmsux-imsx:...)
(Zero or more letters from the set 'a', 'i', 'L', 'm', 's', 'u', 'x', optionally followed by '-' followed by one or more letters from the 'i', 'm', 's', 'x'.) The letters set or remove the corresponding flags: re.A (ASCII-only matching), re.I (ignore case), re.L (locale dependent), re.M (multi-line), re.S (dot matches all), re.U (Unicode matching), and re.X (verbose), for the part of the expression. (The flags are described in Module Contents.)

The letters 'a', 'L' and 'u' are mutually exclusive when used as inline flags, so they can’t be combined or follow '-'. Instead, when one of them appears in an inline group, it overrides the matching mode in the enclosing group. In Unicode patterns (?a:...) switches to ASCII-only matching, and (?u:...) switches to Unicode matching (default). In byte pattern (?L:...) switches to locale depending matching, and (?a:...) switches to ASCII-only matching (default). This override is only in effect for the narrow inline group, and the original matching mode is restored outside of the group.

New in version 3.6.

Changed in version 3.7: The letters 'a', 'L' and 'u' also can be used in a group.

Unfortunately, Python re versions before 3.6 did not support these groups, nor did they support alternating on and off inline modifiers.

If you can use PyPi regex module, you may use a (?i:...) construct:

Click to copy

import regex
s = "Welcome to PuNe, Maharashtra"
print(regex.findall(r"PuNe|(?i:MaHaRaShTrA)",s))

See the online Python demo.

answered Oct 01 '22 12:10

Wiktor Stribiżew

Related questions
                            
                                How to execute multiple bash commands in parallel in python
                            
                                Where is a django validator function's return value stored?
                            
                                Seaborn ImportError: DLL load failed: The specified module could not be found
                            
                                ValueError: Unknown label type in scikit-learn
                            
                                Dropping columns in a dataframe
                            
                                Pandas legend for scatter matrix
                            
                                python: try/except/else and continue statement
                            
                                Calculate CRC32, MD5 and SHA1 of zip content without decompression in Python
                            
                                Yahoo Finance API / URL not working: Python fix for Pandas DataReader
                            
                                Don't create object when if condition is not met in __init__()
                            
                                Django: how to check if Q object is empty?
                            
                                Hosting raw HTML pages in a Pelican static website
                            
                                Bokeh widgets call CustomJS and Python callback for single event?
                            
                                Python kivy - how to reduce height of TextInput
                            
                                Why is processing a random list so much faster than processing an ordered list?
                            
                                How to check if local file is same as S3 object without downloading it with boto3?
                            
                                Popen: differences between python 2 and 3
                            
                                Flask Dynamic dependent dropdown list
                            
                                boto3 start/stop RDS instance with AWS Lambda
                            
                                Aggregating Dataframe in groups of 3

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

restrict 1 word as case sensitive and other as case insensitive in python regex | (pipe)

Tags:

python

regex

Harsha Biyani

People also ask

1 Answers

Wiktor Stribiżew

Recent Activity

Donate For Us