Regular expression to match a dot

People also ask

How do you match a dot in regex?

in regex is a metacharacter, it is used to match any character. To match a literal dot in a raw Python string ( r"" or r'' ), you need to escape it, so r"\." Unless the regular expression is stored inside a regular python string, in which case you need to use a double \ ( \\ ) instead.

Does regex match dot space?

Yes, the dot regex matches whitespace characters when using Python's re module. What is this? The dot matches all characters in the string --including whitespaces. You can see that there are many whitespace characters ' ' among the matched characters.

What does ?! Mean in regex?

It's a negative lookahead, which means that for the expression to match, the part within (?!...) must not match. In this case the regex matches http:// only when it is not followed by the current host name (roughly, see Thilo's comment). Follow this answer to receive notifications.

How do you escape a dot in regex?

(dot) metacharacter, and can match any single character (letter, digit, whitespace, everything). You may notice that this actually overrides the matching of the period character, so in order to specifically match a period, you need to escape the dot by using a slash \.

A . in regex is a metacharacter, it is used to match any character. To match a literal dot in a raw Python string (r"" or r''), you need to escape it, so r"\."

In your regex you need to escape the dot "\." or use it inside a character class "[.]", as it is a meta-character in regex, which matches any character.

Also, you need \w+ instead of \w to match one or more word characters.

Now, if you want the test.this content, then split is not what you need. split will split your string around the test.this. For example:

>>> re.split(r"\b\w+\.\w+@", s)
['blah blah blah ', 'gmail.com blah blah']

You can use re.findall:

>>> re.findall(r'\w+[.]\w+(?=@)', s)   # look ahead
['test.this']
>>> re.findall(r'(\w+[.]\w+)@', s)     # capture group
['test.this']

"In the default mode, Dot (.) matches any character except a newline. If the DOTALL flag has been specified, this matches any character including a newline." (python Doc)

So, if you want to evaluate dot literaly, I think you should put it in square brackets:

>>> p = re.compile(r'\b(\w+[.]\w+)')
>>> resp = p.search("blah blah blah [email protected] blah blah")
>>> resp.group()
'test.this'

to escape non-alphanumeric characters of string variables, including dots, you could use re.escape:

import re

expression = 'whatever.v1.dfc'
escaped_expression = re.escape(expression)
print(escaped_expression)

output:

whatever\.v1\.dfc

you can use the escaped expression to find/match the string literally.

Related questions
                            
                                What does "three dots" in Python mean when indexing what looks like a number?
                            
                                Is "x < y < z" faster than "x < y and y < z"?
                            
                                Creating hidden arguments with Python argparse
                            
                                Threading in a PyQt application: Use Qt threads or Python threads?
                            
                                What is the difference between setUp() and setUpClass() in Python unittest?
                            
                                What is the most pythonic way to check if an object is a number?
                            
                                Revert the `--no-site-packages` option with virtualenv
                            
                                Reading in environment variables from an environment file
                            
                                How to programmatically generate markdown output in Jupyter notebooks?
                            
                                Creating functions in a loop
                            
                                Matplotlib connect scatterplot points with line - Python
                            
                                Convert pandas timezone-aware DateTimeIndex to naive timestamp, but in certain timezone
                            
                                Python Requests package: Handling xml response
                            
                                How to migrate back from initial migration in Django 1.7?
                            
                                Creating a zero-filled pandas data frame
                            
                                Reading a binary file with python
                            
                                How to read keyboard-input?
                            
                                Converting strings to floats in a DataFrame
                            
                                Replace and overwrite instead of appending
                            
                                Using List/Tuple/etc. from typing vs directly referring type as list/tuple/etc

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Regular expression to match a dot

Tags:

python

regex

People also ask

Recent Activity

Donate For Us