regex to match any character or none?

Tags:

python

regex

I have the two following peices of strings;

line1 = [16/Aug/2016:06:13:25 -0400] "GET /file/ HTTP/1.1" 302 random stuff ignore

line2 = [16/Aug/2016:06:13:25 -0400] "" 400 random stuff ignore

I'm trying to grab these two parts;

"GET /file/ HTTP/1.1" 302
"" 400

Basically any character in between the two "" or nothing in between "". So far I've tried this;

regex_example = re.search("\".+?\" [0-9]{3}", line1)
print regex_example.group()

This will work with line1, but give an error for line2. This is due to the '.' matching any character, but giving an error if no character exists.

Is there any way for it to match any character or nothing in between the two ""?

879

asked Aug 16 '16 19:08

user1165419

2 Answers

Use .*? instead of .+?.

+ means "1 or more"

* means "0 or more"

Regex101 Demo

If you want a more efficient regex, use a negated character class [^"] instead of a lazy quantifier ?. You should also use the raw string flag r and \d for digits.

r'"[^"]*" \d{3}'

149

answered Nov 14 '22 02:11

4castle

You can use:

import re

lines = ['[16/Aug/2016:06:13:25 -0400] "GET /file/ HTTP/1.1" 302 random stuff ignore', '[16/Aug/2016:06:13:25 -0400] "" 400 random stuff ignore']

rx = re.compile(r'''
        "[^"]*" # ", followed by anything not a " and a "
        \       # a space
        \d+     # at least one digit
        ''', re.VERBOSE)

matches = [m.group(0) \
            for line in lines \
            for m in rx.finditer(line)]

print(matches)
# ['"GET /file/ HTTP/1.1" 302', '"" 400']

See a demo on ideone.com.

answered Nov 14 '22 02:11

Jan

Related questions
                            
                                Does Python's `in` keyword perform a linear search? [duplicate]
                            
                                Python SimpleHTTPServer
                            
                                Python load list in list from text file
                            
                                Why does this pickle reach maximum recursion depth without recursion?
                            
                                Use Python to create 2D coordinate
                            
                                Sorting a list of tuples with multiple conditions
                            
                                Implementation of Luhn Formula
                            
                                how to get all mysql tuple result and convert to json
                            
                                virtualenv can't find python2
                            
                                fast XORing bytes in python 3 [duplicate]
                            
                                How do you assert something is not true in Python?
                            
                                Custom user in django raises ValueError
                            
                                No module named flask while running uWSGI
                            
                                The number of calendar weeks in a year?
                            
                                Theano Shared Variables on Python
                            
                                Python : Replacing Values in netcdf file using netCDF4
                            
                                difference between ways to generate index list in python
                            
                                Python Numpy generate coordinates for X and Y values in a certain range
                            
                                Trouble connecting to phantomJs webdriver using python and selenium
                            
                                Parametrize class tests with pytest

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With