Regex Matching Error

Tags:

regex

I am new to Python (I dont have any programming training either), so please keep that in mind as I ask my question.

I am trying to search a retrieved webpage and find all links using a specified pattern. I have done this successfully in other scripts, but I am getting an error that says

raise error, v # invalid expression
sre_constants.error: multiple repeat

I have to admit I do not know why, but again, I am new to Python and Regular Expressions. However, even when I don't use patterns and use a specific link (just to test the matching), I do not believe I return any matches (nothing is sent to the window when I print match.group(0). The link I tested is commented out below.

Any ideas? It usually is easier for me to learn by example, but any advice you can give is greatly appreciated!

Brock

import urllib2
from BeautifulSoup import BeautifulSoup
import re

url = "http://forums.epicgames.com/archive/index.php?f-356-p-164.html"
page = urllib2.urlopen(url).read()
soup = BeautifulSoup(page)

pattern = r'<a href="http://forums.epicgames.com/archive/index.php?t-([0-9]+).html">(.?+)</a> <i>((.?+) replies)'
#pattern = r'href="http://forums.epicgames.com/archive/index.php?t-622233.html">Gears of War 2: Horde Gameplay</a> <i>(20 replies)'

for match in re.finditer(pattern, page, re.S):
    print match(0)

236

asked Aug 12 '09 21:08

Btibert3

2 Answers

That means your regular expression has an error.

(.?+)</a> <i>((.?+)

What does ?+ mean? Both ? and + are meta characters that does not make sense right next to each other. Maybe you forgot to escape the '?' or something.

133

answered Sep 23 '22 02:09

Unknown

You need to escape the literal '?' and the literal '(' and ')' that you are trying to match.

Also, instead of '?+', I think you're looking for the non-greedy matching provided by '+?'.

retracile

Related questions
                            
                                "NULL identity key" error using SQLAlchemy's base automap to reflect a postgres DB using IDENTITY columns
                            
                                tf.data: Parallelize loading step
                            
                                HuggingFace BERT `inputs_embeds` giving unexpected result
                            
                                How can I build a setup.py to compile C++ extension using Python, pybind11 and Mingw-w64?
                            
                                Predicting radius of circle with Neural Network
                            
                                Matlab numerictype/reinterpretcast equivalent in python?
                            
                                Finding a Pattern in a Grid Python [duplicate]
                            
                                How can I do a seq2seq task with PyTorch Transformers if I am not trying to be autoregressive?
                            
                                Gunicorn (with Flask) parameters for Google Cloud Run (GCR) - what to put in Dockerfile? [closed]
                            
                                Geopandas: how to plot countries/cities?
                            
                                Compile NumPy with MKL on Windows - DLL load failed
                            
                                How to interrupt Python I/O operations when threading?
                            
                                How can I properly mimic this encryption method to produce the proper value for the encryptedPwd field?
                            
                                Accelerate the loop
                            
                                How to cancel previous request in FastAPI
                            
                                Testing GUI code: should I use a mocking library?
                            
                                How to find out if there is data to be read from stdin on Windows in Python?
                            
                                Best way to organize the folders containing the SQLAlchemy models [closed]
                            
                                Find cpu-hogging plugin in multithreaded python
                            
                                Are there any examples on python-purple floating around?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With