I want to write a simple regular expression in Python that extracts a number from HTML. The HTML sample is as follows: <pre class="prettyprint"><code>Your number is 123 </code></pre> Now, how can I extract "123", i.e. the contents of the first bold text after the string "Your number is"?

<pre class="prettyprint"><code>import re m = re.search("Your number is (\d+)", "xxx Your number is 123 fdjsk") if m: print m.groups()[0] </code></pre>

How to use regex to parse a number from HTML?

Tags:

python

regex

I want to write a simple regular expression in Python that extracts a number from HTML. The HTML sample is as follows:

Your number is <b>123</b>

Now, how can I extract "123", i.e. the contents of the first bold text after the string "Your number is"?

204

asked Jun 23 '12 16:06

Saqib

1 Answers

import re m = re.search("Your number is <b>(\d+)</b>",       "xxx Your number is <b>123</b>  fdjsk") if m:     print m.groups()[0]

124

answered Sep 30 '22 13:09

Yevgen Yampolskiy

Related questions
                            
                                Reversible hash function?
                            
                                Are there advantages to use the Python/C interface instead of Cython?
                            
                                Is it possible to have multiple PyPlot windows? Or am I limited to subplots?
                            
                                Move files between two AWS S3 buckets using boto3
                            
                                Install py2exe for python 2.7 over pip: this package requires Python 3.3 or later
                            
                                Setting Background color to transparent in Plotly plots
                            
                                SyntaxError: Generator expression must be parenthezised / python manage.py migrate
                            
                                Can you use a string to instantiate a class?
                            
                                How to copy InMemoryUploadedFile object to disk
                            
                                psycopg2 insert python dictionary as json
                            
                                Python Convert Back Slashes to forward slashes
                            
                                Can I get the matrix determinant using Numpy?
                            
                                cannot urllib.urlencode a URL in python
                            
                                pip freeze creates some weird path instead of the package version
                            
                                Determining version of easy_install/setuptools
                            
                                python histogram one-liner
                            
                                How to make HTTP DELETE method using urllib2?
                            
                                Reset weights in Keras layer
                            
                                Best way to open a socket in Python
                            
                                How to remove read-only attrib directory with Python in Windows?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With