<p>I need to find content of forms from HTML source file, I did some searching and found very good method to do that, but the problem is that it prints out only first found, how can I loop through it and output all form contents, not just first one?</p> <pre class="prettyprint"><code>line = 'bla bla bla<form>Form 1</form> some text...<form>Form 2</form> more text?' matchObj = re.search('<form>(.*?)</form>', line, re.S) print matchObj.group(1) # Output: Form 1 # I need it to output every form content he found, not just first one... </code></pre>

<p><strong>Do not use regular expressions to parse HTML.</strong></p> <p>But if you ever need to find all regexp matches in a string, use the <code>findall</code> function.</p> <pre class="prettyprint"><code>import re line = 'bla bla bla<form>Form 1</form> some text...<form>Form 2</form> more text?' matches = re.findall('<form>(.*?)</form>', line, re.DOTALL) print(matches) # Output: ['Form 1', 'Form 2'] </code></pre>

Python - Using regex to find multiple matches and print them out [duplicate]

Tags:

python

regex

I need to find content of forms from HTML source file, I did some searching and found very good method to do that, but the problem is that it prints out only first found, how can I loop through it and output all form contents, not just first one?

line = 'bla bla bla<form>Form 1</form> some text...<form>Form 2</form> more text?' matchObj = re.search('<form>(.*?)</form>', line, re.S) print matchObj.group(1) # Output: Form 1 # I need it to output every form content he found, not just first one...

300

asked Oct 11 '11 11:10

Stan

1 Answers

Do not use regular expressions to parse HTML.

But if you ever need to find all regexp matches in a string, use the findall function.

import re line = 'bla bla bla<form>Form 1</form> some text...<form>Form 2</form> more text?' matches = re.findall('<form>(.*?)</form>', line, re.DOTALL) print(matches)  # Output: ['Form 1', 'Form 2']

142

answered Sep 29 '22 09:09

Petr Viktorin

Related questions
                            
                                Appending values to dictionary in Python
                            
                                How can I stay tab-free in Geany on Ubuntu?
                            
                                Selecting columns from pandas MultiIndex
                            
                                Does Python have class prototypes (or forward declarations)?
                            
                                Using different versions of python with virtualenvwrapper
                            
                                Convert a filename to a file:// URL
                            
                                Best practice for Python & Django constants
                            
                                Error installing uwsgi in virtualenv
                            
                                How can a function access its own attributes?
                            
                                Subtract a year from a datetime column in pandas
                            
                                PyCharm current working directory
                            
                                "The provided key element does not match the schema" error when getting an item from DynamoDB
                            
                                Download image file from the HTML page source using python?
                            
                                Python 2 CSV writer produces wrong line terminator on Windows
                            
                                Python - temporarily modify the current process's environment
                            
                                Seeking from end of file throwing unsupported exception
                            
                                Why can't PySpark find py4j.java_gateway?
                            
                                Pythonic way to insert every 2 elements in a string
                            
                                how do I launch IDLE, the development environment for Python, on Mac OS 10.7?
                            
                                Counting Cars OpenCV + Python Issue

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With