<p>I want to extract the data between <code><tr></code> tags from an html page. I used the following code.But i didn't get any result. The html between the <code><tr></code> tags is in multiple lines</p> <pre class="prettyprint"><code>category =re.findall('<tr>(.*?)</tr>',data); </code></pre> <p>Please suggest a fix for this problem.</p>

<p>just to clear up the issue. Despite all those links to <code>re.M</code> it wouldn't work here as simple skimming of the its explanation would reveal. You'd need <code>re.S</code>, if you wouldn't try to parse html, of course:</p> <pre class="prettyprint"><code>>>> doc = """<table border="1"> <tr> <td>row 1, cell 1</td> <td>row 1, cell 2</td> </tr> <tr> <td>row 2, cell 1</td> <td>row 2, cell 2</td> </tr> </table>""" >>> re.findall('<tr>(.*?)</tr>', doc, re.S) ['\n <td>row 1, cell 1</td>\n <td>row 1, cell 2</td>\n ', '\n <td>row 2, cell 1</td>\n <td>row 2, cell 2</td>\n '] >>> re.findall('<tr>(.*?)</tr>', doc, re.M) [] </code></pre>

matching multiple line in python regular expression

Tags:

python

I want to extract the data between <tr> tags from an html page. I used the following code.But i didn't get any result. The html between the <tr> tags is in multiple lines

category =re.findall('<tr>(.*?)</tr>',data);

Please suggest a fix for this problem.

989

asked Feb 04 '10 12:02

Sreejith Sasidharan

1 Answers

just to clear up the issue. Despite all those links to re.M it wouldn't work here as simple skimming of the its explanation would reveal. You'd need re.S, if you wouldn't try to parse html, of course:

>>> doc = """<table border="1">
    <tr>
        <td>row 1, cell 1</td>
        <td>row 1, cell 2</td>
    </tr>
    <tr>
        <td>row 2, cell 1</td>
        <td>row 2, cell 2</td>
    </tr>
</table>"""

>>> re.findall('<tr>(.*?)</tr>', doc, re.S)
['\n        <td>row 1, cell 1</td>\n        <td>row 1, cell 2</td>\n    ', 
 '\n        <td>row 2, cell 1</td>\n        <td>row 2, cell 2</td>\n    ']
>>> re.findall('<tr>(.*?)</tr>', doc, re.M)
[]

158

answered Sep 22 '22 01:09

SilentGhost

Related questions
                            
                                Django filter if an optional key in json exists
                            
                                len() of a numpy array in python [duplicate]
                            
                                Django- getting a list of foreign key objects
                            
                                pandas apply function with arguments
                            
                                Find new coordinates of a point after image resize
                            
                                pip installing eyeD3 module. Failed to find libmagic
                            
                                Performance, load and stress testing in Django
                            
                                psycopg2: Update multiple rows in a table with values from a tuple of tuples
                            
                                Download Kaggle Dataset by using Python
                            
                                Write Python DataFrame as CSV into Azure Blob
                            
                                How to install discord.py rewrite?
                            
                                Python urllib3 error - ImportError: cannot import name UnrewindableBodyError
                            
                                What is the difference between keras.backend.max vs keras.backend.argmax?
                            
                                Interact with Jupyter Notebooks via API
                            
                                Split .tfrecords file into many .tfrecords files
                            
                                How to create a keras layer with a custom gradient in TF2.0?
                            
                                What is the difference between tf-nightly and tensorflow in PyPI?
                            
                                How to make black background in cv2.putText with Python OpenCV
                            
                                How to use norm.ppf()?
                            
                                Plotly: How to change the colorscheme of a plotly express scatterplot?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With