<p>How would I use BeautifulSoup to remove <em>only</em> a tag? The method I found deletes the tag and <em>all</em> other tags and content inside it. I want to remove only the tag and leave everything inside it untouched, e.g.</p> <p>change this:</p> <pre class="prettyprint"><code><div> <p>dvgbkfbnfd</p> <div> <span>dsvdfvd</span> </div> <p>fvjdfnvjundf</p> </div> </code></pre> <p>to this:</p> <pre class="prettyprint"><code><p>dvgbkfbnfd</p> <span>dsvdfvd</span> <p>fvjdfnvjundf</p> </code></pre>

<p>I've voted to close as a duplicate, but in case it's of use, reapplying slacy's answer from top related answer on the right gives you this solution:</p> <pre class="prettyprint"><code>from BeautifulSoup import BeautifulSoup html = ''' <div> <p>dvgbkfbnfd</p> <div> <span>dsvdfvd</span> </div> <p>fvjdfnvjundf</p> </div> ''' soup = BeautifulSoup(html) for match in soup.findAll('div'): match.replaceWithChildren() print soup </code></pre> <p>... which produces the output:</p> <pre class="prettyprint"><code><p>dvgbkfbnfd</p> <span>dsvdfvd</span> <p>fvjdfnvjundf</p> </code></pre>

How do I use BeautifulSoup to replace a tag with its contents?

Tags:

python

beautifulsoup

How would I use BeautifulSoup to remove only a tag? The method I found deletes the tag and all other tags and content inside it. I want to remove only the tag and leave everything inside it untouched, e.g.

change this:

<div>
<p>dvgbkfbnfd</p>
<div>
<span>dsvdfvd</span>
</div>
<p>fvjdfnvjundf</p>
</div>

to this:

<p>dvgbkfbnfd</p>
<span>dsvdfvd</span>
<p>fvjdfnvjundf</p>

473

asked May 11 '12 17:05

Blainer

1 Answers

I've voted to close as a duplicate, but in case it's of use, reapplying slacy's answer from top related answer on the right gives you this solution:

from BeautifulSoup import BeautifulSoup

html = '''
<div>
<p>dvgbkfbnfd</p>
<div>
<span>dsvdfvd</span>
</div>
<p>fvjdfnvjundf</p>
</div>
'''

soup = BeautifulSoup(html)
for match in soup.findAll('div'):
    match.replaceWithChildren()

print soup

... which produces the output:

<p>dvgbkfbnfd</p>

<span>dsvdfvd</span>

<p>fvjdfnvjundf</p>

167

answered Nov 14 '22 20:11

Mark Longair

Related questions
                            
                                Some questions regarding Mako modules, Mako's TemplateLookup function, and Pyramid
                            
                                Python MysqlDB using cursor.rowcount with SSDictCursor returning wrong count
                            
                                Import module in another directory from a "parallel" sub-directory
                            
                                How to access Django message framework content in Django unit tests
                            
                                Why is turtle lightening pixels?
                            
                                What are the best practices for creating Python Distributions(eggs) on(and for) Multiple Operating Systems
                            
                                Sort list of tuples considering locale (swedish ordering)
                            
                                How to save generated PDF with Reportlab to Datastore in App Engine Python
                            
                                trim big log file
                            
                                Normalize/Standardize a numpy recarray
                            
                                Break string into list elements based on keywords
                            
                                Conditional 'with' tag in Django
                            
                                Python - open pdf file to specific page/section
                            
                                Algorithm to traverse all edges in a graph
                            
                                Ubuntu Chrome: How to read a cookie from a python script
                            
                                How to pass command line arguments from one python module to another
                            
                                Extracting Javascript gettext messages using Babel CLI extractor
                            
                                python, Json and string indices must be integers, not str
                            
                                PyQt4 jpeg/jpg unsupported image format
                            
                                Install pip and virtualenv, a chicken and the egg dilemma?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With