<p>Can <code><script></code> tags and all of their contents be removed from HTML with BeautifulSoup, or do I have to use Regular Expressions or something else?</p>

<pre class="prettyprint"><code>>>> from bs4 import BeautifulSoup >>> soup = BeautifulSoup('<script>a</script>baba<script>b</script>', 'html.parser') >>> for s in soup.select('script'): >>> s.extract() >>> soup baba </code></pre>

<p>Updated answer for those who might need for future reference: The correct answer is. <code>decompose()</code>. You can use different ways but <code>decompose</code> works in place.</p> <p>Example usage:</p> <pre class="prettyprint"><code>soup = BeautifulSoup('<p>This is a slimy text and <i> I am slimer</i></p>') soup.i.decompose() print str(soup) #prints '<p>This is a slimy text and</p>' </code></pre> <p>Pretty useful to get rid of detritus like <code><script></code>, <code><img></code> and so forth.</p>

Can I remove script tags with BeautifulSoup?

2 Answers

>>> from bs4 import BeautifulSoup >>> soup = BeautifulSoup('<script>a</script>baba<script>b</script>', 'html.parser') >>> for s in soup.select('script'): >>>    s.extract() >>> soup baba

answered Oct 28 '22 15:10

Fábio Diniz

Updated answer for those who might need for future reference: The correct answer is. decompose(). You can use different ways but decompose works in place.

Example usage:

soup = BeautifulSoup('<p>This is a slimy text and <i> I am slimer</i></p>') soup.i.decompose() print str(soup) #prints '<p>This is a slimy text and</p>'

Pretty useful to get rid of detritus like <script>, <img> and so forth.

answered Oct 28 '22 13:10

Abhishek Dujari

Related questions
                            
                                Why is an MD5 hash created by Python different from one created using echo and md5sum in the shell?
                            
                                Why do I get a SyntaxError for a Unicode escape in my file path?
                            
                                float64 with pandas to_csv
                            
                                Numpy index slice without losing dimension information
                            
                                Django - "no module named django.core.management"
                            
                                Python CSV error: line contains NULL byte
                            
                                Why does Popen.communicate() return b'hi\n' instead of 'hi'?
                            
                                Get row-index values of Pandas DataFrame as list? [duplicate]
                            
                                Python pickle error: UnicodeDecodeError
                            
                                Where is my Django installation?
                            
                                Assert that a method was called in a Python unit test
                            
                                How to change fonts in matplotlib (python)?
                            
                                How do I close a tkinter window?
                            
                                How do I test if int value exists in Python Enum without using try/catch?
                            
                                Display rows with one or more NaN values in pandas dataframe
                            
                                Why does `True == False is False` evaluate to False? [duplicate]
                            
                                Elegant way to check if a nested key exists in a dict?
                            
                                Two way/reverse map [duplicate]
                            
                                Analyze audio using Fast Fourier Transform
                            
                                matplotlib colorbar for scatter

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Can I remove script tags with BeautifulSoup?

Tags:

python

html

beautifulsoup

Sam

People also ask

2 Answers

Fábio Diniz

Abhishek Dujari

Recent Activity

Donate For Us