I modified an html file by removing some of the tags using <code>beautifulsoup</code>. Now I want to write the results back in a html file. My code: <pre class="prettyprint"><code>from bs4 import BeautifulSoup from bs4 import Comment soup = BeautifulSoup(open('1.html'),"html.parser") [x.extract() for x in soup.find_all('script')] [x.extract() for x in soup.find_all('style')] [x.extract() for x in soup.find_all('meta')] [x.extract() for x in soup.find_all('noscript')] [x.extract() for x in soup.find_all(text=lambda text:isinstance(text, Comment))] html =soup.contents for i in html: print i html = soup.prettify("utf-8") with open("output1.html", "wb") as file: file.write(html) </code></pre> Since I used soup.prettify, it generates html like this: <pre class="prettyprint"><code> BATAM.TRIBUNNEWS.COM, BINTAN - Tradisi pedang pora mewarnai serah terima jabatan pejabat di <a href="http://batam.tribunnews.com/tag/polres/" title="Polres"> Polres </a> <a href="http://batam.tribunnews.com/tag/bintan/" title="Bintan"> Bintan </a> , Senin (3/10/2016). </code></pre> I want to get the result like <code>print i</code> does: <pre class="prettyprint"><code>BATAM.TRIBUNNEWS.COM, BINTAN - Tradisi pedang pora mewarnai serah terima jabatan pejabat di <a href="http://batam.tribunnews.com/tag/polres/" title="Polres">Polres</a> <a href="http://batam.tribunnews.com/tag/bintan/" title="Bintan">Bintan</a>, Senin (3/10/2016). Empat perwira baru Senin itu diminta cepat bekerja. Tumpukan pekerjaan rumah sudah menanti di meja masing masing. </code></pre> How can I get a result the same as <code>print i</code> (ie. so the tag and its content appear on the same line)? Thanks.

Just convert the <code>soup</code> instance to string and write: <pre class="prettyprint"><code>with open("output1.html", "w") as file: file.write(str(soup)) </code></pre>

How to write the output to html file with Python BeautifulSoup

Tags:

python

html

beautifulsoup

I modified an html file by removing some of the tags using beautifulsoup. Now I want to write the results back in a html file. My code:

from bs4 import BeautifulSoup from bs4 import Comment  soup = BeautifulSoup(open('1.html'),"html.parser")  [x.extract() for x in soup.find_all('script')] [x.extract() for x in soup.find_all('style')] [x.extract() for x in soup.find_all('meta')] [x.extract() for x in soup.find_all('noscript')] [x.extract() for x in soup.find_all(text=lambda text:isinstance(text, Comment))] html =soup.contents for i in html:     print i  html = soup.prettify("utf-8") with open("output1.html", "wb") as file:     file.write(html)

Since I used soup.prettify, it generates html like this:

<p>     <strong>      BATAM.TRIBUNNEWS.COM, BINTAN     </strong>     - Tradisi pedang pora mewarnai serah terima jabatan pejabat di     <a href="http://batam.tribunnews.com/tag/polres/" title="Polres">      Polres     </a>     <a href="http://batam.tribunnews.com/tag/bintan/" title="Bintan">      Bintan     </a>     , Senin (3/10/2016).    </p>

I want to get the result like print i does:

<p><strong>BATAM.TRIBUNNEWS.COM, BINTAN</strong> - Tradisi pedang pora mewarnai serah terima jabatan pejabat di <a href="http://batam.tribunnews.com/tag/polres/" title="Polres">Polres</a> <a href="http://batam.tribunnews.com/tag/bintan/" title="Bintan">Bintan</a>, Senin (3/10/2016).</p> <p>Empat perwira baru Senin itu diminta cepat bekerja. Tumpukan pekerjaan rumah sudah menanti di meja masing masing.</p>

How can I get a result the same as print i (ie. so the tag and its content appear on the same line)? Thanks.

601

asked Nov 10 '16 14:11

Kim Hyesung

1 Answers

Just convert the soup instance to string and write:

with open("output1.html", "w") as file:     file.write(str(soup))

168

answered Sep 22 '22 07:09

alecxe

Related questions
                            
                                What is the use of Python's basic optimizations mode? (python -O)
                            
                                String function to strip the last comma
                            
                                How to extract dictionary single key-value pair in variables
                            
                                How to get the union of two lists using list comprehension? [duplicate]
                            
                                Pytest monkeypatch isn't working on imported function
                            
                                unexpected results converting timezones in python
                            
                                what's the inverse of the quantile function on a pandas Series?
                            
                                Simple Subquery with OuterRef
                            
                                Escaping dollar sign in ipython notebook
                            
                                The view didn't return an HttpResponse object. It returned None instead
                            
                                How to remove all characters before a specific character in Python?
                            
                                Keras Conv2D and input channels
                            
                                Reduce list of Python objects to dict of object.id -> object
                            
                                What's the Python version for “Code against an interface, not an object”?
                            
                                How to delete a directory created with tempfile.mkdtemp?
                            
                                Python return statement error " 'return' outside function"
                            
                                Set vs. frozenset performance
                            
                                How do I write to the console in Google App Engine?
                            
                                TypeError:exceptions must be old-style classes or derived from BaseException, not str
                            
                                Exposing python jupyter on LAN

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With