I am trying to extract the innerHTML from a tag using the following code: <pre class="prettyprint"><code>theurl = "http://na.op.gg/summoner/userName=Darshan" thepage = urlopen(theurl) soup = BeautifulSoup(thepage,"html.parser") rank = soup.findAll('span',{"class":"tierRank"}) </code></pre> However I am getting <code>[ Master ]</code> instead. What I want to show is the value "Master" only. Using <code>soup.get_text</code> instead of <code>soup.findall</code> doesn't work. I tried adding <code>.text</code> and <code>.string</code> to the end of last line but that did not work either.

<code>soup.findAll('span',{"class":"tierRank"})</code> returns a list of elements that match <code></code>. <ol> <li>You want the first element from that list.</li> <li>You want the <code>innerHtml</code> from that element, which can be accessed by the <code>decode_contents()</code> method.</li> </ol> All together: <pre class="prettyprint"><code>rank = soup.findAll('span',{"class":"tierRank"})[0].decode_contents() </code></pre> This will store "Master" in <code>rank</code>.

How to extract innerHTML from tag using BeautifulSoup in Python

Tags:

python-3.x

beautifulsoup

I am trying to extract the innerHTML from a tag using the following code:

theurl = "http://na.op.gg/summoner/userName=Darshan"
thepage = urlopen(theurl)
soup = BeautifulSoup(thepage,"html.parser")
rank = soup.findAll('span',{"class":"tierRank"})

However I am getting [ Master ] instead. What I want to show is the value "Master" only.

Using soup.get_text instead of soup.findall doesn't work.

I tried adding .text and .string to the end of last line but that did not work either.

460

asked Apr 19 '18 01:04

Naveen Manoharan

1 Answers

soup.findAll('span',{"class":"tierRank"}) returns a list of elements that match .

You want the first element from that list.
You want the innerHtml from that element, which can be accessed by the decode_contents() method.

All together:

rank = soup.findAll('span',{"class":"tierRank"})[0].decode_contents()

This will store "Master" in rank.

162

answered Sep 29 '22 14:09

Matt Morgan

Related questions
                            
                                How can I format a float with given precision and zero padding?
                            
                                Comparing date strings in python
                            
                                What happened to ifilter?
                            
                                No module named 'requests' Python 3.5.0
                            
                                How does a for loop evaluate its argument
                            
                                Fast way to split an int into bytes
                            
                                Multi threading in Tkinter GUI, threads in different classes
                            
                                Unable to download nltk data
                            
                                Install Openalpr in Windows python
                            
                                What is the difference between the apply() function and a function call using the object of the class?
                            
                                calculate precision and recall in a confusion matrix
                            
                                Getting the target of a symbolic link with pathlib
                            
                                Python Pandas Dataframe merge and pick only few columns
                            
                                mypy error: List or tuple literal expected as the second argument to namedtuple()
                            
                                Using Python Faker generate different data for 5000 rows
                            
                                Error when parsing graph_def from string
                            
                                Python ssl.SSLError: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:748)
                            
                                Python asyncio/aiohttp: ValueError: too many file descriptors in select() on Windows
                            
                                Regex: don't match string ending with newline (\n) with end-of-line anchor ($)
                            
                                UnpicklingError: invalid load key, '3'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With