I am struggling with the syntax required to grab some hrefs in a td. The table, tr and td elements dont have any class's or id's. If I wanted to grab the anchor in this example, what would I need? < tr > < td > < a >... Thanks

As per the docs, you first make a parse tree: <pre class="prettyprint"><code>import BeautifulSoup html = "<html><body><tr><td><a href='foo'/></td></tr></body></html>" soup = BeautifulSoup.BeautifulSoup(html) </code></pre> and then you search in it, for example for <code><a></code> tags whose immediate parent is a <code><td></code>: <pre class="prettyprint"><code>for ana in soup.findAll('a'): if ana.parent.name == 'td': print ana["href"] </code></pre>

How to get a nested element in beautiful soup

2 Answers

As per the docs, you first make a parse tree:

import BeautifulSoup html = "<html><body><tr><td><a href='foo'/></td></tr></body></html>" soup = BeautifulSoup.BeautifulSoup(html)

and then you search in it, for example for <a> tags whose immediate parent is a <td>:

for ana in soup.findAll('a'):   if ana.parent.name == 'td':     print ana["href"]

178

answered Nov 06 '22 04:11

Alex Martelli

Something like this?

from BeautifulSoup import BeautifulSoup soup = BeautifulSoup(html) anchors = [td.find('a') for td in soup.findAll('td')]

That should find the first "a" inside each "td" in the html you provide. You can tweak td.find to be more specific or else use findAll if you have several links inside each td.

UPDATE: re Daniele's comment, if you want to make sure you don't have any None's in the list, then you could modify the list comprehension thus:

from BeautifulSoup import BeautifulSoup soup = BeautifulSoup(html) anchors = [a for a in (td.find('a') for td in soup.findAll('td')) if a]

Which basically just adds a check to see if you have an actual element returned by td.find('a').

answered Nov 06 '22 04:11

John Montgomery

Related questions
                            
                                Update function in TSQL trigger
                            
                                C# - StringDictionary - how to get keys and values using a single loop?
                            
                                Passing data of a non-primitive type between activities in android
                            
                                Does JUnit support properties files for tests?
                            
                                Can I set a users profile image using the Facebook API?
                            
                                Best JSON parser for Qt? [closed]
                            
                                .NET: Combining two generic lists
                            
                                Is Erlang really a functional language? [closed]
                            
                                Encrypting / Decrypting file with Mcrypt
                            
                                jQuery Validate Ignore elements with style
                            
                                Command history in R
                            
                                Resolving a Circular Dependency between Template Classes

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to get a nested element in beautiful soup

Tags:

joepour

People also ask

2 Answers

Alex Martelli

John Montgomery

Recent Activity

Donate For Us