Beautiful Soup find children for particular div

Tags:

I have am trying to parse a webpage that looks like this with Python->Beautiful Soup: enter image description here

I am trying to extract the contents of the highlighted td div. Currently I can get all the divs by

alltd = soup.findAll('td')      for td in alltd:     print td

But I am trying to narrow the scope of that to search the tds in the class "tablebox" which still will probably return 30+ but is more managable a number than 300+.

How can I extract the contents of the highlighted td in picture above?

947

asked Nov 02 '12 19:11

Nick

1 Answers

It is useful to know that whatever elements BeautifulSoup finds within one element still have the same type as that parent element - that is, various methods can be called.

So this is somewhat working code for your example:

soup = BeautifulSoup(html) divTag = soup.find_all("div", {"class": "tablebox"})  for tag in divTag:     tdTags = tag.find_all("td", {"class": "align-right"})     for tag in tdTags:         print tag.text

This will print all the text of all the td tags with the class of "align-right" that have a parent div with the class of "tablebox".

109

answered Sep 25 '22 15:09

Bo Milanovich

Related questions
                            
                                Python - What is the process to create pdf reports with charts from a DB?
                            
                                Longest word chain from a list of words
                            
                                When to use weak references in Python?
                            
                                Python: How exactly can you take a string, split it, reverse it and join it back together again?
                            
                                All example concurrent.futures code is failing with "BrokenProcessPool"
                            
                                Django: AppRegistryNotReady()
                            
                                Spyder 5 missing dependencies - spyder_kernels version error [closed]
                            
                                What does the ** maths operator do in Python?
                            
                                What is the best way to do automatic attribute assignment in Python, and is it a good idea?
                            
                                Automatically import models on Django shell launch
                            
                                Heroku & Django: "OSError: No such file or directory: '/app/{myappname}/static'"
                            
                                How can I pass parameters to a RequestHandler?
                            
                                How to activate different anaconda environment from powershell
                            
                                How do I set the content-type for POST requests in python-requests library?
                            
                                No module named 'tqdm'
                            
                                Using monotonically_increasing_id() for assigning row number to pyspark dataframe
                            
                                Read a file on App Engine with Python?
                            
                                Use fnmatch.filter to filter files by more than one possible file extension
                            
                                Python: Iterating through a dictionary gives me "int object not iterable"
                            
                                Can Pylint error checking be customized?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Beautiful Soup find children for particular div

Tags:

python

parsing

beautifulsoup

Nick

People also ask

1 Answers

Bo Milanovich

Recent Activity

Donate For Us