beautiful soup getting tag.id

Tags:

I'm attempting to get a list of div ids from a page. When I print out the attributes, I get the ids listed.

for tag in soup.find_all(class_="bookmark blurb group") :   print(tag.attrs)

results in:

Click to copy

{'id': 'bookmark_8199633', 'role': 'article', 'class': ['bookmark', 'blurb', 'group']} {'id': 'bookmark_7744613', 'role': 'article', 'class': ['bookmark', 'blurb', 'group']} {'id': 'bookmark_7338591', 'role': 'article', 'class': ['bookmark', 'blurb', 'group']} {'id': 'bookmark_7338535', 'role': 'article', 'class': ['bookmark', 'blurb', 'group']} {'id': 'bookmark_4530078', 'role': 'article', 'class': ['bookmark', 'blurb', 'group']}

So I know there ARE ids. However, when I print out tag.id instead, I just get a list of "None". What am I doing wrong here?

738

asked Jul 25 '14 18:07

klreeher

2 Answers

You can access tag’s attributes by treating the tag like a dictionary (documentation):

Click to copy

for tag in soup.find_all(class_="bookmark blurb group") :     print tag.get('id')

The reason tag.id didn't work is that it is equivalent to tag.find('id'), which results into None since there is no id tag found (documentation).

168

answered Sep 23 '22 20:09

alecxe

This solution lists all tags with ids in a page , It might be helpful too.

Click to copy

tags = page_soup.find_all() for tag in tags:     if 'id' in tag.attrs:         print(tag.name,tag['id'],sep='->')

answered Sep 20 '22 20:09

Thunder

Related questions
                            
                                Open tor browser with selenium
                            
                                PEP 257 docstring trim in standard library?
                            
                                How to avoid floating point errors? [duplicate]
                            
                                Python's in (__contains__) operator returns a bool whose value is neither True nor False
                            
                                Pandas merge giving error "Buffer has wrong number of dimensions (expected 1, got 2)"
                            
                                Read excel sheet with multiple header using Pandas
                            
                                Do Python lambda functions help in reducing the execution times?
                            
                                Renaming file extension using pathlib (python 3)
                            
                                Easiest way to serialize a simple class object with simplejson?
                            
                                how to convert Python 3 to Python 2 code? [closed]
                            
                                Unwanted RST TCP packet with Scapy
                            
                                Changing variable name in Spyder
                            
                                PermissionError: [WinError 32] The process cannot access the file because it is being used by another process
                            
                                numpy testing assert array NOT equal
                            
                                Where are Pip installation logs?
                            
                                Add class to Django label_tag() output
                            
                                copy.deepcopy vs pickle
                            
                                expanding (adding a row or column) a scipy.sparse matrix
                            
                                Alembic --autogenerate producing empty migration
                            
                                'is' operator behaves differently when comparing strings with spaces

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

beautiful soup getting tag.id

Tags:

python

html

html-parsing

beautifulsoup

klreeher

People also ask

2 Answers

alecxe

Thunder

Recent Activity

Donate For Us