How to select tags by attribute value with Beautiful Soup

Tags:

python

beautifulsoup

I have the following HTML fragment:

>>> a
<div class="headercolumn">
<h2>
<a class="results" data-name="result-name" href="/xxy> my text</a>
</h2>

I am trying to select header column only if attribute data-name="result-name"

I've tried:

>>> a.select('a["data-name="result-name""]')

This gives:

ValueError: Unsupported or invalid CSS selector:

How can I get this working?

494

asked Jul 27 '14 18:07

user1592380

2 Answers

You can simply do this :

soup = BeautifulSoup(html)
results = soup.findAll("a", {"data-name" : "result-name"})

Source : How to find tags with only certain attributes - BeautifulSoup

154

answered Nov 15 '22 08:11

Azwr

html = """
<div class="headercolumn">
<h2>
<a class="results" data-name="result-name" href="/xxy> my text</a>
</h2>
"""

from bs4 import BeautifulSoup
soup = BeautifulSoup(html)
for d in soup.findAll("div",{"class":"headercolumn"}):
    print d.a.get("data-name")
    print d.select("a.results")

result-name
[<a class="results" data-name="result-name" href="/xxy&gt; my text&lt;/a&gt;&lt;/h2&gt;"></a>]

answered Nov 15 '22 07:11

Padraic Cunningham

Related questions
                            
                                How to unpack optional items from a tuple? [duplicate]
                            
                                Python - convert set-cookies response to dict of cookies
                            
                                Is there an easy way to unpack a tuple while using enumerate in loop?
                            
                                Kivy. Text provider error
                            
                                Python convex hull with scipy.spatial.Delaunay, how to eleminate points inside the hull?
                            
                                Why do I get E127 from this vimscript?
                            
                                Fourier smoothing of data set
                            
                                Python: case where x==y and x.__eq__y() return different things. Why?
                            
                                "python manage.py syncdb" not creating tables
                            
                                excluding url pattern from django app..is it possible?
                            
                                List of arguments with argparse
                            
                                How to detect if all the rows of a non-square matrix are orthogonal in python
                            
                                Django 'DateField' object has no attribute 'is_hidden'
                            
                                Pass extra values along with urls to scrapy spider
                            
                                add label to subplot in matplotlib
                            
                                pycrypto installation: configure error: cannot run C compiled programs
                            
                                py.test: hide stacktrace lines from unittest module
                            
                                HMAC signing requests in Python
                            
                                Stream multiple files into a readable object in Python
                            
                                Start python script with cron and output print to a file [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With