Extracting an attribute value with beautifulsoup

People also ask

How do you get attributes in BeautifulSoup?

read() f. close() from BeautifulSoup import BeautifulStoneSoup soup = BeautifulStoneSoup(s) inputTags = soup. findAll(attrs={"name" : "stainfo"}) ### You may be able to do findAll("input", attrs={"name" : "stainfo"}) output = [x["stainfo"] for x in inputTags] print output ### This will print a list of the values.

How do I find a specific element with BeautifulSoup?

BeautifulSoup has a limited support for CSS selectors, but covers most commonly used ones. Use select() method to find multiple elements and select_one() to find a single element.

.find_all() returns list of all found elements, so:

input_tag = soup.find_all(attrs={"name" : "stainfo"})

input_tag is a list (probably containing only one element). Depending on what you want exactly you either should do:

output = input_tag[0]['value']

or use .find() method which returns only one (first) found element:

input_tag = soup.find(attrs={"name": "stainfo"})
output = input_tag['value']

In Python 3.x, simply use get(attr_name) on your tag object that you get using find_all:

xmlData = None

with open('conf//test1.xml', 'r') as xmlFile:
    xmlData = xmlFile.read()

xmlDecoded = xmlData

xmlSoup = BeautifulSoup(xmlData, 'html.parser')

repElemList = xmlSoup.find_all('repeatingelement')

for repElem in repElemList:
    print("Processing repElem...")
    repElemID = repElem.get('id')
    repElemName = repElem.get('name')

    print("Attribute id = %s" % repElemID)
    print("Attribute name = %s" % repElemName)

against XML file conf//test1.xml that looks like:

<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<root>
    <singleElement>
        <subElementX>XYZ</subElementX>
    </singleElement>
    <repeatingElement id="11" name="Joe"/>
    <repeatingElement id="12" name="Mary"/>
</root>

prints:

Processing repElem...
Attribute id = 11
Attribute name = Joe
Processing repElem...
Attribute id = 12
Attribute name = Mary

If you want to retrieve multiple values of attributes from the source above, you can use findAll and a list comprehension to get everything you need:

import urllib
f = urllib.urlopen("http://58.68.130.147")
s = f.read()
f.close()

from BeautifulSoup import BeautifulStoneSoup
soup = BeautifulStoneSoup(s)

inputTags = soup.findAll(attrs={"name" : "stainfo"})
### You may be able to do findAll("input", attrs={"name" : "stainfo"})

output = [x["stainfo"] for x in inputTags]

print output
### This will print a list of the values.

For me:

<input id="color" value="Blue"/>

This can be fetched by below snippet.

page = requests.get("https://www.abcd.com")
soup = BeautifulSoup(page.content, 'html.parser')
colorName = soup.find(id='color')
print(colorName['value'])

Related questions
                            
                                Django DB Settings 'Improperly Configured' Error
                            
                                Open file in a relative location in Python
                            
                                How to clear variables in ipython?
                            
                                Django 1.7 throws django.core.exceptions.AppRegistryNotReady: Models aren't loaded yet
                            
                                MySQL "incorrect string value" error when save unicode string in Django
                            
                                Pass a parameter to a fixture function
                            
                                Remove a prefix from a string [duplicate]
                            
                                Modifying a subset of rows in a pandas dataframe
                            
                                In Django, how do I check if a user is in a certain group?
                            
                                Counting unique values in a column in pandas dataframe like in Qlik?
                            
                                Install a module using pip for specific python version
                            
                                sys.argv[1] meaning in script
                            
                                Find an element in a list of tuples
                            
                                How do I find numeric columns in Pandas?
                            
                                How to calculate rolling / moving average using python + NumPy / SciPy?
                            
                                Why doesn't django's model.save() call full_clean()?
                            
                                pip or pip3 to install packages for Python 3?
                            
                                getting the index of a row in a pandas apply function
                            
                                Python list iterator behavior and next(iterator)
                            
                                Difference between defining typing.Dict and dict?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Extracting an attribute value with beautifulsoup

Tags:

python

parsing

attributes

beautifulsoup

People also ask

Recent Activity

Donate For Us