Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Python Beautifulsoup Getting Attribute Value

I'm having difficulty getting the proper syntax to extract the value of an attribute in Beautifulsoup with HTML 5.0.

So I've isolated the occurrence of a tag in my soup using the proper syntax where there is an HTML 5 issue:

tags = soup.find_all(attrs={"data-topic":"recUpgrade"})

Taking just tags[1]:

date = tags[1].find(attrs={"data-datenews":True})

and date here is:

<span class="invisible" data-datenews="2018-05-25 06:02:19" data-idnews="2736625" id="horaCompleta"></span>

But now I want to extract the date time "2018-05-25 06:02:19". Can't get the syntax.

Insight/help please.

like image 568
Windstorm1981 Avatar asked Mar 06 '23 03:03

Windstorm1981


1 Answers

You can access the attrs using key-value pair

Ex:

from bs4 import BeautifulSoup
s = """<span class="invisible" data-datenews="2018-05-25 06:02:19" data-idnews="2736625" id="horaCompleta"></span>"""
soup = BeautifulSoup(s, "html.parser")
print(soup.span["data-datenews"])

Output:

2018-05-25 06:02:19
like image 153
Rakesh Avatar answered Mar 16 '23 23:03

Rakesh