Getting attribute's value using BeautifulSoup

Tags:

I'm writing a python script which will extract the script locations after parsing from a webpage. Lets say there are two scenarios :

<script type="text/javascript" src="http://example.com/something.js"></script>

and

<script>some JS</script>

I'm able to get the JS from the second scenario, that is when the JS is written within the tags.

But is there any way, I could get the value of src from the first scenario (i.e extracting all the values of src tags within script such as http://example.com/something.js)

Here's my code

#!/usr/bin/python

import requests 
from bs4 import BeautifulSoup

r  = requests.get("http://rediff.com/")
data = r.text
soup = BeautifulSoup(data)
for n in soup.find_all('script'):
    print n

Output : Some JS

Output Needed : http://example.com/something.js

695

asked Sep 11 '13 05:09

aditya.gupta

2 Answers

It will get all the src values only if they are present. Or else it would skip that <script> tag

from bs4 import BeautifulSoup
import urllib2
url="http://rediff.com/"
page=urllib2.urlopen(url)
soup = BeautifulSoup(page.read())
sources=soup.findAll('script',{"src":True})
for source in sources:
 print source['src']

I am getting following two src values as result

http://imworld.rediff.com/worldrediff/js_2_5/ws-global_hm_1.js
http://im.rediff.com/uim/common/realmedia_banner_1_5.js

I guess this is what you want. Hope this is useful.

answered Nov 07 '22 22:11

Venkateshwaran Selvaraj

Get 'src' from script node.

import requests 
from bs4 import BeautifulSoup

r  = requests.get("http://rediff.com/")
data = r.text
soup = BeautifulSoup(data)
for n in soup.find_all('script'):
    print "src:", n.get('src') <====

answered Nov 07 '22 23:11

rajpy

Related questions
                            
                                change matplotlib axis settings
                            
                                python about multiple %s in a string
                            
                                Problems Opening Firefox
                            
                                Creating my own "integer" object in Python
                            
                                Django auth.user with unique email
                            
                                Deleting Elements from an array
                            
                                Python - how to delete hidden signs from string?
                            
                                Replacing Filename characters with python
                            
                                Adding a string to a list using augmented assignment
                            
                                Admin interface for SQLAlchemy?
                            
                                How can I use xdotool from within a python module/script?
                            
                                How to convert (inherit) parent to child class?
                            
                                How can I pass configuration variable values into the pyodbc connect command?
                            
                                Get file size from "Content-Length" value from a file in python 3.2
                            
                                How to write to CSV and not overwrite past text
                            
                                Python printing without commas
                            
                                Python - Splitting List That Contains Strings and Integers
                            
                                Sending a Dictionary using Sockets in Python?
                            
                                Python: categorising a list by orders of magnitude
                            
                                Filtering Characters from a String [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Getting attribute's value using BeautifulSoup

Tags:

python

beautifulsoup

python-2.7

aditya.gupta

People also ask

2 Answers

Venkateshwaran Selvaraj

rajpy

Recent Activity

Donate For Us