Getting form "action" from BeautifulSoup result

Tags:

I'm coding a Python parser for a website to do some job automatically but I'm not much into "re" module (regex) for Py and can't make it work.

req = urllib2.Request(tl2)
req.add_unredirected_header('User-Agent', ua)
response = urllib2.urlopen(req)
try:
    html = response.read()
except urllib2.URLError, e:
    print "Error while reading data. Are you connected to the interwebz?!", e

soup = BeautifulSoup.BeautifulSoup(html)
form = soup.find('form', id='form_product_page')
pret = form.prettify()

print pret

Result:

<form id="form_product_page" name="form_1362737440" action="/download/791055/164084/" method="get">
<input id="nojssubmit" type="submit" value="Download" />
</form>

Indeed that code is done, just what I need for start. Now, I'm wondering on which way should I extract "action" attribute from "form" tag. That is only what I need from BeautifulSoup response.

I've tried using form = soup.find('form', id='form_product_page').parent.get('action') but result was 'None'. What I want to extract is for example "/download/791055/164084/". This is different on every URL from link.

Variables (example):
tl2 = http://example.com
ua = Mozilla Firefox / 14.04

625

asked May 04 '14 23:05

sensation

1 Answers

You can do it in one step:

action = soup.find('form', id='form_product_page').get('action')

179

answered Oct 18 '22 03:10

Casimir et Hippolyte

Related questions
                            
                                How to show data labels when you mouse over data
                            
                                Python: Fast and efficient way of writing large text file
                            
                                django-rest-framework: How Do I Serialize a Field That Already Contains JSON?
                            
                                How to call a specific Python function from a batch file?
                            
                                Square brackets next to an object - What's the notation called?
                            
                                Qt Designer QListWidget checkbox
                            
                                Listening for global key-combinations in python on Linux
                            
                                Why am I getting an empty row in my dataframe after using pandas apply?
                            
                                Backgroundworker in python
                            
                                How to install Tkinter on debian sid?
                            
                                Best way to split every nth string element and merge into array?
                            
                                python re.search error TypeError: expected string or buffer
                            
                                python re ?: example [duplicate]
                            
                                create database automatically in django using settings.py or models.py
                            
                                What is the best way to store data in Python? [closed]
                            
                                NameError: name 'urllib2' is not defined [closed]
                            
                                Python: Why redefinition of a function is not an error? Is there a hackish way to have that feature? [duplicate]
                            
                                How to get PIP for python [duplicate]
                            
                                Image filtering with scikit-image?
                            
                                convert c enum bitfield to python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Getting form "action" from BeautifulSoup result

Tags:

python

regex

beautifulsoup

web-scraping

sensation

People also ask

1 Answers

Casimir et Hippolyte

Recent Activity

Donate For Us