replace characters not working in python [duplicate]

Tags:

I am using beautiful soup and I am writing a crawler and have the following code in it:

  print soup.originalEncoding                 #self.addtoindex(page, soup)                   links=soup('a')             for link in links:                  if('href' in dict(link.attrs)):                                        link['href'].replace('..', '')                     url=urljoin(page, link['href'])                     if url.find("'") != -1:                         continue                     url = url.split('?')[0]                     url = url.split('#')[0]                     if url[0:4] == 'http':                         newpages.add(url)         pages = newpages

The link['href'].replace('..', '') is supposed to fix links that come out as ../contact/orderform.aspx, ../contact/requestconsult.aspx, etc. However, it is not working. Links still have the leading ".." Is there something I am missing?

567

asked Aug 26 '11 18:08

sdiener

1 Answers

string.replace() returns the string with the replaced values. It doesn't modify the original so do something like this:

link['href'] = link['href'].replace("..", "")

114

answered Oct 06 '22 16:10

joel goldstick

Related questions
                            
                                How can I select all checkboxes from a form using pure JavaScript (without JS frameworks)?
                            
                                onServiceConnected() not called
                            
                                Add different class to even and odd divs
                            
                                C: Multiple scanf's, when I enter in a value for one scanf it skips the second scanf [duplicate]
                            
                                Implementing a Countdown Timer in Objective-c?
                            
                                Is there an easier way to do boolean conversions?
                            
                                Accessing multiple property files with @PropertyResource in Spring
                            
                                How to turn off icon gloss effect in Xcode 5
                            
                                Modify output from Python Pandas describe
                            
                                what is the use of $this->uri->segment(3) in codeigniter pagination
                            
                                Is it possible to create multiple PendingIntents with the same requestCode and different extras?
                            
                                Method to return the equation of a straight line given two points

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With