Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Automatically Extracting feed links (atom, rss,etc) from webpages [closed]

I have a huge list of URLs and my task is to feed them to a python script which should spit out the feed urls if there are any. Is there an API library or code out there that can help?

like image 377
Max Avatar asked Dec 10 '22 05:12

Max


1 Answers

There's feedfinder:

>>> import feedfinder
>>>
>>> feedfinder.feed('scripting.com')
'http://scripting.com/rss.xml'
>>>
>>> feedfinder.feeds('scripting.com')
['http://delong.typepad.com/sdj/atom.xml', 
 'http://delong.typepad.com/sdj/index.rdf', 
 'http://delong.typepad.com/sdj/rss.xml']
>>>
like image 83
David Johnstone Avatar answered Dec 11 '22 19:12

David Johnstone