I'm trying to find best way to capture links listed under response headers, exactly like this one and I'm using python requests module. Below is link which has Link Headers section on Python Requests page: docs.python-requests.org/en/latest/user/advanced/ But, in my case my response headers contains links like below: <pre class="prettyprint"><code>{'content-length': '12276', 'via': '1.1 varnish-v4', 'links': '<http://justblahblahblah.com/link8.html>;rel="last">,<http://justblahblahblah.com/link2.html>;rel="next">', 'vary': 'Accept-Encoding, Origin'} </code></pre> Please notice > after "last" which is not the case under Requests examples and I just cant seem to figure out how to solve this.

There is already a way provided by <code>requests</code> to access links header <pre class="prettyprint"><code>response.links </code></pre> It returns the dictionary of links header value which can easily parsed further using <pre class="prettyprint"><code>response.links['next']['url'] </code></pre> to get the required values.

You can parse the header's value manually. To make things easier you might want to use request's parsing function <code>parse_header_links</code> as a reference. Or you can do some find/replace and use original <code>parse_header_links</code> <pre class="prettyprint"><code>In [1]: import requests In [2]: d = {'content-length': '12276', 'via': '1.1 varnish-v4', 'links': '<http://justblahblahblah.com/link8.html>;rel="last">,<http://justblahblahblah.com/link2.html>;rel="next">', 'vary': 'Accept-Encoding, Origin'} In [3]: requests.utils.parse_header_links(d['links'].rstrip('>').replace('>,<', ',<')) Out[3]: [{'rel': 'last', 'url': 'http://justblahblahblah.com/link8.html'}, {'rel': 'next', 'url': 'http://justblahblahblah.com/link2.html'}] </code></pre> If there might be a space or two between <code>>,</code> and <code><</code> then you need to do replace with a regular expression.

python requests link headers

Tags:

python

hyperlink

header

python-requests

I'm trying to find best way to capture links listed under response headers, exactly like this one and I'm using python requests module. Below is link which has Link Headers section on Python Requests page: docs.python-requests.org/en/latest/user/advanced/

But, in my case my response headers contains links like below:

{'content-length': '12276', 'via': '1.1 varnish-v4', 'links': '<http://justblahblahblah.com/link8.html>;rel="last">,<http://justblahblahblah.com/link2.html>;rel="next">', 'vary': 'Accept-Encoding, Origin'}

Please notice > after "last" which is not the case under Requests examples and I just cant seem to figure out how to solve this.

314

asked Aug 31 '15 13:08

user1819085

2 Answers

There is already a way provided by requests to access links header

response.links

It returns the dictionary of links header value which can easily parsed further using

response.links['next']['url']

to get the required values.

166

answered Sep 20 '22 14:09

Atul Mishra

You can parse the header's value manually. To make things easier you might want to use request's parsing function parse_header_links as a reference.

Or you can do some find/replace and use original parse_header_links

In [1]: import requests  In [2]: d = {'content-length': '12276', 'via': '1.1 varnish-v4', 'links': '<http://justblahblahblah.com/link8.html>;rel="last">,<http://justblahblahblah.com/link2.html>;rel="next">', 'vary': 'Accept-Encoding, Origin'}  In [3]: requests.utils.parse_header_links(d['links'].rstrip('>').replace('>,<', ',<')) Out[3]: [{'rel': 'last', 'url': 'http://justblahblahblah.com/link8.html'},  {'rel': 'next', 'url': 'http://justblahblahblah.com/link2.html'}]

If there might be a space or two between >, and < then you need to do replace with a regular expression.

answered Sep 21 '22 14:09

Alik

Related questions
                            
                                Creating files and directories via Python
                            
                                Jinja2 round filter not rounding
                            
                                Create empty Dataframe with same dimensions as another?
                            
                                Passing data from Django to D3
                            
                                OpenCV Python: How to detect if a window is closed?
                            
                                'module' object has no attribute 'basicConfig'
                            
                                How to clear a multiprocessing queue in python
                            
                                Dot product of two vectors in tensorflow
                            
                                Cannot import models from another app in Django
                            
                                jupyter server : not started, no kernel in vs code
                            
                                Parsing hostname and port from string or url
                            
                                Convert all keys of a dictionary into lowercase [duplicate]
                            
                                Execute statement every N iterations in Python
                            
                                Make Sqlalchemy Use Date In Filter Using Postgresql
                            
                                How to get the current Linux process ID from the command line a in shell-agnostic, language-agnostic way
                            
                                Removing character in list of strings
                            
                                defaultdict is not defined
                            
                                how can I check database connection to mysql in django
                            
                                wxPython, Set value of StaticText()
                            
                                Storing and updating lists in Python dictionaries: why does this happen?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With