Python Requests - ChunkedEncodingError(e) - requests.iter_lines

Tags:

I'm getting a ChunkedEncodingError(e) using Python requests. I'm using the following to rip down JSON:

r = requests.get(url, headers=auth, stream=True)

And the iterating over each line, using the carriage return as a delimiter, which is how this API distinguishes between distinct JSON events.

for d in r.iter_lines(delimiter="\n"):
    d += "\n"
    sock.send(d)

I'm delimiting on the carriage return and then adding it back in as the endpoint I'm pushing the logs to actually expects a carriage return at the end of each event also. This seems to work for roughly 100k log files. When I try to make a larger call I'll get this following thrown:

for d in r.iter_lines(delimiter="\n"):
logs_1           |   File "/usr/local/lib/python2.7/dist-packages/requests/models.py", line 783, in iter_lines
logs_1           |     for chunk in self.iter_content(chunk_size=chunk_size, decode_unicode=decode_unicode):
logs_1           |   File "/usr/local/lib/python2.7/dist-packages/requests/models.py", line 742, in generate
logs_1           |     raise ChunkedEncodingError(e)
logs_1           | requests.exceptions.ChunkedEncodingError: ('Connection broken: IncompleteRead(0 bytes read)', IncompleteRead(0 bytes read))

UPDATE: I've discovered the API is sending back a NoneType at some point as well. So how can I account for this null byte somewhere in the response without blowing everything up? Each individual event is ended with a \n, and I need to be able to inspect each even individually. Should I chunk the content instead of iter_lines? Then ensure there is no NoneType in the chunk? That way I don't try to iter_lines over a NoneType and it blows up?

993

asked Jun 12 '17 22:06

HectorOfTroy407

1 Answers

ChunkedEncodingError is caused by: httplib.IncompletedRead

enter image description here

import httplib

def patch_http_response_read(func):
    def inner(*args):
        try:
            return func(*args)
        except httplib.IncompleteRead, e:
            return e.partial
    return inner

httplib.HTTPResponse.read = patch_http_response_read(httplib.HTTPResponse.read)

I think this could be a patch. It allows you to deal with defective http servers.

Most servers transmit all data, but due implementation errors they wrongly close session and httplib raise error and bury your precious bytes.

103

answered Sep 22 '22 09:09

gushitong

Related questions
                            
                                Profiling memory usage on App Engine
                            
                                Python calculating Catalan Numbers
                            
                                How to check if a function is pure in Python?
                            
                                The differences between MySQLdb and mysqlconnector
                            
                                Import a module from a directory (package) one level up
                            
                                Can pandas.DataFrame have list type column?
                            
                                How to save and load MLLib model in Apache Spark?
                            
                                Add metadata comment to Numpy ndarray
                            
                                How to use technical indicators of TA-Lib with pandas in python
                            
                                How to send a colored text message?
                            
                                Jupyter: Write a custom magic that modifies the contents of the cell it's in
                            
                                zip_longest without fillvalue
                            
                                How to optimize multiprocessing in Python
                            
                                How to split a list into n groups in all possible combinations of group length and elements within group?
                            
                                Spyder 3 "Set Console Working Directory" not working
                            
                                How do I feed Tensorflow placeholders with numpy arrays?
                            
                                What should I put in the body of an abstract method?
                            
                                What's the difference between dummy variable and one-hot encoding?
                            
                                TypeError: init() missing 1 required positional argument: 'message' using Multiprocessing
                            
                                Pipe PIL images to ffmpeg stdin - Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Requests - ChunkedEncodingError(e) - requests.iter_lines

Tags:

python

python-requests

chunked-encoding

http-chunked

HectorOfTroy407

People also ask

1 Answers

gushitong

Recent Activity

Donate For Us