Set Request Timeout in Elastic Search for bulk loads [duplicate]

Tags:

I wanted to set the request time to 20 sec or more in Elasticsearch Bulk uploads. Default time is set to 10 sec and my Warning message days it takes 10.006 sec. And, right after displaying the waring the execution is throwing an error

Now, I wanted to set the Request Timeout either for every request taking input from user or any value set by default.

Error Message:

    WARNING:elasticsearch:HEAD /opportunityci/predictionsci [status:404 request:0.080s]
validated the index and mapping...!
WARNING:elasticsearch:POST http://192.168.204.154:9200/_bulk [status:N/A request:10.003s]
Traceback (most recent call last):
  File "/Users/adaggula/anaconda/lib/python2.7/site-packages/elasticsearch/connection/http_urllib3.py", line 94, in perform_request
    response = self.pool.urlopen(method, url, body, retries=False, headers=self.headers, **kw)
  File "/Users/adaggula/anaconda/lib/python2.7/site-packages/urllib3/connectionpool.py", line 640, in urlopen
    _stacktrace=sys.exc_info()[2])
  File "/Users/adaggula/anaconda/lib/python2.7/site-packages/urllib3/util/retry.py", line 238, in increment
    raise six.reraise(type(error), error, _stacktrace)
  File "/Users/adaggula/anaconda/lib/python2.7/site-packages/urllib3/connectionpool.py", line 595, in urlopen
    chunked=chunked)
  File "/Users/adaggula/anaconda/lib/python2.7/site-packages/urllib3/connectionpool.py", line 395, in _make_request
    self._raise_timeout(err=e, url=url, timeout_value=read_timeout)
  File "/Users/adaggula/anaconda/lib/python2.7/site-packages/urllib3/connectionpool.py", line 315, in _raise_timeout
    raise ReadTimeoutError(self, url, "Read timed out. (read timeout=%s)" % timeout_value)
ReadTimeoutError: HTTPConnectionPool(host='192.168.204.154', port='9200'): Read timed out. (read timeout=10)
ERROR:DataScience:init exception : Traceback (most recent call last):
  File "/Users/adaggula/Documents/workspace/LatestDemo/demo/com/ci/dataScience/engine/Driver.py", line 194, in <module>
    sample.persist(finalResults)
  File "/Users/adaggula/Documents/workspace/LatestDemo/demo/com/ci/dataScience/ES/sample.py", line 68, in persist
    res = helpers.bulk(client,data,stats_only=True)
  File "/Users/adaggula/anaconda/lib/python2.7/site-packages/elasticsearch/helpers/__init__.py", line 188, in bulk
    for ok, item in streaming_bulk(client, actions, **kwargs):
  File "/Users/adaggula/anaconda/lib/python2.7/site-packages/elasticsearch/helpers/__init__.py", line 160, in streaming_bulk
    for result in _process_bulk_chunk(client, bulk_actions, raise_on_exception, raise_on_error, **kwargs):
  File "/Users/adaggula/anaconda/lib/python2.7/site-packages/elasticsearch/helpers/__init__.py", line 89, in _process_bulk_chunk
    raise e
ConnectionTimeout: ConnectionTimeout caused by - ReadTimeoutError(HTTPConnectionPool(host='192.168.204.154', port='9200'): Read timed out. (read timeout=10))

994

asked Jul 12 '16 10:07

Jack Daniel

1 Answers

Use parameter 'request_timeout'

E.g.:

bulk(es, records, chunk_size=500, request_timeout=20)

152

answered Sep 27 '22 17:09

Lyncean Patel

Related questions
                            
                                No handlers could be found for logger "__main__"
                            
                                Open BytesIO (xlsx) with xlrd
                            
                                Understanding Matplotlib's quiver plotting
                            
                                Dictionary Comprehension for list values
                            
                                How to assign and use column headers in Spark?
                            
                                Specifying default dtype for np.array(1.)
                            
                                How to erase line from text file in Python?
                            
                                How do you merge the master branch into a feature branch with GitPython?
                            
                                Add an autoincrementing ID column to an existing table with Sqlite
                            
                                How can I implement a recursive neural network in TensorFlow?
                            
                                DefaultRouter class not creating API root view for all apps in python
                            
                                Creating log directory in tensorboard
                            
                                How to download images from BeautifulSoup?
                            
                                Pandas - How to check if multi index column exists
                            
                                python if statement dictionary incompatible indexer with Series
                            
                                which tokenizer is better to be used with nltk
                            
                                Pandas Python, select columns based on rows conditions
                            
                                Where is target specified in tensorflow's load_csv function
                            
                                Infinite While True Loop in the Background (Python)
                            
                                Is a constructor __init__ necessary for a class in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Set Request Timeout in Elastic Search for bulk loads [duplicate]

Tags:

python

elasticsearch

elasticsearch-bulk-api

request-timed-out

Jack Daniel

People also ask

1 Answers

Lyncean Patel

Recent Activity

Donate For Us