Persistent/keepalive HTTP with the PHP Curl library?

Question

I'm using a simple PHP library to add documents to a SOLR index, via HTTP.

There are 3 servers involved, currently:

The PHP box running the indexing job
A database box holding the data being indexed
The solr box.

At 80 documents/sec (out of 1 million docs), I'm noticing an unusually high interrupt rate on the network interfaces on the PHP and solr boxes (2000/sec; what's more, the graphs are nearly identical -- when the interrupt rate on the PHP box spikes, it also spikes on the Solr box), but much less so on the database box (300/sec). I imagine this is simply because I open and reuse a single connection to the database server, but every single Solr request is currently opening a new HTTP connection via cURL, thanks to the way the Solr client library is written.

So, my question is:

Can cURL be made to open a keepalive session?
What does it take to reuse a connection? -- is it as simple as reusing the cURL handle resource?
Do I need to set any special cURL options? (e.g. force HTTP 1.1?)
Are there any gotchas with cURL keepalive connections? This script runs for hours at a time; will I be able to use a single connection, or will I need to periodically reconnect?

Piskvor left the building · Accepted Answer

cURL PHP documentation (curl_setopt) says:

CURLOPT_FORBID_REUSE - TRUE to force the connection to explicitly close when it has finished processing, and not be pooled for reuse.

So:

Yes, actually it should re-use connections by default, as long as you re-use the cURL handle.
by default, cURL handles persistent connections by itself; should you need some special headers, check CURLOPT_HTTPHEADER
the server may send a keep-alive timeout (with default Apache install, it is 15 seconds or 100 requests, whichever comes first) - but cURL will just open another connection when that happens.

Richard Keizer · Answer

Curl sends the keep-alive header by default, but:

create a context using curl_init() without any parameters.
store the context in a scope where it will survive (not a local var)
use CURLOPT_URL option to pass the url to the context
execute the request using curl_exec()
don't close the connection with curl_close()

very basic example:

function get($url) {     global $context;     curl_setopt($context, CURLOPT_URL, $url);     return curl_exec($context); }  $context = curl_init(); //multiple calls to get() here curl_close($context);

Persistent/keepalive HTTP with the PHP Curl library?

Tags:

http

php

curl

libcurl

keep-alive

Frank Farmer

2 Answers

Piskvor left the building

Richard Keizer

Recent Activity

Donate For Us

Persistent/keepalive HTTP with the PHP Curl library?

Tags:

http

php

curl

libcurl

keep-alive

Frank Farmer

2 Answers

Piskvor left the building

Richard Keizer

Related questions

Recent Activity

Donate For Us