In Kubernetes services talk to each other via a service ip. With iptables or something similar each TCP connection is transparently routed to one of the pods that are available for the called service. If the calling service is not closing the TCP connection (e.g. using TCP keepalive or a connection pool) it will connect to one pod and not use the other pods that may be spawned. What is the correct way to handle such a situation? <hr> My own unsatisfying ideas: <h3>Closing connection after each api call</h3> Am I making every call slower only to be able to distribute requests to different pods? Doesn't feel right. <h3>Minimum number of connections</h3> I could force the caller to open multiple connections (assuming it would then distribute the requests across these connections) but how many should be open? The caller has (and probably should not have) no idea how many pods there are. <h3>Disable bursting</h3> I could limit the resources of the called services so it gets slow on multiple requests and the caller will open more connections (hopefully to other pods). Again I don't like the idea of arbitrarily slowing down the requests and this will only work on cpu bound services.

The keep-alive behavior can be tuned by options specified in the Keep-Alive general header: E.g: <pre class="prettyprint"><code>Connection: Keep-Alive Keep-Alive: max=10, timeout=60 </code></pre> Thus, you could re-open a tcp connection after a specific timeout instead than at each API request or after a max number of http transactions. Keep in mind that timeout and max are not guaranteed. EDIT: Note that If you use k8s service you can choose two LB mode: <ul> <li>iptables proxy mode (By default, kube-proxy in iptables mode chooses a backend at random.)</li> <li>IPVS proxy mode where you have different load balancing options:</li> </ul> IPVS provides more options for balancing traffic to backend Pods; these are: rr: round-robin lc: least connection (smallest number of open connections) dh: destination hashing sh: source hashing sed: shortest expected delay nq: never queue check this link

How to manage persistent connections in kubernetes

Tags:

kubernetes

kube-proxy

In Kubernetes services talk to each other via a service ip. With iptables or something similar each TCP connection is transparently routed to one of the pods that are available for the called service. If the calling service is not closing the TCP connection (e.g. using TCP keepalive or a connection pool) it will connect to one pod and not use the other pods that may be spawned.

What is the correct way to handle such a situation?

My own unsatisfying ideas:

Closing connection after each api call

Am I making every call slower only to be able to distribute requests to different pods? Doesn't feel right.

Minimum number of connections

I could force the caller to open multiple connections (assuming it would then distribute the requests across these connections) but how many should be open? The caller has (and probably should not have) no idea how many pods there are.

Disable bursting

I could limit the resources of the called services so it gets slow on multiple requests and the caller will open more connections (hopefully to other pods). Again I don't like the idea of arbitrarily slowing down the requests and this will only work on cpu bound services.

614

asked Jul 23 '19 06:07

deflomu

1 Answers

The keep-alive behavior can be tuned by options specified in the Keep-Alive general header:

E.g:

Connection: Keep-Alive
Keep-Alive: max=10, timeout=60

Thus, you could re-open a tcp connection after a specific timeout instead than at each API request or after a max number of http transactions.

Keep in mind that timeout and max are not guaranteed.

EDIT:

Note that If you use k8s service you can choose two LB mode:

iptables proxy mode (By default, kube-proxy in iptables mode chooses a backend at random.)
IPVS proxy mode where you have different load balancing options:

IPVS provides more options for balancing traffic to backend Pods; these are:

rr: round-robin lc: least connection (smallest number of open connections) dh: destination hashing sh: source hashing sed: shortest expected delay nq: never queue

check this link

179

answered Sep 28 '22 20:09

melix

Related questions
                            
                                kubernetes mountPath vs hostPath
                            
                                What is the meaning of CPU and core in Kubernetes?
                            
                                Helm upgrade doesn't pull new container
                            
                                Kubernetes autoscaler targetCPUUtilizationPercentage
                            
                                The connection to the server localhost:8080 was refused
                            
                                How to resolve scheduler and controller-manager unhealthy state in Kubernetes [closed]
                            
                                Is there a way to add arbitrary records to kube-dns?
                            
                                How do I get the pod ID in Kubernetes?
                            
                                Error: Kubernetes cluster unreachable: Get "http://localhost:8080/version?timeout=32s": dial tcp 127.0.0.1:8080: connect: connection refused
                            
                                CoreDNS fails to run in Kubernetes cluster
                            
                                Ensure Kubernetes Deployment has completed and all pods are updated and available
                            
                                Draft and Helm vs Ksonnet? [closed]
                            
                                How do you get the Node IP from inside a Pod?
                            
                                How to set GOOGLE_APPLICATION_CREDENTIALS on GKE running through Kubernetes
                            
                                Disabling network logs on Kubernetes when running kubectl exec
                            
                                What is the difference between Mixer and Pilot in Istio?
                            
                                Pod limit on Node - AWS EKS
                            
                                Kubernetes master username and password
                            
                                Error: error installing: the server could not find the requested resource HELM Kubernetes
                            
                                Possible Memory Leak in Ignite DataStreamer

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With