Our aim is to horizontally scale a .NET Core 2.0 Web API using Kubernetes. The Web API application will be served by Kestrel. It looks like we can gracefully handle the termination of pods by configuring Kestrel's shutdown timeout so now we are looking into how to probe the application to determine readiness and liveness. Would it be enough to simply probe the Web API with a HTTP request? If so, would it be a good idea to create a new healthcheck controller to handle these probing requests or would it make more sense to probe an actual endpoint that would be consumed in normal use? What should we consider when differentiating between the liveness and readiness probes?

I would recommend to perform health checks through separate endpoints. In general, there are a number of good reasons for doing so, like: <ol> <li>Checking that the application is live/ready or, more in general, in a healthy status is not necessarily the same as sending a user request to your web service. When performing health checks you should define what makes your web service healthy: this could be e.g. checking access to external resources, like database.</li> <li>It is easier to control who can actually perform health checks through your endpoints.</li> <li>More in general, you do not want to mess up with the actual service functionalities: you would otherwise need to re-think the way you do health checks when maintaining your service's functionalities. E.g. if your service interacts with a database, in a health checks context you want to verify the connection to the database is fine, but you do not actually care much about the data being manipulated internally by your service.</li> <li>Things get even more complicated if your web service is not stateless: in such case, you will need to make sure data remain consistent independently from your health checks.</li> </ol> As you pointed out, a good way to avoid any of the above could be setting up a separate Controller to handle health checks. As an alternative option, there is a standard library available in ASP.NET Core for enabling Health Checks on your web service: at the time of writing this answer, it is not officially part of ASP.NET Core and no NuGet packages are available yet, but there is a plan for this to happen on future releases. For now, you can easily pull the code from the Official Repository and include it in your solution as explained in the Microsoft documentation. This is currently planned to be included in ASP.NET Core 2.2 as described in the ASP.NET Core 2.2 Roadmap. I personally find it very elegant, as you will configure everything through the <code>Startup.cs</code> and <code>Program.cs</code> and won't need to explicitly create a new endpoint as the library already handles that for you. I have been using it in a few projects and I would definitely recommend it. The repository includes an example specific for ASP.NET Core projects you can use to get quickly up to speed. <h3>Liveness vs Readiness</h3> In Kubernetes, you may then setup liveness and readiness probes through HTTP: as explained in the Kubernetes documentation, while the setup for both is almost identical, Kubernetes takes different actions depending on the probe: Liveness probe from Kubernetes documentation: <blockquote> Many applications running for long periods of time eventually transition to broken states, and cannot recover except by being restarted. Kubernetes provides liveness probes to detect and remedy such situations. </blockquote> Readiness probe from Kubernetes documentation: <blockquote> Sometimes, applications are temporarily unable to serve traffic. For example, an application might need to load large data or configuration files during startup. In such cases, you don’t want to kill the application, but you don’t want to send it requests either. Kubernetes provides readiness probes to detect and mitigate these situations. A pod with containers reporting that they are not ready does not receive traffic through Kubernetes Services. </blockquote> So, while an unhealthy response to a liveness probe will cause the Pod (and so, the application) to be killed, an unhealthy response to a readiness probe will simply cause the Pod to receive no traffic until it gets back to a healthy status. What to consider when differentiating liveness and readiness probes? For liveness probe: I would recommend to define what makes your application healthy, i.e. minimum requirements for user consumption, and implement health checks based on that. This typically involves external resources or applications running as separate processes, e.g. databases, web services, etc. You may define health checks by using ASP.NET Core Health Checks library or manually with a separate Controller. For readiness probe: You simply want to load your service to verify it actually responds in time and so allows Kubernetes to balance traffic accordingly. Trivially (and in most cases as suggested by Lukas in another answer), you may use the same exact endpoint you would use for liveness but setting up different timeouts, but this then really depends on your needs and requirements.

<blockquote> What should we consider when differentiating between the liveness and readiness probes </blockquote> My recommendation would be to provide a <code>/health</code> endpoint in your application separate from your application endpoint. This is useful if you want to block your consumers from calling your internal health endpoint. Then you can configure Kubernetes to query your HTTP <code>/health</code> endpoint like in the example below. <pre class="prettyprint"><code>apiVersion: v1 kind: Pod metadata: name: goproxy spec: containers: - name: goproxy image: k8s.gcr.io/goproxy:0.1 ports: - name: http containerPort: 8080 readinessProbe: httpGet: port: http path: /health initialDelaySeconds: 60 livenessProbe: httpGet: port: http path: /health </code></pre> Inside your <code>/health</code> endpoint you should check the internal state of your application and return a status code of either <code>200</code> if everything is OK or <code>503</code> if your application is having issues. Keep in mind that health checks are performed usually every 15 seconds for every instance and if you are performing expensive operations to determining your application state you might slow down your application. <blockquote> What should we consider when differentiating between the liveness and readiness probes </blockquote> Usually the only difference between liveness and readiness probes are the timeouts in each probe. Maybe your application needs 60 seconds to start then you would need to set the initial timeout of your readiness probe to 60 while keeping the default liveness timeout.

Appropriate Kubernetes Readiness and Liveness Probes for Kestrel .NET Core Web API

Tags:

asp.net-core

kubernetes

asp.net-core-webapi

kestrel-http-server

Our aim is to horizontally scale a .NET Core 2.0 Web API using Kubernetes. The Web API application will be served by Kestrel.

It looks like we can gracefully handle the termination of pods by configuring Kestrel's shutdown timeout so now we are looking into how to probe the application to determine readiness and liveness.

Would it be enough to simply probe the Web API with a HTTP request? If so, would it be a good idea to create a new healthcheck controller to handle these probing requests or would it make more sense to probe an actual endpoint that would be consumed in normal use?

What should we consider when differentiating between the liveness and readiness probes?

979

asked Dec 06 '17 07:12

Alasdair Stark

2 Answers

I would recommend to perform health checks through separate endpoints. In general, there are a number of good reasons for doing so, like:

Checking that the application is live/ready or, more in general, in a healthy status is not necessarily the same as sending a user request to your web service. When performing health checks you should define what makes your web service healthy: this could be e.g. checking access to external resources, like database.
It is easier to control who can actually perform health checks through your endpoints.
More in general, you do not want to mess up with the actual service functionalities: you would otherwise need to re-think the way you do health checks when maintaining your service's functionalities. E.g. if your service interacts with a database, in a health checks context you want to verify the connection to the database is fine, but you do not actually care much about the data being manipulated internally by your service.
Things get even more complicated if your web service is not stateless: in such case, you will need to make sure data remain consistent independently from your health checks.

As you pointed out, a good way to avoid any of the above could be setting up a separate Controller to handle health checks.

As an alternative option, there is a standard library available in ASP.NET Core for enabling Health Checks on your web service: at the time of writing this answer, it is not officially part of ASP.NET Core and no NuGet packages are available yet, but there is a plan for this to happen on future releases. For now, you can easily pull the code from the Official Repository and include it in your solution as explained in the Microsoft documentation. This is currently planned to be included in ASP.NET Core 2.2 as described in the ASP.NET Core 2.2 Roadmap.

I personally find it very elegant, as you will configure everything through the Startup.cs and Program.cs and won't need to explicitly create a new endpoint as the library already handles that for you.

I have been using it in a few projects and I would definitely recommend it. The repository includes an example specific for ASP.NET Core projects you can use to get quickly up to speed.

Liveness vs Readiness

In Kubernetes, you may then setup liveness and readiness probes through HTTP: as explained in the Kubernetes documentation, while the setup for both is almost identical, Kubernetes takes different actions depending on the probe:

Liveness probe from Kubernetes documentation:

Many applications running for long periods of time eventually transition to broken states, and cannot recover except by being restarted. Kubernetes provides liveness probes to detect and remedy such situations.

Readiness probe from Kubernetes documentation:

Sometimes, applications are temporarily unable to serve traffic. For example, an application might need to load large data or configuration files during startup. In such cases, you don’t want to kill the application, but you don’t want to send it requests either. Kubernetes provides readiness probes to detect and mitigate these situations. A pod with containers reporting that they are not ready does not receive traffic through Kubernetes Services.

So, while an unhealthy response to a liveness probe will cause the Pod (and so, the application) to be killed, an unhealthy response to a readiness probe will simply cause the Pod to receive no traffic until it gets back to a healthy status.

What to consider when differentiating liveness and readiness probes?

For liveness probe: I would recommend to define what makes your application healthy, i.e. minimum requirements for user consumption, and implement health checks based on that. This typically involves external resources or applications running as separate processes, e.g. databases, web services, etc. You may define health checks by using ASP.NET Core Health Checks library or manually with a separate Controller.

For readiness probe: You simply want to load your service to verify it actually responds in time and so allows Kubernetes to balance traffic accordingly. Trivially (and in most cases as suggested by Lukas in another answer), you may use the same exact endpoint you would use for liveness but setting up different timeouts, but this then really depends on your needs and requirements.

156

answered Oct 13 '22 10:10

smn.tino

What should we consider when differentiating between the liveness and readiness probes

My recommendation would be to provide a /health endpoint in your application separate from your application endpoint. This is useful if you want to block your consumers from calling your internal health endpoint. Then you can configure Kubernetes to query your HTTP /health endpoint like in the example below.

apiVersion: v1
kind: Pod
metadata:
  name: goproxy
spec:
  containers:
  - name: goproxy
    image: k8s.gcr.io/goproxy:0.1
    ports:
    - name: http
      containerPort: 8080
    readinessProbe:
      httpGet:
        port: http
        path: /health
      initialDelaySeconds: 60
    livenessProbe:
      httpGet:
        port: http
        path: /health

Inside your /health endpoint you should check the internal state of your application and return a status code of either 200 if everything is OK or 503 if your application is having issues. Keep in mind that health checks are performed usually every 15 seconds for every instance and if you are performing expensive operations to determining your application state you might slow down your application.

What should we consider when differentiating between the liveness and readiness probes

Usually the only difference between liveness and readiness probes are the timeouts in each probe. Maybe your application needs 60 seconds to start then you would need to set the initial timeout of your readiness probe to 60 while keeping the default liveness timeout.

answered Oct 13 '22 12:10

Lukas Eichler

Related questions
                            
                                Unable to resolve service for type 'Microsoft.AspNetCore.Cors.Infrastructure.ICorsService
                            
                                How to create a mock instance of IOptions<MyOption>?
                            
                                How to ignore routes in ASP.NET Core?
                            
                                AspNet Core 3.0 and 3.1 : Enable runtime compilation for Razor Pages
                            
                                Unit test controller model validation on AspNetCore
                            
                                .net core create on memory zipfile
                            
                                ASP.NET Core UrlHelper and how it works
                            
                                The type or namespace name 'IWebHostEnvironment' could not be found (are you missing a using directive or an assembly reference?)
                            
                                Get browser language in ASP.NET Core?
                            
                                aspnetcore.dll failed to load
                            
                                Populating Dropdown in ASP.net Core
                            
                                how to implement google login in .net core without an entityframework provider
                            
                                How to enable SSL for IIS Express in VS2015
                            
                                .NET Core2.0 bundleconfig.json not working
                            
                                Return to previous page in ASP.Net Core MVC
                            
                                Error 415 when posting to ASP.Net Core WebAPI using XMLHttpRequest
                            
                                ASP.NET 5, DNX & Kestrel: Not hitting breakpoints
                            
                                ASP .NET 5 MVC 6 Identity 3 Roles Claims Groups [closed]
                            
                                Best practice for storing ASP.NET Core Authorization claims when authenticating users against Active Directory?
                            
                                How can I run selenium chrome driver in a docker container?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With