Understanding when to use stateful services and when to rely on external persistence in Azure Service Fabric

Tags:

I'm spending my evenings evaluating Azure Service Fabric as a replacement for our current WebApps/CloudServices stack, and feel a little bit unsure about how to decide when services/actors with state should be stateful actors, and when they should be stateless actors with externally persisted state (Azure SQL, Azure Storage and DocumentDB). I know this is a fairly new product (to the general public at least), so there's probably not a lot of best practices in regards to this yet, but I've read through most of the documentation made available by Microsoft without finding a definite answer for this.

The current problem domain I'm approaching is our event store; parts of our applications are based on event sourcing and CQRS, and I'm evaluating how to move this event store over to the Service Fabric platform. The event store is going to contain a lot time series-data, and as it's our only source of truth for the data being persisted there it must be consistent, replicated and stored to some form of durable storage.

One way I have considered doing this is with stateful "EventStream" actor; each instance of an aggregate using event sourcing stores its events within an isolated stream. This means the stateful actor could keep track of all the events for its own stream, and I'd have met my requirements as to how the data is stored (transactional, replicated and durable). However, some streams may grow very large (hundreds of thousands, if not millions, of events), and this is where I'm starting to get unsure. Having an actor with a large amount of state will, I imagine, have impacts on the performance of the system when these large data models needs to be serialized to or deserialized from disk.

Another option is to keep these actors stateless, and have them just read their data from some external storage like Azure SQL - or just go with stateless services instead of actors.

Basically, when is the amount of state for an actor/service "too much" and you should start considering other ways of handling state?

Also, this section in the Service Fabric Actors design pattern: Some anti-patterns documentation leave me a little bit puzzled:

Treat Azure Service Fabric Actors as a transactional system. Azure Service Fabric Actors is not a two phase commit-based system offering ACID. If we do not implement the optional persistence, and the machine the actor is running on dies, its current state will go with it. The actor will be coming up on another node very fast, but unless we have implemented the backing persistence, the state will be gone. However, between leveraging retries, duplicate filtering, and/or idempotent design, you can achieve a high level of reliability and consistency.

What does "if we do not implement the optional persistance" indicate here? I was under the impression that as long as your transaction modifying the state succeeded, your data was persisted to durable storage and replicated to at least a subset of the replicas. This paragraph leaves me wondering if there are situations where state within my actors/services will get lost, and if this is something I need to handle myself. The impression I got from the stateful model in other parts of the documentation seems to counteract this statement.

691

asked May 05 '15 11:05

Trond Nordheim

2 Answers

One option that you have is to keep 'some' of the state in the actor (let's say what could be considered to be hot data that needs to be quickly available) and store everything else on a 'traditional' storage infrastructure such as SQL Azure, DocDB, .... It is difficult to have a general rule about too much local state but, maybe, it helps to think about hot vs. cold data. Reliable Actors also offer the ability to customize the StateProvider so you can also consider implementing a customized StateProvider (by implementing the IActorStateProvider) with the specific policies that you need to be more efficient with the requirements that you have in terms of amount of data, latency, reliability and so on (note: documentation is still very minimal on the StateProvider interface but we can publish some sample code if this is something you want to pursue).

About the anti-patterns: the note is more about implementing transactions across multiple actors. Reliable Actors provides full guarantee on reliability of the data within the boundaries of an actor. Because of the distributed and loosly coupled nature of the Actor model, implementing transactions that involve multiple actors is not a trivial task. If 'distributed' transactions is a strong requirement, the Reliable Services programming model is probably a better fit.

152

answered Oct 03 '22 08:10

clca

I know this has been answered, but recently found myself in the same predicament with a CQRS/ES system and here's how I went about it:

Each Aggregate was an actor with only the current state stored in it.
On a command, the aggregate would effect a state change and raise an event.
Events themselves were stored in a DocDb.
On activation, AggregateActor instances read events from DocDb if available to recreate its state. This is obviously only performed once per actor activation. This took care of the case where an actor instance is migrated from one node to another.

answered Oct 03 '22 08:10

Raghu

Related questions
                            
                                Where is the Microsoft.IdentityModel dll
                            
                                The subscription is not registered to use namespace 'Microsoft.DataFactory error
                            
                                Use SQL Server Management Studio to connect remotely to an SQL Server Express instance hosted on an Azure Virtual Machine
                            
                                Azure Storage Blob Rename
                            
                                How do I construct an ISO 8601 datetime in C++?
                            
                                CI/CD of a ASP.NET Core Web API using VSTS
                            
                                Windows Azure - The current service model is out of sync
                            
                                Web Publish password not the same as my Azure admin password?
                            
                                Clean Windows Azure Website
                            
                                Azure: Moving an App Service to another existing App Service Plan
                            
                                "Use a tenant-specific endpoint or configure the application to be multi-tenant" when signing into my Azure website
                            
                                How to get all rows in Azure table Storage in C#?
                            
                                'Unable to Authenticate' when trying to connect to Azure DevOps Artifacts feed through npm; I get an E401 error
                            
                                TokenValidationParameters no longer working after upgrade to 5.0.0
                            
                                How to enable gzip HTTP compression on Windows Azure dynamic content
                            
                                Windows Azure or Amazon EC2 for ASP.NET MVC Development?
                            
                                Sending email from Azure
                            
                                Azure WebApp Asp.NET Core 2 error: An error occurred while starting the application
                            
                                How to switch accounts via VS Code Azure Account Extension
                            
                                SQL Azure table size

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Understanding when to use stateful services and when to rely on external persistence in Azure Service Fabric

Tags:

azure

azure-service-fabric

Trond Nordheim

People also ask

2 Answers

clca

Raghu

Recent Activity

Donate For Us