Apache Flink: Why to choose the MemoryStateBackend over the FsStateBackend?

Tags:

Flink has a MemoryStateBackend and a FsStateBackend (and a RocksDBStateBackend). Both seem to extend the HeapKeyedStateBackend, i.e. the mechanism for storing the current working state is entirely the same.

This SO answer says that the main difference lies in the MemoryStateBackend keeping a copy of the checkpoints in the JobManagers memory. (I wasn't able to glean any evidence for that from the source code.) The MemoryStateBackend also limits the maximum state size per subtask.

Now I wonder: Why would you ever want to use the MemoryStateBackend?

914

asked Jan 23 '19 01:01

Caesar

1 Answers

As you said, both MemoryStateBackend and FSStateBackend are based on HeapKeyedStateBackend. This means, that both state backends maintain the state of an operator as regular objects on the JVM heap of the TaskManager, i.e., state is always accessed in memory.

The backends differ in how they persist the state for checkpoints. A checkpoint is a copy of the state of all operators of an application that is stored somewhere. In case of a failure, the application is restarted and the state of the operators is initialized from the checkpoint.

The FSStateBackend stores the checkpoint in a file system, typically HDFS, S3, or a NFS that is mounted on all worker nodes. The MemoryStateBackend stores the state in the JVM of the JobManager. This has the following pros and cons:

Pros:

No need to setup a (distributed) file system.
No need to configure a storage location.

Cons:

State is lost if the JobManager process dies.
Size of state is bound by the size of the JobManager memory.

Since checkpoints are lost if the JM goes down, the MemoryStateBackend is unsuitable for most production use cases. It can be useful for developing and testing stateful applications, because it requires not configuration or setup.

answered Sep 30 '22 09:09

Fabian Hueske

Related questions
                            
                                Flink exactly-once message processing
                            
                                Flink Error - Key group is not in KeyGroupRange
                            
                                What is Apache Flink's detached mode?
                            
                                Apache Flink: My application does not resume from a checkpoint when I restart it
                            
                                What do terms like Hash, Forward mean in the Flink plan?
                            
                                Apache Flink Rest-Client Jar-Upload not working
                            
                                Apache Flink: ClassNotFoundException on remote cluster
                            
                                java.lang.ClassNotFoundException: com.fasterxml.jackson.databind.ser.FilterProvider when flink boot up
                            
                                apache flink - the correct way of error handling
                            
                                Integration - Apache Flink + Spring Boot
                            
                                Iterator behaviour in flink reduceGroup
                            
                                Flink Windows Boundaries, Watermark, Event Timestamp & Processing Time
                            
                                Flink job with CassandrSink fails with Error writing
                            
                                flink kafka consumer groupId not working
                            
                                Can I use Flink state to perform join?
                            
                                Get file name of DataStream with Flink
                            
                                How does Apache Flink implement iteration?
                            
                                How Apache Flink deal with skewed data?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Apache Flink: Why to choose the MemoryStateBackend over the FsStateBackend?

Tags:

apache-flink

flink-streaming

Caesar

People also ask

1 Answers

Fabian Hueske

Recent Activity

Donate For Us