Kafka uncommitted messages

Tags:

apache-kafka

Lets say the partition has 4 replicas (1 leader, 3 followers) and all are currently in sync. min.insync.replicas is set to 3 and request.required.acks is set to all or -1.

The producer send a message to the leader, the leader appends it to it's log. After that, two of the replicas crashed before they could fetch this message. One remaining replica successfully fetched the message and appended to it's own log.

The leader, after certain timeout, will send an error (NotEnoughReplicas, I think) to the producer since min.insync.replicas condition is not met.

My question is: what will happen to the message which was appended to leader and one of the replica's log?

Will it be delivered to the consumers when crashed replicas come back online and broker starts accepting and committing new messages (i.e. high watermark is forwarded in the log)?

937

asked Jul 01 '16 08:07

Vikk

1 Answers

If there is no min.insync.replicas available and producer uses ack=all, then the message is not committed and consumers will not receive that message, even after crashed replicas come back and are added to the ISR list again. You can test this in the following way.

Start two brokers with min.insync.replicas = 2

$ ./bin/kafka-server-start.sh ./config/server-1.properties
$ ./bin/kafka-server-start.sh ./config/server-2.properties

Create a topic with 1 partition and RF=2. Make sure both brokers are in the ISR list.

$ ./bin/kafka-topics.sh --zookeeper zookeeper-1 --create --topic topic1 --partitions 1 --replication-factor 2
Created topic "topic1".
$ ./bin/kafka-topics.sh --zookeeper zookeeper-1 --describe --topic topic1
Topic:topic1    PartitionCount:1    ReplicationFactor:2 Configs:
        Topic: topic1   Partition: 0    Leader: 1   Replicas: 1,2   Isr: 1,2

Run console consumer and console producer. Make sure produce uses ack=-1

$ ./bin/kafka-console-consumer.sh --new-consumer --bootstrap-server kafka-1:9092,kafka-2:9092 --topic topic1
$ ./bin/kafka-console-producer.sh --broker-list kafka-1:9092,kafka-2:9092 --topic topic1 --request-required-acks -1

Produce some messages. Consumer should receive them.

Kill one of the brokers (I killed broker with id=2). Check that ISR list is reduced to one broker.

$ ./bin/kafka-topics.sh --zookeeper zookeeper-1 --describe --topic topic1
Topic:topic1    PartitionCount:1    ReplicationFactor:2 Configs:
       Topic: topic1    Partition: 0    Leader: 1   Replicas: 1,2   Isr: 1

Try to produce again. In the producer you should get some

Error: NOT_ENOUGH_REPLICAS

(one per retry) and finally

Messages are rejected since there are fewer in-sync replicas than required.

Consumer will not receive these messages.

Restart the killed broker and try to produce again. Consumer will receive these message but not those that you sent while one of the replicas was down.

134

answered Sep 17 '22 13:09

Luciano Afranllie

Related questions
                            
                                Spring Kafka configure number of partitions for topic
                            
                                Reading into SQL Server from Kafka feed [closed]
                            
                                Kafka Stream aggregation with custom object data type
                            
                                How to achieve strong consistency in Kafka?
                            
                                How to monitor messages rate in Kafka topics?
                            
                                Error sending fetch request (sessionId=INVALID, epoch=INITIAL) to node 1001: org.apache.kafka.common.errors.DisconnectException
                            
                                How to make Spark Streaming (Spark 1.0.0) read the latest data from Kafka (Kafka Broker 0.8.1)
                            
                                Not able to connect to kafka server on google compute engine from local machine
                            
                                Kafka: No message seen on console consumer after message sent by Java Producer
                            
                                kafka on kubernetes cannot produce/consume topics (ClosedChannelException, ErrorLoggingCallback)
                            
                                Sending data with kafka-python only working when briefly delaying code
                            
                                Why do we need to mention Zookeeper details even though Apache Kafka configuration file already has it?
                            
                                Kafka cached zkVersion not equal to that in zookeeper broker not recovering
                            
                                Choosing the right cleanup policy in Kafka configuration
                            
                                How does Kinesis achieve Kafka style Consumer Groups?
                            
                                How to set auto.create.topics.enable as default config on AWS MSK cluster
                            
                                Kubernetes pod resolve external kafka hostname in coredns not as hostaliases inside pod
                            
                                Kafka - Stream vs Topic
                            
                                how to delete kafka message after reading
                            
                                How to make kafka-python or pykafka work as an async producer with uwsgi and gevent?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With