Kafka only once consumption guarantee

Tags:

apache-kafka

I see in some answers around stack-overflow and in general in the web the idea that Kafka does not support consumption acknowledge or that exactly once consumption is hard to achieve.

In the following entry as a sample Is there any reason to use RabbitMQ over Kafka?, I can read the following statements:

RabbitMQ will keep all states about consumed/acknowledged/unacknowledged messages while Kafka doesn't

Exactly once guarantees are hard to get with Kafka.

This is not what I understand by reading the official Kafka documentation at: https://kafka.apache.org/documentation/#design_consumerposition

The previous documentation states that Kafka does not use a traditional acknowledge implementation (as RabbitMQ). Instead they rely on the relationship partition-consumer and offset...

This makes the equivalent of message acknowledgements very cheap

Could somebody please explain why "only once consumption guarantee" in Kafka is difficult to achieve? and How this differs from Kafka vs other more traditional Message Broker as RabbitMQ? What am I missing?

979

asked Feb 10 '17 17:02

Teimatini Marin

1 Answers

If you mean exactly once the problem is like this. Kafka consumer as you may know use a polling mechanism, that is consumers ask the server for messages. Also, you need to recall that the consumer commit message offsets, that is, it tells the cluster what is the next expected offset. So, imagine what could happen.

Consumer poll for messages and get message with offset = 1.

A) If consumer commit that offset immediately before processing the message, then it can crash and will never receive that message again because it was already committed, on next poll Kafka will return message with offset = 2. This is what they call at most once semantic.

B) If consumer process the message first and then commit the offset, what could happen is that after processing the message but before committing, the consumer crashes, so in that case next poll will get again the same message with offset = 1 and that message will be processed twice. This is what they call at least once.

138

answered Nov 16 '22 01:11

Luciano Afranllie

Related questions
                            
                                Amazon Kinesis vs AWS Manage Service Kafka (MSK) - (Connect from on-prem)
                            
                                Spring Boot Kafka Listener vs Consumer
                            
                                How to configure kafka topic retention policy during creation with Spring?
                            
                                Kafka message codec - compress and decompress
                            
                                Kafka-python get number of partitions for topic
                            
                                Kafka multiple consumers for a partition
                            
                                Kafka consumer - what's the relation of consumer processes and threads with topic partitions
                            
                                I can't run zookeeper
                            
                                org.apache.kafka.common.config.ConfigException: Missing required configuration "bootstrap.servers" which has no default value
                            
                                max.poll.intervals.ms set to int.Max by default
                            
                                Consuming from single kafka partition by multiple consumers
                            
                                Kafka - Docker - Error when sending message from Host to Container (Batch Expired)
                            
                                Spring Boot Kafka: Unable to start consumer due to NoSuchBeanDefinitionException
                            
                                How to reset offsets to arbitrary value in Kafka Consumer Group?
                            
                                Does Kafka support secure communication?
                            
                                Kafka Consumer hanging at .hasNext in java
                            
                                How to get kafka consume lag in java program
                            
                                Error when Spark 2.2.0 standalone mode write Dataframe to local single-node Kafka
                            
                                RecordTooLargeException in Kafka streams join
                            
                                How to manage Kafka KStream to Kstream windowed join?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With