Difference between session.timeout.ms and max.poll.interval.ms for Kafka >= 0.10.1

Tags:

I am unclear why we need both session.timeout.ms and max.poll.interval.ms and when would we use one or the other or both? It seems like both settings indicate the upper bound on the time the coordinator will wait to get the heartbeat from a consumer before assuming it's dead.

Also how does it behave for versions 0.10.1.0+ based on KIP-62?

381

asked Sep 27 '16 16:09

Deeps

1 Answers

Before KIP-62, there is only session.timeout.ms (ie, Kafka 0.10.0 and earlier). max.poll.interval.ms is introduced via KIP-62 (part of Kafka 0.10.1).

KIP-62, decouples heartbeats from calls to poll() via a background heartbeat thread, allowing for a longer processing time (ie, time between two consecutive poll()) than heartbeat interval.

Assume processing a message takes 1 minute. If heartbeat and poll are coupled (ie, before KIP-62), you will need to set session.timeout.ms larger than 1 minute to prevent consumer to time out. However, if a consumer dies, it also takes longer than 1 minute to detect the failed consumer.

KIP-62 decouples polling and heartbeat allowing to send heartbeats between two consecutive polls. Now you have two threads running, the heartbeat thread and the processing thread and thus, KIP-62 introduced a timeout for each. session.timeout.ms is for the heartbeat thread while max.poll.interval.ms is for the processing thread.

Assume, you set session.timeout.ms=30000, thus, the consumer heartbeat thread must sent a heartbeat to the broker before this time expires. On the other hand, if processing of a single message takes 1 minutes, you can set max.poll.interval.ms larger than one minute to give the processing thread more time to process a message.

If the processing thread dies, it takes max.poll.interval.ms to detect this. However, if the whole consumer dies (and a dying processing thread most likely crashes the whole consumer including the heartbeat thread), it takes only session.timeout.ms to detect it.

The idea is, to allow for a quick detection of a failing consumer even if processing itself takes quite long.

Implemenation Detail

The new timeout max.poll.interval.ms is mainly a client side concept: if poll() is not called within max.poll.interval.ms, the heartbeat thread will detect this case and send a leave-group request to the broker. -- max.poll.interval.ms is still relevant for consumer group rebalances: if a rebalance is triggered, consumers have max.poll.interval.ms time to re-join the group by calling poll() client side which triggers a join-group request.

answered Nov 05 '22 13:11

Matthias J. Sax

Related questions
                            
                                Is Apache Kafka appropriate for use as an unordered task queue?
                            
                                How to handle HTTP requests in a Microservice / Event Driven Architecture?
                            
                                Running into LeaderNotAvailableException when using Kafka 0.8.1 with Zookeeper 3.4.6
                            
                                Kafka in Docker not working
                            
                                What command shows all of the topics and offsets of partitions in Kafka?
                            
                                What does "Rebalancing" mean in Apache Kafka context?
                            
                                ActiveMQ vs Apollo vs Kafka
                            
                                changing kafka retention period during runtime
                            
                                NServiceBus and Rabbit MQ or Kafka
                            
                                Can multiple Kafka consumers read same message from the partition
                            
                                When/how does a topic "marked for deletion" get finally removed?
                            
                                Kafka or SNS or something else? [closed]
                            
                                How to view kafka message
                            
                                How to check if ZooKeeper is running or up from command prompt?
                            
                                How to change the number of replicas of a Kafka topic?
                            
                                Kafka consumer list
                            
                                What is the difference between Apache kafka vs ActiveMQ
                            
                                In Apache Kafka why can't there be more consumer instances than partitions?
                            
                                How to list all available Kafka brokers in a cluster?
                            
                                Difference between stream processing and message processing

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Difference between session.timeout.ms and max.poll.interval.ms for Kafka >= 0.10.1

Tags:

apache-kafka

kafka-consumer-api

Deeps

People also ask

1 Answers

Matthias J. Sax

Recent Activity

Donate For Us