During rolling upgrade/restart, how to detect when a kafka broker is "done"?

Tags:

I need to automate a rolling restart of a kafka cluster (3 kafka brokers). I can easily do it manually - restart one after the other, while checking the log to see when it's fine (e.g., when the new process has joined the cluster).

What is a good way to automate this check? How can I ask the broker whether it's up and running, connected to its peers, all topics up-to-date and such? In my restart script, I have access to the metrics, but to be frank, I did not really see one there which gives me a clear picture.

Another way would be to ask what a good "readyness" probe would be that does not simply check some TCP/IP port, but looks at the actual server...

708

asked Jan 18 '19 08:01

AnoE

1 Answers

I would suggest exposing JMX metrics and tracking the following for cluster health

the controller count (must be 1 over the whole cluster)
under replicated partitions (should be zero for healthy cluster)
unclean leader elections (if you don't disable this in server.properties make sure there are none in the metric counts)
ISR shrinks within a reasonable time period, like 10 minute window (should be none)

Also, Yelp has tooling for rolling restarts implemented in Python, which requires Jolokia JMX Agents installed on the brokers, and it polls the metrics to make sure some of the above conditions are true

183

answered Nov 15 '22 08:11

OneCricketeer

Related questions
                            
                                How can I get the offset value in KStream
                            
                                How to start Zookeeper and then Kafka?
                            
                                Is KafkaTemplate thread safe
                            
                                Getting Illegal initial character: in schema parsing
                            
                                How to write streaming dataset to Kafka?
                            
                                Kafka Producer Exception NoClassDefFoundError
                            
                                kafka-console-producer ignores value serializer?
                            
                                How do I configure spring-kafka to ignore messages in the wrong format?
                            
                                Maximum subscription limit of Kafka Topics Per Consumer
                            
                                KafkaIO checkpoint - how to commit offsets to Kafka
                            
                                Kafka connector logs
                            
                                Kafka Streams closing processor's state store
                            
                                How to join multiple Kafka topics?
                            
                                Kafka Streams: Is it possible to have "compact,delete" policy on state stores?
                            
                                Automatic change kafka topic partition leader
                            
                                Use message key in Kafka connect source connector
                            
                                Adding new ZooKeeper node in Kafka cluster?
                            
                                Kafka ordering with multiple producers on same topic and parititon
                            
                                Kafka broker auto scaling
                            
                                Are there any problems with this way of starting an infinite loop in a Spring Boot application?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

During rolling upgrade/restart, how to detect when a kafka broker is "done"?

Tags:

upgrade

apache-kafka

AnoE

People also ask

1 Answers

OneCricketeer

Recent Activity

Donate For Us