Kafka broker auto scaling

Tags:

apache-kafka

I am looking for some suggestion on Kafka broker auto scaling up and down based on the load.

Let us say we have an e-commerce site and we are capturing certain activities or events and these events are send to Kafka. Since during the peak hours/days the site traffic will be more so having the ideal kafka cluster with fixed number of brokers always is not a good idea so we want to scale it up the number of brokers when site traffic is more and scale it down the number of brokers when traffic is less.

How does people solve this kind of issue? i am not able to find any resource in this topic. any help will be greatly appreciated.

856

asked Nov 13 '18 06:11

Kalaiselvam M

1 Answers

Kafka doesn't really work that way. Adding/removing brokers from the cluster is a very hands-on process, and it creates a lot of additional load/overhead on the cluster, so you wouldn't want the cluster to be automatically scaling up or down by itself. The main reason why it creates so much additional overhead is that adding or removing brokers requires lots of data copying across the cluster, on top of the normal traffic. Basically, all the data from a dead broker needs to be copied somewhere else, to keep the same replication factor for the topic/partitions, or if it's a new broker, data needs to be shuffled into it from the other brokers, so that the load on the cluster as a whole is reduced. All this data being copied around creates lots of IO/CPU load on the cluster, and it might be enough to cause significant problems.

The best way to handle this scenario is to do performance testing and optimization with 2x or even 3x the traffic you'd expect during peak hours, and build out the cluster accordingly. This way, you'll have plenty of headroom if there are sudden spikes, and you won't have to scale-out/scale-in.

Kafka is extremely performant, even for traffic of millions of messages per second, so you will probably find that the cluster size your application/system requires is not as large/expensive as you initially thought.

102

answered Sep 21 '22 16:09

mjuarez

Related questions
                            
                                kafka java producer stuck in producing message
                            
                                Kafka topic per producer
                            
                                How can I get the offset value in KStream
                            
                                How to start Zookeeper and then Kafka?
                            
                                Is KafkaTemplate thread safe
                            
                                Getting Illegal initial character: in schema parsing
                            
                                How to write streaming dataset to Kafka?
                            
                                Kafka Producer Exception NoClassDefFoundError
                            
                                kafka-console-producer ignores value serializer?
                            
                                How do I configure spring-kafka to ignore messages in the wrong format?
                            
                                Maximum subscription limit of Kafka Topics Per Consumer
                            
                                KafkaIO checkpoint - how to commit offsets to Kafka
                            
                                Kafka connector logs
                            
                                Kafka Streams closing processor's state store
                            
                                How to join multiple Kafka topics?
                            
                                Kafka Streams: Is it possible to have "compact,delete" policy on state stores?
                            
                                Automatic change kafka topic partition leader
                            
                                Use message key in Kafka connect source connector
                            
                                Adding new ZooKeeper node in Kafka cluster?
                            
                                Kafka ordering with multiple producers on same topic and parititon

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With