How can I gracefully handle a Kafka outage?

Tags:

I am connecting to Kafka using the 0.8.2.1 kafka-clients library. I am able to successfully connect when Kafka is up, but I want to handle failure gracefully when Kafka is down. Here is my configuration:

kafkaProperties.setProperty(ProducerConfig.BOOTSTRAP_SERVERS_CONFIG, kafkaUrl);
kafkaProperties.setProperty(ProducerConfig.VALUE_SERIALIZER_CLASS_CONFIG, "org.apache.kafka.common.serialization.StringSerializer");
kafkaProperties.setProperty(ProducerConfig.KEY_SERIALIZER_CLASS_CONFIG, "org.apache.kafka.common.serialization.StringSerializer");
kafkaProperties.setProperty(ProducerConfig.RETRIES_CONFIG, "3");
producer = new KafkaProducer(kafkaProperties);

When Kafka is down, I get the following error in my logs:

WARN: 07 Apr 2015 14:09:49.230 org.apache.kafka.common.network.Selector:276 - [] Error in I/O with localhost/127.0.0.1
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) ~[na:1.7.0_75]
at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) ~[na:1.7.0_75]
at org.apache.kafka.common.network.Selector.poll(Selector.java:238) ~[kafka-clients-0.8.2.1.jar:na]
at org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:192) [kafka-clients-0.8.2.1.jar:na]
at org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:191) [kafka-clients-0.8.2.1.jar:na]
at org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:122) [kafka-clients-0.8.2.1.jar:na]
at java.lang.Thread.run(Thread.java:745) [na:1.7.0_75]

This error repeats in an infinite loop and locks up my Java application. I have tried various configuration settings related to timeouts, retries, and acknowledgements, but I have been unable to prevent this loop from occurring.

Is there a configuration setting that can prevent this? Do I need to try a different version of the client? How can a Kafka outage be handled gracefully?

248

asked Apr 07 '15 20:04

David Hansen

1 Answers

I figured out that this combination of settings allows the kafka client to fail quickly without holding the thread or spamming the logs:

kafkaProperties.setProperty(ProducerConfig.METADATA_FETCH_TIMEOUT_CONFIG, "300");
kafkaProperties.setProperty(ProducerConfig.TIMEOUT_CONFIG, "300");
kafkaProperties.setProperty(ProducerConfig.RETRY_BACKOFF_MS_CONFIG, "10000");
kafkaProperties.setProperty(ProducerConfig.RECONNECT_BACKOFF_MS_CONFIG, "10000");

I dislike that the kafka client holds the thread while trying to connect to the kafka server, rather than being fully async, but this at least is functional.

174

answered Sep 20 '22 07:09

David Hansen

Related questions
                            
                                Creating JSONObject from string in JAVA (org.json)
                            
                                java persistence native sql not accepting parameters
                            
                                What is "passive data structure" in Android/Java?
                            
                                Is PersistenceAnnotationBeanPostProcessor of any use at all?
                            
                                last block incomplete with CipherInputStream/CipherOutputStream, even with padding AES/CBC/PKCS5Padding
                            
                                does netbeans have something like Eclipse Debug display view
                            
                                Invalid initial heap size. Could not create the Java virtual machine
                            
                                Android Manifest- intent filter and activity
                            
                                In the System.java source, the standard input, output and error streams are declared final and initialized null?
                            
                                Spring: Access bean property from another bean
                            
                                JNI error : Local reference table overflow 512 entries
                            
                                Caused by: org.hibernate.MappingException: Repeated column in mapping for entity
                            
                                In Spring javaconfig, how to initialize a @Bean which depends on a @Service
                            
                                Jersey: MessageBodyWriter not found for media type=application/json, type=class org.codehaus.jackson.node.ObjectNode?
                            
                                Java Unchecked Overriding Return Type
                            
                                how to set font weight in Java for Swing components
                            
                                Java, gulp and maven folder structure
                            
                                How Spring aspects work internally?
                            
                                Java access bean methods with LambdaMetafactory
                            
                                Could not complete schema update: org.h2.jdbc.JdbcSQLException: Table "PG_CLASS" not found; SQL statement

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I gracefully handle a Kafka outage?

Tags:

java

apache-kafka

David Hansen

People also ask

1 Answers

David Hansen

Recent Activity

Donate For Us