How to handle backpressure in a Kafka Connect Sink?

Tags:

We build a custom Kafka Connect sink which in turn calls a remote REST API. How do I propagate backpressure to the Kafka Connect infrastructure, so put() is called less often in cases when the remote system is slower than the internal consumer delivers messages to put()? The Kafka connect documentation says that we should not block in put(), but block in flush(). But not blocking in put() means that we have to buffer data which surely leads to OOM exceptions at some point, if put() is called more often than flush(). I've seen that a kafka consumer is allowed to call pause() or block in the loop(). Is it possible to leverage this in a kafka connect sink?

720

asked Apr 19 '18 06:04

longliveenduro

1 Answers

I've seen that a kafka consumer is allowed to call pause() or block in the loop(). Is it possible to leverage this in a kafka connect sink?

The raw consumer is not exposed, so no. You could call /pause on the whole connector, though I'm not sure what happens to un-flushed messages at that point.

But not blocking in put() means that we have to buffer data which surely leads to OOM exceptions at some point

It can, sure, but that is really the only viable option for holding onto data for longer than necessary. For instance, this is how the S3 and HDFS connectors work.

rotate.interval.ms
The time interval in milliseconds to invoke file commits...

Your HTTP client connection is likely blocking anyway to make the request, is it not?

The alternative would be to make your HTTP server embed a Kafka consumer so it can poll messages itself and act on them locally rather than need to be sent requests over HTTP.

152

answered Sep 23 '22 01:09

OneCricketeer

Related questions
                            
                                Why does a Kafka consumer take a long time to start consuming?
                            
                                Kafka streams use cases for add global store
                            
                                How can I instantiate a Mock Kafka Topic for junit tests?
                            
                                Kafka error deserializing key/value for partition
                            
                                Kafka Consumer - Poll behaviour
                            
                                Good practice when using kafka with jpa
                            
                                How to filter messages before passing them on to consumers?
                            
                                Difference between heartbeat.interval.ms and session.timeout.ms in Kafka consumer config
                            
                                How to test whether log compaction is working or not in Kafka?
                            
                                How to make kafka consumer to read from last consumed offset but not from beginning
                            
                                Simple Kafka Consumer Example not working
                            
                                Send Custom Java Objects to Kafka Topic
                            
                                In Kafka how to get the exact offset according producing time
                            
                                The group coordinator is not available-Kafka
                            
                                How to fix "java.io.NotSerializableException: org.apache.kafka.clients.consumer.ConsumerRecord" in Spark Streaming Kafka Consumer?
                            
                                Kafka with .Net Client
                            
                                Zookeeper issue in setting kafka
                            
                                Apache Kafka Producer Broker Connection
                            
                                Create multiple consumers in Kafka in command line

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to handle backpressure in a Kafka Connect Sink?

Tags:

apache-kafka

apache-kafka-connect

longliveenduro

People also ask

1 Answers

OneCricketeer

Recent Activity

Donate For Us