I have a Kafka consumer that I create on a schedule. It attempts to consume all of the new messages that have been added since the last commit was made. I would like to shut the consumer down once it consumes all of the new messages in the log instead of waiting indefinitely for new messages to come in. I'm having trouble finding a solution via Kafka's documentation. I see a number of timeout related properties available in the Confluent.Kafka.ConsumerConfig and ClientConfig classes, including FetchWaitMaxMs, but am unable to decipher which to use. I'm using the .NET client. Any advice would be appreciated.

I have never used the .NET client, but assuming it cannot be all that different from the Java client, the <code>poll()</code> method should accept a timeout value in milliseconds, so setting that to <code>5000</code> should work in most cases. No need to fiddle with config classes. Another approach is to find the maximum offset at the time that your consumer is created, and only read up until that offset. This would theoretically prevent your consumer from running indefinitely if, by any chance, it is not consuming as fast as producers produce. But I have never tried that approach.

How do I stop attempting to consume messages off of Kafka when at the end of the log?

Tags:

apache-kafka

kafka-consumer-api

I have a Kafka consumer that I create on a schedule. It attempts to consume all of the new messages that have been added since the last commit was made.

I would like to shut the consumer down once it consumes all of the new messages in the log instead of waiting indefinitely for new messages to come in.

I'm having trouble finding a solution via Kafka's documentation.

I see a number of timeout related properties available in the Confluent.Kafka.ConsumerConfig and ClientConfig classes, including FetchWaitMaxMs, but am unable to decipher which to use. I'm using the .NET client.

Any advice would be appreciated.

316

asked Feb 14 '19 19:02

Matt Schley

2 Answers

I have found a solution. Version 1.0.0-beta2 of Confluent's .NET Kafka library provides a method called .Consume(TimeSpan timeSpan). This will return null if there are no new messages to consume or if we're at the partition EOF. I was previously using the .Consume(CancellationToken cancellationToken) overload which was blocking and preventing me from shutting down the consumer. More here: https://github.com/confluentinc/confluent-kafka-dotnet/issues/614#issuecomment-433848857

Another option was to upgrade to version 1.0.0-beta3 which provides a boolean flag on the ConsumeResult object called IsPartitionEOF. This is what I was initially looking for - a way to know when I've reached the end of the partition.

153

answered Oct 17 '22 20:10

Matt Schley

I have never used the .NET client, but assuming it cannot be all that different from the Java client, the poll() method should accept a timeout value in milliseconds, so setting that to 5000 should work in most cases. No need to fiddle with config classes.

Another approach is to find the maximum offset at the time that your consumer is created, and only read up until that offset. This would theoretically prevent your consumer from running indefinitely if, by any chance, it is not consuming as fast as producers produce. But I have never tried that approach.

answered Oct 17 '22 19:10

Mike Nakis

Related questions
                            
                                Kafka set the maximum number of messages to read from the topic
                            
                                Send byte array to storm kafka bolt
                            
                                object kafka is not a member of package org.apache
                            
                                Join multiple Kafka topics by key
                            
                                kafka stop consuming message from new assigned partitions after rebalancing
                            
                                Kafka NotLeaderForPartitionException
                            
                                How to acknowledge consume message in kafka using php-rdkafka?
                            
                                How to deploy Kafka Streaming Application on Kafka Cluster
                            
                                How to detect duplicate messages in a kafka topic?
                            
                                Kafka Streams in docker-compose takes long time for partition assignment
                            
                                Which Kafka broker configuration gets precedence when there's a conflict?
                            
                                ClickHouse Kafka Performance
                            
                                Spark Structured Streaming app has no jobs and no stages
                            
                                Confluent 4.1.0 ->KSQL : STREAM-TABLE join -> table data null
                            
                                Kafka transaction failed but commits offset anyway
                            
                                Cannot connect to single-node Kafka server through Docker
                            
                                @KafkaListener concurrency multiple topics
                            
                                Use the same topic as a source more than once with Kafka Streams DSL
                            
                                What is the command for getting the list of Consumers of a particular topic in Kafka
                            
                                Launching multiple Kafka brokers fails

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I stop attempting to consume messages off of Kafka when at the end of the log?

Tags:

apache-kafka

kafka-consumer-api

Matt Schley

People also ask

2 Answers

Matt Schley

Mike Nakis

Recent Activity

Donate For Us