Apache Kafka Streams Materializing KTables to a topic seems slow

Tags:

I'm using kafka stream and I'm trying to materialize a KTable into a topic.

It works but it seems to be done every 30 secs or so.

How/When does Kafka Stream decides to materialize the current state of a KTable into a topic ?

Is there any way to shorten this time and to make it more "real-time" ?

Here is the actual code I'm using

// Stream of random ints: (1,1) -> (6,6) -> (3,3)
// one record every 500ms
KStream<Integer, Integer> kStream = builder.stream(Serdes.Integer(), Serdes.Integer(), RandomNumberProducer.TOPIC);

// grouping by key
KGroupedStream<Integer, Integer> byKey = kStream.groupByKey(Serdes.Integer(), Serdes.Integer());

// same behaviour with or without the TimeWindow
KTable<Windowed<Integer>, Long> count = byKey.count(TimeWindows.of(1000L),"total");

// same behaviour with only count.to(Serdes.Integer(), Serdes.Long(), RandomCountConsumer.TOPIC);
count.toStream().map((k,v) -> new KeyValue<>(k.key(), v)).to(Serdes.Integer(), Serdes.Long(), RandomCountConsumer.TOPIC);

835

asked Jun 23 '17 00:06

thomas.g

1 Answers

This is controlled by commit.interval.ms, which defaults to 30s. More details here: http://docs.confluent.io/current/streams/developer-guide.html

The semantics of caching is that data is flushed to the state store and forwarded to the next downstream processor node whenever the earliest of commit.interval.ms or cache.max.bytes.buffering (cache pressure) hits.

and here:

https://cwiki.apache.org/confluence/display/KAFKA/KIP-63%3A+Unify+store+and+downstream+caching+in+streams

137

answered Oct 17 '22 18:10

Michal Borowiecki

Related questions
                            
                                Squeezing more performance out of monadic streams in Haskell
                            
                                Buffered and Unbuffered Streams in Java
                            
                                NodeJS: How can I create a fake tcp socket for testing servers
                            
                                getByteFrequencyData not working for live streams in Safari
                            
                                Create a stream from a resource
                            
                                Force Node.js to flush writes to child processes
                            
                                What is the best way to pass a stream around
                            
                                How to send interrupt key sequence to a Java Process?
                            
                                Non-strict, Immutable, Non-memoized Infinite series in Scala
                            
                                Inheriting from std::basic_streambuf to write to a socket
                            
                                Node.JS Unbounded Concurrency / Stream backpressure over TCP
                            
                                normalization methods for stream data
                            
                                How to read from a text file compressed with 7z?
                            
                                Returning image created by Image.FromStream(Stream stream) Method
                            
                                How do I implement a basic node Stream.Readable example?
                            
                                When to use TcpClient.ReceiveTimeout vs. NetworkStream.ReadTimeout?
                            
                                Streaming a zip file over http in .net with SharpZipLib
                            
                                How to check if stdin is still opened without blocking?
                            
                                How to play MP3 sound from buffer (ByteArray/Stream) in ActionScript 3?
                            
                                C++ cin char read symbol-by-symbol

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Apache Kafka Streams Materializing KTables to a topic seems slow

Tags:

stream

reactive-programming

apache-kafka

thomas.g

People also ask

1 Answers

Michal Borowiecki

Recent Activity

Donate For Us