During investigation within new features in Apache Kafka 0.9 and 0.10, we had used KStreams and KTables. There is an interesting fact that Kafka uses RocksDB internally. See Introducing Kafka Streams: Stream Processing Made Simple. RocksDB is not written in JVM compatible language, so it needs careful handling of the deployment, as it needs extra shared library (OS dependent). And here there are simple questions: <ul> <li>Why Apache Kafka Streams uses RocksDB?</li> <li>How is it possible to change it?</li> </ul> I had tried to search the answer, but I see only implicit reason, that RocksDB is very fast for operations in the range of about millions of operations per second. On the other hand, I see some DBs that are coded in Java and perhaps end to end they could do that as well as they are not going over JNI.

RocksDB is used for several (internal) reasons (as you mentioned already for example its performance). Conceptually, Kafka Streams does not need RocksDB -- it is used as internal key-value cache and any other store offering similar functionality would work, too. Comment from @miguno below (rephrased): <blockquote> One important advantage of RocksDB in contrast to pure in-memory key-value stores is its ability to write to disc. Thus, a state larger than available main memory can be supported by Kafka Streams. </blockquote> Comment from @miguno above: <blockquote> FYI: <code>"RocksDB is not written in JVM compatible language, so it needs careful handling of the deployment, as it needs extra shared library (OS dependent)."</code> As a user of Kafka Streams you don't need to install anything. </blockquote> Using Kafka Streams DSL, as of 0.10.2 release (KAFKA-3825) it's possible to plug in custom state stores and to use a different key-value store. Using Kafka Streams Processor API, you can implement your own store via <code>StateStore</code> interface and connect it to a processor node in your topology.

Why Apache Kafka Streams uses RocksDB and if how is it possible to change it?

Tags:

java-native-interface

key-value-store

apache-kafka-streams

in-memory-database

rocksdb

During investigation within new features in Apache Kafka 0.9 and 0.10, we had used KStreams and KTables. There is an interesting fact that Kafka uses RocksDB internally. See Introducing Kafka Streams: Stream Processing Made Simple. RocksDB is not written in JVM compatible language, so it needs careful handling of the deployment, as it needs extra shared library (OS dependent).

And here there are simple questions:

Why Apache Kafka Streams uses RocksDB?
How is it possible to change it?

I had tried to search the answer, but I see only implicit reason, that RocksDB is very fast for operations in the range of about millions of operations per second.

On the other hand, I see some DBs that are coded in Java and perhaps end to end they could do that as well as they are not going over JNI.

611

asked Oct 18 '16 14:10

Seweryn Habdank-Wojewódzki

1 Answers

RocksDB is used for several (internal) reasons (as you mentioned already for example its performance). Conceptually, Kafka Streams does not need RocksDB -- it is used as internal key-value cache and any other store offering similar functionality would work, too.

Comment from @miguno below (rephrased):

One important advantage of RocksDB in contrast to pure in-memory key-value stores is its ability to write to disc. Thus, a state larger than available main memory can be supported by Kafka Streams.

Comment from @miguno above:

FYI: "RocksDB is not written in JVM compatible language, so it needs careful handling of the deployment, as it needs extra shared library (OS dependent)." As a user of Kafka Streams you don't need to install anything.

Using Kafka Streams DSL, as of 0.10.2 release (KAFKA-3825) it's possible to plug in custom state stores and to use a different key-value store.

Using Kafka Streams Processor API, you can implement your own store via StateStore interface and connect it to a processor node in your topology.

154

answered Sep 29 '22 08:09

Matthias J. Sax

Related questions
                            
                                Debugging C++ library with Android Studio
                            
                                What is the 'correct' way to store a native pointer inside a Java object?
                            
                                How to target multiple architectures using NDK?
                            
                                What happens when I throw a C++ exception from a native Java method?
                            
                                Invoking JNI functions in Android package name containing underscore
                            
                                Can OpenCV for Android leverage the standard C++ Support to get native build support on Android Studio 2.2 for Windows?
                            
                                jstring(JNI) to std::string(c++) with utf8 characters
                            
                                when to use JNIEXPORT and JNICALL in Android NDK?
                            
                                In Java Swing how do you get a Win32 window handle (hwnd) reference to a window?
                            
                                Linking using g++ fails searching for -lstdc++
                            
                                Tomcat startup fails due to 'java.net.SocketException Invalid argument' on Mac OS X
                            
                                Unable to launch cygpath in android
                            
                                WARNING: ABIs [armeabi-v7a,armeabi] set by 'android.injected.build.abi' gradle flag contained 'ARMEABI' not targeted by this project
                            
                                How to create an Android RFCOMM socket without any input from the user?
                            
                                JNI - Passing large amounts of data between Java and Native code
                            
                                jni.h: no such file or directory
                            
                                Use 32-bit jni libraries on 64-bit android
                            
                                error: base operand of ‘->’ has non-pointer type ‘JNIEnv’
                            
                                Underlying technique of Android's FaceDetector
                            
                                Looking for a convenient way to call Java from C++

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With