Kafka topic creation best-practice

Tags:

apache-kafka

Whats is considered best-practise when creating topics for Apache Kafka?
Does everyone allow automatic creation of topics or how do you do it? Do you bundle the topic-creation-step with the starting of the kafka-instance?

I have a docker-based Kafka-installation which is gone be used by multiple applications. How can I keep the topic-creation for every application separate from the startup of the Kafka-container?. Looking at Confluents music-demo they create the topics by spinning up a new kafka-image, calling the "create-topic-script" and then leave the container to die. This feels abit "hacky" but maby its the only way?

Regards

699

asked Jan 29 '18 09:01

Jiinxy

2 Answers

Whats is considered best-practise when creating topics for Apache Kafka? Does everyone allow automatic creation of topics or how do you do it?

It depends on what you're doing. You can definitely use topic auto creation, but then the automatically created topics will have the default broker-wide configuration in terms of partitions and replication factor.

For Kafka Streams, a Confluent engineer writes that manually creating topics before starting the application is recommended:

I also want to point out, that it is highly recommended to not use auto topic create for Streams, but to manually create all input/output topics before you start your Streams application.

For more details, see http://docs.confluent.io/current/streams/developer-guide.html#managing-topics-of-a-kafka-streams-application

With regard to:

Do you bundle the topic-creation-step with the starting of the kafka-instance?

Yes. If you have a Java application, you can use AdminClient in the main method of your application before you start your application. If you have some other kind of application, you can run an init script that calls bin/kafka-topics.sh before your application. If you're using Kubernetes, you can use a Kubernetes Init Container. But there are obviously any number of ways you can do this.

This feels abit "hacky" but maby its the only way?

I don't think this is hacky. Pretty normal to have init steps, I think.

Finally, also note that you may want to configure the retention policy on your topics. This can be done with broker-wide defaults or on a per-topic basis: https://stackoverflow.com/a/48504305/741970.

answered Jan 08 '23 10:01

Dmitry Minkovsky

Another option to manage topics is to take a declarative, git-ops approach. This separates topic management from the runtime of applications completely. Whether this constitutes "best practice," or not, I think depends on the situation. For some use-cases/teams this can work very well.

See the following question/answer for tooling supporting this approach.

How to declaratively manage Kafka topics?

123

answered Jan 08 '23 12:01

peterevans

Related questions
                            
                                Apache Kafka order windowed messages based on their value
                            
                                Kafka leader election causes Kafka Streams crash
                            
                                How to check Kafka server status or details?
                            
                                Kafka Java API offset operations clarification
                            
                                Kafka Offset after retention period
                            
                                Implementing sagas with Kafka
                            
                                Does kafka have any default web UI [closed]
                            
                                Kafka vs StreamSets
                            
                                KafkaSpout working example [closed]
                            
                                Reading a topic of kafka with react
                            
                                KStream batch process windows
                            
                                Checking the existence of topic in kafka before creating in Java
                            
                                How to build efficient Kafka broker healthcheck?
                            
                                Meaning of sendOffsetsToTransaction in Kafka 0.11
                            
                                Securing access to REST API of Kafka Connect
                            
                                Alternative to openjdk:8-alpine for Kafka Streams
                            
                                Can Kafka Streams be configured to wait for KTable to load?
                            
                                How to connect kafka topic with web endpoint using Faust Python package?
                            
                                Managing Kafka Topic with spring
                            
                                Kafka connect cluster setup or launching connect workers

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With