Kafka Streams in docker-compose takes long time for partition assignment

Tags:

I'm running a Kafka streaming application in a docker container. For testing purposes I have a docker-compose file with that runs the streaming application, a single instance of kafka, and zookeeper. The configuration for both kafka and zookeeper have worked before.

It takes upwards of 5 minutes for the kafka streaming application to be assigned partitions. If I delay running the stream container until kafka and zookeeper are up, and the topic that streaming application is consuming has been instantiated properly, then it gets its assigned partitions almost instantly.

It seems like the kafka stream group is being instantiated, but, the application is being assigned no partitions. This is, presumably, because the topic hasn't been fully instantiated yet. It does not get partitions assigned until the next generation, which seems to take almost exactly 5 minutes.

In my (limited) understanding of the situation, I have a few options for decreasing this delay:

check to see that the topic has metadata before starting the streaming application
decrease the interval between generations (seems like this could have problems in production, but may be fine for testing)

However, I realize I might be missing something obvious considering my limited knowledge in this area.

EDIT: docker-compose file for reference

version: 3.3
services:
    kafka-stream-ingestor:
      build:
        context: .
        dockerfile: Dockerfile
        args:
          - version

      networks:
        - services

    zookeeper:
        image: wurstmeister/zookeeper
        ports:
          - 2181:2181
        networks:
          - services

    kafka:
      image: wurstmeister/kafka:latest
      ports:
        - 9094:9094
        - 9092:9092
      environment:
        KAFKA_ADVERTISED_HOST_NAME: ${DOCKER_KAFKA_HOST}
        KAFKA_ZOOKEEPER_CONNECT: zookeeper:2181
        KAFKA_LISTENER_SECURITY_PROTOCOL_MAP: INSIDE:PLAINTEXT,OUTSIDE:PLAINTEXT
        KAFKA_ADVERTISED_PROTOCOL_NAME: OUTSIDE
        KAFKA_ADVERTISED_PORT: 9094
        KAFKA_CREATE_TOPICS: "kafka-queue:12:1"
        KAFKA_PROTOCOL_NAME: INSIDE
        KAFKA_PORT: 9092
      volumes:
        - /var/run/docker.sock:/var/run/docker.sock
      networks:
        - services

  networks:
    services:

  volumes:
    testresult:

448

asked Jan 05 '18 16:01

shaftoes

1 Answers

I've found a temporary solution that will work under limited circumstances (only required for testing locally, or via integration tests). I will not mark this as solved to allow for better answers.

Essentially the stream app is asking for metadata before the partitions is ready. kafka says 'there are no partitions yet' and the app says 'okay there are no partitions for assignment', and then waits a (configurable) amount of time until the partition metadata has become stale. It then makes another request to kafka, which, at this point, has created the partition.

The configuration that dictates this refresh interval is kafka.metadata.max.age.ms. I set this to 1000ms.

114

answered Nov 15 '22 05:11

shaftoes

Related questions
                            
                                Running Wildfly Swarm with KeyCloak on docker image
                            
                                pytest: environment variable to specify pytest.ini location
                            
                                GitlabCi slow build with docker and mysql service
                            
                                How to make docker containers talk to a non-dockerized application?
                            
                                Dockerfile: create and mount disk image during build
                            
                                Why digests are different depend on registry?
                            
                                Docker secrets in build-time
                            
                                docker stats in Swarm mode?
                            
                                How to initialize a keytab in docker?
                            
                                How to Change the Size of /dev/shm in App Engine Flexible
                            
                                Cant create conda env in dockerfile
                            
                                How to put seed data into SQL Server docker image?
                            
                                Install CentOS 7 Desktop in Windows using Docker
                            
                                How to share volumes among containers by docker-compose
                            
                                Running Django migrations in a multi-container Docker setup
                            
                                how to integrate cmake in gitlab repository for Continuous Integration(CI)
                            
                                What is the signal sent to the process running in the container when k8s liveness probe fails? KILL or TERM
                            
                                How to set Docker Backing Filesystem to XFS?
                            
                                ASP.NET Core app to Linux on Azure App Service
                            
                                how to find MAX memory from docker stats?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Kafka Streams in docker-compose takes long time for partition assignment

Tags:

docker

docker-compose

apache-kafka

apache-kafka-streams

shaftoes

People also ask

1 Answers

shaftoes

Recent Activity

Donate For Us