This question is part of my continuing exploration of Docker and in some ways follows up on one of my earlier questions. I have now understood how one can get a full application stack (effectively a mini VPS) working by linking together a bunch of Docker containers. For example one could create a stack that provides Apache + PHP5 with a sheaf of extensions + Redis + MemCached + MySQL all running on top of Ubuntu with or without an additional data container to make it easy to serialize user data. All very nice and elegant. However, I cannot but help wonder... . 5 containers to run that little VPS (I count 5 not 6 since Apache + PHP5 go into one container). So suppose I have 100 such VPSs running? That means I have 500 containers running! I understand the arguments here - it is easy to compose new app stacks, update one component of the stack etc. But are there no unnecessary overheads to operating this way? Suppose I did this <ul> <li>Put all my apps inside one container</li> <li> Write up a little shell script !/bin/bash service memcached start service redis-server start .... service apache2 start while: do : done </li> </ul> In my Dockerfile I have <pre class="prettyprint"><code>ADD start.sh /usr/local/bin/start.sh RUN chmod +x /usr/local/bin/start.sh .... ENTRYPOINT ["/bin/bash"] CMD ["/usr/local/bin/start.sh"] </code></pre> I then get that container up & running <pre class="prettyprint"><code>docker run -d -p 8080:80 -v /var/droidos/site:/var/www/html -v /var/droidos/logs:/var/log/apache2 droidos/minivps </code></pre> and I am in business. Now when I want to shut down that container programmatically I can do so by executing one single docker command. There are many questions of a similar nature to be found when one Google's for them. Apart from the arguments I have reproduced above one of the commonest reasons given for the one-app-per-container approach is "that is the way Docker is designed to work". What I would like to know <ul> <li>What are the downsides to running x100 instances of N linked containers - tradeoffs by way of speed, memory usage etc on the host?</li> <li>What is wrong with what I have done here?</li> </ul>

@Bryan's answer is solid, particularly in relation to the overheads of a container that just runs one process being low. That said, you should at least read the arguments at https://phusion.github.io/baseimage-docker/, which makes a case for having containers with multiple processes. Without them, docker is light on provision for: <ul> <li>process supervision</li> <li>cron jobs</li> <li>syslog</li> </ul> baseimage-docker runs an init process which fires up a few processes besides the main one in the container. For some purposes this is a good idea, but also be aware that for instance having a cron daemon and a syslog daemon per container adds up a bit more overhead. I expect that as the docker ecosystem matures we'll see better solutions that don't require this.

Running multiple applications in one docker container

Tags:

docker

This question is part of my continuing exploration of Docker and in some ways follows up on one of my earlier questions. I have now understood how one can get a full application stack (effectively a mini VPS) working by linking together a bunch of Docker containers. For example one could create a stack that provides Apache + PHP5 with a sheaf of extensions + Redis + MemCached + MySQL all running on top of Ubuntu with or without an additional data container to make it easy to serialize user data.

All very nice and elegant. However, I cannot but help wonder... . 5 containers to run that little VPS (I count 5 not 6 since Apache + PHP5 go into one container). So suppose I have 100 such VPSs running? That means I have 500 containers running! I understand the arguments here - it is easy to compose new app stacks, update one component of the stack etc. But are there no unnecessary overheads to operating this way?

Suppose I did this

Put all my apps inside one container
Write up a little shell script

!/bin/bash service memcached start service redis-server start .... service apache2 start while: do : done

In my Dockerfile I have

ADD start.sh /usr/local/bin/start.sh
RUN chmod +x /usr/local/bin/start.sh

....
ENTRYPOINT ["/bin/bash"]
CMD ["/usr/local/bin/start.sh"]

I then get that container up & running

docker run -d -p 8080:80 -v /var/droidos/site:/var/www/html -v /var/droidos/logs:/var/log/apache2 droidos/minivps

and I am in business. Now when I want to shut down that container programmatically I can do so by executing one single docker command.

There are many questions of a similar nature to be found when one Google's for them. Apart from the arguments I have reproduced above one of the commonest reasons given for the one-app-per-container approach is "that is the way Docker is designed to work". What I would like to know

What are the downsides to running x100 instances of N linked containers - tradeoffs by way of speed, memory usage etc on the host?
What is wrong with what I have done here?

727

asked Dec 27 '14 05:12

DroidOS

2 Answers

A container is basically a process. There is no technical issue with running 500 processes on a decent-sized Linux system, although they will have to share the CPU(s) and memory.

The cost of a container over a process is some extra kernel resources to manage namespaces, file systems and control groups, and some management structures inside the Docker daemon, particularly to handle stdout and stderr.

The namespaces are introduced to provide isolation, so that one container does not interfere with any others. If your groups of 5 containers form a unit that does not need this isolation then you can share the network namespace using --net=container. There is no feature at present to share cgroups, AFAIK.

What is wrong with what you suggest:

it is not "the Docker way". Which may not be important to you.
you have to maintain the scripting to get it to work, worry about process restart, etc., as opposed to using an orchestrator designed for the task
you will have to manage conflicts in the filesystem, e.g. two processes need different versions of a library, or they both write to the same output file
stdout and stderr will be intermingled for the five processes

117

answered Oct 23 '22 03:10

Bryan

@Bryan's answer is solid, particularly in relation to the overheads of a container that just runs one process being low.

That said, you should at least read the arguments at https://phusion.github.io/baseimage-docker/, which makes a case for having containers with multiple processes. Without them, docker is light on provision for:

process supervision
cron jobs
syslog

baseimage-docker runs an init process which fires up a few processes besides the main one in the container.

For some purposes this is a good idea, but also be aware that for instance having a cron daemon and a syslog daemon per container adds up a bit more overhead. I expect that as the docker ecosystem matures we'll see better solutions that don't require this.

answered Oct 23 '22 03:10

mc0e

Related questions
                            
                                "Address already in use" error upon docker-compose up
                            
                                docker container exits immediately even with Console.ReadLine() in a .NET Core console application
                            
                                Pip install -e packages don't appear in Docker
                            
                                How do I fix my docker-compose.yml? - expected <block end>, but found '<block mapping start>'
                            
                                Failed to create endpoint on network nat: hnsCall failed in Win32: The process cannot access the file
                            
                                Embed sqlite database to docker container?
                            
                                get label value from docker inspect [duplicate]
                            
                                './docker-compose.yml', service must be a mapping, not a NoneType
                            
                                Docker push to AWS ECR private repo failing with malformed JSON
                            
                                Docker load and save: "archive/tar: invalid tar header"
                            
                                Create kubernetes docker-registry secret from yaml file?
                            
                                Access to mysql container from other container
                            
                                Upgrade docker on CentOS 7
                            
                                Install python package in docker file
                            
                                Importing self-signed cert into Docker's JRE cacert is not recognized by the service
                            
                                how do I clean up my docker host machine
                            
                                How to idiomatically access sensitive data when building a Docker image?
                            
                                Is there a way to view the Kubernetes image download progress during pod initialization?
                            
                                Jenkins GitHub plugin can't choose my credentials
                            
                                Capturing output of python script run inside a docker container

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With