Persisting data in a docker swarm with glusterfs

Tags:

I have a docker swarm with a lot of containers, but in particolar:

mysql
mongodb
fluentd
elasticsearch

My problem is that when a node fails, the manager discards the current container and creates a new one in another node. So everytime i lost the persisting data stored in that particular container even using docker volumes.

So i would create four distributed glusterfs volumes over my cluster, and mount them as docker volumes into my containers.

Is this a correct way to resolve my problem?

If it is, what type of filesystem should i use for my glusterfs volumes?

Are there perfomance problems with this approch?

914

asked Jan 18 '18 21:01

Antonio Caristia

1 Answers

GlusterFS would not be the correct way to resolve this for all of your containers since Gluster does not support "structured data", as stated in the GlusterFS Install Guide:

Gluster does not support so called “structured data”, meaning live, SQL databases. Of course, using Gluster to backup and restore the database would be fine - Gluster is traditionally better when using file sizes at of least 16KB (with a sweet spot around 128KB or so).

One solution to this would be master slave replication for the data in your databases. MySQL and mongoDB both support this (as described here and here), as do most common DBMSs.

Master slave replication is basically where for 2 or more copies of your database, one will be the master and the rest will be slaves. All write operations happen on the master, and all read operations happen on the slaves. Any data written to the master will be replicated across the slaves, by the master. Some DBMSs also provide a way to check if the master goes down and elect a new master if this happens, but I don't think all DBMSs do this.

You could alternatively set up a Galera Cluster, but as far as I'm aware this only supports MySQL.

I would have thought you could use GlusterFS for Fluentd and Elasticsearch, but I'm not familiar with either of those so I couldn't say for certain. I imagine it would depend on how they store any data they collect (if they collect any at all).

answered Sep 19 '22 00:09

benjilev08

Related questions
                            
                                How to build an Image using Docker API Python Client?
                            
                                Understanding Docker in Production
                            
                                List of used files in the Docker context
                            
                                Create neo4j databse from backup inside neo4j docker
                            
                                Alpine Linux docker set hostname
                            
                                docker - driver "devicemapper" failed to remove root filesystem after process in container killed
                            
                                Persisting content across docker restart within an Azure Web App
                            
                                How to setup laravel with npm using docker-compose?
                            
                                GitLab CI docker in docker can't create volume
                            
                                Docker stat network traffic
                            
                                Set environment variables in Docker
                            
                                Download speed for pip install inside docker container very slow
                            
                                Programmatically add a service to docker compose project
                            
                                Cannot stop 10 containers after Kubernetes minikube tutorial
                            
                                Hosting Jenkins on Kubernetes while using docker.sock
                            
                                Unable to connect outside database from Docker container App
                            
                                Show volume files in the GUI of Docker Jupyter notebook
                            
                                Cannot restart Docker container due to input/output error?
                            
                                Stall when debugging with gdbserver in VSCode - "The preLaunchTask 'docker gdb' cannot be tracked."
                            
                                Kubernetes pod cpu usage calculation method for HPA

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Persisting data in a docker swarm with glusterfs

Tags:

docker

docker-swarm

glusterfs

Antonio Caristia

People also ask

1 Answers

benjilev08

Recent Activity

Donate For Us