i am relatively new to docker. I'd like to set up a postgres database but I wonder how to make sure that the data isn't being lost if I recreated the container. Then I stumbled over named volumes (not bind volumes) and how to use them. But... in a Dockerfile you can't use named volumes. E.g. data:/var/lib etc. As I understood using a Dockerfile it's always an anonymous volume. So every single time I'd recreate a container it would get its own new volume. So here comes my question: Firstly: how do I make sure, if the container get's updated or recreated that the postgres database from within the new container references to the same data and not losing the reference to the previously created anonymous volume. Secondly: how does this work with a yml file? is it possible to reference multiple replicas of such a database container to one volume? (High Availability Mode)? It would really be great if someone could get me a hint or best practices. Thank you in advance.

Looking at the Dockerfile for Postgres, you see that it declares a volume instruction: <pre class="prettyprint"><code>VOLUME /var/lib/postgresql/data </code></pre> Everytime you run a new Postgres container, without specifying a <code>--volume</code> option, docker automatically creates a new volume. The volume is given a random name. You can see all volumes by running the command: <pre class="prettyprint"><code>docker volume ls </code></pre> You can also inspect the files stored on the host by the volume, by inspecting the host path using: <pre class="prettyprint"><code>docker volume inspect <volume-name> </code></pre> So when you don't specify the <code>--volume</code> option for the run command, docker create volumes for all volumes declared in the Dockerfile. This is mainly a safety if you forget to name your volume and the data shouldn't be lost. <blockquote> Firstly: how do I make sure, if the container get's updated or recreated that the postgres database from within the new container references to the same data and not losing the reference to the previously created anonymous volume. </blockquote> If you want docker to use the same volume, you need to specify the <code>--volume</code> option. Once specified, docker won't create a new volume and it will simply mount the existing volume onto the specified folder in the docker command. As a best practice, name your volumes that have valuable data. For example: <pre class="prettyprint"><code>docker run --volume postgresData:/var/lib/postgresql/data ... </code></pre> If you run this command for the first time the volume <code>postgresData</code> will be created and will backup <code>/var/lib/postgresql/data</code> on the host. The second time you run it the same data backed up on the host will be mounted onto the container. <blockquote> Secondly: how does this work with a yml file? is it possible to reference multiple replicas of such a database container to one volume? </blockquote> Yes, volumes can be shared between multiple containers. You can mount the same volume onto multiple containers, and the containers will use the same files. Docker compose allows you to do that ... However, beware that volumes are limited to the host they were created. When running containers on multiple machines, the volume needs to be accessible from all the machines. There are ways/tools to achieve that but they are a bit complex. This is still a limitation to be addressed in Docker.

dockerized postgresql with volumes

Tags:

docker

postgresql

volumes

i am relatively new to docker. I'd like to set up a postgres database but I wonder how to make sure that the data isn't being lost if I recreated the container.

Then I stumbled over named volumes (not bind volumes) and how to use them. But... in a Dockerfile you can't use named volumes. E.g. data:/var/lib etc. As I understood using a Dockerfile it's always an anonymous volume. So every single time I'd recreate a container it would get its own new volume.

So here comes my question:

Firstly: how do I make sure, if the container get's updated or recreated that the postgres database from within the new container references to the same data and not losing the reference to the previously created anonymous volume.

Secondly: how does this work with a yml file? is it possible to reference multiple replicas of such a database container to one volume? (High Availability Mode)?

It would really be great if someone could get me a hint or best practices.

Thank you in advance.

208

asked Nov 23 '17 14:11

Chris

1 Answers

Looking at the Dockerfile for Postgres, you see that it declares a volume instruction:

VOLUME /var/lib/postgresql/data

Everytime you run a new Postgres container, without specifying a --volume option, docker automatically creates a new volume. The volume is given a random name.

You can see all volumes by running the command:

docker volume ls

You can also inspect the files stored on the host by the volume, by inspecting the host path using:

docker volume inspect <volume-name>

So when you don't specify the --volume option for the run command, docker create volumes for all volumes declared in the Dockerfile. This is mainly a safety if you forget to name your volume and the data shouldn't be lost.

Firstly: how do I make sure, if the container get's updated or recreated that the postgres database from within the new container references to the same data and not losing the reference to the previously created anonymous volume.

If you want docker to use the same volume, you need to specify the --volume option. Once specified, docker won't create a new volume and it will simply mount the existing volume onto the specified folder in the docker command.

As a best practice, name your volumes that have valuable data. For example:

docker run --volume postgresData:/var/lib/postgresql/data ...

If you run this command for the first time the volume postgresData will be created and will backup /var/lib/postgresql/data on the host. The second time you run it the same data backed up on the host will be mounted onto the container.

Secondly: how does this work with a yml file? is it possible to reference multiple replicas of such a database container to one volume?

Yes, volumes can be shared between multiple containers. You can mount the same volume onto multiple containers, and the containers will use the same files. Docker compose allows you to do that ...

However, beware that volumes are limited to the host they were created. When running containers on multiple machines, the volume needs to be accessible from all the machines. There are ways/tools to achieve that but they are a bit complex. This is still a limitation to be addressed in Docker.

168

answered Sep 25 '22 05:09

yamenk

Related questions
                            
                                What is a row constructor used for?
                            
                                SQL to get the 2nd (nth) record for each group (postgresql or mysql)
                            
                                How to select the latest entry from database with Ecto/Phoenix
                            
                                Update PostgreSQL hstore field with sql variable
                            
                                Transactions are auto committed on PostgreSQL 9.5.2 with no option to change it?
                            
                                Using WITH + DELETE clause in a single query in postgresql
                            
                                SQLAlchemy func.count on boolean column
                            
                                Rails console can't connect to database but rake tasks can
                            
                                Rename all columns from all tables with specific column name in PostgreSQL?
                            
                                PostgreSQL multi-row updates in Node.js
                            
                                Use Ecto to generate_series in postgres and also retrieve Null-values as “0”
                            
                                How to sanitize input data in golang?
                            
                                Need to insert struct directly in a PostgreSQL DB
                            
                                PostgreSQL `analyse` vs `analyze`
                            
                                python & postgresql: reliably check for updates in a specific table
                            
                                How to copy a csv file from a url to Postgresql
                            
                                PostgreSQL, regex to match text fields with numeric values
                            
                                How to recreate Docker container?
                            
                                Get million record from django with queryset is slow
                            
                                DBeaver on OSX - null connection returned

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With