How to handle DB migration using AWS deployment tools

Tags:

Amazon Web Services offer a number of continuous deployment and management tools such as Elastic Beanstalk, OpsWorks, Cloud Formation and Code Deploy depending on your needs. The basic idea being to facilitate code deployment and upgrade with zero downtime. They also help manage best architectural practice using AWS resources.

For simplicity lets assuming a basic architecture where you have a 2 tear structure; a collection of application servers behind a load balancer and then a persistence layer using a multi-zone RDS DB.

The actual code upgrade across a fleet of instances (app servers) is easy to understand. For a very simplistic overview the AWS service upgrades each node in turn handing connections off so the instance in question is not being used.

However, I can't understand how DB upgrades are managed. Assume that we are going from version 1.0.0 to 2.0.0 of an application and that there is a requirement to change the DB structure. Normally you would use a script or a library like Flyway to perform the upgrade. However, if there is a fleet of servers to upgrade there is a point where both 1.0.0 and 2.0.0 applications exist across the fleet each requiring a different DB structure.

I need to understand how this is actually achieved (high level) to know what the best way/time of performing the DB migration is. I guess there are a couple of ways they could be achieving this but I am struggling to see how they can do it and allow both 1.0.0 and 2.0.0 to persist data without loss.

If they migrate the DB structure with the first app node upgrade and at the same time create a cached version of the 1.0.0. Users connected to the 1.0.0 app persist using the cached version of the DB and users connected to the 2.0.0 app persist to the new migrated DB. Once all the app nodes are migrated, the cached data is merged into the DB.

It seems unlikely they can do this as the merge would be pretty complex but I can't see another way. Any pointers/help would be appreciated.

309

asked Feb 15 '15 09:02

tarka

1 Answers

This is a common problem to encounter once your application infrastructure gets into multiple application nodes. In the olden days, you could take your application offline for "maintenance windows" during which you could:

Replace application with a "System Maintenance, back soon" page.
Perform database migrations (schema and/or data)
Deploy new application code
Put application back online

In 2015, and really for several years this approach is not acceptable. Your users expect 24/7 operation, so there must be a better way. Of course there is, the answer is a series of patterns for Database Refactorings.

The basic concept to always keep in mind is to assume you have to maintain two concurrent versions of your application, and there can be no breaking changes between these two versions. This means that you have a current application (v1.0.0) currently in production and (v2.0.0) that is scheduled to be deployed. Both these versions must work on the same schema. Once v2.0.0 is fully deployed across all application servers, you can then develop v3.0.0 that allows you to complete any final database changes.

answered Oct 08 '22 17:10

Mikelax

Related questions
                            
                                Copy S3 Bucket including versions
                            
                                How does multi-line logging work in Lambda -> CloudWatch
                            
                                aws fargate docker container instances not able to get local hostname
                            
                                using AWS Glue with Apache Avro on schema changes
                            
                                Aws load balancer for Server Sent Events or Websockets
                            
                                Ansible deployment to windows host behind bastion
                            
                                Heroku multi region support
                            
                                Deploy to elasticbeanstalk via CLI deploy command with Dockerrun.aws.json
                            
                                Python: AWS Lambda "errorMessage": "Unable to import module '<module_name>'"
                            
                                IAM Role limit sts:AssumeRole to one AWS Lambda function
                            
                                AWS Elasticsearch & VPC - configuring network access from my fixed IP
                            
                                Cloudformation KeyValuePair List as a parameter
                            
                                API Gateway custom domain certificate error
                            
                                AWS SNS subscription keeps deleting the subscription itself
                            
                                CRITICAL WORKER TIMEOUT on gunicorn when deployed to AWS
                            
                                Count Array Length in JSON Message Object with Amazon Cloudwatch Logs Insights
                            
                                Retrieving amazon order history programmatically in java
                            
                                Message bus over ZeroMQ
                            
                                How do I get unique values of a column in AWS Dynamo?
                            
                                JSON stored in AWS EB environment variables is retrieved without quotes

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to handle DB migration using AWS deployment tools

Tags:

amazon-web-services

amazon-elastic-beanstalk

amazon-cloudformation

aws-opsworks

aws-code-deploy

tarka

People also ask

1 Answers

Mikelax

Recent Activity

Donate For Us