Can Amazon Glacier mirror an Amazon S3 bucket?

Tags:

amazon-s3

I'd like to mirror an S3 bucket with Amazon Glacier.

The Glacier FAQ states:

Amazon S3 now provides a new storage option that enables you to utilize Amazon Glacier’s extremely low-cost storage service for data archiving. You can define S3 lifeycycle rules to automatically archive sets of Amazon S3 objects to Amazon Glacier to reduce your storage costs. You can learn more by visiting the Object Lifecycle Management topic in the Amazon S3 Developer Guide.

This is close, but I'd like to mirror. I do not want to delete the content on S3, only copy it to Glacier.

Is this possible to setup automatically with AWS?

Or does this mirroring need be uploaded to Glacier manually?

987

asked Mar 10 '13 18:03

3 Answers

It is now possible to achieve an S3 to Glacier "mirror" by first creating a cross-region replication bucket on Amazon S3 (this replication bucket will be a mirror of your original bucket - see http://docs.aws.amazon.com/AmazonS3/latest/dev/crr.html), then setting up a life-cycle rule (to move the data to Glacier) from within the replication bucket.

116

answered Oct 22 '22 04:10

Jordan Magnuson

Amazon doesn't offer this feature through its API. We had the same problem, and solved the problem by running a daily cron job that re-uploads files to Glacier.

Here is a snippet of code you can run using Python and boto to copy a file to a Glacier vault. Note that with the code below, you do have to download the file locally from S3 before you can run it (you can use s3cmd, for instance) - the following code is useful for uploading the local file to Glacier.

import boto

# Set up your AWS key and secret, and vault name
aws_key = "AKIA1234"
aws_secret = "ABC123"
glacierVault = "someName"

# Assumption is that this file has been downloaded from S3
fileName = "localfile.tgz"

try: 
  # Connect to boto
  l = boto.glacier.layer2.Layer2(aws_access_key_id=aws_key, aws_secret_access_key=aws_secret)

  # Get your Glacier vault
  v = l.get_vault(glacierVault)

  # Upload file using concurrent upload (so large files are OK)
  archiveID = v.concurrent_create_archive_from_file(fileName)

  # Append this archiveID to a local file, that way you remember what file
  # in Glacier corresponds to a local file. Glacier has no concept of files.
  open("glacier.txt", "a").write(fileName + " " + archiveID + "\n")
except:
  print "Could not upload gzipped file to Glacier"

answered Oct 22 '22 05:10

Suman

This is done via Lifecycle policy, but the object is not available in S3 anymore. You can duplicate it into separate bucket to keep it.

answered Oct 22 '22 05:10

Ahmed Al Hafoudh

Related questions
                            
                                The target group does not have an associated load balancer
                            
                                How to forward http request to https in Amazon Route53?
                            
                                Local development and staging with Amazon Redshift
                            
                                How can I access Amazon DynamoDB via Python?
                            
                                Launching with snapshot based volume fails
                            
                                Boto3: get credentials dynamically?
                            
                                AWS Lambda: How to set up a NAT gateway for a lambda function with VPC access
                            
                                What is the REST (or CLI) API for logging in to Amazon Cognito user pools
                            
                                Bash with AWS CLI - unable to locate credentials
                            
                                Amazon Managed Streaming for Kafka- MSK features and performance
                            
                                How to see what profile is default with CLI?
                            
                                How to delete empty S3 bucket which generated by Elastic Beanstalk?
                            
                                How to add multiple keys for elastic beanstalk instance?
                            
                                how to 'load data infile' on amazon RDS?
                            
                                How to create a folder in an amazon S3 bucket using terraform
                            
                                AWS classic load balancer listener isn't created, then disapears.
                            
                                EC2 instance image on VirtualBOX?
                            
                                Cognito auth flow fails with "Already found an entry for username Facebook_10155611263153532"
                            
                                Static outgoing IP in Kubernetes
                            
                                In AWS IAM, What is the Purpose/Use of the "Path" Variable?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Can Amazon Glacier mirror an Amazon S3 bucket?

Tags:

amazon-web-services

amazon-s3

Justin Tanner

People also ask

3 Answers

Jordan Magnuson

Suman

Ahmed Al Hafoudh

Recent Activity

Donate For Us