S3 FAQ mentions that "Amazon S3 buckets in all Regions provide read-after-write consistency for PUTS of new objects and eventual consistency for overwrite PUTS and DELETES." However, I don't know how long it takes to get eventual consistency. I tried to search for this but couldn't find an answer in S3 documentation. Situation: We have a website consists of 7 steps. When user clicks on save in each step, we want to save a json document (contains information of all 7 steps) to Amazon S3. Currently we plan to: <ol> <li>Create a single S3 bucket to store all json documents.</li> <li>When user saves step 1 we create a new item in S3.</li> <li>When user saves step 2-7 we override the existing item. </li> <li>After user saves a step and refresh the page, he should be able to see the information he just saved. i.e. We want to make sure that we always read after write. </li> </ol> The full json document (all 7 steps completed) is around 20 KB. After users clicked on save button we can freeze the page for some time and they cannot make other changes until save is finished. Question: <ol> <li>How long does it take for AWS S3 to save and load an item? (We can freeze our website when document is being saved to S3)</li> <li>Is there a function to calculate save/load time based on item size?</li> <li>Is the save/load time gonna be different if I choose another S3 region? If so which is the best region for Seattle?</li> </ol>

I wanted to add to @error2007s answers. <blockquote> How long does it take for AWS S3 to save and load an item? (We can freeze our website when document is being saved to S3) </blockquote> It's not only that you will not find the exact time anywhere - there's actually no such thing exact time. That's just what "eventual consistency" is all about: consistency will be achieved eventually. You can't know when. If somebody gave you an upper bound for how long a system would take to achieve consistency, then you wouldn't call it "eventually consistent" anymore. It would be "consistent within X amount of time". <hr> The problem now becomes, "How do I deal with eventual consistency?" (instead of trying to "beat it") To really find the answer to that question, you need to first understand what kind of consistency you truly need, and how exactly the eventual consistency of S3 could affect your workflow. Based on your description, I understand that you would write a total of 7 times to S3, once for each step you have. For the first write, as you correctly cited the FAQs, you get strong consistency for any reads after that. For all the subsequent writes (which are really "replacing" the original object), you might observe eventual consistency - that is, if you try to read the overwritten object, you might get the most recent version, or you might get an older version. This is what is referred to as "eventual consistency" on S3 in this scenario. A few alternatives for you to consider: <ul> <li>don't write to S3 on every single step; instead, keep the data for each step on the client side, and then only write 1 single object to S3 after the 7th step. This way, there's only 1 write, no "overwrites", so no "eventual consistency". This might or might not be possible for your specific scenario, you need to evaluate that.</li> <li>alternatively, write to S3 objects with different names for each step. E.g., something like: after step 1, save that to <code>bruno-preferences-step-1.json</code>; then, after step 2, save the results to <code>bruno-preferences-step-2.json</code>; and so on, then save the final preferences file to <code>bruno-preferences.json</code>, or maybe even <code>bruno-preferences-step-7.json</code>, giving yourself the flexibility to add more steps in the future. Note that the idea here to avoid overwrites, which could cause eventual consistency issues. Using this approach, you only write new objects, you never overwrite them.</li> <li>finally, you might want to consider Amazon DynamoDB. It's a NoSQL database, you can securely connect to it directly from the browser or from your server. It provides you with replication, automatic scaling, load distribution (just like S3). And you also have the option to tell DynamoDB that you want to perform strongly consistent reads (the default is eventually consistent reads; you have to change a parameter to get strongly consistent reads). DynamoDB is typically used for "small" records, 20kB is definitely within the range -- the maximum size of a record would be 400kB as of today. You might want to check this out: DynamoDB FAQs: What is the consistency model of Amazon DynamoDB?</li> </ul>

<blockquote> How long does it take for AWS S3 to save and load an item? (We can freeze our website when document is being saved to S3) </blockquote> You will not find the exact time anywhere. If you ask AWS they will give you approx timings. Your file is 20 KB so as per my experience from S3 usage the time will be more or less 60-90 Sec. <blockquote> Is there a function to calculate save/load time based on item size? </blockquote> No there is no any function using which you can calculate this. <blockquote> Is the save/load time gonna be different if I choose another S3 region? If so which is the best region for Seattle? </blockquote> For Seattle US West Oregon Will work with no problem. You can also take a look at this experiment for comparison https://github.com/andrewgaul/are-we-consistent-yet

How long does it take for AWS S3 to save and load an item?

Tags:

amazon-web-services

amazon-s3

data-consistency

S3 FAQ mentions that "Amazon S3 buckets in all Regions provide read-after-write consistency for PUTS of new objects and eventual consistency for overwrite PUTS and DELETES." However, I don't know how long it takes to get eventual consistency. I tried to search for this but couldn't find an answer in S3 documentation.

Situation:

We have a website consists of 7 steps. When user clicks on save in each step, we want to save a json document (contains information of all 7 steps) to Amazon S3. Currently we plan to:

Create a single S3 bucket to store all json documents.
When user saves step 1 we create a new item in S3.
When user saves step 2-7 we override the existing item.
After user saves a step and refresh the page, he should be able to see the information he just saved. i.e. We want to make sure that we always read after write.

The full json document (all 7 steps completed) is around 20 KB. After users clicked on save button we can freeze the page for some time and they cannot make other changes until save is finished.

Question:

How long does it take for AWS S3 to save and load an item? (We can freeze our website when document is being saved to S3)
Is there a function to calculate save/load time based on item size?
Is the save/load time gonna be different if I choose another S3 region? If so which is the best region for Seattle?

494

asked Jun 06 '16 22:06

EV3

2 Answers

I wanted to add to @error2007s answers.

How long does it take for AWS S3 to save and load an item? (We can freeze our website when document is being saved to S3)

It's not only that you will not find the exact time anywhere - there's actually no such thing exact time. That's just what "eventual consistency" is all about: consistency will be achieved eventually. You can't know when.

If somebody gave you an upper bound for how long a system would take to achieve consistency, then you wouldn't call it "eventually consistent" anymore. It would be "consistent within X amount of time".

The problem now becomes, "How do I deal with eventual consistency?" (instead of trying to "beat it")

To really find the answer to that question, you need to first understand what kind of consistency you truly need, and how exactly the eventual consistency of S3 could affect your workflow.

Based on your description, I understand that you would write a total of 7 times to S3, once for each step you have. For the first write, as you correctly cited the FAQs, you get strong consistency for any reads after that. For all the subsequent writes (which are really "replacing" the original object), you might observe eventual consistency - that is, if you try to read the overwritten object, you might get the most recent version, or you might get an older version. This is what is referred to as "eventual consistency" on S3 in this scenario.

A few alternatives for you to consider:

don't write to S3 on every single step; instead, keep the data for each step on the client side, and then only write 1 single object to S3 after the 7th step. This way, there's only 1 write, no "overwrites", so no "eventual consistency". This might or might not be possible for your specific scenario, you need to evaluate that.
alternatively, write to S3 objects with different names for each step. E.g., something like: after step 1, save that to bruno-preferences-step-1.json; then, after step 2, save the results to bruno-preferences-step-2.json; and so on, then save the final preferences file to bruno-preferences.json, or maybe even bruno-preferences-step-7.json, giving yourself the flexibility to add more steps in the future. Note that the idea here to avoid overwrites, which could cause eventual consistency issues. Using this approach, you only write new objects, you never overwrite them.
finally, you might want to consider Amazon DynamoDB. It's a NoSQL database, you can securely connect to it directly from the browser or from your server. It provides you with replication, automatic scaling, load distribution (just like S3). And you also have the option to tell DynamoDB that you want to perform strongly consistent reads (the default is eventually consistent reads; you have to change a parameter to get strongly consistent reads). DynamoDB is typically used for "small" records, 20kB is definitely within the range -- the maximum size of a record would be 400kB as of today. You might want to check this out: DynamoDB FAQs: What is the consistency model of Amazon DynamoDB?

195

answered Oct 12 '22 07:10

Bruno Reis

How long does it take for AWS S3 to save and load an item? (We can freeze our website when document is being saved to S3)

You will not find the exact time anywhere. If you ask AWS they will give you approx timings. Your file is 20 KB so as per my experience from S3 usage the time will be more or less 60-90 Sec.

Is there a function to calculate save/load time based on item size?

No there is no any function using which you can calculate this.

Is the save/load time gonna be different if I choose another S3 region? If so which is the best region for Seattle?

For Seattle US West Oregon Will work with no problem.

You can also take a look at this experiment for comparison https://github.com/andrewgaul/are-we-consistent-yet

answered Oct 12 '22 07:10

error2007s

Related questions
                            
                                DynamoDB Query FilterExpression Multiple Condition Chaining Python
                            
                                AWS CloudFront Returns Access Denied from S3 Origin with Query String
                            
                                Active Storage with Amazon S3 not saving with filename specified but using file key instead
                            
                                Single Docker image push into AWS elastic container registry (ECR) from VSTS build/release definition
                            
                                AWS Cloudformation Link API Key to API Gateway
                            
                                AWS SQS doesn't reliably trigger Lambda
                            
                                AWS Api Gateway as a HTTP Proxy is currupting binary uploaded image files
                            
                                AWS Glue: Do I really need a Crawler for new content?
                            
                                Pgp key in terraform for aws_iam_user_login_profile
                            
                                Do I need a separate NAT gateway for each private subnet?
                            
                                AWS Lambda: cannot import name '_imaging' from 'PIL'
                            
                                Keep same IP address for EC2 Instance
                            
                                AWS SAM local and environment parameters
                            
                                aws-vault credentials missing
                            
                                Python logging.getLogger not working in AWS Glue python shell job
                            
                                Network load balancer real life examples?
                            
                                Terraform RDS database credentials
                            
                                dynamodb PartiQL SELECT query returns ValidationException: Unexpected from source
                            
                                Hadoop and Amazon Web Services [closed]
                            
                                Uploading SSL certificates TO IAM

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With