DynamoDB: When does 1MB limit for queries apply

1 Answers

Indeed, your interpretation is correct. With KeyConditionExpression, DynamoDB can efficiently fetch only the data matching its criteria, and you only pay for this matching data and the 1MB read size applies to the matching data. But with FilterExpression the story is different: DynamoDB has no efficient way of filtering out the non-matching items before actually fetching all of it then filtering out the items you don't want. So you pay for reading the entire unfiltered data (before FilterExpression), and the 1MB maximum also corresponds to the unfiltered data.

If you're still unconvinced that this is the way it should be, here's another issue to consider: Imagine that you have 1 gigabyte of data in your database to be Scan'ed (or in a single key to be Query'ed), and after filtering, the result will be just 1 kilobyte. Were you to make this query and expect to get the 1 kilobyte back, Dynamo would need to read and process the entire 1 gigabyte of data before returning. This could take a very long time, and you would have no idea how much, and will likely timeout while waiting for the result. So instead, Dynamo makes sure to return to you after every 1MB of data it reads from disk (and for which you pay ;-)). Control will return to you 1000 (=1 gigabyte / 1 MB) times during the long query, and you won't have a chance to timeout. Whether a 1MB limit actually makes sense here or it should have been more, I don't know, and maybe we should have had a different limit for the response size and the read amount - but definitely some sort of limit was needed on the read amount, even if it doesn't translate to large responses.

By the way, the Scan documentation includes a slightly differently-worded version of the explanation of the 1MB limit, maybe you will find it clearer than the version in the Query documentation:

A single Scan operation will read up to the maximum number of items set (if using the Limit parameter) or a maximum of 1 MB of data and then apply any filtering to the results using FilterExpression.

109

answered Sep 22 '22 19:09

Nadav Har'El

Related questions
                            
                                Kafka like offset on Kinesis Stream?
                            
                                AWS SNS Creation times out
                            
                                AWS MySQL connection frequently times out
                            
                                Cant Modify or Resize Amazon EBS Volume
                            
                                How do I write the policy statement of an encrypted SQS for S3 events?
                            
                                Missing required client configuration options: region
                            
                                Issues Creating a Glue Connection to an MS SQL Server RDS
                            
                                Changing ACLs of objects in an S3 bucket using Boto3
                            
                                Is there a way to set a walltime on AWS Batch jobs?
                            
                                AWS gives us Amazon MQ but how can I trigger a Lambda?
                            
                                How to allow only email as username alias with CloudFormation?
                            
                                I want to know the sample bucket name in boto3
                            
                                Terraform: Creating and validating multiple ACM certificates
                            
                                create a read-only IAM user in AWS
                            
                                aws: boto3 get all instances of a load balancers
                            
                                How do I run my CDK app?
                            
                                Clearing out tmp folder from AWS Lambda
                            
                                Terraform init fails for remote backend S3 when creating the state bucket
                            
                                Value of property SecurityGroupIds must be of type List of String error while updating stack
                            
                                When Jenkins Building a maven project gave Error: Could not find or load main class org.apache.maven.surefire.booter.ForkedBooter Jenkins [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

DynamoDB: When does 1MB limit for queries apply

Tags:

amazon-web-services

amazon-dynamodb

dynamodb-queries

J. Hesters

People also ask

1 Answers

Nadav Har'El

Recent Activity

Donate For Us