Read capacity cost of a DynamoDB table scan

Tags:

amazon-dynamodb

It's unclear to me, after reading the docs, how many read capacity units are consumed during a scan operation with a filter in DynamoDB. For example, with this ruby request:

table.items.where(:MyAttribute => "Some Value").each do |item_data|
   # do something with the item_data
end

My understanding is that this will result in a table scan but DynamoDB will only return the items that I'm interested in. But if my table has 10000 items, and only 5 of those items are what gets through my filter, am I still being "charged" for a huge number of read capacity units?

The attribute I'm using for the filter is not a hash, range or secondary index. I've just had to add that attribute recently, and unexpectedly, which is why I'm not using a query instead.

487

asked Jul 21 '15 09:07

RTF

1 Answers

In short, you will be "charged" for the total amount of items scanned (not the total amount of items returned). Scan is, compared to query (as you already mentioned) an expensive operation.

Worth mentioning is the fact that when you invoke a scan on a table, it does not mean that the whole table will be scanned. If the size of the scanned items exceeds the limit of 1MB, the scan stops and you have to invoke it again to scan the next portion of the table.

This is taken from the official docs:

If the total number of scanned items exceeds the maximum data set size limit of 1 MB, the scan stops and results are returned to the user as a LastEvaluatedKey value to continue the scan in a subsequent operation. The results also include the number of items exceeding the limit. A scan can result in no table data meeting the filter criteria.

The filter is applied after the scan on the found items so it does not affect the throughput capacity at all.

If you are going to be performing these operations regularly, it may be worth considering an addition of some secondary indexes or optimizing the hash and range keys.

126

answered Sep 28 '22 09:09

Smajl

Related questions
                            
                                How do DynamoDB streams distribute records to shards?
                            
                                Best practices for developing scalable video transcoding server on Amazon Web Services?
                            
                                I need help duplicating Amazon AWS EC2 instances
                            
                                Is it possible to generate an AWS access key via IAM for use with the Product Advertising API?
                            
                                Virtualization type 'hvm' is required for instances of type 't2.micro'
                            
                                What do Network In and Network Out mean in Amazon?
                            
                                Has anyone figured out how to scale Amazon RDS read replicas?
                            
                                Scaling Up an Elasticache Instance?
                            
                                How do I customize nginx on AWS elastic beanstalk to loadbalance Meteor?
                            
                                Organizing AWS IAM permissions: limit of 10 policies?
                            
                                AWS SQS Long Polling doesn't reduce empty receives
                            
                                How to run docker task with Amazon ECS - getting error `STOPPED (CannotStartContainerError: Error response from dae)`
                            
                                When to use delay queue feature of Amazon SQS?
                            
                                Upload files to S3 Bucket directly from a url
                            
                                what ip AWS lambda function use?
                            
                                AWS RDS Writer Endpoint vs Reader Endpoint
                            
                                implementing USER_SRP_AUTH with python boto3 for AWS Cognito
                            
                                Connection pooling in AWS across lambdas
                            
                                How to assign Elastic IP to Application Load Balancer in AWS?
                            
                                Creating a CloudWatch alarm based on a search expression

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Read capacity cost of a DynamoDB table scan

Tags:

amazon-web-services

amazon-dynamodb

RTF

People also ask

1 Answers

Smajl

Recent Activity

Donate For Us