Amazon DynamoDB Mapper - limits to batch operations

Tags:

I am trying to write a huge number of records into a dynamoDB and I would like to know what is the correct way of doing that. Currently, I am using the DynamoDBMapper to do the job in a one batchWrite operation but after reading the documentation, I am not sure if this is the correct way (especially if there are some limits concerning the size and number of the written items).

Let's say, that I have an ArrayList with 10000 records and I am saving it like this:

mapper.batchWrite(recordsToSave, new ArrayList<BillingRecord>());

The first argument is the list with records to be written and the second one contains items to be deleted (no such items in this case).

Does the mapper split this write into multiple writes and handle the limits or should it be handled explicitly?

I have only found examples with batchWrite done with the AmazonDynamoDB client directly (like THIS one). Is using the client directly for the batch operations the correct way? If so, what is the point of having a mapper?

979

asked Jun 18 '15 09:06

Smajl

1 Answers

Does the mapper split your list of objects into multiple batches and then write each batch separately? Yes, it does batching for you and you can see that it splits the items to be written into batches of up to 25 items here. It then tries writing each batch and some of the items in each batch can fail. An example of a failure is given in the mapper documentation:

This method fails to save the batch if the size of an individual object in the batch exceeds 400 KB. For more information on batch restrictions see, http://docs.aws.amazon.com/amazondynamodb/latest/APIReference/API_BatchWriteItem.html

The example is talking about the size of one record (one BillingRecord instance in your case) exceeding 400KB, which at the time of writing this answer, is the maximum size of a record in DynamoDB.

In the case a particular batch fails, it moves on to the next batch (sleeping the thread for a bit in case the failure was because of throttling). In the end, all of the failed batches are returned in List of FailedBatch instances. Each FailedBatch instance contains a list of unprocessed items that weren't written to DynamoDB.

Is the snippet that you provided the correct way for doing batch writes? I can think of two suggestions. The BatchSave method is more appropriate if you have no items to delete. You might also want to think about what you want to do with the failed batches.

Is using the client directly the correct way? If so, what is the point of the mapper? The mapper is simply a wrapper around the client. The mapper provides you an ORM layer to convert your BillingRecord instances into the sort-of nested hash maps that the low-level client works with. There is nothing wrong with using the client directly and this does tend to happen in some special cases where additional functionality needed needs to be coded outside of the mapper.

135

answered Oct 06 '22 23:10

Rohit Kulshreshtha

Related questions
                            
                                Having trouble reading AWS config file with python configparser
                            
                                uvicorn error on AWS EC2 with uvicorn + fastapi
                            
                                How to configure Spark / Glue to avoid creation of empty $_folder_$ after Glue job successful execution
                            
                                ECS "InternalError: failed to normalize image reference"
                            
                                What do you need to take into consideration when deciding between MySQL and Amazon's SimpleDB for a RoR app?
                            
                                can't ssh after assigning an elastic ip
                            
                                Is there an async I/O based Aws java client?
                            
                                Amazon Auto Scaling API for Job Servers
                            
                                how does swap environmental URL work exactly?
                            
                                MySQL: InnoDB vs. MyISAM: how and why to change (Amazon RDS)?
                            
                                Protecting Amazon Web Services (AWS) S3 from DDoS attacks
                            
                                Which unix flavor is the standard amazon EC2 linux image based on?
                            
                                Connecting to EC2 Instance via Terminal on Mac
                            
                                Can Amazon Elastic Transcoder concatenate two videos
                            
                                Use Subdomain with Amazon EC2 Public DNS Address?
                            
                                What's wrong with my code to upload a file to AWS S3 using a pre-signed URL?
                            
                                Amazon S3 Content Type won't be set when doing a POST request
                            
                                Installing phpMyAdmin onto Amazon EC2 instance
                            
                                How to connect aws VM running mongo DB to robomongo on windows machine?
                            
                                simple node.js example in aws lambda

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Amazon DynamoDB Mapper - limits to batch operations

Tags:

amazon-web-services

amazon-dynamodb

Smajl

People also ask

1 Answers

Rohit Kulshreshtha

Recent Activity

Donate For Us