I'm currently facing some questions regarding my database design. Currently i'm developing an api which lets users do the following: <ul> <li>Create an Account ( 1 User owns 1 Account)</li> <li>Create a Profile ( 1 Account owns 1-n Profiles)</li> <li>Let a profile upload 2 types of items ( 1 Profile owns 0-n Items ; the items differ in type and purpose)</li> </ul> Calling the API methods triggers AWS Lambda to perform the requested operations in the DynamoDB tables. My current plan looks like this: <img src="https://i.stack.imgur.com/rU0HK.png" alt="enter image description here"> It should be possible to query items by specifying a time frame and the Profile ID. But i think my design completely defeats the purpose of DynamoDB. AWS documentation says that a well designed product only requires one table. <ul> <li>What would be a good way to realise this architecture in one table?</li> <li>Are there any drawbacks on using the current design?</li> <li>What would you specify as Primary/Partition/sort key/secondary indexes in both the current design and a one-table-approach?</li> </ul>

I’m going to give this answer assuming that you need to be able to do the following queries. <ul> <li>Given an Account, find all profiles</li> <li>Given a Profile, find all Items</li> <li>Given a Profile and a specific ItemType, find all Items</li> <li>Given an Item, find the owning Profile</li> <li>Given a Profile, find the owning account</li> </ul> One of the beauties of DynamoDB (and also a bane, perhaps) is that it is mostly schema-less. You need to have the mandatory Primary Key attributes for every item in the table, but all of the other attributes can be anything you like. In order to have a DynamoDB design with only one table, you usually need to get used to the idea of having mixed types of objects in the same table. That being said, here’s a possible schema for your use case. My suggestion assumes that you are using something like UUIDs for your identifiers. The partition key is a field that is simply called <code>pkey</code> (or whatever you want). We’ll also call the sort key <code>skey</code> (but again, it doesn’t really matter). Now, for an Account, the value of <code>pkey</code> is <code>Account-{{uuid}}</code> and the value of <code>skey</code> would be the same. For a Profile, the <code>pkey</code> value is also <code>Account-{{uuid}}</code>, but the <code>skey</code> value is <code>Profile-{{uuid}}</code>. Finally, for an Item, the <code>pkey</code> is <code>Profile-{{uuid}}</code> and the <code>skey</code> is <code>Item-{{type}}-{{uuid}}</code>. For all of the attributes of an item, don’t worry about it, just use whatever attributes you want to use. Since the “parent” object is always the partition key, you can get any of the “child” objects simply by querying for the ID of the of the parent. For example, your key condition expression to get all the ‘ItemType2’s for a Profile would be <pre class="prettyprint"><code>pkey = “Profile-{{uuid}}” AND begins_with(skey, “Item-Type2”) </code></pre> In this schema, your GSI has the same keys as the table, but reversed. You can query the GSI for ‘Item-{{type}}-{{uuid}}’ to get the owning Profile, and similarly with a Profile is to get the owning account. What I have illustrated here is the adjacency list pattern. DynamoDB also has an article describing how to use composite sort keys for hierarchical data, which would also be suitable for your data, and depending on your expected queries, it might be more suitable than using the adjacency list. You don’t have to put everything in a single table. Yes, DynamoDB recommends it, but it is far more important to make sure that your application is correct and maintainable. If having multiple tables means it’s easier to write a defect free application, then use multiple tables.

DynamoDB 1 big table or multiple small tables?

Tags:

amazon-web-services

database-design

amazon-dynamodb

I'm currently facing some questions regarding my database design. Currently i'm developing an api which lets users do the following:

Create an Account ( 1 User owns 1 Account)
Create a Profile ( 1 Account owns 1-n Profiles)
Let a profile upload 2 types of items ( 1 Profile owns 0-n Items ; the items differ in type and purpose)

Calling the API methods triggers AWS Lambda to perform the requested operations in the DynamoDB tables.

My current plan looks like this:

enter image description here

It should be possible to query items by specifying a time frame and the Profile ID. But i think my design completely defeats the purpose of DynamoDB. AWS documentation says that a well designed product only requires one table.

What would be a good way to realise this architecture in one table?
Are there any drawbacks on using the current design?
What would you specify as Primary/Partition/sort key/secondary indexes in both the current design and a one-table-approach?

418

asked Feb 27 '19 01:02

bautista

1 Answers

I’m going to give this answer assuming that you need to be able to do the following queries.

Given an Account, find all profiles
Given a Profile, find all Items
Given a Profile and a specific ItemType, find all Items
Given an Item, find the owning Profile
Given a Profile, find the owning account

One of the beauties of DynamoDB (and also a bane, perhaps) is that it is mostly schema-less. You need to have the mandatory Primary Key attributes for every item in the table, but all of the other attributes can be anything you like. In order to have a DynamoDB design with only one table, you usually need to get used to the idea of having mixed types of objects in the same table.

That being said, here’s a possible schema for your use case. My suggestion assumes that you are using something like UUIDs for your identifiers.

The partition key is a field that is simply called pkey (or whatever you want). We’ll also call the sort key skey (but again, it doesn’t really matter). Now, for an Account, the value of pkey is Account-{{uuid}} and the value of skey would be the same. For a Profile, the pkey value is also Account-{{uuid}}, but the skey value is Profile-{{uuid}}. Finally, for an Item, the pkey is Profile-{{uuid}} and the skey is Item-{{type}}-{{uuid}}. For all of the attributes of an item, don’t worry about it, just use whatever attributes you want to use.

Since the “parent” object is always the partition key, you can get any of the “child” objects simply by querying for the ID of the of the parent. For example, your key condition expression to get all the ‘ItemType2’s for a Profile would be

pkey = “Profile-{{uuid}}” AND begins_with(skey, “Item-Type2”)

In this schema, your GSI has the same keys as the table, but reversed. You can query the GSI for ‘Item-{{type}}-{{uuid}}’ to get the owning Profile, and similarly with a Profile is to get the owning account.

What I have illustrated here is the adjacency list pattern. DynamoDB also has an article describing how to use composite sort keys for hierarchical data, which would also be suitable for your data, and depending on your expected queries, it might be more suitable than using the adjacency list.

You don’t have to put everything in a single table. Yes, DynamoDB recommends it, but it is far more important to make sure that your application is correct and maintainable. If having multiple tables means it’s easier to write a defect free application, then use multiple tables.

154

answered Oct 22 '22 00:10

Matthew Pope

Related questions
                            
                                Static IP for Auto Scale in AWS
                            
                                Using AWS ECS with Boto3
                            
                                elastic beanstalk, awsebcli, and blessed 1.9.5
                            
                                AWS CloudFormation use existing security group
                            
                                How to link from docker-compose to Amazon RDS
                            
                                AWS Lambda permission denied when trying to use ffmpeg
                            
                                What is the different between AWS Elasticsearch and AWS Redshift
                            
                                'aws configure' in docker container will not use environment variables or config files
                            
                                Requesting AWS limit increase of EC2 instances - how long does it take?
                            
                                AWS CloudFormation stack fails with error Received 0 SUCCESS signal(s) out of 1
                            
                                How to Add item to string_set on Dynamodb with Boto3
                            
                                AWS S3: How to delete all contents of a directory in a bucket but not the directory itself?
                            
                                Serverless Framework AWS 403 Forbidden Error with Domain Only
                            
                                Sharing code in AWS Lambda
                            
                                Create user with custom attribute using AdminCreateUser in AWS Cognito
                            
                                AWS Elastic Beanstalk - Environment must have instance profile associated with it
                            
                                AWS IOT MQTT: Getting error ERR_CERT_SYMANTEC_LEGACY in chrome
                            
                                AWS Lambda handler throws a ClassCastException with Scala generics
                            
                                Attaching AWS documentDB to Spring Boot application
                            
                                cfn-init for cloudformation launchtemplate

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With