I am migrating my persistence tier from Riak to DynamoDB. My data model contains an optional business identifier field, which is desired to be able to be queried as an alternative to the key. It appears that DynamoDB secondary indexes can't be <code>null</code> and require a range key, so despite the similar name to Riak's secondary indexes, make this appear quite a different beast. Is there an elegant way to efficiently query my optional field, short of throwing the data in an external search index?

When you asked this question, DynamoDB did not have Global Secondary Indexes: http://aws.amazon.com/about-aws/whats-new/2013/12/12/announcing-amazon-dynamodb-global-secondary-indexes/ Now, it does. A local secondary index is best thought of, and functions as, a secondary range key. @andreimarinescu is right: you still must query by the item's hash key, only with a secondary index you can use a limited subset of a DynamoDB query's comparison operators on that range key (e.g. greater than, equal to, less than, etc.) So, you still need to know which "hash bucket" you're performing the comparison within. Global secondary indexes are a bit of a different beast. They are more like a secondary version of your table (and Amazon charges you similarly in terms of provisioned throughput). You can use non-primary key attributes of your table as primary key attributes of your index in a global secondary index, and query them accordingly. For example, if your table looks like: <pre class="prettyprint"><code>|**Hash key**: Item ID | **Range Key**: Serial No | **Attribute**: Business ID | -------------------------------------------------------------------------------- | 1 | 12345 | 1A | -------------------------------------------------------------------------------- | 2 | 45678 | 2B | -------------------------------------------------------------------------------- | 3 | 34567 | (empty) | -------------------------------------------------------------------------------- | 3 | 12345 | 2B | -------------------------------------------------------------------------------- </code></pre> Then, with a local secondary index on <code>Business ID</code> you could perform queries like, "find all the items with a hash key of <code>3</code> and a business ID equal to <code>2B</code>", but you could not do "find all items with a business ID equal to <code>2B</code>" because the secondary index requires a hash key. If you were to add a global secondary index using business ID, then you could perform such queries. You would essentially be providing an alternate primary key for the table. You could perform a query like "find all items with a business ID equal to <code>2B</code> and get items <code>2-45678</code> and <code>3-12345</code> as a response. Sparse indexes work fine with DynamoDB; it's perfectly allowable that not all the items have a business ID and can allow you to keep the provisioned throughput on your index lower than that of the table depending on how many items you anticipate having a business ID.

Optional secondary indexes in DynamoDB

1 Answers

When you asked this question, DynamoDB did not have Global Secondary Indexes: http://aws.amazon.com/about-aws/whats-new/2013/12/12/announcing-amazon-dynamodb-global-secondary-indexes/

Now, it does.

A local secondary index is best thought of, and functions as, a secondary range key. @andreimarinescu is right: you still must query by the item's hash key, only with a secondary index you can use a limited subset of a DynamoDB query's comparison operators on that range key (e.g. greater than, equal to, less than, etc.) So, you still need to know which "hash bucket" you're performing the comparison within.

Global secondary indexes are a bit of a different beast. They are more like a secondary version of your table (and Amazon charges you similarly in terms of provisioned throughput). You can use non-primary key attributes of your table as primary key attributes of your index in a global secondary index, and query them accordingly.

For example, if your table looks like:

|**Hash key**: Item ID | **Range Key**: Serial No | **Attribute**: Business ID |
--------------------------------------------------------------------------------
|           1          |        12345             |             1A             |
--------------------------------------------------------------------------------    
|           2          |        45678             |             2B             |
-------------------------------------------------------------------------------- 
|           3          |        34567             |            (empty)         |
--------------------------------------------------------------------------------
|           3          |        12345             |             2B             |
--------------------------------------------------------------------------------

Then, with a local secondary index on Business ID you could perform queries like, "find all the items with a hash key of 3 and a business ID equal to 2B", but you could not do "find all items with a business ID equal to 2B" because the secondary index requires a hash key.

If you were to add a global secondary index using business ID, then you could perform such queries. You would essentially be providing an alternate primary key for the table. You could perform a query like "find all items with a business ID equal to 2B and get items 2-45678 and 3-12345 as a response.

Sparse indexes work fine with DynamoDB; it's perfectly allowable that not all the items have a business ID and can allow you to keep the provisioned throughput on your index lower than that of the table depending on how many items you anticipate having a business ID.

102

answered Oct 17 '22 06:10

rpmartz

Related questions
                            
                                Folder won't delete on Amazon S3
                            
                                Deploying Django to AWS - WSGIPath refers to a file that does not exist
                            
                                AWS Elastic Beanstalk - ERROR: No Application Version named 'v0_9_2-76-gf5a4' found
                            
                                Why does a AWS NAT Gateway require an ElasticIP?
                            
                                ImageMagick not converting pdfs anymore in AWS Lambda
                            
                                How to reference a resource ARN in a cloudformation policy document ? (yaml)
                            
                                What are the valid instanceState's for the Amazon EC2 API?
                            
                                Using PHPMailer and Amazon SES
                            
                                How to specify root volume size of core-os ec2 instance using boto3?
                            
                                How do I dynamically change the table accessed using DynamoDB's Java Mapper?
                            
                                AWS Scale out , Scale Up
                            
                                Method PUT is not allowed by Access-Control-Allow-Methods in preflight response, from AWS API Gateway
                            
                                How to set the password of a cognito user as the admin?
                            
                                GUI in Amazon EC2 Linux instance
                            
                                Extra folder appended to my web root on AWS
                            
                                Amazon SNS For Apple - Error loading apple credentials from file
                            
                                Change S3 Bucket Storage class to S3 Infrequent Access
                            
                                Webpack and AWS Lambda issue - handler missing on module
                            
                                Passing NODE_ENV to docker to run package.json scripts
                            
                                Method of finding instances attached to ELB

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Optional secondary indexes in DynamoDB

Tags:

amazon-web-services

amazon-dynamodb

secondary-indexes

nullPainter

People also ask

1 Answers

rpmartz

Recent Activity

Donate For Us