DynamoDB Query with multiple tags

Tags:

I am rather new to DynamoDB and currently we are thinking about migrating an existing project to a serverless application using DynamoDB where we want to adapt the following setup from a RDMS database:

Tables:

Projects (ProjectID)
Files (FileID, ProjectID, Filename)
Tags (FileID, Tag)

We want to make a query with DynamoDB to fetch all Files for a specific Project (by ProjectID) with one or multiple Tags (by Tag). In an RDMS this query would be simple with something like:

SELECT * FROM Files JOIN Tags ON Tags.FileID = Files.FileID WHERE Files.ProjectID = ?PROJECT AND Tags.Tag = ?TAG_1 OR ?TAG_2 ...

At the moment, we have the following DynamoDB setup (but it can still be changed):

Projects (ProjectID [HashKey], ...)
Files (ProjectID [HashKey], FileID [RangeKey], ...)

Please also consider that the number of project entries is huge (between 1000 - 30000) and also the number of files for each project (is between 50 and 100.000) and the query should be really fast.

How can this be achieved using DynamoDB-query, best without using filter expressions since they are applied after data selection? It would be perfect if the table Files could have a StringSet Tags as column but I guess that this cannot be used for an efficient DynamoDB-query (so without using DynamoDB-scan) since DynamoDB-indices can only be of type String, Binary and Number and not of type StringSet? Is this maybe an applicable use case for the Global Secondary Index (GSI)?

553

asked Mar 08 '17 13:03

Tom

1 Answers

A bit late, just saw this question referenced from another one.

I guess you've went and solved it something like this?

DynamoDB tables

Projects (ProjectID [HashKey], ...)
Files (ProjectID [HashKey], FileID [RangeKey], ...)
Tags (Tag [HashKey], FileID [RangeKey], ProjectID [LSI Sort Key])

On the FileTags, you need the FileID to make the primary key unique, but you can add the ProjectID as a sort key for a Local Secondary Index, so you can search on Tag + ProjectID.

It's some sort of Data Denormalization, but that's what it takes to go NoSQL :-( . E.g. if your File would be switched to another Project, you'll need to update the ProjectID not only on the File, but also on all the Tags.

answered Sep 28 '22 08:09

GeertPt

Related questions
                            
                                Docker container keeps growing
                            
                                AWS API Gateway accept Content-type: application/xml
                            
                                DynamoDB count operation capacity units consumption
                            
                                Install pgAgent on AWS RDS for Postgres
                            
                                AWS IAM Policy to allow user to create IAM Roles (from Management Console & AWS CLI)
                            
                                AWS Lambda - callback("some error type") equivalent in Java 8
                            
                                Migrating from SQL Server to AWS Aurora
                            
                                Performance of listing S3 bucket with prefix and delimiter
                            
                                How do I set my Elastic Beanstalk application to use an Application Load Balancer?
                            
                                Fetch external link of ecs task running from aws cli
                            
                                Can't associate an Elastic IP on Amazon EC2 Instance
                            
                                How to use S3 SSE C (Server Side Encryption with Client Provided Keys) on NodeJS
                            
                                Does aws s3 sync s3://mybucket s3://mybucket2 copy files to local?
                            
                                Amazon Linux machine - Ansible ansible_distribution* variables major release distribution
                            
                                Elastic beanstalk require python 3.5
                            
                                Can I attach an EC2 instance to an existing Load Balancer using CloudFormation
                            
                                How to copy file while preserving directory structure using AWS command line
                            
                                DynamoDB Validation Exception - Key element does not match the schema
                            
                                API Gateway variable number of path parameters
                            
                                Multiple AWS routes with same Destination Cidr Blocks

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

DynamoDB Query with multiple tags

Tags:

amazon-web-services

amazon-dynamodb

Tom

People also ask

1 Answers

GeertPt

Recent Activity

Donate For Us