How do you query DynamoDB?

Tags:

I'm looking at Amazon's DynamoDB as it looks like it takes away all of the hassle of maintaining and scaling your database server. I'm currently using MySQL, and maintaining and scaling the database is a complete headache.

I've gone through the documentation and I'm having a hard time trying to wrap my head around how you would structure your data so it could be easily retrieved.

I'm totally new to NoSQL and non-relational databases.

From the Dynamo documentation it sounds like you can only query a table on the primary hash key, and the primary range key with a limited number of comparison operators.

Or you can run a full table scan and apply a filter to it. The catch is that it will only scan 1Mb at a time, so you'd likely have to repeat your scan to find X number of results.

I realize these limitations allow them to provide predictable performance, but it seems like it makes it really difficult to get your data out. And performing full table scans seems like it would be really inefficient, and would only become less efficient over time as your table grows.

For Instance, say I have a Flickr clone. My Images table might look something like:

Image ID (Number, Primary Hash Key)
Date Added (Number, Primary Range Key)
User ID (String)
Tags (String Set)
etc

So using query I would be able to list all images from the last 7 days and limit it to X number of results pretty easily.

But if I wanted to list all images from a particular user I would need to do a full table scan and filter by username. Same would go for tags.

And because you can only scan 1Mb at a time you may need to do multiple scans to find X number of images. I also don't see a way to easily stop at X number of images. If you're trying to grab 30 images, your first scan might find 5, and your second may find 40.

Do I have this right? Is it basically a trade-off? You get really fast predictable database performance that is virtually maintenance free. But the trade-off is that you need to build way more logic to deal with the results?

Or am I totally off base here?

261

asked Feb 03 '12 15:02

chriserwin

2 Answers

Yes, you are correct about the trade-off between performance and query flexibility.

But there are a few tricks to reduce the pain - secondary indexes/denormalising probably being the most important.

You would have another table keyed on user ID, listing all their images, for example. When you add an image, you update this table as well as adding a row to the table keyed on image ID.

You have to decide what queries you need, then design the data model around them.

answered Sep 21 '22 15:09

DNA

I think you need create your own secondary index, using another table.

This table "schema" could be:

    User ID (String, Primary Key)     Date Added (Number, Range Key)     Image ID (Number)

That way you can query by User ID and filter by Date as well

answered Sep 20 '22 15:09

Rodrigo Ribeiro

Related questions
                            
                                SELECT and UPDATE multiple records in oriento / orientjs and transaction in waterline
                            
                                Laying out a database schema for a calendar application
                            
                                At what point is it worth using a database?
                            
                                Using a database table as a queue
                            
                                Mongoose: Schema vs Model?
                            
                                What is a database index? [duplicate]
                            
                                How to schedule a stored procedure in MySQL
                            
                                Microservices with shared database? using multiple ORM's? [closed]
                            
                                Display all views on oracle database
                            
                                Why do we need a temporal database?
                            
                                Why is a primary-foreign key relation required when we can join without it?
                            
                                How to enter special characters like "&" in oracle database? [duplicate]
                            
                                The Next-gen Databases [closed]
                            
                                Laravel nested relationships
                            
                                UNIQUE constraint failed: sqlite database : android
                            
                                phpMyAdmin Removing an index
                            
                                How to release possible Postgres row locks?
                            
                                Rake db:migrate - how do I undo all migrations and redo them
                            
                                How to connect to a local database in SQL Server Management Studio?
                            
                                Can't export my database from mysql workbench

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do you query DynamoDB?

Tags:

database

nosql

amazon-dynamodb

chriserwin

People also ask

2 Answers

DNA

Rodrigo Ribeiro

Recent Activity

Donate For Us