NoSQL: Getting the latest values from tables DynamoDB/Azure Table Storage

Tags:

I have a little problem that needs some suggestions:

Lets say we have a few hundred data tables with a few dozen million rows each.
Data tables are timestamp(key) - value
Data tables are written once every second

The latest entry of each table should be quickly obtainable and will most likely be queried the most (sorta like "follow data in real time"). With the lack of 'Last()' or similar, I was thinking of creating another table "LatestValues" where the latest entry of each data table is updated for a faster retrieval. This, however, would add an extra update for each write operation. Also, most of the traffic would be concentrated on this table (good/bad?). Is there a better solution for this or am I missing something?

Also, lets say we want to query for the values in data tables. Since scanning is obviously out of the question, is the only option left to create a secondary index by duplicating the data, effectively doubling the storaging requirements and the amount write operations? Any other solutions?

I'm primarily looking at DynamoDB and Azure Table Storage, but I'm also curious how BigTable handles this.

563

asked Oct 09 '12 22:10

user1597701

1 Answers

I just published an article today with some common "recipes" about DynamoDB. One of them is "Storing article revisions, getting always the latest" I think it might interest you :)

In a nutshell, you can get the latest item using Query(hash_key=..., ScanIndexForward=True, limit=1)

But, this assumes you have a range_key_defined.

With Scan, you have no such parameter as ScanIndexForward=false and anyway, you can not rely on the order as data is spread over partitions and the Scan request is then load balanced.

To achieve you goal with DynamoDB, you may "split" your timestamp this way:

hash_key: date
range_key: time or full timestamp, as you prefer

Then, you can use the 'trick' of Query + Limit=1 + ScanIndexForward=false

135

answered Oct 02 '22 07:10

yadutaf

Related questions
                            
                                Using a Filesystem (Not a Database!) for Schemaless Data - Best Practices
                            
                                Am I missing something about Document Databases?
                            
                                What does it mean that Azure Cosmos DB is multi-model?
                            
                                WHERE clause on an array in Azure DocumentDb
                            
                                What .NET-compatible graph database solution(s) have a proven track record?
                            
                                Graph DBs vs. Document DBs vs. Triplestores
                            
                                Why don't you start off with a "single & small" Cassandra server as you usually do it with MySQL?
                            
                                MongoDB: How to get distinct list of sub-document field values?
                            
                                Does MongoDB support floating point types?
                            
                                Redis,distributed or not?
                            
                                Mongoose populate embedded
                            
                                Do NoSQL databases use or need indexes?
                            
                                Should I use redis to store a large number of binary files? [closed]
                            
                                File Storage for Web Applications: Filesystem vs DB vs NoSQL engines
                            
                                Database EAV Pros/Cons and Alternatives
                            
                                CouchDB sorting and filtering in the same view
                            
                                Are there any REAL advantages to NoSQL over RDBMS for structured data on one machine?
                            
                                Is an ORM redundant with a NoSQL API?
                            
                                Cassandra - transaction support
                            
                                How to do basic aggregation with DynamoDB?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

NoSQL: Getting the latest values from tables DynamoDB/Azure Table Storage

Tags:

nosql

amazon-dynamodb

azure-table-storage

user1597701

People also ask

1 Answers

yadutaf

Recent Activity

Donate For Us