Object-Relational-Mappers have been created to help applications (which think in terms of objects) deal with stored data in a more application-friendly way like every other class/object. However, I have never seen a OKM (Object-Key/Value-Mapper) for NoSQL "Key/Value" storage systems. Which seems odd because the need should be far greater given the fact that more value-relations will have to be hard-coded into the app than a regular, single SQL table row object. <pre class="prettyprint"><code>four requests: user:id user:id:name user:id:email user:id:created vs one request: user = [id => ..., name => ..., email => ...] </code></pre> Plus you must keep track of "lists" (post has_many comments) since you don't have has_many through tables or foreign keys. <pre class="prettyprint"><code>INSERT INTO user_groups (user_id, group_id) VALUES (23, 54) vs usergroups:user_id = {54,108,32,..} groupsuser:group_id = {23,12,645,..} </code></pre> And there are lots more examples of the added logic that an application would need to replicate some basic features that normal relational databases use. All of these reasons make the idea of a OKM sound like a shoe-in. Are there any? Are there any reasons there are not any?

One of the design goals of SQL is that any data can be stored/queried in any relational database - There are some differences between platforms, but in general the correct way to handle a particular data structure is well known and easily automated but requiring fairly verbose code. That is not the case with NoSQL - generally you will be directly storing the data as used in your application rather than trying to map it to a relational structure, and without joins or other object/relational differences the mapping code is trivial. Beyond generating the boilerplate data access code, one of the main purposes of an ORM is abstraction of differences between platforms. In my experience the ability to switch platforms has always been purely theoretical, and this lowest common denominator approach simply won't work for NoSQL as the platform is usually chosen specifically for capabilities not present on other platforms. Your example is only for the most trivial key value store - depending on your platform you most likely have some useful additional commands, so your first example could be MGET user:id:name user:id:email ... (multiget - get any number of keys in a single call) GET user:id:* (key wildcards) HGETALL user:id (redis hash - gets all subkeys of user) You might also have your user object stored in a serialized form - unlike in a relational database this will not break all your queries. Working with lists isn't great if your platform doesn't have support built in - native list/set support is one of the reasons I like to use redis - but aside from potentially needing locks it's no worse than getting the list out of sql. It's also worth noting that you may not need all the relationships you would define in sql - for example if you have a group containing a million users, the ability to get a list of all users in a group is completely useless, so you would never create the groupsuser list at all and rather than a seperate usergroups list have user:id:groups as a multivalue property. If you just need to check for membership you could set up keys as usergroups:userid:groupid and get constant time lookup. I find it helps to think in terms of indexes rather than relationships - when setting up your data access code decide which fields will need to be queried and adding appropriate index records when those fields are written.

Are there ORM (OKM) for key-value stores?

Tags:

database

orm

nosql

Object-Relational-Mappers have been created to help applications (which think in terms of objects) deal with stored data in a more application-friendly way like every other class/object.

However, I have never seen a OKM (Object-Key/Value-Mapper) for NoSQL "Key/Value" storage systems. Which seems odd because the need should be far greater given the fact that more value-relations will have to be hard-coded into the app than a regular, single SQL table row object.

four requests:
user:id
user:id:name
user:id:email
user:id:created

vs one request:
user = [id => ..., name => ..., email => ...]

Plus you must keep track of "lists" (post has_many comments) since you don't have has_many through tables or foreign keys.

INSERT INTO user_groups (user_id, group_id) VALUES (23, 54)

vs

usergroups:user_id = {54,108,32,..}
groupsuser:group_id = {23,12,645,..}

And there are lots more examples of the added logic that an application would need to replicate some basic features that normal relational databases use. All of these reasons make the idea of a OKM sound like a shoe-in.

Are there any? Are there any reasons there are not any?

586

asked Nov 04 '10 16:11

Xeoncross

2 Answers

Ruby's DataMapper project is an ORM and will happily talk to a key-value store through the use of an adapter.

Redis and MongoDB have adapters that already exist. CouchDB has an adapter — it's not maintained, but at one point it worked pretty well. I don't think anyone's done anything with Cassandra yet, but there's no reason it couldn't be done. The Dubious framework for Google App Engine takes a very similar approach to Data Mapper to make the Data Store available to applications.

So it's very possible to do ORM with key-value stores. The ORM just really needs to avoid the assumption that SQL is its primary vocabulary.

144

answered Nov 15 '22 10:11

Bob Aman

One of the design goals of SQL is that any data can be stored/queried in any relational database - There are some differences between platforms, but in general the correct way to handle a particular data structure is well known and easily automated but requiring fairly verbose code. That is not the case with NoSQL - generally you will be directly storing the data as used in your application rather than trying to map it to a relational structure, and without joins or other object/relational differences the mapping code is trivial.

Beyond generating the boilerplate data access code, one of the main purposes of an ORM is abstraction of differences between platforms. In my experience the ability to switch platforms has always been purely theoretical, and this lowest common denominator approach simply won't work for NoSQL as the platform is usually chosen specifically for capabilities not present on other platforms. Your example is only for the most trivial key value store - depending on your platform you most likely have some useful additional commands, so your first example could be

MGET user:id:name user:id:email ... (multiget - get any number of keys in a single call)

GET user:id:* (key wildcards)

HGETALL user:id (redis hash - gets all subkeys of user)

You might also have your user object stored in a serialized form - unlike in a relational database this will not break all your queries.

Working with lists isn't great if your platform doesn't have support built in - native list/set support is one of the reasons I like to use redis - but aside from potentially needing locks it's no worse than getting the list out of sql.

It's also worth noting that you may not need all the relationships you would define in sql - for example if you have a group containing a million users, the ability to get a list of all users in a group is completely useless, so you would never create the groupsuser list at all and rather than a seperate usergroups list have user:id:groups as a multivalue property. If you just need to check for membership you could set up keys as usergroups:userid:groupid and get constant time lookup.

I find it helps to think in terms of indexes rather than relationships - when setting up your data access code decide which fields will need to be queried and adding appropriate index records when those fields are written.

answered Nov 15 '22 12:11

Tom Clarkson

Related questions
                            
                                Ansible mysql grant
                            
                                Is adding and dropping indexes everyday on huge tables a good practice?
                            
                                Retrieve data in chunks in postgresql
                            
                                what is difference between atomicity and isolation in ACID properties?
                            
                                Slow running MySql restore - 10 times slower than backup speed and still going
                            
                                MySQL select from INT column
                            
                                How to fix "java.sql.SQLSyntaxErrorException: Unknown column 'product0_.return_policy' in 'field list' " exception?
                            
                                Can I download firebase database json
                            
                                Best Practices for uploading files to database
                            
                                Stategies for coping with schema evolution? [closed]
                            
                                Include a database file with Android application via Eclipse
                            
                                How to model this[Networks, details in post] in database for efficiency and ease of use?
                            
                                Are there any existing solutions for creating a generic DNA sequence database with a website front end?
                            
                                How to restore MySQL database from WAMP?
                            
                                Checking if INSERT INTO statement was successful
                            
                                Using ruby with databases, but without rails
                            
                                SQL Server Performance with Key/Pair Table vs XML Field and XPath
                            
                                How should I build a simple database package for my python application?
                            
                                Struggling with a MySQL database of phone numbers
                            
                                Difference between Blob and a file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With