I understand that document-oriented NoSQL DBs are "extensions" of the KV model in that they allow you to query more than just a single lookup key. But once something is a "document", I feel like it already has a relational model baked into it: <pre class="prettyprint"><code>"myJson": { "fizz": 4, "buzz": "true", "widget" : { ...etc. } } </code></pre> To me, I don't see the difference between this JSON, and a <code>json_objects</code> table with a <code>fizz</code> and <code>buzz</code> field, and a foreign key relationship to a second <code>widgets</code> table. And "columnar" DB's like Cassandra just sound like straight-up relational/table DBs. So I ask: what is so different about document- and column-oriented DBs, and so distinguishing (from RDBMSes) about them? What problems are they best suited to solve that render them superior to relational DBs under certain circumstances? Thanks in advance!

Firstly I'd like to say that you are very correct in saying that NoSql is different from Relational Databases and so its hard to make a comparison. With that being said there are many big distinctions between the two that can be compared. Scaling Although you can shard a MySql database there are issues with sharding and enforcing ACID properties when a RDMS is on multiple machines will be very challenging, NoSql solutions like Cassandra are famous for their ability to grow without problems with some cases managing 400 nodes in a cluster without a problem. Not only is it easy to grow a Cassandra database, but performance does not take a hit. Schema(less) model. NoSQL database systems are developed to manage large volumes of data that don't follow a fixed schema. This means that for example you wish to add a new column to an existing column family in Cassandra you don't need to go back and amend the column family so no need for this: <pre class="prettyprint"><code>ALTER TABLE table_name ALTER COLUMN column_name datatype; </code></pre> We can instead just add new columns as we go, and might end up with the following 'table': <pre class="prettyprint"><code> key | follower1 | follower2 | follower2 -------------+------------+-------------+----------- lyubent | joeb | chuckn | gordonf chuckn | joeb | gordonf gordonf | chuckn joeb | chuckn | lyubent | joeb </code></pre> This allows data models to be flexible and easily extended but in doing so data becomes less structured. Speed NoSql databases are optimized for high write speeds while the RDBMs' aim for high read speeds. But even with that in mind NoSql solutions still tend to outperform RDBMs systems when it comes to reads. This is because the NoSql databases don't implement many of the functions that slow down read/write/update operations in the Relational Model like for example ACID properties and transactions. When should it be used? <ul> <li>Your application/website will need to grow rapidly but you want to start off small.</li> <li>You're more concerned with writing data than reading it back. (Lots of tweets are posted but not all of them are read)</li> <li>Availability of your system is more important that data being 100% updated. (So if you are a bank, you don't want NoSql but if you are a website that needs 100% uptime it could be a good choice)</li> <li>If the data being written needs to succeed 100% of the time, but eventual consistency isn't a problem.</li> </ul> Just for a visual illustration, this helped me out a lot in understanding where the different sql solutions fit into the database world and how each fits a purpose. <img src="https://i.stack.imgur.com/rOeRQ.png" alt="Database Triad - Availability, Consistency and Partition Tolerance">

Relational vs Columnar and Document Databases - aren't they one in the same?

Tags:

mongodb

nosql

cassandra

column-oriented

document-oriented-db

I understand that document-oriented NoSQL DBs are "extensions" of the KV model in that they allow you to query more than just a single lookup key. But once something is a "document", I feel like it already has a relational model baked into it:

"myJson": {
    "fizz": 4,
    "buzz": "true",
    "widget" : {
        ...etc.
    }
}

To me, I don't see the difference between this JSON, and a json_objects table with a fizz and buzz field, and a foreign key relationship to a second widgets table.

And "columnar" DB's like Cassandra just sound like straight-up relational/table DBs.

So I ask: what is so different about document- and column-oriented DBs, and so distinguishing (from RDBMSes) about them? What problems are they best suited to solve that render them superior to relational DBs under certain circumstances? Thanks in advance!

876

asked Mar 08 '13 21:03

IAmYourFaja

2 Answers

Firstly I'd like to say that you are very correct in saying that NoSql is different from Relational Databases and so its hard to make a comparison. With that being said there are many big distinctions between the two that can be compared.

Scaling
Although you can shard a MySql database there are issues with sharding and enforcing ACID properties when a RDMS is on multiple machines will be very challenging, NoSql solutions like Cassandra are famous for their ability to grow without problems with some cases managing 400 nodes in a cluster without a problem. Not only is it easy to grow a Cassandra database, but performance does not take a hit.

Schema(less) model.
NoSQL database systems are developed to manage large volumes of data that don't follow a fixed schema. This means that for example you wish to add a new column to an existing column family in Cassandra you don't need to go back and amend the column family so no need for this:

ALTER TABLE table_name ALTER COLUMN column_name datatype;

We can instead just add new columns as we go, and might end up with the following 'table':

 key         | follower1  | follower2   | follower2          
-------------+------------+-------------+-----------
 lyubent     | joeb       | chuckn      | gordonf     
 chuckn      | joeb       | gordonf                   
 gordonf     | chuckn                                 
 joeb        | chuckn     | lyubent     | joeb

This allows data models to be flexible and easily extended but in doing so data becomes less structured.

Speed
NoSql databases are optimized for high write speeds while the RDBMs' aim for high read speeds. But even with that in mind NoSql solutions still tend to outperform RDBMs systems when it comes to reads. This is because the NoSql databases don't implement many of the functions that slow down read/write/update operations in the Relational Model like for example ACID properties and transactions.

When should it be used?

Your application/website will need to grow rapidly but you want to start off small.
You're more concerned with writing data than reading it back. (Lots of tweets are posted but not all of them are read)
Availability of your system is more important that data being 100% updated. (So if you are a bank, you don't want NoSql but if you are a website that needs 100% uptime it could be a good choice)
If the data being written needs to succeed 100% of the time, but eventual consistency isn't a problem.

Just for a visual illustration, this helped me out a lot in understanding where the different sql solutions fit into the database world and how each fits a purpose.

Database Triad - Availability, Consistency and Partition Tolerance

156

answered Nov 03 '22 19:11

Lyuben Todorov

In no schema db you don't have fixed columns and types.

For example product 'Jeans' can have attributes 'price', 'length' and 'model' (M/W) but for product book you have attributes 'price', 'authors' and 'title'. For mobile phones you will have 'screen type', 'operating system' etc.

It is very difficult to model that in RDBMS because you are not flexible and user cannot insert arbitrary attributes so it is easier to use a document database which are optimized for this kind of data so that you can easily search and filter by value on arbitrary attributes (eg. all products with length>30 and model=w).

answered Nov 03 '22 17:11

user1944408

Related questions
                            
                                Index mongoDB with ElasticSearch
                            
                                Insert DBObject into MongoDB using Spring Data
                            
                                Converting DBObject to Java Object while retrieve values from MongoDB
                            
                                Node mongoose find query in loop not working
                            
                                Why is the following Spring Boot + HATEOAS with mongodb not working (MarshalException)?
                            
                                Autowire MongoRepository in Spring MVC
                            
                                mongoose schema set max length for a String [duplicate]
                            
                                Insert JSON into an existing MongoDB collection
                            
                                Check object existence in mongo using gopkg.in/mgo.v2
                            
                                Get index of given element in array field in MongoDB
                            
                                MongoDB jsonSchema validation additionalProperties
                            
                                MongoDB Error: Cannot use retryable writes with limit=0
                            
                                How do I connect to the Azure CosmosDB Emulator for MongoDB?
                            
                                How do I define a different primary key other than _id in Mongoose?
                            
                                MongoDB SpiderMonkey doesn't understand UTF-8
                            
                                Export one object with mongoexport, how to specify _id?
                            
                                Why do my MongooseJS ObjectIds fail the equality test?
                            
                                Finding the next document in MongoDb
                            
                                What can be done with Mongo Aggregation / Performance of Mongo Aggregation
                            
                                Auto increment document number in Mongo / Mongoose

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With