Apache Kafka is an real-time messaging service. It stores streams of data safely in distributed and fault-tolerant. We can filter streaming data when comming producer. I don't understant that why we need NoSQL databases like as MongoDB to store same data in Apache Kafka. The true question is that why we store same data in a NoSQL database and Apache Kafka? I think if we need a NoSQL database, we can collect streams of data from clients in MongoDB at first without the use of Apache Kafka. But, most of big data architecture preference using Apache Kafka between data source and NoSQL database.(see) <img src="https://i.stack.imgur.com/bHYc3.jpg" alt="and also see"> What is the advantages of that for real systems?

This architecture has several advantages: <ol> <li> Kafka as Data Integration Bus It helps distribute data between several producers and many consumers easily. Here Apache Kafka serves as an "data" integration message bus. </li> <li> Kafka as Data Buffer Putting Kafka in front of your "end" data storages like MongoDB or MySQL acts like a natural data buffer. So you are able to deploy/maintain/redeploy your consumer services independently. At the time your service is down for maintanance Kafka is still storing all incoming data, that is quite useful. </li> <li> Kafka as a Short Time Data Storage You don't have to store everything in Kafka: very often you use Kafka topics with retention. It means all data older than some value will be deleted by Kafka automatically. So, for example you may have Kafka topic with 1 week retention (so you store 1 week of data only) but at the same time your data lives in long time storage services like classic SQL-DBs or Cassandra etc. </li> <li> Kafka as a Long Time Data Storage On the other hand you can use Apache Kafka as a long term storage system. Using compacted topics enables you to store only the last value for each key. So your topic becomes a last state storage of your app. </li> </ol>

Why we require Apache Kafka with NoSQL databases?

Tags:

mongodb

nosql

apache-kafka

Apache Kafka is an real-time messaging service. It stores streams of data safely in distributed and fault-tolerant. We can filter streaming data when comming producer. I don't understant that why we need NoSQL databases like as MongoDB to store same data in Apache Kafka. The true question is that why we store same data in a NoSQL database and Apache Kafka?

I think if we need a NoSQL database, we can collect streams of data from clients in MongoDB at first without the use of Apache Kafka. But, most of big data architecture preference using Apache Kafka between data source and NoSQL database.(see) and also see

What is the advantages of that for real systems?

733

asked Feb 12 '18 09:02

tolgabuyuktanir

1 Answers

This architecture has several advantages:

Kafka as Data Integration Bus

It helps distribute data between several producers and many consumers easily. Here Apache Kafka serves as an "data" integration message bus.
Kafka as Data Buffer

Putting Kafka in front of your "end" data storages like MongoDB or MySQL acts like a natural data buffer. So you are able to deploy/maintain/redeploy your consumer services independently. At the time your service is down for maintanance Kafka is still storing all incoming data, that is quite useful.
Kafka as a Short Time Data Storage

You don't have to store everything in Kafka: very often you use Kafka topics with retention. It means all data older than some value will be deleted by Kafka automatically. So, for example you may have Kafka topic with 1 week retention (so you store 1 week of data only) but at the same time your data lives in long time storage services like classic SQL-DBs or Cassandra etc.
Kafka as a Long Time Data Storage

On the other hand you can use Apache Kafka as a long term storage system. Using compacted topics enables you to store only the last value for each key. So your topic becomes a last state storage of your app.

answered Oct 19 '22 23:10

codejitsu

Related questions
                            
                                Installing PECL on MAMP
                            
                                Meteor.Collection and Meteor.Collection.Cursor
                            
                                Limit is ignored when remove (mongoose)
                            
                                MongoDB, Java, sort by first array entry
                            
                                MongoDB:Can't canonicalize query: BadValue bad geo query
                            
                                MongoDB Read/Write Locks
                            
                                Robomongo connection with Meteor mongodb
                            
                                Can serialVersionUID be ignored when serializing to JSON?
                            
                                Need to remove ObjectID() from _id using Meteor
                            
                                Getting results from 2.0 MongoDb c# driver
                            
                                Timezone in mongo query
                            
                                Node.js: Mongoose initializeUnorderedBulk returning null
                            
                                retrieve array from mongodb using java with mongodb api
                            
                                Can not start MongoDB 3.2.1 on CentOS 7
                            
                                Move a document to another collection with Mongoose
                            
                                How to query data efficiently in large mongodb collection?
                            
                                Slow query behaviour using $exists with mongodb on fields with an index
                            
                                How to run a local script in mongo shell - Solution load() [duplicate]
                            
                                Why doesn't my MongoDB $or query work?
                            
                                How to read data from Mongodb which have duplicate element name in c#

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With