Can druid replace hadoop?

Tags:

Druid is used for both real time and batch processing. But can it totally replace hadoop? If not why? As in what is the advantage of hadoop over druid? I have read that druid is used along with hadoop. So can the use of Hadoop be avoided?

937

asked Jun 09 '14 14:06

Amit Sharma

1 Answers

Can you avoid using Hadoop with Druid? Yes, you can stream data in real-time into a Druid cluster rather than batch-loading it with Hadoop. One way to do this is to stream data into Kafka, which will handle incoming events and pass them into Storm, which can then process and load them into Druid Realtime nodes.

Typically this setup is used with Hadoop in parallel, because streamed real-time data comes with its own baggage and often needs to be fixed up and backfilled. That whole architecture has been dubbed "Lambda" by some.

169

answered Sep 29 '22 20:09

user766353

Related questions
                            
                                Spark Job running on Yarn Cluster java.io.FileNotFoundException: File does not exits , eventhough the file exits on the master node
                            
                                Working with input splits(HADOOP)
                            
                                What is the meaning of EOF exceptions in hadoop namenode connections from hbase/filesystem?
                            
                                Hadoop HDFS - Cannot connect to port on master
                            
                                problems running simple map-reduce hadoop examples in cygwin
                            
                                Join of two datasets in Mapreduce/Hadoop
                            
                                Differences between hadoop jar and yarn -jar
                            
                                Send KafkaProducer from local machine to hortonworks sandbox on virtualbox
                            
                                Setting spark classpaths on EC2: spark.driver.extraClassPath and spark.executor.extraClassPath
                            
                                winutils.exe chmod command doesn't set permission
                            
                                sparkSession/sparkContext can not get hadoop configuration
                            
                                Are there any distributed machine learning libraries for using Python with Hadoop? [closed]
                            
                                (HBase) Error: JAVA_HOME is not set and Java could not be found
                            
                                Incorrect memory allocation for Yarn/Spark after automatic setup of Dataproc Cluster
                            
                                Using S3 (Frankfurt) with Spark
                            
                                Python Spark / Yarn memory usage
                            
                                httpfs error Operation category READ is not supported in state standby
                            
                                MapReduceBase and Mapper deprecated
                            
                                GUI for using Hadoop [closed]
                            
                                how to deploy and run oozie job?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Can druid replace hadoop?

Tags:

hadoop

druid

Amit Sharma

People also ask

1 Answers

user766353

Recent Activity

Donate For Us