Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How Blockchain is different from HDFS and how bitcoin mining is different from Map reduce or spark?

i am going through with blockchain technology and what i found is that it saves files on multiple different server.Now as per Hadoop documentation it is also stores files on multiple servers and keep 1 master copy on 1 master node. It also has zoo keeper which maintains the data between master node and all the secondry nodes.

Now my question is there any zookeeper for blockchain as well which is maintaing master copy or i am on the wrong way of understanding?

like image 275
Atul Agrawal Avatar asked Apr 28 '17 05:04

Atul Agrawal


People also ask

What is the main difference between Bitcoin and Blockchain?

Bitcoin is a cryptocurrency, while blockchain is a distributed database. Bitcoin is powered by blockchain technology, but blockchain has found many uses beyond Bitcoin. Bitcoin promotes anonymity, while blockchain is about transparency.

Is Hadoop a Blockchain?

1 Answer. The basic difference between Blockchain and Hadoop can be implied from the file distribution systems that they pertain to. While Blockchain networks are based on IPFS (InterPlanetary File System), Hadoop is based on HDFS (Hadoop Distributed File System).

What is HDFS in Spark?

HDFS is a distributed file system designed to store large files spread across multiple physical machines and hard drives. Spark is a tool for running distributed computations over large datasets. Spark is a successor to the popular Hadoop MapReduce computation framework.

What are the differences between Blockchains?

Public blockchains allow anyone to access them; private blockchains are closed to only selected users; permissioned blockchains are a hybrid of public and private blockchains where anyone can access them as long as they have permission from the administrators to do so.


1 Answers

These are fairly different animals. Most blockchain full nodes operate completely independently and spend their time validating transactions. The fact is that every full node in Bitcoin protocol or Ethereum has a full copy of the the entire chain--that is, every transaction that has ever occurred since block zero. They operate and validate on a consensus basis, so it is truly headless, not a distributed cluster.

HDFS is organized on a node (usually) as "just a bunch of disks" (JBOD), meaning they don't even usually bother with RAID. Replication is accomplished across nodes, but it's not full replication, meaning that there may be 5 copies of a file in a medium or large cluster and there is potential (which increases with the size of the cluster) for disk failures to cause actual data loss.

HDFS scales massively, but is inherently risky and requires a backup strategy. With blockchain, on the other hand, is much more compact, but since there are thousands or millions of complete copies of the ledger, each of which is improbably difficult to modify after the fact, it is virtually impossible to lose data.

like image 199
Jim O'Donnell Avatar answered Sep 22 '22 07:09

Jim O'Donnell