Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Can sqoop run without hadoop?

Tags:

hadoop

sqoop

Just wondering can sqoop run without a hadoop cluster? sort of in a standalone mode? Has anyone tried to run sqoop on spark, please share some experiences on it.

like image 634
Shengjie Avatar asked Apr 24 '17 14:04

Shengjie


People also ask

Can we use Sqoop without Hadoop?

You can't use it directly and will need to first compile the sources into binary executables. For your convenience, the Sqoop community provides a binary tarball for each major supported version of Hadoop along with the source tarball.

How does Sqoop work with Hadoop?

Sqoop distributes the input data among the mappers equally to get high performance. Then each mapper creates connection with the database using JDBC and fetches the part of data assigned by Sqoop and writes it into HDFS or Hive or HBase based on the option provided in the command line.

What are the 2 main functions of Sqoop?

Sqoop has two main functions: importing and exporting. Importing transfers structured data into HDFS; exporting moves this data from Hadoop to external databases in the cloud or on-premises. Importing involves Sqoop assessing the external database's metadata before mapping it to Hadoop.


1 Answers

To run Sqoop commands (both sqoop1 and sqoop2), Hadoop is a mandatory prerequisite. You cannot run sqoop commands without the Hadoop libraries.

Sqoop works in local mode too, so it is not a requirement that the Hadoop daemons must be running. To run sqoop in local mode,

sqoop [tool-name] -fs local -jt local [tool-arguments]

Sqoop on Spark is still In-Progress. See SQOOP-1532

like image 182
franklinsijo Avatar answered Sep 23 '22 05:09

franklinsijo