Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Deploying Mahout on hadoop cluster

I want to run Mahout's K-Means example in a hadoop cluster of 5 machines. Which Mahout jar files should I need to keep in all the nodes, in order for the K-Means to be executed in a distributed manner. Thanks.

-Venkiram

like image 347
Venkiram Avatar asked Apr 15 '11 11:04

Venkiram


1 Answers

If you really just want to run the built-in K-Means, or other jobs with static drivers, the answer is 'none'. The mahout 'job' jars are self-contained hadoop job jars. If you submit a job to the cluster with 'hadoop job' it will work without any other jars.

like image 173
bmargulies Avatar answered Sep 18 '22 18:09

bmargulies