I want to run Mahout's K-Means example in a hadoop cluster of 5 machines. Which Mahout jar files should I need to keep in all the nodes, in order for the K-Means to be executed in a distributed manner. Thanks.
-Venkiram
If you really just want to run the built-in K-Means, or other jobs with static drivers, the answer is 'none'. The mahout 'job' jars are self-contained hadoop job jars. If you submit a job to the cluster with 'hadoop job' it will work without any other jars.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With