Deploying Mahout on hadoop cluster
Asked Answered
P

1

7

I want to run Mahout's K-Means example in a hadoop cluster of 5 machines. Which Mahout jar files should I need to keep in all the nodes, in order for the K-Means to be executed in a distributed manner. Thanks.

-Venkiram

Primus answered 15/4, 2011 at 11:59 Comment(0)
O
2

If you really just want to run the built-in K-Means, or other jobs with static drivers, the answer is 'none'. The mahout 'job' jars are self-contained hadoop job jars. If you submit a job to the cluster with 'hadoop job' it will work without any other jars.

Optimum answered 15/4, 2011 at 12:7 Comment(3)
Thanks. I have installed Mahout jars in master node, should I replicate it in all slave nodes also?Primus
Now you are beyond what I know about hadoop.Optimum
Isn't the job driver supposed to take care of this by localizing the .jar on all the nodes it uses?Breckenridge

© 2022 - 2024 — McMap. All rights reserved.