Ubuntu cluster management

Tags:

I am trying to figure out a solution for managing a set of linux machines(OS:Ubuntu,~40 nodes. same hardware). These machines are supposed to be images of each other, softwareinstalled in one needs to be installed in other. My software requirements are hadoop, R and servicemix. R packages on all machines also need to be synchronized(package installed on one needs to be available in all the others)

One solution I am using right now is by using NFS and pssh. I am hoping there is a better/easier solution out there, which would make my life a bit easier. Any suggestion is appreciated.

776

asked Apr 05 '11 06:04

smschauhan

2 Answers

Two popular choices are Puppet from Puppet Labs and Chef from OpsCode.

Another potential mechanism is creating a new metapackage that Requires: the packages you want installed on all machines. When you modify your metapackage, an apt-get update && apt-get -u dist-upgrade would install the new package on all your systems simultaneously.

The metapackage approach might be less work to configure and use initially, but Puppet or Chef might provide better returns on investment in the long run, as they can manage far more than just package installs.

answered Oct 09 '22 19:10

sarnold

I have used a low-tech apporach in the past for this by simply sharing (at least parts of) /usr/local/ to keep a common R library in /usr/local/lib/R/site-library/. I guess that could work for your Hadoop installation too.

I tried to keep the rest in Debian / Ubuntu packages and kept all nodes current. Local R and Ubuntu package repositories (for locally created packages) can also help, but are a bit more work.

answered Oct 09 '22 19:10

Dirk Eddelbuettel

Related questions
                            
                                Fill a column with a vector if condition is met
                            
                                R - How to rearrange rows in a data frame while maintaining their grouping?
                            
                                What is a python/pandas equivalent to R's `with`?
                            
                                How to select entire matrix except certain rows and columns?
                            
                                xaringan set the document title dynamically
                            
                                How to repeat rows by their value by multiple columns and divide back
                            
                                Extract data frames from nested list
                            
                                Running an Rscript on Mac OS X
                            
                                LU decomposition with row pivot
                            
                                R plot type "b" with text instead of points - Slope graph with ggplot2
                            
                                Bewildering behavior of `substitute` in R
                            
                                Best way in the shell to do basic statistics?
                            
                                Problems with ggplot and pgfSweave
                            
                                Create table with subtotal per row and per column
                            
                                Cannot concatenate more than 3 elements in an expression for ggplot2's geom_text
                            
                                Consume a web service in R
                            
                                How do add a column in a data frame in R
                            
                                Gradient in geom_ribbon
                            
                                Calling predict() inside an R function
                            
                                Legend in Base R: Can fill refrain from drawing boxes on some lines? Can fill draw boxes that cover the whole symbol?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Ubuntu cluster management

Tags:

r

ubuntu

hadoop

cluster-computing

smschauhan

People also ask

2 Answers

sarnold

Dirk Eddelbuettel

Recent Activity

Donate For Us