I need to run thousands* of models on 15 machines (each of 4 cores), all Windows. I started to learn parallel
, snow
and snowfall
packages and read a bunch of intro's, but they mainly focus on the setup of the master. There is only a little information on how to set up the worker (slave) nodes on Windows. The information is often contradictory: some say that SOCK cluster is practically the easiest way to go, others claim that SOCK cluster setup is complicated on Windows (sshd setup) and the best way to go is MPI.
So, what is an easiest way to install slave nodes on Windows? MPI, PVM, SOCK or NWS? My, possibly naive ideas were (listed by priority):
- To use all 4 cores on the slave nodes (required).
- Ideally, I need only R with some packages and a slave R script or R function that would listen on some port and wait for tasks from master.
- Ideally, nodes can be added/removed dynamically from the cluster.
- Ideally, the slaves would connect to the master - so I wouldn't have to list all the slaves IP's in configuration of the master.
Only 1 is 100% required, 2-4 are "would be good". Is it too naive to request?
I am sorry but I have not been able to figure this out from the available docs and tutorials. I would be grateful if you point me out to the right source.
* Note that each of those thousands of models will take at least 7 minutes, so there won't be a big communication overhead.
sfExport
andsfLibrary
). – BonannoSOCK
. – Bonanno