I asked a related question here and the response worked well: using parallel's parLapply: unable to access variables within parallel code
The problem is when I try to use the answer inside of the function it won't work as I think it has to the default environment of clusterExport
. I've read the vignette and looked at the help file but am approaching this with a very limited knowledge base. The way I used parLapply
I expected it to behave similar to lapply
but it doesn't appear to.
Here is my attempt:
par.test <- function(text.var, gc.rate=10){
ntv <- length(text.var)
require(parallel)
pos <- function(i) {
paste(sapply(strsplit(tolower(i), " "), nchar), collapse=" | ")
}
cl <- makeCluster(mc <- getOption("cl.cores", 4))
clusterExport(cl=cl, varlist=c("text.var", "ntv", "gc.rate", "pos"))
parLapply(cl, seq_len(ntv), function(i) {
x <- pos(text.var[i])
if (i%%gc.rate==0) gc()
return(x)
}
)
}
par.test(rep("I like cake and ice cream so much!", 20))
#gives this error message
> par.test(rep("I like cake and ice cream so much!", 20))
Error in get(name, envir = envir) : object 'text.var' not found
envir
argument toclusterExport
asvarlist
is exported from the.GlobalEnv
by default. Doesenvir=environment()
work? – Bayne