Using lapply to apply a function over list of data frames and saving output to files with different names
Asked Answered
B

3

42

I have a list of data frames and have given each element in the list (e.g. each data frame) a name:

e.g.

df1 <- data.frame(x = c(1:5), y = c(11:15))  
df2 <- data.frame(x = c(1:5), y = c(11:15))  
mylist <- list(A = df1, B = df2)  

I have a function that I want to apply to each data frame; In this function, I want to include a line to write the results to file (eventually I want to do more complicated things like save plots of the correlation between two variables for each data frame but thought I'd start simple)

e.g.

NewVar <- function(mydata, whichVar, i) {  
mydata$newVar <- mydata[, whichVar] + 1  
write.csv(mydata, file = i)  
}

I want to use lapply() to apply this function to each data frame in my list

something like:

hh<-lapply(mylist, NewVar, whichVar = "y")

I can't figure out how to assign the "i" within the context of lapply so that i iterates over the names in the list of data frames, saving multiple files with different names (in this case, two files named A and B) that correspond with the modified data frames.

Buke answered 10/6, 2013 at 6:39 Comment(1)
I know you said you're trying to avoid using a for loop but I think it's right to use a for loop in this case. I don't like using lapply() if nothing is being returned. Something like l_ply() from the plyr package is probably more appropriate. Or just a for loop.Handlebar
P
16

It will work with the following lapply call:

lapply(names(mylist), function(x) NewVar(mylist[[x]], "y", x))
Perk answered 10/6, 2013 at 6:52 Comment(0)
G
14

There are many options. For example:

  lapply(names(mylist),
         function(x)write.csv(mylist[x],
                              file =paste0(x,'.csv')))

or using indexes :

 lapply(seq_along(mylist),
     function(i)write.csv(mylist[i],
                          file =paste0(names(mylist)[i],'.csv')))
Gosnell answered 10/6, 2013 at 6:49 Comment(1)
Thanks for the reply...However, I tried both of these options and they don't seem to work for a function that is defined outside of the lapply command. My real function needs to do a bunch of analyses and then save all the results so no option for defining it in one line.Buke
P
0

I had the same problem and I solved it using l_ply() function from plyr() package.

my_list <- list(df1,df2) my_function <- function(i) {...} library(plyr) l_ply(my_list, my_function)

Photina answered 11/5, 2023 at 8:4 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.