I am trying to find the percentage of NAs in columns as well as inside the whole dataframe:
The first method which I have commented gives me zero and the second method which is not commented gives me a matrix. Not sure what I am missing. Any hint is truly appreciated!
cp.2006<-read.csv(file="cp2006.csv",head=TRUE)
#countNAs <- function(x) {
# sum(is.na(x))
#}
#total=0
#for (i in col(cp.2006)) {
# total=countNAs(i)+total
#}
#print(total)
count<-apply(cp.2006, 1, function(x) sum(is.na(x)))
dims<-dim(cp.2006)
num<-dims[1]*dims[2]
NApercentage<-(count/num) * 100
print(NApercentage)
> sum(is.na(cp.2006)) [1] 138
– Longwise