I have a huge dataset which is similar to the columns posted below
NameofEmployee <- c(x, y, z, a)
Region <- c("Pune", "Orissa", "Orisa", "Poone")
As you can see, in the Region
column, the region "Pune" is spelled in two different ways- i.e "Pune" and "Poona".
Similarly, "Orissa" is spelled as "Orissa" and "Orisa".
I have multiple regions which are actually the same but are spelled in different ways. This will cause problems when I analyze the data.
I want to automatically be able to obtain a list of these mismatched spellings with the help of R.
I would also like to replace the spellings with the correct spellings automatically.