I have data where the words as follows
location<- c("xyz, sss, New Zealand", "USA", "Pris,France")
id<- c(1,2,3)
df<-data.frame(location,id)
I would like to extract the country name from the data. The tricky part is if i extract just the last word then I will have only one record (France).
library(stringr)
df$country<- word(df$location,-1)
Any ideas on how to extract country data from this data?
id location country
1 xyz, sss, New Zealand New Zealand
2 USA USA
3 Pris,France France
explanation [sub]:
fromdf$location
, replace any character.
, occuring any number of times*
, upto a comma, followed by any number/type of whitespace\\s
with nothing''
explanation [str_extract]:
fromdf$location
, provide 1 or multiple+
whole words\\b
, not[ ]
in a string that ends in a comma^,
until the end of the string$
. (so basically, provide all whole words after comma) – Citrate