I am working currently on a project that should identify each part of an address, for example from "str. Jack London 121, Corvallis, ARAD, ap. 1603, 973130 " the output should be like this:
street name: Jack London;
no: 121; city: Corvallis;
state: ARAD;
apartment: 1603;
zip code: 973130
The problem is that not all of the input data are in the same format so some of the elements may be missing or in different order, but it is guaranteed to be an address.
I checked some sources on the internet, but a lot of them are adapted for US addresses only - like Google API Places, the thing is that I will use this for another country.
Regex is not an option since the address may variate too much.
I also thought about NLP to use Named Entity Recognition model but I'm not sure that will work.
Do you know what could a be a good way to start, and maybe help me with some tips?