From what sources does geolocation databases like http://www.maxmind.com/ get its data? As i have understood it the IP registry places like ARIN and RIPE just hold information about what company is assigned the IP range so it has to be from the ISP's right? If so, there has to be some way of accessing this.
I had the same question and found the following information.
As I use MaxMind information I wanted their validity their website states: "MaxMind tests the accuracy of the GeoIP2 and GeoIP Legacy Databases on a periodic basis. In our recent tests, the downloadable databases were 99.8% accurate on a country level, 90% accurate on a state level in the US, and 86% accurate for cities in the US within a 50 kilometer radius. For more details, see GeoIP2 City Coverage and Accuracy. MaxMind periodically tests the accuracy of the data used in GeoIP2 products and services. Accuracy is calculated by checking known web user IP address and location pairs against the data within MaxMind's GeoIP2 Precision Web service as well as the GeoIP2 City and GeoLite2 City database offerings."
For ISP information they state: "The ISP name is about 95% accurate in the US. Outside the US, accuracy ranges from 50% to 80%, depending on the country. The data is generally more accurate for countries with more Internet users."
source: https://www.maxmind.com/en/geoip2-isp-database
As to the process the following answer seemed informative:
IP geolocation databases are generally gathered based on the following:
IP spidering--traceroutes and other automated methods designed to map the routing infrastructure of the Internet. These techniques can be fairly complex and time consuming, given the task (4+billion IP addresses that constantly are allocated, deallocated, or moved). Plus, with IPv6, this becomes orders of magnitude more difficult.
Data supplied by users tied to IP addresses--some companies take anonymous user data (postal codes/city) tied to IP addresses and use that to help populate their databases. Obviously, this data needs to be carefully scrubbed to make sure it's reliable.
Sharing relationships with ISPs. Companies such as mine (Digital Element...http://www.digitalelement.com/) are often contacted by ISPs to make sure our data is accurate, because they don't want their users to be incorrectly targeted by services such as Hulu or ESPN and possibly blocked from content when they should otherwise be able to get it. This data is usually highly accurate, assuming it is kept up to date, because ISPs have perfect knowledge of the location of their own IP addresses.
Registry data--looking at ARIN, RIPE, etc. [Generally not that accurate.] \
There are a number of MAXMIND databases, some are free and some are commercially licensed. In part, the information in these databases comes from other public databases, such as RIPE and in part from MAXMIND's internal processing/matching.
You can write your own tools to parse RIPE database, extract all of the fields and build your own database if you want, or you can pay someone who has already done it.
If you are going to query their databases programatically, there are APIs on dev.maxmind.com for most languages.
Just a word of caution - if you use the MAXMIND GeoASN database to lookup ASN from IP, the AS name will not necessarily match the ISP name associated with the given IP. (e.g. AS2856 resolves to many different organisations, incl. BT and TalkTalk). Which is why the GeoISP database is keyed off IP address ranges, rather than ASNs.
-- ab1
© 2022 - 2024 — McMap. All rights reserved.