IP Geolocation accuracy
6/1/2023 Daily accuracy report - Kenya
Why and how do we do that?
It is difficult to overestimate the importance of a precise IP address geolocation. Often, it is the only entirely non-intrusive tool available to indicate where our online customers/visitors are coming from and learn how we can serve them better by catering to their geographical properties.
However, IP geolocation cannot be 100% accurate and has limitations. Some of which we've described in our How accurate can IP Geolocation get? blog post.
Moreover, estimating an overall IP Geolocation accuracy is a complex problem. We have to put it up to test against a massive range of IP addresses with diverse locations, sources, and providers to test the accuracy comprehensively. Ideally, we should try it against our potential customer base, where IP addresses are associated with real people, just like our visitors are. Typically, this ground-truth reference data is rare and must be harvested thoughtfully.
So, where are we getting our reference data from?
- First, we kindly ask our website visitors to share their location with us. You may have noticed it when you visited our What is my IP Address? page.
- Then we provide a range of iOS and Android apps, such as
These apps are the best in range and must-have tools for every network professional and enthusiast. We provide these apps for absolutely free and without ads. In exchange and only with the user's consent, we collect location information associated with the public IP address used.
- We also offer a FREE Client-side reverse geocoding API. It is an industry-first free reverse geocoding and IP Geolocation API without access rate or volume limitations. We only enforce a strict client-side use policy, ensuring that the requested location will likely reflect the caller's IP address and actual service location.
At all times, BigDataCloud refrains from gathering any data that could establish a connection between the user's IP address or location and their actual identity. We do not seek to obtain information about end-users or their activities, nor do we engage in surveillance or store personal data associated with IP addresses, even for internal purposes. It should be noted that we do not possess any such information, and we do not sell any personal data.
What do we do with the received data?
Whenever an IP address/location pair is received on any collection channels, we instantly carry out the IP Geolocation check against our latest dataset and store the accuracy results. We also perform the same check against the competitive datasets and automatically generate a comprehensive report presented below.
Noticeably, the data we collect is perfectly suitable to represent real-life internet users, website visitors, mobile app users and IoT devices around the world. However, it is less likely to include infrastructural segments such as network routers or public websites.
We also go to great lengths to make sure we exclude abusers. However, we allow VPN and other anonymisers, likely in the same way and proportion one would expect from the traffic hitting a random website or another public service.
We often get several location samples for the same IP address, for example, when the user is on the move. We take the best hit point accuracy record to report, so it is based on distinct IP addresses regardless of the number of occurrences.
We operate precisely the same algorithm and sample categorisation for all datasets, constantly comparing apples to apples.
The Scope
This report can be customised to represent IP addresses reported from a selected country or territory only.
IP Address Geolocation Service Providers
We are aiming to include as many IP Geolocation original data vendors as possible. We guarantee that we're making absolutely no use of another provider's data beyond the scope of this report.
If you are an IP Geolocation provider, please get in touch with us and let us access your data, API access is prefered, and we will happily include it in our automated report. We promise to handle your data with great respect and confidentiality expected.
Legend BDC | Product / website link | Blocks Count* 510,320 | Distinct lat/lng locations 2,337 | Dataset 5/31/2023 |
Legend IP2Location | Product / website link | Blocks Count* 19,734 | Distinct lat/lng locations 150 | Dataset 5/3/2023 |
Legend MaxMind GeoLite2 | Product / website link | Blocks Count* 2,409 | Distinct lat/lng locations 89 | Dataset 5/26/2023 |
Legend db-ip | Product / website link | Blocks Count* 47,473 | Distinct lat/lng locations 354 | Dataset 5/1/2023 |
Legend WhoisXmlApi | Product / website link | Blocks Count* 44,774 | Distinct lat/lng locations 213 | Dataset 5/29/2023 |
* The blocks count field represents the total number of unique records presented in the dataset. This value is generally applicable to flat data sources. However, the BigDataCloud database does not have such a metric available as we get down to a single IP address resolution. Therefore, we offer a total number of network segments detected by the likelihood of serving the same territory and purpose.
Confidence area accuracy
BigDataCloud's innovative patent-pending technology enables accurate estimation of the geographical area served by an IP address, rather than a specific device or location. This area is known as the confidence area, and it is represented by a polygon. Some other providers refer to it as an accuracy radius represented by a circle.
The chart provided below showcases the accuracy of this area. If the ground-truth location data is within the confidence area or the accuracy radius of the IP address, it is considered a hit. Therefore, the percentage of location data that hits within this area is known as the Hit Ratio.
Point accuracy
The point accuracy data represents the actual error in the straight distance detected between the actual location reported to the one estimated by an IP Geolocation.
Wired consumers
Wired consumers' chart data primarily represents fixed network installations such as wired WiFi, home and office consumers.
Fixed network detection is a complex issue therefore we must assume that some false-positive cases are included.
Cellular consumers
Cellular consumers classification represents the network IP addresses that were detected as servicing cellular networks.
Hosting and mixed network consumers
This category represents the IP addresses that were detected as servicing hosting networks, VPNs, proxies and mixed networks. The mixed networks are the network subnets that serve legit customers like cellular users and are also subleased to host some VPN or proxy services.
In addition, this category also usually includes residential proxies.