Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

How Accurate are WorldPop-Global Gridded Population Data at the Cell-Level?: A Simulation Analysis in Urban Namibia

Version 1 : Received: 22 February 2021 / Approved: 22 February 2021 / Online: 22 February 2021 (15:36:31 CET)
Version 2 : Received: 14 May 2021 / Approved: 17 May 2021 / Online: 17 May 2021 (15:54:59 CEST)
Version 3 : Received: 31 March 2022 / Approved: 1 April 2022 / Online: 1 April 2022 (06:22:53 CEST)

A peer-reviewed article of this Preprint also exists.

Thomson, D. R.; Leasure, D. R.; Bird, T.; Tzavidis, N.; Tatem, A. J. How Accurate Are WorldPop-Global-Unconstrained Gridded Population Data at the Cell-Level?: A Simulation Analysis in Urban Namibia. PLOS ONE, 2022, 17, e0271504. https://doi.org/10.1371/journal.pone.0271504. Thomson, D. R.; Leasure, D. R.; Bird, T.; Tzavidis, N.; Tatem, A. J. How Accurate Are WorldPop-Global-Unconstrained Gridded Population Data at the Cell-Level?: A Simulation Analysis in Urban Namibia. PLOS ONE, 2022, 17, e0271504. https://doi.org/10.1371/journal.pone.0271504.

Abstract

Disaggregated population counts are needed to calculate health, economic, and development indicators in Low- and Middle-Income Countries (LMICs), especially in settings of rapid urbanisation. Censuses are often outdated and inaccurate in LMIC settings, and rarely disaggregated at fine geographic scale. Modelled gridded population datasets derived from census data have become widely used by development researchers and practitioners; however, none of these datasets have been evaluated for accuracy of population estimates at the grid cell-level. This is because the finest-scale population figures generally available to data producers are those input into gridded population models and disaggregated to smaller grid cells (e.g., 100x100m). We simulate a realistic "true" 2016 population in Khomas, Namibia, a majority urban region, and introduce realistic levels of outdatedness (over 15 years) and inaccuracy in slum, non-slum, and rural areas. We then aggregate these simulated realistic populations by census and administrative boundaries (to mimic census data), and generate 32 gridded population datasets that are typical of a LMIC setting using WorldPop-Global's gridded population approach. We evaluate the cell-level accuracy of these simulated WorldPop-Global datasets, using the original "true" population as a reference. In our simulation, we found large cell-level errors, particularly in urban cells, driven by WorldPop-Global's use of average population densities in large areal units to determine cell-level population densities. Age, accuracy, and aggregation of the input data did play a primary role in these errors. We suggest incorporating finer-scale training data into gridded population models generally, and WorldPop-Global in particular (e.g., from simulated populations, routine household surveys, or slum community profiles), and use of new building footprint datasets as a covariate to improve cell-level accuracy of gridded population data. It is important to measure cell-level accuracy of all gridded population datasets, especially if they are to be used for monitoring key development indicators.

Keywords

LMIC; Global South; indicator; Random Forrest

Subject

Business, Economics and Management, Accounting and Taxation

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.