Letter | Open | Published:

# Mapping changes in housing in sub-Saharan Africa from 2000 to 2015

## Abstract

Access to adequate housing is a fundamental human right, essential to human security, nutrition and health, and a core objective of the United Nations Sustainable Development Goals1,2. Globally, the housing need is most acute in Africa, where the population will more than double by 2050. However, existing data on housing quality across Africa are limited primarily to urban areas and are mostly recorded at the national level. Here we quantify changes in housing in sub-Saharan Africa from 2000 to 2015 by combining national survey data within a geostatistical framework. We show a marked transformation of housing in urban and rural sub-Saharan Africa between 2000 and 2015, with the prevalence of improved housing (with improved water and sanitation, sufficient living area and durable construction) doubling from 11% (95% confidence interval, 10–12%) to 23% (21–25%). However, 53 (50–57) million urban Africans (47% (44–50%) of the urban population analysed) were living in unimproved housing in 2015. We provide high-resolution, standardized estimates of housing conditions across sub-Saharan Africa. Our maps provide a baseline for measuring change and a mechanism to guide interventions during the era of the Sustainable Development Goals.

## Main

Access to adequate housing and shelter is a fundamental human right, considered central to human wellbeing through the provision of facilities that are essential to security, comfort, health and nutrition. However, major inequalities persist, and a third of the world’s urban population lived in slum conditions in 20141. In response, Sustainable Development Goal 11 aims for universal access to adequate, safe and affordable housing, and to upgrade slums by 20302. This goal builds on Millennium Development Goal 7, which aimed for a substantial improvement in the lives of 100 million people who lived in slums by 20203.

The opportunity and need for better housing is particularly acute in Africa, with its rapidly shifting economic and demographic profile. The continent’s population is the fastest growing in the world and is predicted to increase from 1.2 billion in 2015 to 2.5 billion by 2050 (an addition equivalent to the current population of India)4, which will necessitate hundreds of millions of new homes. Alongside increased housing demand, the existing housing stock is steadily transforming—for example, thatch roofs are being replaced by corrugated metal roofs, and mud walls by concrete and brick walls5. These changes present a powerful opportunity to improve human wellbeing, and they also demonstrate the urgent need for investment in housing infrastructure to ensure that vulnerable populations are not left behind6.

Reliable measurements of house types in Africa are critical for tracking changes and targeting interventions, but existing data on African housing are limited7. The primary housing indicator for the United Nations Millennium Development Goals and Sustainable Development Goals is the prevalence of urban slum housing, estimates of which are limited to urban areas only, derived from basic extrapolations from national survey data, restricted to specific years and not standardized across the continent at any subnational scale8,9. Other detailed records of African housing conditions are focused on housing costs and finance10. Here we conduct a standardized analysis using a geospatial framework to quantify the changing profile of housing in urban and rural sub-Saharan Africa during the era of the Millennium Development Goals. We show that African housing underwent a marked change between 2000 and 2015, but unimproved housing persists.

To quantify changes in housing across sub-Saharan Africa, we leveraged 62 georeferenced national household surveys, representing 661,945 unique households in 31 countries (Extended Data Fig. 1). We designed a geostatistical regression model to map house construction materials and overall house type at 5 × 5-km2 resolution across sub-Saharan Africa. We categorized house construction materials into a binary variable that compared houses built from finished materials (for example, parquet, vinyl, tiled, cement or carpet flooring) to those built from natural or unfinished materials (for example, earth, sand, dung or palm flooring) (Extended Data Table 1). We based our categorization of house type on the Millennium Development Goal and Sustainable Development Goal definition of slum housing. We considered ‘unimproved’ housing to have at least one of four characteristics: (1) unimproved water supply; (2) unimproved sanitation; (3) more than three people per bedroom; and (4) house made of natural or unfinished materials (Supplementary Methods). We considered houses that had none of these characteristics as ‘improved’.

The independent variables (covariates) used in our model were aridity11, urbanicity12, accessibility13, travel friction13, night-time lights14 and irrigation15, which are commonly used in Africa-focused geostatistical models16; we also included space and time to account for autocorrelated residual effects. Our geostatistical model utilizes the random Fourier feature approach17 in which a nonlinear, interacting function is defined through high-dimensional feature spaces computed in explicit form in a feature map. The feature map that characterizes this relationship $$\left(\varphi :{\mathscr{X}}\to {\mathscr{H}}\right)$$ induces a measure of similarity—the kernel function $$\left(k:{\mathscr{X}}\times {\mathscr{X}}\to {\mathbb{R}}\right)$$—such that $$k\left({x}_{i},{x}_{j}\right)={\left(\varphi \left({x}_{i}\right),\varphi \left({x}_{j}\right)\right)}_{{\mathscr{H}}}$$ . Expanding on a previous study17, our feature map takes the Fourier spectral form $$z(x|\omega )={[{\rm{c}}{\rm{o}}{\rm{s}}({x}^{T}\omega ){\rm{s}}{\rm{i}}{\rm{n}}({x}^{T}\omega )]}^{T}$$such that $$k\left({x}_{i},{x}_{j}|\theta \right)\approx \frac{{\sigma }^{2}}{{N}_{{\rm{feat}}}}{\sum }_{r=1}^{M}z{\left({x}_{i}|{\omega }_{r}\right)}^{T}z({x}_{j}|{\omega }_{r})$$   , with spectral measure ωr. Rather than assuming a specific spectral distribution ωr, we obtain this Lebesgue measure directly from the data18. Given a response variable (for example, wall material), we used a beta-binomial likelihood function p([y+ytotal] | xωϕ) = BetaBinomial(z(x|ω), ϕ) that enabled overdispersion in the data while simultaneously accounting for sample size variation (from ytotalytotal). We performed regularization using dropout19. Approximate posterior confidence intervals were estimated using the weighted likelihood bootstrap20. Fitting was performed using ADAM stochastic gradient descent21. Cross-validation was performed to check for model fit and to assess the predictive accuracy of the model (Supplementary Text, Extended Data Figs. 2, 9). Population-weighted prevalences of people living in different house types in urban and rural areas were calculated using yearly population data from the WorldPop project22 and a static urban–rural definition from the Global Urban Footprint project23.

Our analysis revealed a marked transformation of housing in sub-Saharan Africa from 2000 to 2015. Across all sub-Saharan countries (excluding South Africa, Comoros and desert areas), the prevalence of houses that were built with finished materials increased from 32% (29–33%) in 2000 to 51% (49–54%) in 2015 (Table 1 and Figs. 1, 2). Our analysis suggests a widespread pattern of incremental modifications to the roof, then the walls and finally to the floor of houses (Extended Data Fig. 3). Overall, the predicted prevalence of improved housing (with improved water and sanitation, sufficient living area and durable construction) doubled from 11% (10–12%) in 2000 to 23% (21–25%) in 2015 (Table 1), with prevalences ranging from 5% (5–6%) in rural Ethiopia to 76% (71–80%) in urban Zimbabwe in 2015 (Supplementary Table 1). Between 2000 and 2015, 134 (118–147) million Africans in the analysed countries gained access to improved housing. However, unacceptable inequalities persist, with 53 (50–57) million urban inhabitants (47% (44–50%) of the total urban population of sub-Saharan Africa analysed) and 595 (585–607) million rural inhabitants (82% (80–83%) of the rural population) living in unimproved housing in 2015.

To examine the links between housing and socioeconomic factors, we quantified the association between house type and household characteristics in 51 national surveys, representing 588,892 households (Supplementary Table 2). For each survey, controlling for cluster-level variation, we jointly estimated the odds of improved housing in relation to education level of the household head, household wealth and age of the household head. We found that the odds of improved housing were 80% higher in more educated households (adjusted odds ratio, 1.80; 95% confidence interval, 1.68–1.93; P < 0.001; Fig. 3a), more than double in the wealthiest households (adjusted odds ratio, 2.53; 95% confidence interval, 2.28–2.82; P < 0.001; Fig. 3b) and 31% higher with increased age of the household head (adjusted odds ratio, 1.31; 95% confidence interval, 1.24–1.39; Extended Data Fig. 4). We also observed a higher prevalence of improved housing in urban survey clusters than rural survey clusters (Extended Data Fig. 5). Across all surveys, a 10% increase in the prevalence of urban clusters (including small rural towns that have grown from villages and account for much of sub-Saharan Africa’s urban growth) was associated with a 7.5% increase in improved housing (Extended Data Fig. 6).

Here we quantified housing conditions across urban and rural sub-Saharan Africa during the era of the Millenium Development Goals, and provided a detailed baseline measurement for the Sustainable Development Goals. By applying a geospatial approach to empirical observations, we have built considerably on existing measurements of African housing, which are limited to urban areas, not standardized at any subnational scale and are derived from more simplistic extrapolations from survey data. We show that the prevalence of improved housing (defined as housing with improved water and sanitation, sufficient living area and durable construction) doubled during 2000–2015, but that an unacceptably large proportion of people still live in unimproved housing in urban areas.

Our findings are consistent with continent-wide changes to African housing being driven by economic growth24. Increasing household spending is likely to have led people to invest more in their homes and, indeed, we found a clear increase in the prevalence of houses built with finished materials since 2000. Furthermore, house types changed the most in countries with the highest baseline prevalence of improved housing (Extended Data Fig. 7) and house type was clearly associated at the household level with education, wealth and age of the household head. In urban areas, the changes may also have been driven by a lack of traditional materials and the commodification of housing. In the future, continued population and urban growth in sub-Saharan Africa may help to sustain housing demand and incremental housing changes. In turn, ‘healthy urbanization’ has been recognized as important for maintaining economic productivity and growth24.

Our study has important implications for international goals, which have sought to address housing inequalities by achieving a substantial improvement in the lives of 100 million people living in slums by 20203 and universal access to adequate, safe and affordable housing by 20302. We show a considerable reduction in the prevalence of urban unimproved housing across sub-Saharan Africa from 68% (65–71%) in 2000 to 47% (44–50%) in 2015, similar to the equivalent estimates from the United Nations of 65% in 2000 and 55% in 201425. However, nearly half of Africa’s urban population still lives in unimproved conditions, which is partly explained by widespread unimproved sanitation—the most common housing deprivation in 75% (52 out of 69) of surveys analysed (Extended Data Fig. 8). These findings highlight the urgent need for governments to improve water and sanitation infrastructure as households continue to spend individually on their homes.

Housing is a central pillar of human security and wellbeing and is increasingly vital in the context of Africa’s urbanization and population growth. For example, house design is integral to Sustainable Development Goal 3 through a myriad of associated health outcomes, including mental health, respiratory disease, soil-transmitted helminths, diarrhoeal disease, leishmaniasis and malaria26,27,28. As towns and cities in sub-Saharan Africa grow, rapid development of luxury housing in major urban centres is occurring alongside the expansion of informal settlements that lack basic infrastructure. In addition, formal housing investment typically lags behind urbanization and the continent’s major urban growth is concentrated in smaller urban centres that have limited capacity to organize construction6. Addressing the housing needs of a growing population is key for sustainable urban development and the health and wellbeing of millions of Africans29, and will facilitate faster attainment of the Sustainable Development Goals. Our maps provide a critical mechanism to guide intervention and the measurement of change.

## Methods

### Data reporting

No statistical methods were used to predetermine sample size. The experiments were not randomized and the investigators were not blinded to allocation during experiments and outcome assessment.

### Data sources

Data were sourced from Demographic and Health Surveys (DHS), Malaria Indicator Surveys (MIS) and AIDS Indicator Surveys (AIS), which are cross-sectional surveys designed to collect nationally representative health and sociodemographic data, typically at 3–5-year intervals30,31. Surveys are administered using a stratified two-stage cluster design in which primary sampling units are randomly selected from census data and households are randomly selected within primary sampling units from an updated enumeration list. Raw DHS datasets were downloaded from www.dhsprogram.com and data were directly categorized into the household-level variables included in the analysis, using the definitions below.

### Definition of house type

We explored two aspects of housing: (1) house construction materials; and (2) overall house type (Extended Data Table 1). DHS and MIS record the main materials used for the roof, walls and/or floor32 and categorize these as ‘natural’, ‘rudimentary’ or ‘finished’ (for example, finished floor materials include parquet, vinyl, ceramic tiles, cement and carpet, whereas natural or rudimentary floor materials include earth, sand, dung, wood planks, palm and bamboo33). We classified houses as ‘built from finished materials’ if at least two out of three of the materials for the walls, roof and floor were finished and as ‘built from natural or unfinished materials’ if this criterion was not met. We based our categorization of house type on the definition of slum housing3 of the United Nations Millennium Development Goals and Sustainable Development Goals. ‘Unimproved’ houses were considered to have at least one of four characteristics: (1) unimproved water supply (as defined by the World Health Organization Joint Monitoring Programme (WHO JMP)34 (Supplementary Table 3)); (2) unimproved sanitation (as defined by WHO JMP34); (3) more than three people per bedroom; and (4) house made of natural or unfinished material. Houses that had none of these characteristics were considered as ‘improved’. Following the United Nations protocol, we excluded a fifth characteristic of unimproved housing from our definition (insecurity of tenure) due to the lack of internationally comparable data3.

### Predicting changes in housing from 2000 to 2015

We included in the analysis all georeferenced surveys (that is, latitude and longitude were available for each geographical cluster) with data on water supply, sanitation facilities, number of household members, number of bedrooms and main material of the roof, walls and floor (Extended Data Fig. 1 and Supplementary Table 4). We designed a geostatistical regression model to map (1) main material of the roof, walls and floor; and (2) overall house type at 5 × 5-km2 resolution across sub-Saharan Africa. The independent variables (covariates) used in our model were aridity index11, degree of urbanicity12, accessibility to large cities13, travel friction surface13, night-time lights14 and irrigation15. These independent variables were chosen as close proxies for factors that affect house type, such as poverty, development, urbanization, transport access and population density. We also included spatial coordinates and time, to account for spatio-temporally autocorrelated residual effects. Our geostatistical model utilizes the random Fourier feature approach17 for which a nonlinear, interacting function is defined through high-dimensional feature spaces computed in explicit form through a feature map. This approach approximates a kernel function through an explicit rather than implicit map. The feature map associates a kernel function $$k:{\mathscr{X}}\times {\mathscr{X}}\to {\mathbb{R}}$$, which is defined on an input domain $${\mathscr{X}}\in \left\{{x}_{1},..,{x}_{d}\right\}\in {{\mathbb{R}}}^{d}$$ such that $$k\left({x}_{i},{x}_{j}\right)={\left(\varphi \left({x}_{i}\right),\varphi \left({x}_{j}\right)\right)}_{{\mathscr{H}}}$$ where $$\varphi :{\mathscr{X}}\to {\mathscr{H}}$$ is the feature map that associates kernel k with an embedding of the input space into a reproducing kernel Hilbert space $${\mathscr{H}}$$. On the basis of a previous study17, our feature map takes the form $$z(x|\omega )={[{\rm{c}}{\rm{o}}{\rm{s}}({x}^{T}\omega ){\rm{s}}{\rm{i}}{\rm{n}}({x}^{T}\omega )]}^{T}$$ such that $$k\left({x}_{i},{x}_{j}|\theta \right)\approx \frac{{\sigma }^{2}}{{N}_{{\rm{feat}}}}{\sum }_{r=1}^{M}z{\left({x}_{i}|{\omega }_{r}\right)}^{T}z({x}_{j}|{\omega }_{r})$$ with a given spectral measure ωr. Rather than assuming a spectral distribution in which ωr is associated with a given kernel (for example, Student’s t for the Matérn kernel), we obtained this distribution (empirical Lebesgue measure) directly from the data18. Given a response variable (for example, wall type), we used a beta-binomial likelihood function $$p(y|x,\omega ,\varphi )={\rm{BetaBinomial}}\left(z\left(x|\omega \right),\varphi \right)$$ to perform inference, allowing for overdispersion and sample size effects in the data. We performed regularization using dropout19. Dropout was chosen over standard $${{\ell }}_{1},{{\ell }}_{2}$$ penalties owing to ease of implementation, parameter tuning and superior cross-validation performance. We included length scale parameters for each covariate dimension by shrinking or expanding the empirical measure ω. These unconstrained length scale parameters were passed through a reticulated linear function to ensure positivity and sparse covariate selection (known as automatic relevance determination). Fitting was performed using ADAM stochastic gradient descent21 with GPU TensorFlow. Validation was carried out to check for appropriate model fit and to assess the predictive accuracy of the model. The predictive performance of the model at the pixel level and administrative division level 1 was assessed via out-of-sample validation. The dropout hyperparameter was selected by random search. The robustness of this hyperparameter was double-checked by implementing automatic selection of the hyperparameter through concrete dropout35. Confidence intervals and uncertainty were estimated using the weighted likelihood bootstrap, a method that generates samples from an approximate Bayesian posterior of a parametric model20. In brief, this approach involved repeated fitting of the model while weighting the likelihood with uniform Dirichlet weights—that is, $$p{\left(y|x,\omega \right)}^{w}$$ where $$w \sim {\rm{Dirichlet}}\left(1,1,\ldots ,n\right)$$. We performed this procedure 100 times to obtain 100 realizations of the model and subsequent surfaces. Population-weighted prevalences of people living in different house types in urban and rural areas were calculated using yearly population data from the WorldPop project22 and using a static urban-rural definition from the Global Urban Footprint project23.

### Model performance

The predictive performance of both the improved housing and finished materials models at pixel level was assessed through out-of-sample validation. In this validation scheme, the full dataset was randomly partitioned such that 75% of the dataset was used to fit the model and the remaining 25% was used to test the model predictive performance. This scheme was repeated 10 times and the scores were averaged. The mean squared error36 and the correlation coefficient were used to evaluate model performance. We evaluated the model at both the pixel level and the aggregate survey level. The latter was included to give an indication of how well the model predicts the survey as a whole. The predictive scores (Supplementary Table 6 and Extended Data Fig. 9) indicate excellent model performance, comparable to those from widely used, established models16. In addition, as pixel-level estimates we performed additional cross-validation analyses in which entire administrative divisions of data were held out. This allowed us to compare our predictions with those from surveys at a resolution at which the surveys are reliable. The results from this validation support the conclusions from our primary analysis (Supplementary Table 7). Confidence intervals were evaluated by the widely used continuous ranked probability score, a score that generalizes the mean absolute error to probability distributions36. The continuous ranked probability scores show suitable credible intervals with most scores clustering towards zero.

### Household-level association between house type and socioeconomic factors

Binary variables for education, wealth and age of the household head were created at the household level. National surveys record the highest level of education of the household head and we compared more than primary education with primary education or less. DHS and MIS household wealth index scores are developed using principal component analyses that typically include variables that describe durable asset ownership, access to utilities and infrastructure and house construction materials37. To enable estimation of the association between house type and wealth, we constructed a new asset-based wealth index for each survey that excluded variables related to house construction. For each household survey, we applied inclusion criteria27 of (1) fewer than 10% missing values; and (2) frequency values between 5% and 95% for the following set of assets: (a) car; (b) motorboat; (c) scooter; (d) cart; (e) bicycle; (f) television; (g) refrigerator; (h) radio; (i) watch; (j) mobile telephone; (k) landline telephone; and (l) electrification of the household. We tested several dimensionality reduction algorithms, with the goal of condensing these twelve assets into a single dimensional index. We tested ISOMAP38, kernel principal component analysis (PCA)39, t-distributed stochastic neighbour embedding40 and linear PCA41. We found minimal differences between algorithms and therefore opted for linear PCA. Using the first principal component, we created a binary wealth variable comparing households belonging to the upper 75th wealth quartile with all other households.

We examined the association between house type and education level of the household head, household wealth and age of the household head in 51 cross-sectional household surveys, representing 588,892 households. For each survey, we jointly estimated the odds of improved housing in relation to whether the household head completed at least primary education, whether a household belonged to the upper 75% wealth quartile and whether the household head was aged over 55 years (which is approximately the upper 75% age quartile). We performed conditional logistic regression to allow these associations to be estimated within geographical cluster in order to eliminate confounding due to inter-cluster variation in urbanicity, regional wealth, climate and other survey design factors. Individual survey odds ratios were combined to determine a summary odds ratio for all surveys using random effects meta-analysis. Individual and summary odds ratios were displayed in forest plots.

### Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this paper.

### Code availability

Analysis code is available via codeshare: https://codeshare.io/2pm4Px.

## Data availability

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## References

1. 1.

United Nations. Progress towards the Sustainable Development Goals. Report of the Secretary-General. (UN Economic and Social Council, 2016).

2. 2.

United Nations. New Urban Agenda. In The United Nations Conference on Housing and Sustainable Urban Development Habitat III. (United Nations, 2016).

3. 3.

United Nations. Indicators for Monitoring the Millennium Development Goals: 7.10 Proportion of Urban Population Living in Slums. (United Nations, 2012).

4. 4.

United Nations. World Population Prospects, 2015 Revision. (United Nations Department of Economic and Social Affairs, 2015).

5. 5.

Rek, J. C. et al. Rapid improvements to rural Ugandan housing and their association with malaria from intense to reduced transmission: a cohort study. Lancet Planet. Health 2, e83–e94 (2018).

6. 6.

Parby, J. I. et al. Stocktaking of the Housing Sector in Sub-Saharan Africa: Challenges and Opportunities Vol. 2 (World Bank, Washington DC, 2015).

7. 7.

Habitat for Humanity International. Mapping Poverty Housing. (Habitat for Humanity International, 2008).

8. 8.

UN Habitat. UN-Habitat Urban Data http://urbandata.unhabitat.org/ (UN Habitat, 2018).

9. 9.

United Nations. Millennium Development Goals Indicators: Data Download https://unstats.un.org/UNSD/MDG/Data.aspx (United Nations, 2018).

10. 10.

CAHF. Housing Finance in Africa: A Review of Africa’s Housing Finance Markets. (The Centre for Affordable Housing Finance in Africa, 2018).

11. 11.

Trabucco, A. & Zomer, R. J. Global Aridity Index (Global-Aridity) and Global Potential Evapo-Transpiration (Global-PET) Geospatial Database https://cgiarcsi.community/ (CGIAR Consortium for Spatial Information, 2009).

12. 12.

Esch, T. et al. Dimensioning urbanization — an advanced procedure for characterizing human settlement properties and patterns using spatial network analysis. Appl. Geogr. 55, 212–228 (2014).

13. 13.

Weiss, D. J. et al. A global map of travel time to cities to assess inequalities in accessibility in 2015. Nature 553, 333–336 (2018).

14. 14.

Noor, A. M., Alegana, V. A., Gething, P. W., Tatem, A. J. & Snow, R. W. Using remotely sensed night-time light as a proxy for poverty in Africa. Popul. Health Metr. 6, 5 (2008).

15. 15.

FAO. Global Map of Irrigation Areas (GMIA). (Food and Agriculture Organization of the United Nations, 2016).

16. 16.

Bhatt, S. et al. The effect of malaria control on Plasmodium falciparum in Africa between 2000 and 2015. Nature 526, 207–211 (2015).

17. 17.

Rahimi, A. & Recht, B. Random features for large-scale kernel machines. In Proc. Advances in Neural Information Processing Systems 20 (eds Platt, J. C. et al.) (Neural Information Processing Systems, 2007).

18. 18.

Ton, J.-F., Flaxman, S., Sejdinovic, D. & Bhatt, S. Spatial mapping with Gaussian processes and nonstationary Fourier features. Spat. Stat. 28, 59–78 (2018).

19. 19.

Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I. & Salakhutdinov, R. Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014).

20. 20.

Newton, M. A. & Raftery, A. E. Approximate Bayesian inference with the weighted likelihood bootstrap. J. R. Stat. Soc. B 56, 3–26 (1994).

21. 21.

Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. Preprint at https://arxiv.org/abs/1412.6980 (2017).

22. 22.

Lloyd, C. T., Sorichetta, A. & Tatem, A. J. High resolution global gridded data for use in population studies. Sci. Data 4, 170001 (2017).

23. 23.

Esch, T. et al. Breaking new ground in mapping human settlements from space — The Global Urban Footprint. ISPRS J. Photogramm. Remote Sens. 134, 30–42 (2017).

24. 24.

Bughin, J. et al. Lions on the Move II: Realizing the Potential of Africa’s Economies. (McKinsey Global Institute, 2016).

25. 25.

UN Habitat. State of African Cities 2014: Re-imagining Sustainable Urban Transitions. (UN Habitat, 2014).

26. 26.

Wilson, A. L. et al. Benefit of insecticide-treated nets, curtains and screening on vector borne diseases, excluding malaria: a systematic review and meta-analysis. PLoS Negl. Trop. Dis. 8, e3228 (2014).

27. 27.

Tusting, L. S. et al. Housing improvements and malaria risk in sub-Saharan Africa: a multi-country analysis of survey data. PLoS Med. 14, e1002234 (2017).

28. 28.

Clasen, T. F. et al. Interventions to improve disposal of human excreta for preventing diarrhoea. Cochrane Database Syst. Rev. 6, CD007180 (2010).

29. 29.

Farha, L. Adequate Housing as a Component of the Right to an Adequate Standard of Living. (UN General Assembly, 2015).

30. 30.

DHS Program. Survey Search https://dhsprogram.com/what-we-do/survey-search.cfm (ICF International, 2015).

31. 31.

UNICEF. Multiple Indicator Cluster Survey List. (United Nations Childrens’ Fund, 2015).

32. 32.

UNDESA. Methodology Sheets: Proportion of Urban Population Living in Slums. (United Nations Department of Economic and Social Affairs, 2007).

33. 33.

ICF International. Demographic and Health Surveys Methodology. (ICF International, 2011).

34. 34.

WHO/UNICEF Joint Monitoring Programme. Drinking-Water and Sanitation Categories for Monitoring Purposes. (WHO/UNICEF Joint Monitoring Programme, 2016).

35. 35.

Gal, Y., Hron, J. & Kendall, A. Concrete dropout. Preprint at https://arxiv.org/abs/1705.07832 (2017).

36. 36.

Gneiting, T. & Raftery, A. E. Strictly proper scoring rules, prediction, and estimation. J. Am. Stat. Assoc. 102, 359–378 (2007).

37. 37.

Rutstein, S. O. Steps to Constructing the New DHS Wealth Index (ICF International, 2015).

38. 38.

Tenenbaum, J. B., de Silva, V. & Langford, J. C. A global geometric framework for nonlinear dimensionality reduction. Science 290, 2319–2323 (2000).

39. 39.

Schölkopf, B., Smola, A. & Müller, K.-R. Kernel principal component analysis. In Artificial Neural Networks — ICANN’97. Lecture Notes in Computer Science, Vol. 1327 (eds Gerstner, W., Germond, A., Hasler, M. & Nicoud, J.-D.) 583–588 (Springer, Berlin, Heidelberg, 1997).

40. 40.

Van Der Maaten, L. & Hinton, G. H. Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).

41. 41.

Vyas, S. & Kumaranayake, L. Constructing socio-economic status indices: how to use principal components analysis. Health Policy Plan. 21, 459–468 (2006).

## Acknowledgements

L.S.T. is a Skills Development Fellow (N011570) jointly funded by the UK Medical Research Council and the UK Department for International Development under the MRC/DFID Concordat agreement (http://www.mrc.ac.uk/). S.W.L. is supported by the Global Health Trials funded by the MRC/DFID–Wellcome Trust, and the Global Challenges Research Fund funded by BBSRC, DFID and MRC. P.W.G. receives support from the Bill and Melinda Gates Foundation (OPP1068048, OPP1106023 and OPP1132415) (http://www.gatesfoundation.org/). R.C. is employed by the World Health Organization. The author alone is responsible for the views expressed in this paper, which do not necessarily represent the decisions, policy or views of the WHO.

### Reviewer information

Nature thanks Shuaib Lwasa and the other anonymous reviewer(s) for their contribution to the peer review of this work.

## Author information

### Affiliations

1. #### Department of Disease Control, London School of Hygiene & Tropical Medicine, London, UK

• Lucy S. Tusting
2. #### RTI International, Washington, DC, USA

• Donal Bisanzio
3. #### Division of Epidemiology and Public Health, School of Medicine, University of Nottingham, Nottingham, UK

• Donal Bisanzio
4. #### United Nations Human Settlements Programme, Geneva, Switzerland

• Graham Alabaster
5. #### Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK

• Ewan Cameron
• , Harry S. Gibson
• , Daniel J. Weiss
• , Peter W. Gething
•  & Samir Bhatt
6. #### Health Metrics and Measurement Cluster, World Health Organization, Geneva, Switzerland

• Richard Cibulskis
7. #### UCL Institute for Environmental Design and Engineering (IEDE), University College London, London, UK

• Michael Davies
8. #### Department of Mathematics and Data Science Institute, Imperial College London, London, UK

• Seth Flaxman
9. #### School of Architecture, The Royal Danish Academy of Fine Arts, Copenhagen, Denmark

• Jakob Knudsen
10. #### Kenya Medical Research Institute, Kilifi, Kenya

• Charles Mbogo
11. #### KEMRI-Wellcome Trust Research Program, Nairobi, Kenya

• Charles Mbogo
12. #### Environmental Health and Ecological Sciences Department, Ifakara Health Institute, Ifakara, Tanzania

• Fredros O. Okumu
13. #### School of Public Health, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa

• Fredros O. Okumu
14. #### Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK

• Fredros O. Okumu
15. #### Mahidol-Oxford Tropical Medicine Research Unit (MORU), Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand

• Lorenz von Seidlein
16. #### Department of Biosciences, Durham University, Durham, UK

• Steve W. Lindsay
17. #### Department of Infectious Disease Epidemiology, Imperial College London, London, UK

• Samir Bhatt

### Contributions

L.S.T., S.W.L. and P.W.G. conceptualized the study; L.S.T., H.S.G. and D.J.W. curated the data; L.S.T., D.B. and S.B. analysed the data; L.S.T. acquired funding; S.F. and S.B. developed the methodology; E.C., D.J.W., H.S.G. and P.W.G. provided resources; S.B. produced the figures; L.S.T. and S.B. wrote the original draft of the manuscript; L.S.T., D.B., G.A., E.C., R.C., M.D., S.F., H.S.G., J.K., C.M., F.O.O., L.v.S., D.J.W., S.W.L., P.W.G. and S.B. reviewed and edited the manuscript. All authors discussed the results and contributed to the revision of the final manuscript.

### Competing interests

The authors declare no competing interests.

### Corresponding author

Correspondence to Lucy S. Tusting.

## Extended data figures and tables

1. ### Extended Data Fig. 1 Availability of national survey data for the period 1990–2016 for the variables that are required to determine house construction materials and house type in sub-Saharan Africa.

a, Availability of surveys for the determination of house construction materials. b, Availability of surveys for the determination of house type. Maps were produced using ArcGIS.

2. ### Extended Data Fig. 2 The continuous ranked probability scores for the models of house type and house construction material.

Continuous ranked probability scores (CRPS) are shown for the house type model (left; mean = 0.11) and the house construction material model (right; mean = 0.08). Both distributions indicate well-calibrated confidence intervals with clustering towards zero; that is, a low error with regards to the distribution of the prediction.

3. ### Extended Data Fig. 3 Prevalence of main roof, wall and floor material of houses in sub-Saharan Africa in 2000 and 2015.

a, Prevalence of houses built with finished (versus natural or unfinished) roof material in 2000 predicted at 5 × 5-km2 resolution. b, Prevalence of houses built with finished (versus natural or unfinished) roof material in 2015 predicted at 5 × 5-km2 resolution. c, Prevalence of houses built with finished (versus natural or unfinished) wall material in 2000 predicted at 5 × 5-km2 resolution. d, Prevalence of houses built with finished (versus natural or unfinished) wall material in 2015 predicted at 5 × 5-km2 resolution. e, Prevalence of houses built with finished (versus natural or unfinished) floor material in 2000 predicted at 5 × 5-km2 resolution. f, Prevalence of houses built with finished (versus natural or unfinished) floor material in 2015 predicted at 5 × 5-km2 resolution. Results are derived from a geospatial model fitted to 66 surveys for roof material, 96 surveys for floor material and 62 surveys for wall material. Maps were produced using the raster package (v.2.6-7) in R. The images were plotted using the rasterVis package (v.3.4).

4. ### Extended Data Fig. 4 Association between house type and age of the household head.

The pooled increase in odds of living in an improved house when the age of the household head is over 55 years, compared to 55 years or less, is shown to the right of the vertical line representing the null value (no difference between groups). Odds ratios are adjusted for wealth index, education level of the household head and geographical cluster. Error bars show 95% confidence intervals. Data are from 48 DHS, 2 MIS and 1 AIS conducted between 1996 and 2015 (Supplementary Table 2).

5. ### Extended Data Fig. 5 Prevalence of improved housing in rural and urban survey clusters.

Data are from 59 national household surveys from 31 countries in sub-Saharan Africa conducted between 1994 and 2015.

6. ### Extended Data Fig. 6 Prevalence of house types in relation to survey-level prevalence of urban clusters.

Left, house construction materials (adjusted R2 = 0.46, P < 0.001). Right, house type (adjusted R2 = 0.35, P < 0.001). Points represent each national survey included in the analysis.

7. ### Extended Data Fig. 7 Changes in housing in sub-Saharan Africa from 2000 to 2015 relative to the 2000 baseline.

Points represent countries stratified by urban (blue) and rural (red) areas.

8. ### Extended Data Fig. 8 Number of households per survey that lack the characteristics of improved houses.

Characteristics shown are improved water source (blue), improved sanitation facilities (red), sufficient living area (purple) and finished house construction materials (brown). Data are from 1 AIS, 15 MIS and 53 DHS that had data on all four of these variables, conducted between 1993 and 2015. Out of a total of 69 surveys, households most frequently lacked improved sanitation facilities (52 surveys; 75%) and finished materials (16 surveys; 23%).

9. ### Extended Data Fig. 9 Observed versus predicted prevalence of improved housing aggregated to district level (administrative division 1 level).

Fit predictions for both observed and predicted prevalence were aggregated to the district level and plotted.

## Supplementary information

1. ### Supplementary Information

This file contains Supplementary Methods and Supplementary Tables 1–7. The Supplementary Methods contain additional details of the house type classification. Seven display items (Supplementary Tables) provide details of the datasets analysed, house type classifications, model performance and detailed predictions of house type prevalence by country for 2000 and 2015.

### DOI

https://doi.org/10.1038/s41586-019-1050-5

• ### Reduced mosquito survival in metal-roof houses may contribute to a decline in malaria transmission in sub-Saharan Africa

• Steve W. Lindsay
• , Musa Jawara
• , Julia Mwesigwa
• , Jane Achan
• , Nabie Bayoh
• , Balla Kandeh
• , Matthew J. Kirby
• , Jakob Knudsen
• , Mike Macdonald
• , Margaret Pinder
• , Lucy S. Tusting
• , Dan J. Weiss
• , Anne L. Wilson
•  & Umberto D’Alessandro

Scientific Reports (2019)