Estimating hospital catchments from in-patient admission records: a spatial statistical approach applied to malaria

Admission records are seldom used in sub-Saharan Africa to delineate hospital catchments for the spatial description of hospitalised disease events. We set out to investigate spatial hospital accessibility for severe malarial anaemia (SMA) and cerebral malaria (CM). Malaria admissions for children between 1 month and 14 years old were identified from prospective clinical surveillance data recorded routinely at four referral hospitals covering two complete years between December 2015 to November 2016 and November 2017 to October 2018. These were linked to census enumeration areas (EAs) with an age-structured population. A novel mathematical-statistical framework that included EAs with zero observations was used to predict hospital catchment for malaria admissions adjusting for spatial distance. From 5766 malaria admissions, 5486 (95.14%) were linked to specific EA address, of which 272 (5%) were classified as cerebral malaria while 1001 (10%) were severe malaria anaemia. Further, results suggest a marked geographic catchment of malaria admission around the four sentinel hospitals although the extent varied. The relative rate-ratio of hospitalisation was highest at <1-hour travel time for SMA and CM although this was lower outside the predicted hospital catchments. Delineation of catchments is important for planning emergency care delivery and in the use of hospital data to define epidemiological disease burdens. Further hospital and community-based studies on treatment-seeking pathways to hospitals for severe disease would improve our understanding of catchments.

In summary, these approaches remain mechanistic and a robust statistical approach improving on these descriptions and integrating disease-specific admission records, mechanistic estimates on distance, population and health system factors to characterize hospital use is required.
Here we employ a mathematical-statistics approach to predict malaria-related hospital access using information recorded at admission at major level 4 and level 5 hospitals. The approach integrates clinical malaria case admissions at the hospital level, fine-spatial resolution mechanistic estimates of distance and travel times, and demographic data among malaria patients reaching the hospitals. A Bayesian geostatistical framework is employed accounting for the spatial variability in admissions related to malaria due to variation in care-seeking at community level [23][24][25] as well as a spatial variation in malaria risk 26 . The method also improves on the previous mechanistic distance or travel-time approaches by incorporating population and the spatial dependencies in the admission rates at the community level as well as measures of uncertainty during catchment predictions. Results present a description of catchments representative of malaria care-seeking, and statistical spatial association with severe malaria cases.

Results
Malaria admissions at the four hospitals. Table 1 summarises malaria admission at four county referral hospitals. Across two years of observation, there were 5766 inpatient malaria admissions for children 1 month to 14 years. It was possible to define a specific EA address for 5486 (95.14%) malaria admissions. Of the geocoded cases, 4972 (90.6%) had complete information for CM classification while all the 5486 (100%) had information on SMA. 272 were CM based on children who were unconscious (U) or could only respond to a painful stimulus (P) (AVPU = P or U) while 1001 were SMA based on the combination of haemoglobin ≤5 g/dL and clinical features (severe pallor or blood transfusion). Out of the 5,766 records investigated, retrospectively, 194 (3.4%) were not geocoded or linked to an EA address. Of the 194, information was only available for 76 records (71, were from Uganda (associated with Busia county referral hospital and 5 from Kisumu county referral hospital). There was no EA information to geocode address for the cases from Uganda. The Kisumu cases were more than 200 km from the Kisumu County hospital (Nandi county (3), Kisii County (4), and Kericho county (1). For the other cases, 3 were for adults while 115 were missing. estimation of hospital catchments from malaria admissions. The estimation of hospital catchment area was conducted using EAs encompassing geocoded malaria admissions. It was assumed that the likelihood of admission varied spatially. A Bayesian hierarchical zero-inflated Poisson spatial regression model was fit to the observed counts (including zero) at the EA (using EA centroid) to analyse admission rates using an underlying population for children between 1 month to 14 years. Thus, each EA was assumed to either generate no malaria admissions (zeros), a single malaria admission, or, a set of several admissions (excluding the CM and SMA cases subset for subsequent catchment analysis). EAs with zero admissions arose if there was no malaria case admitted to the study hospital (an actual zero) or malaria case(s) was present but the patient(s) opted to use a different hospital within the study area. Therefore, the novelty of the modelling approach here included aspects of competition implicitly by including zero counts combined with population and spatial ancillary data. It was not possible to include a hospital competition parameter explicitly at the EA level because geocoded admissions data from the competing hospitals were not available. The Bayesian specification included covariate (distance to the nearest road) and the model goodness of fit was assessed based on the deviance information criterion (DIC). Further, the assessment of model performance such as the root mean square error (RMSE) was based on a 30% subset of data. To avoid methods induced circularity in the subsequent analysis of how SMA and CM admission rates corresponded to the predicted malaria catchments, the severe malaria admissions were excluded from the estimation of general malaria catchments. Further details of the Bayesian model specification are included in the methods section. Figure 1A shows the spatial distribution of malaria admissions (n = 4,281) excluding the severe cases (SMA and CM) at the four sentinel hospitals. Figure 1B shows the delineated catchments representing malaria admissions based on the Bayesian predicted posterior median. The geographic extent of the catchments varied by  Figure 1B shows the malaria admissions catchments were not homogenous around the hospital, did not cover the entire health administrative boundary (i.e. the county), and were not of distinct geometrical shape. This spatial variation suggested either a variation in hospital use for malaria care-seeking at the level 4/5 hospital or in the risk of developing severe malaria.
Sensitivity analysis for hospital catchments. Several candidate models were considered in the Bayesian paradigm based on mean deviance, the DIC, the WAIC, the RMSE and the proportion of variance explained by the candidate model. Table 2 list best model fitting parameters for models selected minimizing the DIC, WAIC, MLS whilst exhibiting the best performance in terms of the proportion of variance explained while Table S1 (in the Supplementary Information) shows a combination of these parameters for the wider candidate models. Model performance improved with a refinement of the Delaunay triangulation of the spatial region considered. The proportion of variance explained by best candidate model based on the observed inpatient paediatric admissions was highest in Vihiga (90%) compared to Kakamega (74%), Busia 55%, and 56% for Kisumu (  Table 3 shows the Bayesian posterior estimates for spatial determinants of CM and SMA for each hospital. The individual-level model for CM and SMA described the rates of admission adjusting for travel time to the hospital, the spatial location of a case within compared to those outside the predicted catchment, age (as a continuous random effect), and seasonality (month). Figure 2 shows the spatial variation of CM and SMA admission at the four sentinel hospitals.  www.nature.com/scientificreports www.nature.com/scientificreports/ The rate of admission was also lower outside of predicted hospital catchment compared to rates inside the catchment. For example, at Busia hospital, CM admission was lower by 69% outside the predicted catchment (posterior median relative rate 0.39 95% Bayesian Credible Interval 0.18-0.75) ( Table 1) and by 66% for SMA admissions 0.34 (0.18-0.55). The rate of admission outside the catchment were also lower for Vihiga (0.59 (0.24-1.27) and 0.53 (0.25-1.00) for CM and SMA admissions outside the catchment, respectively.

Discussion
This study provides an example of how in-patient malaria admission records can be used in a mathematical-statistical framework to delineate catchments. Findings suggest a variation in geographic catchment representative of malaria cases reaching level 4 or level 5 hospital. The largest geographic catchment size was at Kakamega county teaching and referral hospital at 322.0 km 2 and smallest at 24 km 2 at Vihiga county hospital. For severe malaria, there were little differences in SMA and CM admissions by distance or travel time.  (Table 3) and there was a lower likelihood of presentation outside of the predicted hospital catchment.
There were very little differences in travel time or distance between SMA and CM children reaching the hospital. For this group admission was also more likely for communities <1 hour from the hospital. Given a high density of health facilities in the study area, it is likely that presentation at a greater travel time to the hospital is attenuated by competition from other health facilities capable of providing in-patient paediatric services. Elsewhere, travel time is a documented risk factor for child mortality related to malaria 18,20,27 . Moreover, when the density of health facilities is low, previous research has shown a decreasing presentation at the hospital with increasing distance (distance-decay) 18,20,28 . In contrast, when the density of health facilities was higher studies have shown a less marked effect of distance 29,30 . Therefore, for this region, the geographic distance may not be the only barrier for hospital care. A variation in care-seeking practices for malaria may contribute to a delayed presentation at the hospital [31][32][33] . As observed in previous studies, these may include but not limited to: care-givers recognition of danger signs (symptoms), household decision-making, mother's education level, and the use of informal care or self-medication [34][35][36][37] .
These community-level factors combined with health system-level factors contribute to estimated variability in catchments although these are representative of cases reaching the hospital. Larger hospitals such as Kakamega referral and teaching hospital may be more attractive in the presence of danger signs that overcomes distance barriers contributed by referral system for severe malaria cases from lower-level facilities. In Uganda, the rate of referral completion, recommended by community health workers, was less than 50% but 2.8 times higher amongst children with danger signs 38 . Our study did not record information on referral malaria cases or factors associated with referrals. However, Fig. 1B depicts communities around the hospital that might require improved transportation for severe malaria conditions requiring emergency care.
The novelty introduced by the two-component approach allowed for zero inflation by including EAs with zero observations. The Bayesian modelling approach quantified spatial dependencies in observed malaria admissions 39 and incorporated measures of uncertainty 40,41 . The spatial-statistical approach had advantages of examining the distance at which the spatial autocorrelation of observations is minimised 42 . The approach can be improved by including other parameters of health care access based on data from community surveys in a generalized spatial regression framework. For example, the inclusion of socio-demographic factors impacting malaria care-seeking at the community level. From a hospital perspective, the method could be extended by explicitly modelling a competition parameter combining aspects of population demand for malaria treatment with hospital availability. Such an undertaking would, however, require a definition of competing hospitals with geocoded admission data, and a knowledge of care-seeking behaviour for malaria at community level.
The findings presented should be interpreted with reference to limitations of available data both for the numerator and denominators. Inaccuracies in the estimation of census population (the denominator), or, by assuming a static EA population may contribute to less precision in estimation. A constant county-level inter-census growth rate was used to project EA census counts while short-term changes in population structure considered at the EA are unknown. For the numerator, a variation in malaria treatment-seeking behaviour contributes to incomplete case registry at the hospital. This study used malaria admissions, it is likely that some malaria cases at the community do not reach the hospital. Thus, the catchment is representative of malaria cases reaching hospital rather than overall healthcare-seeking for all health conditions in the population. The methodology could be improved by geocoding all hospital-related admissions beyond the scope of the current study. In general, an www.nature.com/scientificreports www.nature.com/scientificreports/ in-depth understanding of treatment-seeking would require additional community surveys. However, for rare conditions requiring emergency care, household-level surveys have both sampling and ethical challenges. Our analysis of travel time followed previous frameworks 43-45 by assuming one mode of travel around major roads (motorized) or walking across other landscapes. In reality, combined modes of travel may be used by an individual with a variation at a household level confounded by social-economic factors 46 . Due to the scope of the study such fine-scale variations were not examined along with maternal characteristics.
The delineation of disease-specific hospital catchments is important in identifying populations marginalized from health services 19,47 and these are best defined with data-driven approaches that include demographic characteristics of populations and clinical case observations. In general, if the catchment population is identified, then various population-level disease indicators could be monitored [48][49][50] . For emergency care, catchments may be useful for designing improved access to ambulatory or referral care, for example, in targeting interventions such as rectal artesunate in patients with CM 51 . In addition, hospital catchment representative of malaria care-seeking could also inform health system impact evaluation cluster randomised controlled interventions such as the RTS,S/AS01 malaria vaccine pilot implementation 52,53 . In such vaccine impact evaluation studies, an imbalanced selection of catchment communities could affect estimates of effectiveness 54 . Other previous routine surveillance studies include the evaluation of the MenAfriVac vaccine for meningitis 55 , and the Rotavirus vaccine 56 .
This study presented a detailed spatial-statistical analysis using malaria admission to delineate catchments and conducted a spatial analysis of SMA and CM. While our methods controlled for spatial autocorrelation, we found that travel time to the hospital was similar between SMA and CM. Additionally, the rate of admissions was lower outside the predicted hospital catchment areas. While this study focused on retrospective events over a 2-year period, an important aspect that should be considered for future studies is an empirical understanding of how, when and what choices caregivers make when seeking care for CM and SMA.

Random effects
Month

Random effects
Month

Methods
Study region and hospitals. The study was conducted in stable, endemic malaria setting 26 in Western Kenya (Fig. 3). The region has a high population density of approximately 600 people per square kilometer (KNBS, 2010). Enhanced routine hospital paediatric surveillance has been in operation since 2013 using training,  www.nature.com/scientificreports www.nature.com/scientificreports/ systematic symptom and laboratory result documentation, and electronic data capture tools. Further details of this Clinical Investigation Network (CIN) are provided elsewhere [57][58][59] . Data were recently re-analyzed to define the clinical characteristics of malaria admissions over two complete years, either side of a national doctor's strike, December 2015 to November 2016 and November 2017 to October 2018 from four county referral hospitals (Busia, Kakamega (teaching and referral), Vihiga, and Kisumu; Fig. 3) 60 . Each hospital in Fig. 3 is a major level 4 and level 5 hospital serving as a link between the national referral hospitals and the sub-county-level health facilities 61 . In addition to routine service delivery, the level4/5 hospitals provide a range of services including but not limited to paediatric, surgical services (have surgical ward), medical ward, education outreach, pharmaceutical, laboratory, accident and emergency unit, medical imaging, blood donation units, emergency obstetric care and anesthesiology.
Hospital malaria admissions. The database comprised children of 1 month of age up to 14 years, where a malaria diagnosis was established by a clinician based on laboratory confirmation (positive smear) and clinical observations including history, clinical signs (presence of fever, temperature >37.5 °C) 60 . Neurological status was based on observations of whether the child was alert, or able to respond to voice or pain or was unconscious, (AVPU) scale. Cerebral malaria (CM) was defined among children who were unconscious (U) or could only respond to a painful stimulus (P) 60 . This combined clinical score approximates to a Blantyre Coma Score of <4. Severe malarial anaemia (SMA) was defined based on a combination of admission haemoglobin (Hb) < 5 g/dl, or using clinical signs of severe pallor or where blood transfusion was prescribed where haemoglobin measurement was missing 60 .
Address matching malaria admissions to enumeration areas (EAs). Each child's electronic data record did not include information on the village of residence. However, details of the child's residence were written on the medical records. For each malaria admission, the in-patient medical record was retrieved at each hospital's records office. Information on place of residence was retrieved including county, sub-country, the nearest school, the nearest health facility or market centre. The village of residence was linked to the 2009 census enumeration areas (EAs) obtained from the Kenya National Bureau of Statistics (KNBS) (Fig. 3) and checked in ArcGIS 10.5 ESRI Geographic Information Systems (GIS) software for errors or anomalies. An EA in the 2009 census comprised of approximately 100 households that could be enumerated in a single census night and was on average 1.5 km 2 . The inter-census population growth rates and population age structure (5-year intervals) was obtained from county-level summaries provided by the KNBS 62 . The population growth rate was used to project the counts at EA-level to the study period (2017-2018) while the average proportion of 0-14 years at the county-level was used to correct for age (0-14 years). Maps on the spatial distribution of admission were produced at the EA-level converted to cluster points based on the EA centroid using ESRI ArcGIS software.
Ancillary spatial data. Ancillary geographic layers for roads, elevation and land cover were assembled. estimation of spatial distance and travel time to hospital. Euclidean distance from the centroid of the EA to hospital and to the nearest road was computed from ArcGIS ESRI 10.5 using the road data set. Distance to the nearest road was used in the predictive catchment modelling described in the next section. For travel time, a spatial analysis was conducted using AccessMod version 5 software 65 based on land cover, elevation and roads 18,43,45,66 . In brief, a new raster land cover layer was generated by combining the Globcover layer with roads. Travel time to the main hospital was analysed by specifying travel speed (impedance) to various land cover classes. For example, a major highway was assigned a speed of 80 kilometres per hour (80 km/h) in line with national road regulations assuming motorised travel. For the other all-weather footpaths, a 4 km/h walking correction was used 65    . The Poisson counts model defined cases form EA seen at the hospital. Each hospital modelling was conducted separately rather than a pooled analysis due to differential qualities of the hospital e.g. size. Covariates were specified through λ as: i is the offset term, α 0 an intercept, β is the regression parameter for a covariate as the spatial distance from the centroid of the EA to the nearest road. w s ( ) represents a mean-zero Gaussian Process (GP) ′ w s GP C s s ( ) {0, ( , )} associated with the spatial association between observations. The associated covariance between two pairs of locations was modelled via a Matérn covariance matrix expressed as σ ρ θ ′ = ′ C s s s s ( , ) (, ; ) 2 where ρ is the correlation function and θ includes parameters quantifying the rate of decay and smoothness of realization.
Computation was carried out via Gaussian Markov Random Fields (GMRF) 73 using a stationary SPDE 42,74,75 . More details of GMRF is presented in the supplementary information. The Bayesian specification was complete by assigning non-informative priors to the fixed and SPDE parameters. For the SPDE, penalized complexity priors 76 were used for the spatial specification thus θ τ = k (log( ), log( )) T ,

Model selection and sensitivity analysis.
Model selection was based on deviance information criterion (DIC), the Watanable Akaike Information criterion (WAIC) 77 , cross-validated mean logarithmic score (MLS). Additionally, the root mean square error (RMSE), the percentage of variance explained by the model were calculated from a 30% subset of EA clusters selected randomly. The cross-validation approach comprised of calculating the conditional predictive ordinate (CPO), which is the probability of observing a value given all other data and was examined for all observations 78 .
Spatial determinants of severe malaria admission. To examine the spatial determinants of CM and SMA admissions at the four hospitals, an individual-level analysis was conducted with dichotomy = y 1 i representing a child that had CM or SMA. A Bayesian hierarchical logistic regression model for CM and SMA with a spatial effect of the following form was implemented, where w s ( ) was modelled as previously stated in Eq. 2 and the set of random and fixed dichotomous regression parameters. These included if CM or SMA was within the predicted catchment (categorical variable), a continuous random effect of child age rather than as a categorical variable, a random effect on the month of admission, and an estimated travel-time to paediatric hospital (<10 minutes, 10-30 minutes, 30 minutes -1 hour, and >1 hour). Random effects allowed an examination of how the variable mean change in a continuous form. Bayesian specification was completed by assigning relevant prior information. Flat priors were assigned to the fixed effects with only the age of child and month of admission allowed to vary randomly based on a second-order random walk for flexible smoothing 79 . The precision parameters of random walk for age were assigned prior distributions using penalized complexity priors with approximate mean and variance (for age parameter (1, 0.001). For the month of admission, the precision parameters were assigned log gamma priors with parameters (0.1, 0.001). This approach was implemented using R-INLA 80,81 and the schematic methodological framework is included in the supplementary material ( Supplementary Fig. S1). ethical approval and consent to participate. This work is published with the permission of the Director of KEMRI. KEMRI Scientific and Ethical Review Committee approved the CIN study (SERU number 2465 and number 3459). All malaria routine data for the study period were retrieved retrospectively in 2019 from the health records office under CIN study protocol. There was no contact made with individuals at the hospital level. Details of study regulations and can be obtained by contacting IRB https://www.kemri.org/.

Data availability
Data for this study are under the primary jurisdiction of the Ministry of Health in Kenya. Enquiries about using the data can be made to the KEMRI-Wellcome Trust Research Programme Data Governance Committee.