A comparison of experience sampled hay fever symptom severity across rural and urban areas of the UK

Gledson, Ann; Lowe, Douglas; Reani, Manuele; Topping, David; Hall, Ian; Cruickshank, Sheena; Harwood, Adrian; Woodcock, Joshua; Jay, Caroline

doi:10.1038/s41598-023-30027-x

Download PDF

Article
Open access
Published: 21 February 2023

A comparison of experience sampled hay fever symptom severity across rural and urban areas of the UK

Ann Gledson¹,
Douglas Lowe¹,
Manuele Reani²,
David Topping³,
Ian Hall⁵,
Sheena Cruickshank⁶,
Adrian Harwood¹,
Joshua Woodcock¹ &
…
Caroline Jay⁴

Scientific Reports volume 13, Article number: 3060 (2023) Cite this article

1698 Accesses
2 Citations
345 Altmetric
Metrics details

Subjects

Abstract

Hay fever affects people differently and can change over a lifetime, but data is lacking on how environmental factors may influence this. This study is the first to combine atmospheric sensor data with real-time, geo-positioned hay fever symptom reports to examine the relationship between symptom severity and air quality, weather and land use. We study 36145 symptom reports submitted over 5 years by over 700 UK residents using a mobile application. Scores were recorded for nose, eyes and breathing. Symptom reports are labelled as urban or rural using land-use data from the UK’s Office for National Statistics. Reports are compared with AURN network pollution measurements and pollen and meteorological data taken from the UK Met Office. Our analysis suggests urban areas record significantly higher symptom severity for all years except 2017. Rural areas do not record significantly higher symptom severity in any year. Additionally, symptom severity correlates with more air quality markers in urban areas than rural areas, indicating that differences in allergy symptoms may be due to variations in the levels of pollutants, pollen counts and seasonality across land-use types. The results suggest that a relationship exists between urban surroundings and hay fever symptoms.

UK daily meteorology, air quality, and pollen measurements for 2016–2019, with estimates for missing data

Article Open access 09 February 2022

Investigating the spatiotemporal associations between meteorological conditions and air pollution in the federal state Baden-Württemberg (Germany)

Article Open access 12 March 2024

Strong variations in urban allergenicity riskscapes due to poor knowledge of tree pollen allergenic potential

Article Open access 13 May 2021

Introduction

The worldwide prevalence of allergic respiratory disease has risen considerably in recent years¹. Whilst air pollution is considered to worsen symptoms for the individual^2,3,4,5,6,7, increase pollen concentrations, and lengthen pollen seasons⁸, the mechanisms of these combined effects on symptom severity are still not fully understood.

To investigate the relationship between air quality and hay fever symptoms, this paper reports the first study to compare the severity and duration of real-time symptom reports across rural and urban areas using experience sampled, geo-positioned cross-sectional data⁹. The use of mobile application data to collect users’ symptom reports for comparison with environment data has increased over the last few years. Peeters et al. used two years of geo-positioned mobile app data to compare chronic rhinosinusitis symptoms with air pollution data in Belgium. They found that, during the spring/summer months, a relationship existed between symptoms and exposure to \(\hbox {O}_3\) and \(\hbox {PM}_{2.5}\)⁴. Cabrera et al., using 2 years of seasonal allergic rhinitis symptom data recorded in Madrid, found that temperature and pollution (most significantly \(\hbox {O}_3\)), out of all the environment indicators investigated, had the highest association with participant symptoms¹⁰. Kim et al. discovered an association between allergic rhinitis and \(\hbox {SO}_2\) in a cohort of elementary school children in an industrial region of Korea⁵.

We hypothesise that people experience more severe symptoms in urban than in rural areas, due to an increase in the immune system burden. Urban and rural regions are reported to vary in pollen counts and types¹¹, pollution levels^12,13,14,15, rates of allergic reactions^{16,17,18,19,20,21,22} and daily mortality rates²³:

Hypothesis 1

(H1) Seasonal allergy symptoms are more severe for those in urban areas than in rural areas.

We also investigate whether higher pollution levels are related to more severe seasonal allergy symptoms:

Hypothesis 2

(H2) Higher levels of pollution lead to worse seasonal allergy symptoms.

The Britain Breathing (BB) mobile application²⁴ supports the collection of experience sampled, cross-sectional hay fever symptom data from the general population. Developed for Android and iOS, and made available through Google Play and the Apple App Store, we used it to recruit a large sample of citizen scientists who live in various locations throughout the UK, and are interested in contributing towards research into hay fever symptoms^9,25. Previous studies of allergy symptoms have compared environmental data with medical data including asthma hospitalisation counts¹, epidemiological studies of asthma²⁶ and allergy questionnaires^27,28. In this study, data is collected from those experiencing a range of hay fever symptoms from mild to severe, thus including people who experience common allergy symptoms but do not present to medical services. This provides a broader picture of chronic health issues experienced by hay fever sufferers, as opposed to only observing those with more acute and/or problematic reactions. Within the last decade, mobile applications have increasingly been used to collect experience sampled data from the general population for public health studies, as they are popular with users, allowing easy recruitment^4,25,29. They are often convenient to use and maintain, lowering the cost of user support, regardless of the number of participants, and potentially improving participant engagement. In addition, the resulting data is available for analysis as soon as it is entered by the user.

Figure 1 displays screenshots from the data collection pages of the BB application, which gathers information about a user’s current condition. It asks whether they have taken any medication for their symptoms that day, and uses sliders to capture the severity of four symptoms: nose, eyes, breathing and tiredness. As tiredness was only added in a later version of the app, we focus in this analysis on nose, eyes and breathing. The sliding scale allows submission of the scores: 0 (no symptoms), 1 (mild symptoms), 2 (moderate symptoms) or 3 (severe symptoms). The time, date and location of each report is logged, providing symptom data at a level of temporal and spatial precision not captured by the medical record, questionnaire or prescription data commonly used for studying allergies. Location data is provided by the mobile device’s GPS sensor when the user permits it. Accuracy is set to 100–500 m to provide sufficient precision for the purposes of the study while complying with our data protection and ethical obligations.

A study conducted for 6 months over March–October 2016 showed that the experience sampling method used in the BB mobile application is a reliable approach for collecting allergy symptom data in the general population⁹. In this study we use data collected via the BB mobile application⁹ from 2016 to 2020 to compare allergy symptoms reported in urban and rural locations over this 5 year period.

Methods

Overview

We primarily investigate whether any significant differences in symptom severity exist between BB application users in urban and rural locations. The allergy symptoms measured are nose, eyes, breathing and max score (a calculation of the maximum of the former 3 scores). Each score is an integer in the range 0 to 3, with 3 being the most severe. The user is also asked if they have taken any allergy medications that day (possible answers are yes or no).

Secondly, we test for urban/rural differences in the correlations between each of these symptoms and a variety of air quality and meteorology measurements: \(\hbox {PM}_{2.5}\), \(\hbox {PM}_{10}\), \(\hbox {NO}_2\), \(\hbox {NO}_X\) (as \(\hbox {NO}_2\)), \(\hbox {SO}_2\), \(\hbox {O}_3\), relative humidity, temperature, air pressure and 12 pollens.

Data collection

Britain breathing data

The Britain Breathing study was approved by the University of Manchester Ethics Committee, number CS 250 and carried out in accordance with all UK guidance and regulations. Informed consent was obtained from all participants at the start of the study.

The Britain Breathing mobile application was first released as an Android app on the Google Play store on March 18, 2016⁹ and this version continued until October 30 2016, and then was used again in March to October in 2017 and 2018. A second version was released in 2019 and also included an Apple version, made available on the Apple Store. The BB project and app were advertised via social media, blogs, websites, public engagement activities, appearances in science festivals and on public television⁹.

At the time that the user installs the application, they are asked for basic demographics such as gender, age and do you have hay fever? Participants are asked to report their symptoms each day for each of the three symptoms, using a simple sliding scale widget, designed to allow easy selection of one of the available 4 (0 to 3) scores and whether they have taken allergy medication that day (see Fig. 1). Each time the user inputs their scores, they are sent to a central server and recorded along with the date, time and the phone’s geographical location at the time of submission. In order to make the data non-sensitive, user identifiers were not included in the 2016 dataset, but they were included during subsequent years.

Britain breathing participants

At the end of October 2016, the app had been downloaded 1530 times, 425 people had the app installed on their phones and 20278 reports had been submitted⁹. In the years 2017 to 2020, 924 users had downloaded the application and 17,526 daily reports were submitted.

To rule out any bias relating to user demographics across rural and urban locations, participant data was analysed for gender, age and allergy medication usage. The female to male ratio in urban locations is 0.944 and in rural locations 0.927. The mean user age for all urban reports is 51.1 and for rural reports 57.3. Finally, the mean value of those who had taken medication for allergies on the same day as the report submission was 0.57 for urban locations and 0.54 for rural.

Land-use data

Land-use data was obtained from the UK’s Office for National Statistics (ONS)³⁰ and used to divide the user reports into rural and urban, using the geographical co-ordinates recorded at the time of report submission. The ONS 2011 Census Rural–Urban Classification³⁰ categories are described in the Supplementary Appendix. Categories A1, B1, C1, and C2 for England and Wales, and 1, 2, and 3 for Scotland, are classified as urban. All remaining categories are classified as rural.

Environment data

Total daily pollen grain counts were available from UK monitoring sites for the 12 pollen types: hazel (Corylus spp., 13 sites), alder (Alnus spp., 13 sites), willow (Salix spp., 13 sites), birch (Betula spp., 13 sites), ash (Fraxinus spp., 13 sites), elm (Ulmus spp., 13 sites), oak (Quercus spp., 13 sites), plane (Platanus spp., 13 sites), grass (Poaceae, 15 sites), nettle family (Urticaceae, 13 sites), mugwort (Artemisia spp., 13 sites), and ragweed (Ambrosia spp., 13 sites). The data collection period for the pollen count monitoring stations is early March to early September in the years 2016 to 2020, and the data are obtained from the UK Met Office (MIDAS dataset) via the MEDMI server. Hourly measurements of \(\hbox {PM}_{2.5}\) (81 sites), \(\hbox {PM}_{10}\) (81 sites), \(\hbox {NO}_2\) (161 sites), \(\hbox {NO}_X\) (as \(\hbox {NO}_2\), 161 sites), \(\hbox {SO}_2\) (28 sites), and \(\hbox {O}_3\) (75 sites) were downloaded from the Automatic Urban and Rural Network (AURN) network. Hourly measurements of relative humidity (323 sites), temperature (323 sites), and air pressure (154 sites), are also obtained from the UK Met Office (MIDAS dataset) via the MEDMI server.

Pre-processing

Britain breathing data

BB data was collected as a CSV file with each row representing a user report submission and each field containing report information such as time, location and symptom scores. Several pre-processing steps were performed on the symptom data before all subsequent analysis: For 2016 data, user identifiers were improvised using year-of-birth, gender and postcode location. See the Methodological Limitations Section for the potential affects of this. Reports were filtered to include only those submitted within the months March to September inclusive. Only the latest report per day per user and only reports from users who submitted on at least 10 days, were used. This left 11,576 reports by 344 users for 2016; and 11,662 reports by 417 users for the years 2017 to 2020.

Each report (row) was assigned a postcode, firstly by inputting the geographical co-ordinates into a postcode finder API³¹. If no postcode was found (approximately 10% of reports), the location co-ordinates were mapped to the closest location found in a further online location-to-postcode mapping tool³². Reports were then labelled as urban or rural, using the ONS postcode classifications and the max_symptom score was calculated, which is the maximum of all 3 symptoms (nose, eyes and breathing).

Environment data

The environment measurements consisted of daily means and maximums for each of the pollutants and meteorological variables (calculated from hourly data) and daily counts for the pollen variables. The pollutant and meteorological variables were cleaned, and missing hourly values were (where appropriate) imputed before the daily means and maximums were calculated. We have made all of the pre-processed pollen, pollutant and meteorological sensor data described above publicly available at^33,34. The cleaning and imputation methods used are described in³⁵ and the pre-processing tools are available at³⁶.

Regional estimation of environment data using concentric regions

To link each BB symptom report to environment variables, a regional estimation method is used. We started with the requirement that we needed to preserve the anonymity of the study participants while, at the same time, linking their reported symptoms with atmospheric measurements. Postcode regions were selected for the estimations as they provide similar population sizes, are large enough to provide anonymity and they have clearly defined geographic areas which can be estimated using nearby sensor data. If one or more sensors exist in the same region as the BB report, the mean is used. If not, a concentric regions method³⁵ is used to find environment measurements from the closest possible regions. This searches for sensors in those postcode regions directly adjacent to the reporting region and if none are found, searches the next ring, until sensors are found, from which the mean is taken. We have made all of the pollen, pollutant and meteorological sensor regional estimations publicly available at^37,38. The regional estimations methods used are described in³⁵ and the tools used to pre-process are available at³⁹.

Methodological limitations

The methods used in this study do have some limitations, which should be noted here. Firstly, all participants included in the study are self-selected and therefore are unlikely to represent a random sample of the population. For example it is hypothesised that those who downloaded the BB mobile application and regularly recorded symptoms are more likely to be hay fever sufferers.

Another limitation is the inability to measure the effect of taking medication (using the ’Taken Medication?’ response on the BB mobile app) on the outcomes of this study. This is because there is no record of whether participants reported symptoms before or after taking any antihistamines. For this reason, we do not stratify results by the responses to this question, but only use it as a potential extra indicator of symptom severity.

Participants were asked to report once-per-day, but as no limit was set by the mobile application, we selected the latest report for each day, for each participant. User identifiers are present in the 2017–2020 data, but for the 2016 data user identifiers had to be improvised using year-of-birth, gender and postcode location. This will result in multiple reports being included for users that reported from different postcodes on the same day. To a lesser extent, it could also result in multiple users who reported from the same postcode, with the same gender and year-of-birth being treated as a single user. Although we cannot quantify the scale of either of these anomalies, it is not expected that they would significantly effect the comparisons between urban and rural symptom severity reports.

Results

Within this section we split the analysis of our dataset into three stages. First, we compare symptom severity and duration in urban and rural areas, finding that symptom durations are longer, and severity higher, in urban areas for four of the five years studied. Secondly, we explore symptom and environmental data correlations for urban and rural areas for the whole of the UK, finding that correlations between symptoms and pollutants are strong, with relationships more likely to be found in urban than rural areas. Finally, we explore relationships between symptom and environmental data at the regional level, using postcode areas for matching data, and the concentric regions method for filling gaps in the environmental dataset³⁵ (see “Methods”, “Pre-processing” section). These regional correlations are found to be weak, which may be due to the complexity of interactions between pollutants and bio-aerosols⁴⁰ and the variability of human biological response to those interactions²⁰, and/or difficulties accurately sampling environmental data at this fine granularity.

The BB n values (user and report counts) for each year, used for all results are as follows (note that 2016 user counts are calculated using derived user IDs, as described in the “Methods”: “Pre-processing” section): urban-2016 285 users, 9543 reports; urban-2017 133 users, 4028 reports; urban-2018 84 users, 3094 reports; urban-2019 30 users, 697 reports; urban-2020 15 users, 778 reports; rural-2016 59 users, 2033 reports; rural-2017 78 users, 1073 reports; rural-2018 50 users, 1294 reports; rural-2019 19 users 424 reports; rural-2020 8 users, 274 reports.

The environmental data used are pollutant measurements sourced from the Automatic Urban and Rural Network (AURN), and meteorological and pollen measurements from the UK Met Office (MIDAS dataset)^33,34. More details of these measurements, and how they are preprocessed, are included in the Methods section. For all analyses comparing BB reports with environmental data, only data from the months March to September (inclusive) are used, as this is when pollen data is collected and pollen allergies are strongest. In all of our analyses, only the latest report per day, per user is included. To avoid including highly disengaged users, only reports from users who have submitted on \(\ge\) 10 days are used. We perform a between-subjects analysis at the level of symptom reports, as the sporadic reporting that is common in longitudinal citizen science studies and the fact that people tend to report from either urban or rural areas, rather than across both, means that an inferential within-subjects analysis at the level of the participant would be unreliable.

We explore the differences between urban and rural symptom reports as these two location types act as abstract intermediaries for representing / grouping the complex interactions between the different environmental factors and the reported hay fever symptoms. We label the BB symptom scores as urban or rural using ONS land-use classifications (see the Methods section for the criteria used).

UK-wide urban vs rural symptom severity

To investigate the first hypothesis (H1), we compared mean daily symptom severity between land-use types. Reports were classified as urban or rural and we calculated the mean scores for each day, for each location type and symptom combination. These mean scores are then aggregated by year (2016 to 2020). Table 1 shows comparisons between urban and rural mean scores for each symptom (or whether medication was taken), for all months of the year. (See Supplementary Table S1b for comparisons using only BB reports from March-September incl., in which the differences are as pronounced, if not more so.) The first row displays the averages across all years, and the remaining rows display each year. Nested rows show data for each symptom. The diff mean column is the urban mean score minus the rural mean score, so positive values indicate a higher urban mean score. The table shows that, when averaging across all years, symptoms reported from urban locations have a considerably higher severity. For an expanded version of this table which includes urban and rural standard deviations, see Supplementary Table S1a. We used Cohen’s d⁴¹ to measure the effect size between the two means. Effect sizes can be categorised as: 0.01 = very small; 0.2 = small; 0.5 = medium; 0.8 = large; 1.2 = very large; 2.0 = huge⁴². Table 1 also displays the non-parametric Kolmogorov–Smirnov test result (U1), used to compare the distance between the urban and rural daily mean distributions. Higher scores represent a greater difference between the two distribution functions, with a possible range of 0 to 1. Each individual year shows considerably higher severity in urban areas in at least one symptom, except in 2017, where no substantial differences exist. Rural areas record no considerably higher symptoms than urban areas (no increases greater than 0.061), in any year. The overall results indicate generally higher symptom severity scores in urban regions, supporting H1.

Table 1 Urban vs rural symptom severity (all months).

Full size table

As the above comparisons were not performed as part of a controlled study, it is necessary to check for any indication that the positive results are biased, for example by very high numbers of reports from individuals in one land use type. To achieve this, we used a re-sampling bootstrap test to repeat these comparisons multiple times on smaller samples: 2000 random samples (each containing 20% of the total number of BB reports) were taken and the mean of each sample calculated and plotted in a histogram. Figure 2 displays each of the resulting histograms. Each year is displayed in a row (the top row represents all years together) and the columns show nose, eyes, breathing, medication taken and max_score. The gold histograms represent the distributions of urban means and green represent rural. The results illustrate the strength and consistency of the differences between urban and rural means for all symptoms in 2020. Other years show more mixed urban/rural differences, although the differences are still clear with the exceptions of breathing in 2016, nose and max score in 2018 and eyes in 2019. For data across all years (top row), these differences are clear for nose, eyes and max score. As we see the same effect when using this re-sampling method, we can be reasonably confident that it isn’t biased by a few individuals in one group.

UK-wide urban vs rural symptom duration

To further test the validity of H1, we examined the duration of reports with symptoms scores greater than 0 (meaning that at least some symptoms were experienced) for each user, also allowing for single non-reporting days. Results indicate that durations are slightly longer in urban areas than in rural, supporting H1. Table 2 shows the average duration in days for which users report higher symptoms from March to September inclusive, and compares the differences in duration between urban and rural users. The rural and urban columns show the average number of unbroken, chains of higher scoring days per participant. Note that 2016 data has derived user IDs, as described in the “Methods”: “Pre-processing and methodological limitations” sections; as a result of this, chains of symptom reports are more easily broken in this cohort, as reports by the same user can appear to come from different users where they report from different postcode locations. It is not expected that this will affect urban and rural comparisons. The min score column shows the minimum score that must be recorded, for the chain to be unbroken. The allowed gap column shows the number of days that a user is allowed to miss (not submit any report), before the chain is broken. The results indicate that most urban symptom duration means are higher than rural, for chains of days with a minimum score of 1 or 2. No difference is recorded for chains of days with scores of only 3, as both medians are of 1 day chains only, for each symptom.

Table 2 Urban vs rural symptom duration: medians of the user average (mean) duration of symptoms in days (March–Sept incl.).

Full size table

UK-wide correlations for urban and rural locations

To explore the validity of our second hypothesis (H2) that higher levels of pollution lead to worse hay fever symptoms, the BB user reports are again divided into urban and rural locations and compared with environment measures using monthly averages. We use all UK sensors and take monthly averages, to discover if correlations exist at a general, coarse-granularity. All pollen and weather sensor data are used as they are not classified into location types, but for the pollutant dataset, we include only urban background and rural background sensor measurements. We do not use any sensor data from roadside or industrial locations as we are most interested in tracking large-scale regional patterns in pollution, and these could be masked by the local pollution events measured at these sites. We begin by correlating urban and rural symptoms with UK-wide averages for all environmental measures without dividing the pollution sensors into urban and rural locations (Table 3). To check if urban and rural reports are more correlated with their respective sensor types, we also look at correlations between monthly symptoms and pollution measurements, grouped by urban background and rural background sensor type (Table 4). (Note that this is not possible for non-pollutant variables, as some sensors are not classified by location type).

Table 3 Correlations of BB symptom scores with average monthly environment variables (UK-wide, March–Sept incl.).

Full size table

Table 3 shows all statistically significant (p\(\le\)0.05) correlations between monthly means of BB symptoms and monthly means of environment measurements. The results are split between BB urban scores (top section) and BB rural scores (bottom section), and each section is ordered by significance. These results indicate that urban symptoms correlate more highly, and with a wider variety of UK-wide pollutant markers, than rural symptoms. The highest (absolute) monthly correlation is urban: a negative correlation of −0.72 between \(\hbox {SO}_2\) (daily mean) and urban eyes symptoms. The strongest correlations for urban locations are for \(\hbox {SO}_2\), \(\hbox {NO}_x\), and \(\hbox {NO}_2\). No symptoms exhibit a strong correlation with particulate matter pollutants (\(\hbox {PM}_{10}\) and \(\hbox {PM}_{2.5}\)). The rural symptoms have only four significant correlations, with the highest correlation being +0.38 for eyes vs grass (Poaceae) (daily mean). The only factors rural symptoms correlate with are \(\hbox {SO}_2\) and grass and, unlike in urban locations, no rural symptoms correlate with \(\hbox {NO}_x\), \(\hbox {NO}_2\), or \(\hbox {O}_3\) factors. Although the highest rural symptom correlations are weaker than the highest urban symptom correlations, those factors common across both land-use types (\(\hbox {SO}_2\) and grass) have similar magnitude correlations with symptoms.

Grass (Poaceae) pollen has similar positive correlations for both urban and rural areas and the only other pollen with significant correlation is hazel (Corylus spp.), but it is negative, perhaps due to its spring (February-March) peak, which would be inversely related to the later summertime peak of grass. Another pattern worthy of note is that all of the urban symptom correlations with pollutants are negative, except for \(\hbox {O}_3\). Rural symptoms, on the other hand, only have positive correlations with one gaseous pollutant \(\hbox {SO}_2\) (no significant correlations exist between rural symptoms and \(\hbox {O}_3\)).

Table 4 (top section) shows the correlations between average monthly urban BB symptom scores and background urban pollutant sensors. When BB urban scores are compared only with urban background sensors in this way, the scores remain very similar to those found when both urban and rural background sensors are used (Table 3). Once more, for urban locations, all correlations with \(\hbox {O}_3\) are positive, and all correlations with other pollutants are negative. Again, the significant correlations are only with the gaseous pollutants; particulate matter (\(\hbox {PM}_{10}\) and \(\hbox {PM}_{2.5}\)) show no significant correlations.

Table 4 Correlations of BB symptom scores with average monthly pollutant variables (UK-wide, but grouped by sensor location type, March–Sept incl.).

Full size table

Table 4 (bottom section) shows the correlations between average monthly rural BB symptom scores and background rural pollutant sensors. When BB rural scores are only compared with rural background sensors in this way, the highest rural correlation (\(\hbox {SO}_2\) max daily and eyes) has risen to +0.5. Once more, all correlations with the \(\hbox {SO}_2\) pollutant levels are positive, and no significant correlations exist with any other pollutants.

Overall, these correlations at the coarse-grained level, using monthly means and UK-wide environment variables, show that relationships exist between the pollution levels and allergy symptoms, but they are complex and will be explored further in the Discussion section.

Regional-level correlations between BB symptoms and environment variables

To test for relationships between hay fever symptoms and environmental factors within more localised areas, we compare correlations between symptoms and pollen, pollutant and meteorological data at the regional level. Symptom reports (as in all previous analyses, these are limited to a maximum of 1 report per day per user, each including nose, eyes, breathing, and a max score: the maximum of these) are matched with mean environment measurements for the same postcode region. A concentric regions method³⁵ (see “Methods” section, “Pre-processing” sub-section) is used to find environmental measurements where no sensors exist in a postcode area. This can lead to the use (in particular for pollen) of environmental data that is quite distant from the BB reports, and so we also calculate correlations for BB symptom reports only where environmental sensors for the given variable are available within the region. The correlations from both methods are presented for individual days, as well as aggregated for weekly and monthly periods (to allow for any potential lags between potential environmental triggers and symptoms). For this regional analysis, all pollution sensor types (urban background, rural background, industrial and urban traffic) are used.

Table 5 Correlations of BB symptom scores with environment variables, grouped by postcode region.

Full size table

Table 5 shows correlations between environment scores (pollen, pollutants and weather) that have a significance of p (2-tailed) \(\le\) 0.05. Note that none of the meteorological variables correlated with the required significance to be included in this table. The top 3 rows show correlations of daily BB scores with daily mean environment scores using concentric regions and, in this case, only one environmental measurement correlates significantly with any BB symptom. Grass (Poaceae) correlates with BB nose symptoms 0.1054, eye symptoms: 0.109 and max score: 0.1139 (all with p (2-tailed) \(\le\) 0.001). Using correlations between BB symptom reports and environmental variables in the same region only, only one significant correlation, between eyes and grass, occurs (shown in row 4), with no increase in correlation. Repeating the same analysis for weekly and monthly time aggregations leads to improved correlations (shown in rows 5–8 and 16–21), while using environmental sensors in the same region only increased correlations at the monthly level (shown in rows 22–32) (but not at the weekly level; shown in rows 9–15). These aggregations also broaden the range of environmental variables with which symptoms are correlated. The highest correlation (using monthly means and only matching BB and environment data from the same region) was 0.21, for breathing with \(\hbox {SO}_2\) mean. It should be noted that this correlation is lower than any significant correlation reported using the urban/rural grouping for symptom reports at UK-wide level, as described in previous sections. The lack of significant increase in correlations when replacing the concentric regions method with one that uses the same region only, to minimise this distance, suggests a number of possibilities: (a) the postcode regions used are too large, (b) the smaller sample size of BB reports with a sensor in the same region allows the results to be more influenced by noise, or (c) that other factors are at play. Previous research suggests there are a number of reasons which could make strong correlations unlikely. For example, they could be affected by the complex dynamics of atmospheric components^23,43,44 or the spatial heterogeneity of environment factors^45,46, across the postcode regions (and so between the nearest sensor and each BB report).

To illustrate the regional variability of atmospheric components, Table 6 shows how measurement sites for each pollutant, weather and pollen variable correlate within single regions (see bold for median regional correlations), and across the entire UK (see \(\dagger\) for the median inter-sensor correlations for all sensor pairs across the UK). The first 9 measurements in the table all have 2 or more working sensors in at least one postcode region and at least one pair of those sensors has a correlation with a p value of p\(\le\)0.05. The strong correlations for these variables (and, in particular, for pressure, temperature, \(\hbox {O}_3\), \(\hbox {PM}_{10}\), and \(\hbox {PM}_{2.5}\)) between the sensor-pairs both regionally and UK-wide give us confidence that it is reasonable for us to use these sensor data to infer temporal patterns (at least) for these at the regional scale. The lowest performing measurement of this group is \(\hbox {SO}_2\), which has only a median UK-wide correlation of 0.146. Only one region (Belfast) with more than one \(\hbox {SO}_2\) sensor (with a correlation p value of p\(\le\)0.05) has three sensors and a median correlation of 0.219. An expanded version of Table 6, showing the regions with the lowest and highest median inter-sensor correlations , as well as the mean, minimum and maximum sensor-pair correlations for these regions, is presented in Supplementary Table S6. The last 12 rows (all pollen type measurements) do not show any regions containing more than one sensor, so only their UK correlations are displayed. In this latter group, the UK-wide correlations are also considerably lower than the first group, bringing their ability to represent the majority of BB user environment conditions into question: the highest pollen median correlation is grass (Poaceae), at 0.302.

Table 6 Pollution, pollen and weather measurement: median correlations both within single postcode regions (bold), and across the whole UK (\(\dagger\)).

Full size table

Discussion

One potential reason for the reduced differences between urban and rural symptom severity in 2017, might be that according to DEFRA reports (see Fig. 3 taken from⁴⁷, as well as¹²), the number of days with moderate or higher \(\hbox {O}_3\) levels dropped slightly in 2017 before rising sharply and staying relatively high in subsequent years. Another factor worth considering is that 2017 was warmer and wetter than other years⁴⁸. Temperature, precipitation and humidity have been found to have an effect on pollen counts^40,49,50,51 and also potentially on pollution levels or participants’ biological reactions to such factors. Figure 4 displays the correlations between yearly means of urban BB symptoms with (a) relative humidity (daily max) and (b) \(\hbox {O}_3\) (daily mean) respectively. The relative humidity correlation suggests that wetter weather could reduce the severity of some symptoms, or possibly that \(\hbox {O}_3\) increases symptoms. Previous work also suggests that \(\hbox {O}_3\) is associated with warmer weather¹⁵. The presence of any of the above types of phenomena could be (directly or indirectly) related to reductions in urban symptoms and/or the increase in rural symptoms, potentially lessening the gap between the two for that year.

It is worthy of note that there is a negative correlation between symptom severity and all gaseous pollutants in urban areas, except for \(\hbox {O}_3\) which showed significant positive correlations. This indicates an inverse relationship, in urban areas, between ozone and other pollutants, which has been discussed in several recent studies highlighting the weekend effect^15,52,53 where \(\hbox {O}_3\) levels increase (often at weekends) as other pollutants reduce. The inverse relationship can also be seen in annual DEFRA figures (Fig. 3)⁴⁷ in urban areas in years 2018 to 2020. We hypothesise that if any causal effect exists, is likely to be \(\hbox {O}_3\) worsening symptoms, rather than other gaseous pollutants lessening them.

Weaker correlations between BB hay fever symptoms and environmental variables are observed at a regional level, using the postcode regions method (Table 5), which we suggest could be due to any combination of the following complex factors: the spatial mix of pollutant emission sources; interactions between atmospheric components³; and the spatial variability of pollens and some pollutants such as \(\hbox {NO}_2\) and \(\hbox {SO}_2\) (e.g.^45,46,54), which will create distribution patterns that do not map cleanly onto postcode regions. Any relationships are likely to be further obfuscated by the variety of possible human immune responses.

In future Britain Breathing studies, we intend to obtain more information from users about whether they report symptoms before or after taking any antihistamines (using responses to the ’Taken Medication?’ question), so that we can stratify all symptom results by this factor. This would rule out any effects due to users’ interpretations of whether the symptoms should be recorded whilst such medications are active.

Conclusions

The main aim of this study was to investigate the relationship between environmental factors and real-time hay fever symptom reports using experience sampled, cross sectional data from the general population⁹. To capture any potential relationship, we used a simple method of dividing and comparing symptoms according to whether they occur in urban or rural areas as these are (as outlined in the Introduction) reported to vary in pollen counts and types, pollution levels and rates of allergic reactions.

Our overall results indicate that H1 (Seasonal allergy symptoms are worse for those in urban areas than in rural areas.) is supported. When observing differences between rural and urban symptom severity, the associated Kolmogorov-Smirnov tests displayed in Table 1 show that in all years except 2017, urban means are significantly higher than rural means for nose, taken_medication and max_score. The bootstrap re-sampling method results, illustrated in Fig. 2 also show considerable differences across similar year symptom combinations, allowing confidence that these differences are not biased by a few individuals reporting from one land use type. This motivates further investigation into the reason why urban and rural symptom levels were more similar in 2017 and we suggest possible reasons for this in the Discussion section.

Symptom duration, measured in unbroken sets of days of reported symptoms, is also higher in urban locations, suggesting that symptoms in urban areas are not only likely to be more severe, but to also last longer.

The results of analyses performed to test H2 (Higher levels of pollution lead to worse seasonal allergy symptoms.) were less conclusive and indicate a complex relationship between environment variables and allergy symptoms. We have measured UK-wide correlations between hay fever symptoms and environmental factors and compared differences between urban and rural locations (Tables 3 and 4). Results indicate higher (moderate) correlations in urban areas. Whilst urban symptoms correlate more highly with all gaseous pollutants, rural symptoms correlate only with \(\hbox {SO}_2\) and grass pollen.

Data availability

All environmental data described in this publication are publicly available^33,34. Britain Breathing data is not publicly accessible, due to the possibility of identifying individual participants from the location data. Should people have queries about this data or wish to obtain it under a data sharing agreement, they are invited to contact the lead author.

References

Osborne, N. J. et al. Pollen exposure and hospitalization due to asthma exacerbations: Daily time series in a European city. Int. J. Biometeorol. 61, 1837–1848 (2017).
Article ADS PubMed PubMed Central Google Scholar
Lee, S.-Y., Chang, Y.-S. & Cho, S.-H. Allergic diseases and air pollution. Asia Pac. Allergy 3, 145–154 (2013).
Article PubMed PubMed Central Google Scholar
Reinmuth-Selzle, K. et al. Air pollution and climate change effects on allergies in the Anthropocene: Abundance, interaction, and modification of allergens and adjuvants. Environ. Sci. Technol. 51, 4119–4141 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Peeters, S. et al. Association between outdoor air pollution and chronic rhinosinusitis patient reported outcomes. Environ. Health 21 (2022).
Kim, S. H. et al. Allergic rhinitis is associated with atmospheric SO\(_2\): Follow-up study of children from elementary schools in Ulsan, Korea. PloS one 16, e0248624 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, H. et al. Ambient air pollutants increase the risk of immunoglobulin e-mediated allergic diseases: A systematic review and meta-analysis. Environ. Sci. Pollut. Res. 1–19 (2022).
Li, S. et al. Association between exposure to air pollution and risk of allergic rhinitis: A systematic review and meta-analysis. Environ. Res. 205, 112472 (2022).
Article CAS PubMed Google Scholar
Anderegg, W. R. et al. Anthropogenic climate change is worsening North American pollen seasons. Proc. Natl. Acad. Sci. 118 (2021).
Vigo, M. et al. Britain breathing: Using the experience sampling method to collect the seasonal allergy symptoms of a country. J. Am. Med. Inform. Assoc. 25, 88–92 (2018).
Article PubMed Google Scholar
Cabrera, M. et al. Association between seasonal allergic rhinitis and air pollution, meteorological factors, and grass pollen counts in madrid (1996 and 2009). J. Invest. Allergol. Clin. Immunol. 29, 371–377 (2018).
Article Google Scholar
Bosch-Cano, F., Bernard, N. & Sudre et al., B. Human exposure to allergenic pollens: A comparison between urban and rural areas. Environ. Res. 111, 619–625 (2011).
National Statistics Concentrations of Ozone. https://www.gov.uk/government/statistics/air-quality-statistics/concentrations-of-ozone. Accessed 07 Aug 2021 (2021).
Heald, C. L. & Spracklen, D. V. Land use change impacts on air quality and climate. Chem. Rev. 115, 4476–4496 (2015).
Article CAS PubMed Google Scholar
Yang, W. & Jiang, X. Evaluating the influence of land use and land cover change on fine particulate matter. Sci. Rep. 11, 1–10 (2021).
ADS Google Scholar
Finch, D. P. & Palmer, P. I. Increasing ambient surface ozone levels over the UK accompanied by fewer extreme events. Atmos. Environ. 237, 117627 (2020).
Article CAS Google Scholar
Strachan, D. P. Hay fever, hygiene, and household size. BMJ Br. Med. J. 299, 1259 (1989).
Article CAS Google Scholar
Wickens, K. et al. Farm residence and exposures and the risk of allergic diseases in New Zealand children. Allergy 57, 1171–1179 (2002).
Article CAS PubMed Google Scholar
Eriksson, J. et al. Growing up on a farm leads to lifelong protection against allergic rhinitis. Allergy 65, 1397–1403 (2010).
Article CAS PubMed Google Scholar
Cooper, P. J. et al. Hygiene, atopy and wheeze-eczema-rhinitis symptoms in schoolchildren from urban and rural Ecuador. Thorax 69, 232–239 (2014).
Article PubMed Google Scholar
Schröder, P. C. et al. The rural–urban enigma of allergy: What can we learn from studies around the world?. Pediatric Allergy Immunol. 26, 95–102 (2015).
Article Google Scholar
Elholm, G. et al. The Danish urban–rural gradient of allergic sensitization and disease in adults. Clin. Exp. Allergy 46, 103–111 (2016).
Article CAS PubMed Google Scholar
Patel, N. P. et al. Urban vs rural residency and allergy prevalence among adult women: Iowa women’s health study. Ann. Allergy Asthma Immunol. 120, 654–660 (2018).
Article PubMed PubMed Central Google Scholar
Atkinson, R. Atmospheric chemistry of VOCS and Nox. Atmos. Environ. 34, 2063–2101 (2000).
Article ADS CAS Google Scholar
Britain breathing. https://britainbreathing.org. Accessed 10 June 2022 (2022).
Reade, S. et al. Cloudy with a chance of pain: Engagement and subsequent attrition of daily data entry in a smartphone pilot study tracking weather, disease severity, and physical activity in patients with rheumatoid arthritis. JMIR mHealth uHealth 5, e37 (2017).
Article PubMed PubMed Central Google Scholar
D’Amato, G. et al. Climate change and air pollution: Effects on respiratory allergy. Allergy Asthma Immunol. Res. 8, 391–395 (2016).
Article PubMed PubMed Central Google Scholar
Hwang, B.-F. et al. Relation between air pollution and allergic rhinitis in Taiwanese schoolchildren. Respir. Res. 7, 1–7 (2006).
Article Google Scholar
Wang, J. et al. Asthma and allergic rhinitis among young parents in China in relation to outdoor air pollution, climate and home environment. Sci. Total Environ. 751, 141734 (2021).
Article ADS CAS PubMed Google Scholar
Dixon, W. G. et al. How the weather affects the pain of citizen scientists using a smartphone app. NPJ Digit. Med. 2, 1–9 (2019).
Article Google Scholar
Office for National Statistics. National Statistics Postcode Lookup User Guide.
Postcode and Geolocation API for the UK. https://api.postcodes.io/. Accessed 27 June 2022 (2022).
Freemaptools. https://www.freemaptools.com/download-uk-postcode-lat-lng.htm. Accessed 27 June 2022 (2022).
Lowe, D., Gledson, A., Topping, D. et al. Britain Breathing 2016–2019 Air Quality and Meteorological Dataset (2021).
Lowe, D., Gledson, A., Topping, D. et al. Britain Breathing 2020 Air Quality and Meteorological Dataset (2021).
Reani, M., Lowe, D., Gledson, A., Topping, D. & Jay, C. UK daily meteorology, air quality, and pollen measurements for 2016–2019, with estimates for missing data. Sci. Data 9, 1–12 (2022).
Article Google Scholar
Lowe, D., Gledson, A., Topping, D. et al. Uom_aq_data_tools (2021).
Gledson, A., Lowe, D., Reani, M., Jay, C. & Topping, D. Britain breathing 2016–2019 air quality and meteorological regional estimates dataset. Zenodohttps://doi.org/10.5281/zenodo.5119234 (2021).
Article Google Scholar
Gledson, A. et al. Britain breathing air quality and meteorological regional estimates dataset. Zenodo. https://doi.org/10.5281/zenodo.5457270 (2020).
Article Google Scholar
Gledson, A. et al. Region estimators. Zenodohttps://doi.org/10.5281/zenodo.5119778 (2021).
Article Google Scholar
Grewling, L. et al. Biological and chemical air pollutants in an urban area of central Europe: Co-exposure assessment. Aerosol Air Qual. Res. 19, 1526–1537 (2019).
Article CAS Google Scholar
Cohen, J. Statistical Power Analysis for the Behavioral Sciences. (Routledge, 2013).
Sawilowsky, S. S. New effect size rules of thumb. J. Mod. Appl. Stat. Methods 8, 26 (2009).
Article Google Scholar
Monks, P. S. Gas-phase radical chemistry in the troposphere. Chem. Soc. Rev. 34, 376–395 (2005).
Article CAS PubMed Google Scholar
Jenkin, M. E. Trends in ozone concentration distributions in the UK since 1990: Local, regional and global influences. Atmos. Environ. 42, 5434–5445 (2008).
Article ADS CAS Google Scholar
Sarnat, S. E. et al. An examination of exposure measurement error from air pollutant spatial variability in time-series studies. J. Exposure Sci. Environ. Epidemiol. 20, 135–146 (2010).
Article CAS Google Scholar
Mangia, C., Gianicolo, E. A., Bruni, A., Vigotti, M. A. & Cervino, M. Spatial variability of air pollutants in the city of Taranto, Italy and its potential impact on exposure assessment. Environ. Monit. Assess. 185, 1719–1735 (2013).
Article CAS PubMed Google Scholar
Days with ’moderate’ or higher air pollution (includes sulphur dioxide). https://www.gov.uk/government/statistics/air-quality-statistics/days-with-moderate-or-higher-air-pollution-includes-sulphur-dioxide. (contains public sector information licensed under the Open Government Licence v3.0. https://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/). Accessed 03 July 2022 (2022).
Britain’s summer 2017 was wetter but also warmer than average. https://www.theguardian.com/uk-news/2017/sep/01/britains-summer-2017-was-wetter-but-also-warmer-than-average. Accessed 07 Aug 2021 (2021).
Bartková-Ščevková, J. The influence of temperature, relative humidity and rainfall on the occurrence of pollen allergens (Betula, Poaceae, Ambrosia artemisiifolia) in the atmosphere of Bratislava (slovakia). Int. J. Biometeorol. 48, 1–5 (2003).
Article ADS PubMed Google Scholar
Janati, A., Bouziane, H., del Mar Trigo, M., Kadiri, M. & Kazzaz, M. Poaceae pollen in the atmosphere of Tetouan (NW Morocco): Effect of meteorological parameters and forecast of daily pollen concentration. Aerobiologia 33, 517–528 (2017).
Fernández-González, M., Ribeiro, H., Pereira, J., Rodríguez-Rajo, F. & Abreu, I. Assessment of the potential real pollen related allergenic load on the atmosphere of Porto city. Sci. Total Environ. 668, 333–341 (2019).
Article ADS PubMed Google Scholar
Sicard, P. et al. Ozone weekend effect in cities: Deep insights for urban air pollution control. Environ. Res. 191, 110193 (2020).
Article CAS PubMed PubMed Central Google Scholar
Diaz, F. M. et al. Ozone trends in the United Kingdom over the last 30 years. Atmosphere 11, 534 (2020).
Article ADS CAS Google Scholar
Adams-Groom, B. et al. Regional calendars and seasonal statistics for the United Kingdom’s main pollen allergens. Allergy Eur. J. Allergy Clin. Immunol. 75, 1492–1494 (2020).
Article Google Scholar

Download references

Acknowledgements

Our thanks to the University of Exeter and Met Office for access to the Medical & Environmental Data Mash-up Infrastructure (MEDMI) database, development of which was funded by Medical Research Council (MRC) and Natural Environment Research Council (NERC) grants. We would like to acknowledge the contribution of Markel Vigo, Andy Brass, Lamiece Hassan and William Vance who designed, developed and validated the first version of the app. The authors would also like to acknowledge the assistance given by Research IT, both for the use of the Computational Shared Facility at The University of Manchester, which was used for finding postcodes from geographical co-ordinates; and also for the Mobile Service Application Development team for their ongoing support of the app. The work on this data-set was supported by the Alan Turing Institute grant ”Understanding the relationship between human health and the environment”. This work was also supported by the NERC Digital Solutions Programme (NE/V004069/1). Prior work on BritainBreathing (during 2016) has received funding from the following organizations and grant schemes: Biotechnology and Biological Sciences Research Council Activating Impact award; British Society for Immunology; Medical Research Council award (MR/K006665/1), funded via the Health eResearch Centre; and Wellcome Trust Institutional Strategic Support Fund (105610/Z/14/Z).

Author information

Authors and Affiliations

Research IT, University of Manchester, Manchester, UK
Ann Gledson, Douglas Lowe, Adrian Harwood & Joshua Woodcock
School of Management and Economics, The Chinese University of Hong Kong, Shenzhen, China
Manuele Reani
Department of Earth and Environmental Sciences, University of Manchester, Manchester, UK
David Topping
Department of Computer Science, University of Manchester, Manchester, UK
Caroline Jay
Department of Mathematics, University of Manchester, Manchester, UK
Ian Hall
Division of Infection, Immunity and Respiratory Medicine, University of Manchester, Manchester, UK
Sheena Cruickshank

Authors

Ann Gledson
View author publications
You can also search for this author in PubMed Google Scholar
Douglas Lowe
View author publications
You can also search for this author in PubMed Google Scholar
Manuele Reani
View author publications
You can also search for this author in PubMed Google Scholar
David Topping
View author publications
You can also search for this author in PubMed Google Scholar
Ian Hall
View author publications
You can also search for this author in PubMed Google Scholar
Sheena Cruickshank
View author publications
You can also search for this author in PubMed Google Scholar
Adrian Harwood
View author publications
You can also search for this author in PubMed Google Scholar
Joshua Woodcock
View author publications
You can also search for this author in PubMed Google Scholar
Caroline Jay
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.G. conceived and conducted the analysis, and wrote the manuscript. C.J., D.L., D.T., M.R. and I.H. reviewed/improved analysis methods and edited the manuscript. S.C. leads the Britain Breathing project and reviewed the manuscript. A.H. wrote the later version of the mobile app and edited the manuscript. J.W. wrote the later version of the mobile app.

Corresponding author

Correspondence to Ann Gledson.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gledson, A., Lowe, D., Reani, M. et al. A comparison of experience sampled hay fever symptom severity across rural and urban areas of the UK. Sci Rep 13, 3060 (2023). https://doi.org/10.1038/s41598-023-30027-x

Download citation

Received: 06 July 2022
Accepted: 14 February 2023
Published: 21 February 2023
DOI: https://doi.org/10.1038/s41598-023-30027-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

UK daily meteorology, air quality, and pollen measurements for 2016–2019, with estimates for missing data

Investigating the spatiotemporal associations between meteorological conditions and air pollution in the federal state Baden-Württemberg (Germany)

Strong variations in urban allergenicity riskscapes due to poor knowledge of tree pollen allergenic potential

Introduction

Hypothesis 1

Hypothesis 2

Methods

Overview

Data collection

Britain breathing data

Britain breathing participants

Land-use data

Environment data

Pre-processing

Britain breathing data

Environment data

Regional estimation of environment data using concentric regions

Methodological limitations

Results

UK-wide urban vs rural symptom severity

UK-wide urban vs rural symptom duration

UK-wide correlations for urban and rural locations

Regional-level correlations between BB symptoms and environment variables

Discussion

Conclusions

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links