Introduction

Acute lymphocytic leukaemia (ALL) and non-Hodgkin lymphoma (NHL) are among the most common cancers in childhood in developed countries, in that order [1, 2]. The reported increasing incidence rates seen in the US and in other developed countries in pediatric lymphoid leukaemia rates [3], in particular for B-cell precursor ALL (but not for T-ALL nor acute myeloid leukaemia (AML)) [4], and NHL [3, 5] point towards a role for the modern lifestyle. To date there are relatively few well-established environmental risk factors for such paediatic cancers. For ALL these include ionising radiation [6, 7], heavy birth weight [8,9,10,11] and male sex [10]. NHL is not generally thought to be associated with ionising radiation exposure [6, 7]. There is a modest but significant socio-economic gradient for ALL, at least in the UK and USA, with increasing rates of ALL in higher socioeconomic classes [12,13,14]. There is also a substantial body of data suggesting a role for infections in childhood ALL [15,16,17]. The aetiology of NHL in childhood is poorly understood, the only well-established risk factors being those associated with dysregulation of the immune system [18]. There are known difference in rates of most common types of childhood cancer between racial/ethnic groups [19].

A Finnish cancer incidence study suggested that rates of childhood ALL slightly increased during the lighter part of the year, the increase being most pronounced for children aged 2-4 [20]. A study of childhood haematological malignancies in France demonstrated increasing rates of precursor B-cell ALL with increasing levels of solar ultraviolet radiation (UVR), but no significant variation of NHL [21]. A Greek case/control study reported significant reduction in rates of childhood NHL associated with more than 15 days annual sunbathing [22]. A meta-analysis of population-level cancer incidence data from 57 countries suggested decreased risk of ALL and increased risk of NHL in childhood associated with increased solar exposure, although only the former trend was significant (p < 0.01) [23].

In the current study we analyse childhood ALL and NHL risk in relation to solar exposure in the current Surveillance, Epidemiology and End Results (SEER) data [2] aggregated at the county level. We shall consider a number of different metrics of ambient solar exposure, and assess variations in solar-associated risk of ALL and NHL by sex and major racial/ethnic group.

Materials and methods

Study population

County level SEER22 data for cases diagnosed in 2000–2020 was used [2] in population-based SEER cancer registries, restricting to ALL and NHL cases under the age of 20 (not inclusive). We included registries pertaining to parts of the states of California, Connecticut, Georgia, Idaho, Illinois, Iowa, Kentucky, Louisiana, Massachusetts, New Jersey, New Mexico, New York, Texas, Utah and Washington; as detailed in the Supplement A (Methods) certain other states in SEER22 were omitted for various reasons. In the analytical cohort, there were a total of 1078 counties, with total populations (under age 20) ranging from 25 to 2.68 ×106, with mean 3.67 ×104.

ALL was defined by the lymphoid neoplasm recode 2021 Revision of ‘2(a)1 Precursor Non-Hodgkin lymphoma, B-cell’, and ‘2(b)1 Precursor Non-Hodgkin lymphoma, T-cell’ [24]. For cases diagnosed in 2000–2012, we additionally included ALL cases that had a histology code of the international classification of disease for oncology (ICD-O-3) ‘9727’ [25].

NHL was defined using ‘2(a)2 Mature Non-Hodgkin lymphoma, B-cell’, ‘2(a)3 Non-Hodgkin lymphoma, B-cell, NOS ‘, and ‘2(b)2 Mature Non-Hodgkin lymphoma, T-cell’, or with a ICD-O-3 histology code of ‘9832’ [24]. For cases diagnosed in 2013–2020, we additionally included NHL cases with a histology code of ‘9727’ [25].

We restricted analysis to the four main racial/ethnic groups, namely white non-Hispanic, black non-Hispanic, Hispanic (all races), and non-Hispanic Asian or Pacific Islanders. Further details of groups excluded are detailed in the Supplement A (Methods). The county population-year counts used in the calculation of population-years (somewhat analogous to person–years) at risk were based on the 2000 U.S. standard population (single ages to 84 – Census P25-1130). Given the known difference in childhood cancer rates between these racial/ethnic groups [19], and the geographical heterogeneity of distribution of the various racial/ethnic groups, analysis of exposure response could be potentially confounded. We therefore adjusted for racial/ethnic group in all analyses.

Solar radiation exposure assessment

The AVerage daily total GLObal solar radiation (AVGLO) estimates that are employed are derived used the National Solar Radiation Database (NSRAD) produced by the National Renewable Energy Laboratory (NREL) under the US Department of Energy’s Resource Assessment Program. This is the largest ground-based solar measurement network in the US, containing statistical summaries computed from hourly measurement data (with some infilling for missing data) for 239 US radiation stations for the period 1961–1990, including monthly, yearly, and 30-year average global solar radiation measures, and gives estimates of ambient solar exposure cumulated over a day, measured in W hour/m2. We employ county-level interpolations developed by Tatalovich et al. [26] which deliver estimates of potential solar ambient irradiance (~100–3000 nm) at 1 km² resolution in the mainland US. Linkage of SEER data to this interpolated AVGLO exposure database was via the county-level Federal Information Processing System (FIPS) code. Further details, in particular details of data mislinkage (because of incompleteness in either the SEER or AVGLO data), are given in the Supplement A (Methods).

Two candidate exposure metrics are suggested a priori, namely UVR irradiance (in units of mW/cm2), which is proportional to UVR power density on a surface, or UVR cumulative radiant exposure (in units of MJ/cm2), which is proportional to cumulative solar UVR energy deposition on a surface. These are measures of UVR exposure recommended by the Commission Internationale de l’Eclairage (CIE) [27]. As noted in the Supplement A (Methods) AVGLO is approximately proportional to the average total solar irradiance (in units of mW/cm2). We shall also use the measure of cumulative radiant exposure (in units of MJ/cm2). The derivation of both measures is explained in more detail in the Supplement A (Methods), and is as previously employed [28,29,30].

Statistical analysis

Because of marked under-dispersion, with variance generally reduced by about 10% over the Poisson-expected rates in certain race-sex subgroups for both disease endpoints, a quasi-likelihood model was used for all model fits and tests of significance [31]. The model assumes that the expected number of cases in the stratum with population-years \(P{Y}_{i}\), after UVR exposure, \({H}_{i}\) (using either irradiance or cumulative radiant exposure), with various other explanatory covariates, \({X}_{i}=({X}_{ij})\), is given by:

$$P{Y}_{i}\exp \left[\alpha {H}_{i}+\mathop{\sum}\limits_{j}{X}_{ij}{\beta }_{j}\right]$$
(1)

The population in each year and subgroup defined by the stratification is summed over each separate calendar year to give the population-year total \(P{Y}_{i}\) for that subgroup. Model fitting is performed in R [32] using the glm function. Other variables used for adjustment were taken from a set of demographic/socioeconomic variables measured at county level. The variables measured are described in Supplement A Table A1. To avoid variables that could potentially soak up the effect of UVR exposure, we exclude any which had absolute value of the (Pearson) correlation with UVR irradiance of 0.1 or greater. In order to avoid over-parameterised models, the Akaike Information Criterion (AIC) [33, 34] was employed to select the optimal subset of descriptive variables from this set. A mixed forward-backward stepwise algorithm was used to select the set of variables minimising AIC, using R [32]. In order to test the effect of excluding those baseline variables with correlation >0.1, this restriction was relaxed, and AIC used to select the optimal subset of descriptive variables again. We also performed sensitivity analysis via model fits in which the demographic/socioeconomic variables were omitted. Profile-likelihood confidence intervals (CI) were estimated from the quasi-likelihood [31]. All statistical tests were two-sided.

Results

Among the four main racial/ethnic groups analysed here there are 30,349 ALL cases and 8062 cases of NHL among a population with 831,424,805 population-years of follow-up (Tables 1 and 2).

Table 1 Cases of acute lymphocytic leukaemia (ALL) and non-Hodgkin lymphoma (NHL), and population-years of observation under the age of 20 by sex in the SEER 22-registry data, covering the years 2000–2020.
Table 2 Major risk factors of acute lymphocytic leukaemia and non-Hodgkin lymphoma, in the SEER 22-registry dataa.

For ALL there are highly significant effects of age (with risk varying in a U-shaped manner, peaking at ages 2-3 years), sex (risk for males ~1.29× females), racial/ethnic group (risks for black non-Hispanics ~0.57× white non-Hispanics, risks of Hispanics ~1.33× white non-Hispanics, risks of Asian/Pacific Islanders ~0.94× white non-Hispanics), median rent (with risk decreasing with cheaper rent), heterogeneity by SNAP and increasing trends over time (Table 2). For NHL there are highly significant effects of age (with risk increasing with age), sex (risk for males ~1.87× females), racial/ethnic group (risks for black non-Hispanics ~0.88× white non-Hispanics, risks of Hispanics ~0.82× white non-Hispanics, risks of Asian/Pacific Islanders ~1.04× white non-Hispanics), median rent (risk decreasing in lower rental-cost accommodation), heterogeneity by SNAP and increasing trends over time (Table 2). All heterogeneity p-values are highly significant (p < 0.0001).

The stepAIC algorithm suggests that the optimal background model for ALL includes age, racial/ethnic group, sex, median rent, calendar year, age × racial/ethnic group, age × sex, age × calendar year, racial/ethnic group × calendar year in that order; for NHL the optimal set of explanatory variables are age, sex, racial/ethnic group, median rent, calendar year, SNAP, age × sex, and racial/ethnic group × sex, in that order.

Table 3 and Fig. 1 demonstrate that using this model (and obvious simplifications [omitting the interaction terms in racial/ethnic group and sex] in the racial/ethnic group x sex subgroups) there is a highly significant trend of ALL with UVR irradiance, with relative risk (RR) = 1.200/mW/cm2 (95% CI 1.060, 1.359, p = 0.0040). There are increasing trends of ALL with solar exposure in various racial/ethnic group/sex subgroups, in particular in Hispanic boys (p = 0.0007) and girls (p = 0.0020) (Table 3). However, there is only a borderline significant increasing trend of ALL incidence with UVR cumulative radiant exposure, with RR = 1.444/MJ/cm2 (95% CI 0.949, 2.197, p = 0.0865), although there are significant increasing trends of ALL with UVR cumulative radiant exposure among Hispanic boys (p = 0.0337) and girls (p = 0.0133) (Table 3). Very similar results are obtained if demographic/socioeconomic variables are not used for adjustment (Table 3). Supplement A Table A2 demonstrates that there is evidence of modification of trend RR by race, for ALL (all but one p-value < 0.05), although very little evidence of such modification for sex (all p-values > 0.5). The largest UVR irradiance trend RR for ALL is for Hispanics with RR = 1.517 /mW/cm2 (95% CI 1.254, 1.837), compared with RR = 1.079 /mW/cm2 (95% CI 0.896, 1.300) for White non-Hispanics, RR = 0.903 /mW/cm2 (95% CI 0.537, 1.516) for Black non-Hispanics and RR = 0.708 /mW/cm2 (95% CI 0.442, 1.136) for Asian or Pacific Islanders (Supplement A Table A3). Likewise the largest UVR cumulative radiant exposure trend RR for ALL is for Hispanics with RR = 2.593 /MJ/cm2 (95% CI 1.387, 4.859), compared with RR = 0.999 /MJ/cm2 (95% CI 0.521, 1.913) for White non-Hispanics, RR = 1.504 /MJ/cm2 (95% CI 0.288, 7.772) for Black non-Hispanics and RR = 0.245 /MJ/cm2 (95% CI 0.047, 1.283) for Asian or Pacific Islanders (Supplement A Table A3).

Table 3 Trends in relative risk of acute lymphocytic leukaemia in relation to solar ultraviolet radiation exposure.
Fig. 1: Relative risk (+95% CI) in relation to ambient solar UVR exposure.
figure 1

a acute lymphocytic leukaemia (ALL) in relation to UVR irradiance, (b) acute lymphocytic leukaemia (ALL) in relation to UVR cumulative radiant exposure (c) non-Hodgkin lymphoma in relation to UVR irradiance, (d) non-Hodgkin lymphoma in relation to UVR cumulative radiant exposure. Fitted models are the most complete (all racial/ethnic group, both sexes) AIC-optimal models given in Tables 3 and 4.

By contrast, Table 4 and Fig. 1 demonstrate that solar exposure appears protective for NHL, with decreases in incidence for increasing UVR irradiance with RR = 0.646 /mW/cm2 (95% CI 0.512, 0.816, p = 0.0002). This is also observed, if sometimes only at borderline levels of statistical significance, in a number of racial/ethnic/sex subgroups, in particular in white non-Hispanic boys and girls (p = 0.0348, p = 0.0282, respectively), and Hispanic boys (p = 0.0024) (Table 4), although not in Asian or Pacific Islander children, among whom there are indications of increased NHL risk with increasing irradiance, borderline significant for males (p = 0.1074) and significant (p = 0.0282) for females. There is also a highly significant decreasing trend of NHL with UVR cumulative radiant exposure, with RR = 0.284/MJ/cm2 (95% CI 0.166, 0.485, p < 0.0001), with again significant or borderline significant decreasing trends in white non-Hispanic boys (p = 0.0033) and girls (p = 0.0666) and in Hispanic boys (p = 0.0008) (Table 4). By contrast, there are indications of increased NHL incidence with increasing UVR cumulative radiant exposure among Asian or Pacific Islander children, which for girls is significant (p = 0.0074). Very similar results are obtained if demographic/socioeconomic variables are not used for adjustment (Table 4). Supplement A Table A2 demonstrates that there is evidence of modification of RR by race for NHL (all p-values < 0.05), although little evidence of such modification for sex (all p-values > 0.05). The lowest UVR irradiance trend RR for NHL is for Hispanics with RR = 0.545/mW/cm2 (95% CI 0.359, 0.829), compared with RR = 0.602 /mW/cm2 (95% CI 0.434, 0.833) for White non-Hispanics, RR = 0.578/mW/cm2 (95% CI 0.277, 1.200) for Black non-Hispanics and RR = 2.059/mW/cm2 (95% CI 0.933, 4.574) for Asian or Pacific Islanders (Supplement A Table A3). Likewise the lowest UVR cumulative radiant exposure trend RR for NHL is for Hispanics with RR = 0.218/MJ/cm2 (95% CI 0.123, 0.386), compared with RR = 0.336/MJ/cm2 (95% CI 0.188, 0.600) for White non-Hispanics, RR = 0.511/MJ/cm2 (95% CI 0.250, 1.041) for Black non-Hispanics and RR = 0.331/MJ/cm2 (95% CI 0.155, 0.708) for Asian or Pacific Islanders (Supplement A Table A3).

Table 4 Trends in relative risk of non-Hodgkin lymphoma in relation to solar ultraviolet radiation exposure.

Table 5 demonstrates that there is no significant variation in risk of ALL in different age groups whether in relation to irradiance (p = 0.9087) or cumulative radiant exposure (p = 0.3174). Nevertheless, there is reduction in trend RR with increasing age, both in relation to irradiance or cumulative radiant exposure, with large (and significant) risks for the age group 0–3, particularly in relation to cumulative radiant exposure; so that for example for ages 0–3 RR = 1.269 /mW/cm2 (95% CI 1.024, 1.573) and RR = 20.58 /MJ/cm2 (95% CI 1.559, 272.0), both measures generally decreasing with increasing age, with in some cases trend RR < 1 for older ages. Table 5 also shows that the evidence for such heterogeneity for NHL is stronger, at least for irradiance (p = 0.0113), although less so for cumulative radiant exposure (p = 0.1844). As for ALL, it is notable that there are large (and significant) risks for NHL for the age group 0–3, in relation to both UVR metrics and particularly for cumulative radiant exposure; so that for example for ages 0–3 RR = 2.483/mW/cm2 (95% CI 1.129, 5.463) and RR = 21,881/MJ/cm2 (95% CI 1.960, >106), both measures generally decreasing with increasing age, with in most cases trend RR < 1 for older ages. Very similar results are obtained if the analyses omit adjustment for the various demographic/socioeconomic variables.

Table 5 Variation of risk of acute lymphocytic leukaemia and non-Hodgkin lymphoma modified by age, using either UVR cumulative radiant exposure (MJ cm−2) or UVR irradiance (mW cm−2).

Supplement A Table A4 illustrates the effect of relaxing the restriction on baseline variables having correlation with UVR < 0.1. As can be seen, comparing also with the results in Tables 3 and 4, the effect of allowing these extra variables to be used is to generally weaken the UVR-associated trends, whether the generally positive trends for ALL, or the generally negative trends for NHL. Supplement A Table A5 demonstrates that there is no significant variation in risk of ALL in different median rent groups whether in relation to irradiance (p = 0.8023) or cumulative radiant exposure (p = 0.8974); Supplement A Table A6 shows that the evidence for such heterogeneity for NHL is scarcely stronger, whether in relation to irradiance (p = 0.1628) or cumulative radiant exposure (p = 0.1646). There is high negative correlation between solar exposure and latitude, with \(\rho =-0.729\) (data not shown).

Discussion

The present study has demonstrated a highly significant increase in risk of ALL for age <20 with increasing levels of solar UVR exposure (Table 3). However, the significance of the increasing trend is only seen in the Hispanic group. Although there is some overlap of RR and CI for irradiance and cumulative radiant exposure for the four ethnic groups, nevertheless there is significant heterogeneity by ethnic group using both UVR metrics (most p < 0.05), with trend RR highest for Hispanics (Supplement A Tables A2, A3). The increase is seen using UVR irradiance, and much less strongly using UVR cumulative radiant exposure. By contrast, there are highly significant decreases in risk of NHL, using either measure of UVR exposure (Table 4). Although there is some overlap of RR for irradiance and cumulative radiant exposure for the four ethnic groups, nevertheless there is significant heterogeneity by ethnic group using both UVR metrics (all p < 0.05), with trend RR lowest for Hispanics, and for most other racial/ethnic groups trend RR < 1 (Supplement A Tables A2 and A3). Both for ALL and NHL the risk appears to be concentrated in the youngest age group (0–3 years), and in both endpoints and all UVR metrics, but particularly for cumulative radiant exposure, the trend RR is increased in this age group (Table 5).

Our findings of increased risk of ALL with increasing solar exposure parallel those found in a large recent French ecological study of childhood cases, based on high quality ground-based UV measurements, somewhat similar to the AVGLO measurements used in the present study, and which demonstrated increasing risk of precursor B-cell ALL with elevated UVR, with RR = 1.41 (95% CI 1.13, 1.69) per 100 J/cm2 /day [21]. This is similar to the figure we derive of 1.235 per 100 J/cm2 /day (95% CI 1.070, 1.426) (Table 3). A Finnish study suggested that rates of childhood ALL slightly (but non-significantly) increased during the lighter part of the year (April-September), the increase being most pronounced, about 18% (and borderline significant) for children aged 2–4 [20]. However, it is the studies of long-term solar exposure which are more comparable with the structure of our data. A Californian study using AVGLO UVR data suggested a mild and borderline significant (p = 0.042) protective effect of increased UVR on early childhood (under age 5 year) ALL rates [35]. This study also showed a protective effect of UVR in children of Hispanic and Black mothers [35], again in contrast to our findings (Table 3, Supplement A Table A3). A meta-analysis of population-level cancer incidence data by registry suggested that rates of childhood ALL significantly increased (p < 0.01) with increasing degrees of registry latitude [23], again suggestive of a protective effect. A study of all-age leukaemia mortality data in Spain observed decreases with increasing latitude, suggestive of elevated UVR-associated risk [36].

Our findings of a protective effect of solar exposure on NHL (albeit not for ages 0–3) should be compared with those found in a French ecological study which reported a non-significant increased risk of NHL of RR = 1.26 (95% CI 0.89, 1.87) per 100 J/cm2 /day [21]. This is very much higher, and inconsistent with the significant negative trend we derive of RR = 0.603 per 100 J/cm2 /day (95% CI 0.461, 0.790) (Table 4). A meta analysis, conducted at the level of cancer registry (including 75 registries among 57 countries at all levels of economic development) reported weak and non-significant decreasing trends of childhood NHL with increasing registry latitude [23], suggesting a weak positive trend with UVR. Kim et al. [37] conducted a systematic review and meta-analysis of 17 case-control and 9 cohort studies, all but two covering the full age range, and all relating to developed countries (Australia, Denmark, Europe, France, Germany, Greece, Italy, Norway, Singapore, Sweden, UK, USA) mostly in the northern hemisphere, suggested that NHL risk decreased with increasing personal sunlight exposure in relation to a number of metrics, relating both to solar exposure in childhood and in adulthood. On the other hand, Lu et al. [38] conducted a systematic review and meta-analysis of 10 case-control and 1 cohort studies of occupational exposure, again of developed countries (Australia, Denmark, Europe, Germany, Singapore, Sweden, USA) mostly in the northern hemisphere, which suggested a weak positive association of NHL with solar exposure. Petridou et al. [22] studied 87 cases of NHL at ages 0–14 from a Greek national network of oncology units together with 164 age/sex-matched controls. Average time spent sunbathing per year was determined by interviewing guardians. There was a significant (p = 0.002) reduction in rate of childhood NHL incidence associated with more than 15 days annual sunbathing. The Interlymph Consortium case-control study, analysed by combining trends for each centre, each one in a developed country (Australia, Canada, Denmark, France, Germany, Ireland, Italy, Spain, Sweden, UK, USA) mostly in the northern hemisphere, using meta-analytic tools, suggested weak protective effects of UVR for NHL at ages 10–17 in relation to daily hours of UVR exposure, which were borderline significant (p = 0.06) in relation to total sun exposure [39]. A small Californian study using the same AVGLO assessments of UVR as employed here suggested a modest and non-statistically significant (p > 0.1) protective effect of increased solar UVR on childhood NHL, in particular in relation to children of Hispanic and White mothers [35]. There are also a number of all-age studies, in particular Grant [36] who documented increasing trends of all-age NHL mortality with increasing latitude in Spain, suggesting a protective effect of solar exposure. Analysis of all-age NHL incidence in the US at the level of cancer registry, with UVR assessed via data from the satellite-based Total Ozone Mapping Spectrometer (TOMS) database, documented lower incidence among those registries with higher UVR quintile for many subtypes of NHL [40]. A defect of certain of these studies [21, 23, 35] is their ecological design. In all save the French [21] and Californian [35] studies the measures of solar exposure are crude or non-existent, and many studies [36,37,38] are not confined to cancer in childhood. Latitude, which is used in many, is only a proxy for UVR exposure, in particular will not take account of local climatic factors such as cloud cover or height above sea level, which are also known to be important, and so these apparently conflicting findings are perhaps unsurprising.

The EUROSUN UVR data used by Coste et al. [21] is probably the strongest UVR measure apart from the AVGLO data employed here and in the study of Lombardi et al. [35]. The EUROSUN UVR data is satellite based, although with validating ground-based data, so contrasting with the purely ground-based AVGLO data used here. France spans approximately 41.6–51.0 degrees North in latitude, whereas the 15 states in our study cover the range of 25.8–49.0 degrees North, thereby providing us with a much wider variation in UVR irradiance.

Our finding of concentration of UVR risk for ALL among those aged 0–3 (Table 5) is not without precedent. The study of Coste et al. [21] also reported a significant interaction (p = 0.007) between UVR-associated ALL risk and age, so that only for age under 5 years was the trend significant. However, the concentration of UVR risk for NHL among those aged 0–3 (Table 5) is more novel. These findings suggest powerful factors operating soon after (or before) birth. However, like all novel findings, they require replication.

A major strength of our study is the large size, using prospectively gathered cancer status data, which is linked with an independent set of county-level solar exposure measures. The availability of a rich set of lifestyle and environmental measures, albeit ecological (measured at the level of county) is also a strength. However, we were not able to examine the UVR effect in relation to different histological subtypes of ALL/NHL with varying aetiologies. The solar exposure measurements used in our study are based on interpolated solar exposure measurements derived from a 30-year series of measurements at 215 measurement stations distributed across the contiguous 48 US states [26]. What is used here is therefore a climatic average for a region and does not take account of year-to-year variations in solar exposure. The spatial resolution, to the level of US counties is a potential limitation, although the evidence is that UVR does not vary much over even relatively large (100 km square) geographical units [41]. Solar exposure of an individual living at a specific location will exhibit much greater fluctuations than ambient variation because of differences in time spent outdoors and proximity to shade on different days throughout the year. Furthermore, the solar UVR dose absorbed by the skin (assuming that to be the relevant factor) will be further modified by the use of photoprotective agents such as hats, clothing and sunscreens. There is evidence that people tend to cover up more at lower latitudes [42], which implies that personal level exposures might be less than indicated by the ambient exposure data, suggesting a likely underestimation of the slope of the cancer-solar irradiance response. We also note that there is evidence for different use of sun protective measures amongst different racial/ethnic groups [43,44,45]. Our analyses (Supplement A Tables A5 and A6) demonstrate no evidence of socioeconomic modifiers of UVR risk. However, the variation of solar exposure from year to year in this measurement set is relatively slight [46] and climatic norms will give far more representative sunlight values for the region of interest. Given that disease counts and underlying estimates of populations in our database are available only for complete calendar years, this may not matter too much, but inevitably there will be inaccuracies in assessing exposure, even assuming it was known which solar exposure metric, UVR irradiance vs UVR cumulative radiant exposure, was the more relevant, about which information is lacking. The use of these two methods of measuring solar exposure is a novelty of our study, which has not been attempted hitherto.

It is not known what the relevant exposure period for solar exposure is likely to be for the paediatric cancers considered here. It is suspected that the relevant exposure for such cancers is very early in life, as our analysis of interaction of UVR-associated risk with age might suggest (Table 5). As such correlating with solar exposure at diagnosis, an inescapable feature of the SEER registry data is likely to be not altogether the correct thing to do. However, with paediatric cancers, and particularly ALL, most of which occur under the age of 5, one can be much more confident that the solar exposure being measured relates to the entire duration of life up to the point of development of cancer. There are some mechanistic grounds for supposing that our findings with respect to ALL can be interpreted causally. Brady et al. [47] found Signature 7 UV-related mutations were enriched among a group of aneuploid pediatric ALL cases, specifically in cases with gross chromosomal alterations, including hyperdiploid (detected in 17% of samples), near haploid (35% of samples) and iAMP21 B-ALL (46% of samples). Another possible mechanism is UV-associated immunosuppression, for which there is some evidence [48]. The interaction of ALL with the immune system is complex and many autoimmune diseases exhibit elevated risk of ALL [49]. It is also clear that ALL can inhibit the immune system [50]. The protective effects of circulating vitamin D levels on many types of cancer are reasonably well known, possibly mediated via cellular gap-junctional mechanisms [51, 52], reinforced by findings of reductions in cancer mortality following vitamin D supplementation [53] although there no effects of supplementation on cancer incidence [54, 55].

As with all other studies of solar radiation our study does not take account of population migration. In our data, solar exposure is linked to the SEER county of residence at diagnosis. The effect of this is that a proportion of the population in each area, which would be larger with increasing age, will have expected solar exposure which in the worst case, of in-migration from anywhere within the US, will correspond to the US average, so that the variation in true solar exposure (of the underlying population) between areas will be to some extent over-estimated. There will also be Berkson errors resulting from applying the group means to the individual exposures, but at least to first order the effect of these on trend estimates will again be minimal [56], although uncertainties could be underestimated [57]. That said, the great advantage of studying childhood cancers is that they occur early in life, so that the effects of population migration should not be too serious [58,59,60]. Indeed Bell and Belanger [59] reviewing the literature on residential mobility about the time of birth in various developed countries, noted explicitly that most mothers, when they move, stayed within the same county; there was little variation between countries in this respect. Another source of error is the determination of Hispanic origin in SEER. This employs an algorithm using the patient’s surname, so that misclassification of certain cases is possible. However, as such misclassification is unlikely to vary with degree of UVR exposure it will probably not introduce bias in assessments of cancer risk in relation to UVR exposure.

In summary, our findings of a protective effect of solar exposure on NHL (albeit not for ages 0–3) are generally supported by those of a number of previous, but generally lower quality, ecological studies. Our more novel finding, of an adverse effect of solar exposure on childhood ALL, is consistent with those of a large population-based study in France, but to some extent inconsistent with findings in a case-control study in California with little UVR variation [35] and lower quality ecological studies. However, the evidence is not entirely clear-cut and all findings are in need of replication, perhaps by using individual-level data, where solar exposures can be assigned more accurately.