Main

Chronic infections, including Epstein–Barr virus, human herpesvirus 8, human T lymphotropic virus type I, Plasmodium falciparum, hepatitis B virus, hepatitis C virus (HCV), Helicobacter pylori, Campylobacter jejuni, Chlamydia psittaci, Borrelia burgdorferi and human immunodeficiency virus, have been linked to the pathogenesis of non-Hodgkin lymphoma (NHL) (Hjalgrim and Engels, 2008), a heterogeneous group of disease entities characterised by the malignant transformation of B or T lymphocytes. Hodgkin lymphoma (HL), characterised by the presence of Reed–Sternberg cells, has been associated with Epstein–Barr virus, good hygiene and delayed exposure to infection (Serraino et al, 1991; Tavani et al, 2000).

Our immunological defence against pathogens utilises both the innate and adaptive immune systems. These systems working in synergy enable the host to clear infections. Genetic variation of genes involved in the innate immune response, including tumour necrosis factor receptor-associated factor, receptor-interacting serine-threonine kinase 3, BAT2, Toll-like receptor 6 (Cerhan et al, 2007) and Beta-Defensin 126 (Hu et al, 2013), have been associated with an increased risk of NHL and have a role in infection recognition and control. Chronic antigenic stimulation has been postulated as a potential mechanism for lymphomagenesis (Hjalgrim and Engels, 2008), with acute, community-acquired infections potentially playing a role (Cartwright et al, 1988; La Vecchia et al, 1992; Tavani et al, 2000; Engels et al, 2004; Chang et al, 2005; Koshiol et al, 2011; Becker et al, 2012; Karunanayake et al, 2012; Liu et al, 2012).

Although one study of male US veterans reported associations with infections to be more profound in individuals aged 50 years and those of black race (Koshiol et al, 2011); these results have not been replicated in other study populations. In addition, few studies have investigated infection-related conditions by NHL subtype (Anderson et al, 2009; Kristinsson et al, 2010). Using data from the Surveillance Epidemiology and End Results (SEER)-Medicare database, we previously reported an increased risk of chronic lymphocytic lymphoma (CLL), an NHL subtype, in patients with claims for respiratory and skin infections (Anderson et al, 2009). Similar associations have also been reported for lymphoplasmacytic lymphoma/Waldenström macroglobulinaemia using the US Veterans Affairs database (Kristinsson et al, 2010). Given the heterogeneous nature of NHLs and their disparate clinical and prognostic characteristics, the objective of the study was to investigate the role of infection-related conditions by additional NHL subtypes and HL in SEER-Medicare.

Materials and methods

Study design

The SEER-Medicare database links SEER and Medicare data (Anderson et al, 2008). Since 1973, the SEER program has collected demographic and clinical information on cancers diagnosed in multiple US sites and currently covers approximately 28% of the US population (SEER*Stat Databases (submitted November 2011)). Medicare provides federally funded health insurance for US citizens aged 65 years, entitling them to Part A coverage for hospital inpatient care (Warren et al, 2002). Approximately 95% of beneficiaries subscribe to Part B coverage, including physician and outpatient services (Warren et al, 2002). Individuals may also subscribe to a health maintenance organisation (HMO) scheme providing capitated care; associated claims are not captured by Medicare (Engels et al, 2011).

We conducted a retrospective case–control study using SEER-Medicare data. Cases were defined as individuals with a SEER diagnosis of a primary lymphoid malignancy between 1992 and 2005. In addition to overall NHL, we evaluated the main NHL subtypes, including diffuse large B-cell lymphoma (DLBCL), T-cell lymphoma, follicular lymphoma (FL) and marginal zone lymphoma (MZL). Other rarer subtypes and CLL/SLL, which has been previously reported (Anderson et al, 2009), were not reported seperately. HL cases were investigated although data on extra-nodal HL are not presented owing to the small number of cases (N=83). Cancer case definition was based on the InterLymph hierarchical classification of lymphoid neoplasms (Turner et al, 2010). Cases were required to be aged 66–99 years at diagnosis of malignancy and to have at least 13 months of Part A and Part B and no HMO Medicare coverage before diagnosis to ensure sufficient time to accrue exposure status. Patients diagnosed with malignancy only at autopsy or by death certificate were excluded. Controls (N=200 000) were previously selected from the 5% random sample of all Medicare beneficiaries without cancer residing in SEER areas (Engels et al, 2011) and were frequency matched to a larger group of all cancer types in the SEER data set by gender, age (66–69, 70–74, 75–79, 80–84, 85–99 years) and year of selection.

To ascertain exposure to common infection-related conditions, Medicare hospital, physician and outpatient files were searched for claims made by the attending physicians for infections of the respiratory tract (including acute nasopharyngitis (common cold), bronchitis, pharyngitis, laryngitis, sinusitis, pneumonia), skin (cellulitis, herpes zoster), gastrointestinal tract (gingivitis/periodonitis, gastroenteritis) and urinary tract (acute pyelonephritis, cystitis, prostatitis). Infection-related conditions were selected if the prevalence in controls was >0.5%. Cases and controls with at least one claim for an infection were considered exposed. Claims occurring during the 13-month period before case diagnosis/control selection were excluded to minimise the possibility that diagnoses were attributable to the disease process.

Statistical analysis

Polytomous logistic regression was used to calculate odds ratios (ORs) and 95% confidence intervals (CIs) to assess the association of each infectious disease and subsequent risk of lymphoid malignancy subtypes compared with controls. In the variance computation for the ORs, the fact that each case subtype was compared with the same control population, that some controls later served as cases and that individuals could be selected more than once as controls were accommodated (Anderson et al, 2008).

The combined NHL analyses were stratified by white and black race. The Wald P-value for the interaction term between race and an infection was used to evaluate homogeneity between associations in whites and blacks. Stratified results for NHL subtypes, HL and/or other racial groups are not presented given the small sample sizes.

All analyses were adjusted for gender, age and year of diagnosis/selection. No missing data were observed. As 112 main analyses were conducted to investigate the associations between each infectious disease and each lymphoid malignancy subtype, we used Bonferroni correction (P<0.00045) to highlight associations (underlined) that remained significant after controlling for multiple comparisons.

To assess the time point at which infections may be important in the development of lymphoid malignancies, we investigated infections diagnosed at different time intervals preceding diagnosis (i.e., 30 months, 31–48 months, 49–72 months, >72 months) for those that remained significant following Bonferroni correction. Analyses were conducted with and without patients with human immunodeficiency virus (n=39); as similar point estimates were observed (data not shown), results are presented for the full cohort of patients.

Results

There were small differences in the distribution of characteristics between the case and control groups due to non-restricted matching (Table 1). Compared with controls, NHL cases overall were less likely to be male and more likely to be older, of white race, selected in more recent calendar years and to have longer Medicare coverage. Conversely, HL cases appeared younger and had shorter Medicare coverage than controls.

Table 1 characteristics of controls and patients with non-Hodgkin lymphoma (overall and by subtype) and Hodgkin lymphoma

As shown in Table 2, NHL patients were 10–17% more likely than controls to have had claims for respiratory tract infections, except common cold, >13 months before diagnosis (ORs 1.10–1.17). Skin infection claims, including those for cellulitis (OR 1.15, 95% CI 1.12–1.18) and herpes zoster (OR 1.28, 95% CI 1.22–1.35), and claims for prostatitis (OR 1.12, 95% CI 1.07–1.17) were also more common in NHL cases overall than controls. MZL was most strongly associated with respiratory tract infections, including sinusitis, bronchitis, influenza and pneumonia, Table 2. Following adjustment for multiple comparisons, DLBCL, the most common NHL subtype, remained associated with sinusitis, bronchitis, influenza, cellulitis and herpes zoster, Table 2. FL was associated with claims for sinusitis, laryngitis and herpes zoster while T-cell lymphoma remained associated with pharyngitis and cellulitis only, Table 2. Herpes zoster was the only infection to remain significantly associated with HL following multiple comparison adjustment (OR 1.55; Table 2).

Table 2 Associations between common infection-related conditions and non-Hodgkin lymphoma (combined and by main subtypes) and Hodgkin lymphoma

In stratified analyses by race, associations among whites were similar to those all subjects combined for most infection-related conditions, due to the majority of the cohort being of white race (Tables 2 and 3). For blacks, the only infection-related conditions to reach statistical significance following Bonferroni correction were sinusitis, pharyngitis and cellulitis. However, several infections, including sinusitis, pharyngitis, bronchitis and cellulitis, were more commonly associated with NHL in blacks than in whites, P<0.001.

Table 3 Main associations for non-Hodgkin lymphoma (combined) stratified by race

Most infections highly significant in Table 2 remained associated with NHL even when the 6-year period preceding diagnosis was excluded (Table 4). Sinusitis, bronchitis, cellulitis and herpes zoster remained associated with DLBCL while cellulitis was associated with T-cell NHL across all time periods investigated (Table 4). For FL, sinusitis, laryngitis and herpes zoster were significant at longer latencies. Sinusitis was associated with MZL across all time periods investigated (Table 4). Claims for herpes zoster >30 months before diagnosis were associated with HL (Table 4).

Table 4 Association between common infection-related conditions and risk of non-Hodgkin lymphoma/Hodgkin lymphoma by the time of claim before diagnosis

Discussion

In the largest population-based study to date, we identified respiratory and skin infections to be associated with an increased risk of NHL in individuals aged66 years, paralleling our previous findings for CLL (Anderson et al, 2009). MZLs were associated with the broadest range of infection-related conditions and T-cell lymphomas the fewest. Interestingly, only herpes zoster infection was associated with HL, an association which remained even when initial claims were made>6 years before cancer diagnosis.

Consistent with the observed association between respiratory infections and NHL, Koshiol et al (2011) reported an increased risk of NHL among male US Veterans with upper and lower airway infections, including sinusitis and pneumonia, but did not report by NHL subtype. They identified stronger associations close to NHL diagnosis and postulated that this may have been due to reverse causality due to an underlying, undetected NHL (Koshiol et al, 2011; Richardson et al, 2011). In the current investigation, most associations occurring close to diagnosis were for MZL, an indolent, slow growing lymphoma, potentially supporting this hypothesis. Koshiol et al (2011) also suggested that undetected lymphoma would not fully explain why several infection-related conditions, including lower airway infections, occurred more frequently in cases>5 years before diagnosis. This observation was also apparent in the current study and was particularly evident for DLBCL and FL. Most FL cases have t(14;18)-positive B cells, which are thought to be transformed by exogenous antigen stimulation, such as from a viral infection (Roulland et al, 2006). Antigenic stimulation and/or subclinical immune deficiency, predisposing patients to both infections and lymphoma, may therefore explain the associations identified between infection-related conditions and lymphoma.

Interestingly, in the US Veterans study, associations between infections and NHL were most noticeable in those aged<50 years (Koshiol et al, 2011). Unfortunately, we were unable to assess these associations in a similar age group but have demonstrated, in a non-selected, population-based study, that similar associations are observed in individuals aged66 years. In stratified analyses, we substantiated the observation that associations with infection-related conditions, particularly sinusitis, pharyngitis, bronchitis and cellulitis, were significantly stronger in black than in white individuals (Koshiol et al, 2011). Further analysis by NHL subtypes was hindered by small numbers. Differences in immune-associated polymorphisms by race have been reported (Skibola et al, 2010) and could explain the differential effects observed.

Herpes zoster, commonly known as shingles, has previously been reported to precede HL (Tavani et al, 2000) and other haematological malignancies (Gramenzi et al, 1991; Landgren et al, 2007; Brown et al, 2008; Anderson et al, 2009; Kristinsson et al, 2010; Liu et al, 2012). Shingles is a skin rash caused by reactivation of the varicella-zoster virus, which commonly occurs in immunosuppressed individuals (Kennedy, 2002; Grulich et al, 2007). Herpes zoster infection remained significantly associated with HL even when claims were made>6 years previously. Because of limited sample size, we were unable to investigate infection claims in a longer time frame preceding lymphoma diagnosis, but it has been suggested that lymphoma risk is restricted to the first 10 years since infection onset (Tavani et al, 2000). We observed similar associations between herpes zoster and DLBCL and FL, supporting previous reports in some (Karunanayake et al, 2012; Liu et al, 2012) but not all studies (Cartwright et al, 1988; Becker et al, 2012). Given the timing of these associations, it is possible that herpes zoster infection is a marker of an immunocompromised state many years before diagnosis or that it instigates a decline in cell-mediated immunity (Liu et al, 2012).

Cellulitis, a skin infection generally caused by staphylococcus or streptococcus bacteria, was associated with all NHL subtypes, particularly T-cell lymphoma. T-cell lymphomas are a heterogeneous group of lymphomas that can present with extensive skin and soft tissue necrosis, resembling an infectious process (Serra et al, 1998; Jia and Sun, 2004; Falagas et al, 2007). Misdiagnosis of T-cell lymphomas could explain the observed associations particularly as some cutaneous T-cell lymphomas are often difficult to diagnose in the early stages of disease (Serra et al, 1998; Falagas et al, 2007; Soo et al, 2011). However, the long latency period between infection and diagnosis suggests that misdiagnosis of T-cell lymphoma is unlikely to fully explain the observed associations, and further investigation by T-cell subtype may provide further insight.

We did not observe any strong associations between pyelonephritis (or cystitis or prostatitis, which generally precede pyelonephritis) and NHL or associated NHL subtypes despite previous reports (La Vecchia et al, 1992; Tavani et al, 2000). However, these studies have been limited by the number of exposed cases and relied on self-reported diagnoses, potentially explaining the disparate findings (La Vecchia et al, 1992; Tavani et al, 2000). Similarly we did not find any associations between gastrointestinal infections and lymphoma.

Koshiol et al (2011) reported that viral (RR 1.6, 95% CI 1.5–1.8) and parasitic (RR 1.7, 95% CI 1.5–1.8) infections were more strongly associated with NHL than bacterial infections (RR 1.2, 95% CI 1.1–1.3). Although our study did not include these broad categorisations, the vast majority of conditions associated with NHL were of viral origin, including sinusitis, influenza, bronchitis and herpes zoster.

Although we were able, for the first time, to extensively study infection-related conditions by NHL subtype, the heterogeneous nature of NHLs may mean that some associations were masked by incorporating them into these broad categories. We have previously reported associations between infection-related conditions and CLL in the SEER-Medicare data set (Anderson et al, 2009), but despite being the largest study to date, we had limited sample sizes to investigate rare or more specific NHL subtypes. The population-based sampling of cases and controls means that these findings are more representative than those from hospital-based case–control studies (La Vecchia et al, 1992; Tavani et al, 2000) or specialised registers (Doody et al, 1992; Koshiol et al, 2011) but are limited to the elderly population and by lack of lifetime exposure information to chronic or repeated infection exposure.

Exposure status was not limited by recall bias inherent in case–control studies (Cartwright et al, 1988; La Vecchia et al, 1992; Tavani et al, 2000; Chang et al, 2005; Becker et al, 2012; Karunanayake et al, 2012; Liu et al, 2012), although the use of claims data, instead of diagnostically confirmed infections, means that misclassification of exposure status is possible. This misclassification would be unlikely to be differential in nature, especially for claims many years before lymphoma diagnosis; however, overdiagnosis of infections, such as cellulitis, is possible. As both inpatient and outpatient claims were incorporated into the study, we were able to investigate a broader range of common infection-related conditions than previous studies (Cartwright et al, 1988; Doody et al, 1992; La Vecchia et al, 1992; Tavani et al, 2000; Chang et al, 2005; Koshiol et al, 2011; Karunanayake et al, 2012; Liu et al, 2012). Despite this strength, infections requiring few physician visits, such as the common cold, are likely to be underestimated. As we observed no associations between these infections and lymphoma risk, it supports the contention that we did not have differential diagnosis between cases and controls due to early prediagnostic symptoms. As diagnosis of infections were based on the attending physician claiming compensation, it was not possible to examine characteristics of infections more closely (e.g., whether herpes zoster and varicella-zoster virus-positive patients had the characteristic rash or were only antibody positive). Additionally, we were not able to differentiate the cause of the infection-related conditions and unable to comment on the severity of the infections encountered. Our models were adjusted for limited confounding variables, and hence residual confounding effects by other factors, such as comorbidities, could not be captured. Comorbidities with immune disturbance, such as autoimmune conditions which have been linked with lymphoma (Brown et al, 2008), could increase the susceptibility to infection. Similarly, we did not have data on characteristics and behaviours like smoking, drinking and obesity and therefore could not adjust for these factors. Finally, as we investigated numerous associations between infection-related conditions and lymphomas, some of the associations may have occurred by chance. We therefore focussed our discussion on those associations that remained after crude adjustment for multiple comparisons.

It is possible that an infection or other antigen could lead to different clinical manifestations depending on the hosts immune systems. For example, EBV is an extremely common infection that does not lead to cancer in most people. In a small subset, EBV may contribute to DLBCL (Kinch et al, 2013; Ozsan et al, 2013), whereas in most people it does not. Similarly, not all DLBCL cases are EBV-positive, reflecting heterogeneity in the aetiology even of the same NHL subtype. Immune differences resulting in different clinical outcomes could, for example, be driven by differences in HLA polymorphisms, which can affect antigen presentation.

In conclusion, several common infection-related conditions were associated with NHLs but not with HL, where only herpes zoster was more common in cases than in controls after adjustment for multiple comparisons. Herpes zoster showed the strongest associations for both NHL and HL. Most respiratory tract infections were associated with NHL, particularly MZL. Several infection-related conditions were more strongly associated with NHL in blacks than in whites. Precise elucidation of the mechanisms underlying lympho-proliferations may provide important clues for understanding how immune disturbance contributes to the development of both NHL and HL.