Introduction

Osteoarthritis (OA) is the leading cause of chronic disability in the United States, and is the third most rapidly rising chronic medical condition associated with disability worldwide1,2. Despite its importance and economic impact, there are no disease-modifying anti-osteoarthritic drugs (DMOADs) approved by the US Food and Drug Administration (FDA) or European Medicines Agency, in stark contrast to the multitude of biologic and nonbiologic disease-modifying treatments available in other forms of arthritis3. This lag in the development of DMOADs is due in no small part to a lack of easily accessible radiographic and/or biochemical biomarkers to diagnose OA and discriminate patient phenotypes, including prediction of future progressors.

Accordingly, much attention has recently been focused on the development of diagnostic and prognostic biomarkers for OA. Several groups have described biomarker candidates for OA diagnosis, discriminating OA patients from healthy controls based on peripheral blood analytes. In 2014, Ramos et al. described an mRNA-based peripheral blood signature which could discriminate OA patients from matched controls4, with a c-statistic of 0.97. In 2018, Li et al. published a reanalysis of these data using a different machine learning technique that performed at an equally high sensitivity and specificity on a reduced subset genes5. In 2018, Huang and colleagues used data and biospecimens from the DOXY (doxycycline for treatment of OA) clinical trial to examine four plasma biomarkers6, and found baseline levels of lipopolysaccharide binding protein (LBP) were associated with future radiographic progression (TIC over 18 months OR = 1.418).

A more difficult but perhaps more clinically relevant biomarker task, however, is the classification of OA patients into distinct phenotypes. The largest study yet to take on the task of OA phenotype discrimination was an analysis of serum and urine biochemical biomarkers performed by the Foundation for the National Institutes of Health (FNIH) OA Biomarkers Consortium (OABC-FNIH), using data and biospecimens from the longitudinal, US-based Osteoarthritis Initiative (OAI) study7,8. In this study, the authors produced a model combining three serum and urine biomarkers that could discriminate radiographic and pain progressors from nonprogressors with a receiver operator characteristic area under the curve (AUC-ROC, c-statistic) of 0.6319. Baseline values of these parameters did not offer substantial predictive capability; rather, models utilized a time-integrated concentration (TIC) approach tracking analyte measurements over a period of years.

A drawback of traditional biochemical biomarkers such as those used in the OABC-FNIH is their inherent variability. Epigenetic assays offer theoretical advantages as biomarkers. Most notably, an epigenetics-based assay has the potential to offer prognostic information based on a single time point reflecting early, relatively stable gene regulatory changes that precede gene transcription and subsequent protein translation. Early epigenetic changes in peripheral blood have been found to be useful biomarkers in several rheumatic diseases including rheumatoid arthritis10 and systemic lupus erythematosus11, as well as several chronic low-level inflammatory diseases, including type-2 diabetes mellitus risk12 and cardiovascular disease13. Several recent studies, including our own, have demonstrated alterations in joint tissue epigenetic patterns associated with OA development and progression14,15,16,17,18; however, no analyses of blood epigenetic changes have yet been published.

In this pilot study, we aimed to evaluate the potential of peripheral blood mononuclear cell (PBMC)-based epigenetic models to predict future knee OA radiographic progression in a well-matched cohort of patients from the Osteoarthritis Initiative (OAI), using similar definitions to those previously published in OABC-FNIH studies9.

Results

Baseline patient demographic, disease, and PBMC composition characteristics were well matched. Models developed using only patient characteristics were not predictive of future progression

We first identified a group of 58 OA radiographic progressors with baseline Kellergren-Lawrence (K/L) radiographic grade 2–3, symptomatic knee OA who exhibited ≥0.7 mm of joint space width (JSW) loss over the first 24 months of follow-up and persistent JSW loss at 48 months from the longitudinal, US-based Osteoarthritis Initiative (OAI) study. We then matched 58 nonprogressors with ≤0.5 mm of joint space width loss over 48 months of follow-up by age category, sex, BMI category, ethnicity, and baseline KL radiographic grade (Table 1, Supplementary Table 1). Although not included in our matching criteria, there were no statistical differences in baseline JSW, NSAID use, or smoking history between the two groups. We did note a statistically significant increase in baseline Western Ontario and McMaster (WOMAC) pain subscale among progressors (21.5 ± 2 mean ± SEM vs. 17.0 ± 1.5 points on a 0–100 point normalized scale, P = 0.05).

Table 1 Patient group characteristics.

Recent studies have reported that clinical characteristics alone can be used to model future OA progression19. To ensure that our models were not being affected by patient characteristics, we first developed models using baseline patient characteristics data alone, without including DNA methylation data. These models included age, sex, BMI, baseline JSW, baseline WOMAC pain, smoking history, and NSAID use. Baseline models were not able to discriminate the two groups (receiver operator characteristic area under the curve ROC-AUC, c-statistic = 0.49 ± 0.01 mean ± SEM, accuracy = 50 ± 0.7%) (Fig. 1). Although infrequent, we then added data regarding comorbidities to these baseline models, including history of heart attack, heart failure, diabetes, lung disease, and cancer (Table 1). This did not improve the discriminatory capability of baseline patient characteristic models (p = 0.28 for comparison with models not including comorbidity data).

Figure 1
figure 1

Mean receiver operator characteristic (ROC) curves for patient characteristic-only models, Beta value-based models, and M value-based models when tested on unseen (lockbox) data. Curves represent mean values over 40 cycles of development, error bars represent SEM. ROC-AUC (c-statistic) values are given as mean ± SEM.

DNA methylation data from mixed PBMC samples can be skewed by underlying differences in cellular composition among groups20. As the OAI dataset does not include data on individual blood sample cellular composition, we estimated this using a computational approach21. There were no statistically significant differences in estimated PBMC composition between the two groups; however, we did note a trend towards increased monocyte counts in cases (progressors 7.4 ± 0.3% mean ± SEM vs. nonprogressors 6.7 ± 0.3%, P = 0.12) (Table 1). Accordingly, we corrected our dataset for PBMC composition differences using frozen surrogate variable analysis (FSVA), a technique previously demonstrated to be robust in correcting cellular composition differences in genome-wide DNA methylation data22,23, before developing our epigenetic models.

Models developed based on PBMC DNA methylation data are predictive of radiographic progression

Traditionally, epigenome-wide association studies have reported DNA methylation data as beta values, defined as the fraction methylation (0–1 scale) of each CpG site included in the array. However, beta values are characterized by high heteroscedasticity (most beta values fall within extreme high- and low-percent methylation levels). Therefore, we also analyzed M values (the log2 ratio of methylated:unmethylated probe intensities for a given CpG site), which are approximately homoscedastic, in our models24. In our analysis, models based on PBMC DNA methylation data were consistently capable of discriminating those patients who would go on to experience radiographic progression from nonprogressors (Figs 1, 2, Table 2). Models based on M values outperformed those based on beta values (M value models c = 0.81 ± 0.01, mean ± SEM, vs. beta value models 0.75 ± 0.01, comparison p = 0.002), with corresponding accuracies of 73 ± 1%, mean ± SEM, for M and 69 ± 1% for beta based models. The mean number of CpGs selected for inclusion during model development and optimization was 22 ± 2 (mean ± SEM) for M-based and 19 ± 2 for beta value-based assays (Supplementary Fig. 1), this was not statistically significantly different between the two DNA methylation measures (p = 0.32).

Figure 2
figure 2

OA rapid progressor PBMC DNA methylation-based machine learning discriminant model development plan.

Table 2 Performance of PBMC DNA methylation models to predict future radiographic progression in OA patients when evaluating previously unseen data.

We then added patient characteristic covariate data to model development, including age, sex, BMI, baseline JSW, baseline WOMAC pain, smoking history, and NSAID use. Including these data did not alter model performance (M value models c = 0.80 ± 0.01, beta value models c = 0.78 ± 0.01, both nonsignificant compared to models without covariates). Finally, as our analysis included data from two methylation assay types, we then analyzed model performance for 450k and 850k chips separately; both types demonstrated equivalent performance in our models (p = 0.27).

A subset of DNA methylation CpG sites were selected frequently in multiple rounds of model development and were enriched in functional pathways previously associated with OA development

Although 969 CpGs were selected in at least one round of model development in either M or Beta based models, a subset of CpG sites were selected several times (Table 3, complete list of CpGs selected in Supplementary Table 2). Many of these most frequently selected CpGs in both M value-based and Beta value-based model development were shared; for example, 7 of the top 20 most frequently selected CpGs in both M- and Beta models were identical. The majority of the top 20 CpGs selected during modeling were associated with CpG islands (13 of 20 M- and 16 of 20 Beta-based models), including all but one of the CpG sites shared among the two methylation measures, suggesting these sites may play a role in gene regulation25.

Table 3 Top 20 CpGs selected for supervised model development. CpGs shared by both Beta-value-based and M-value-based models are highlighted in bold (n = 7).

Although we did not perform a differential methylation analysis in the traditional sense, we reasoned that CpG sites chosen for inclusion in our models might still cluster in functional pathways important in OA. To investigate this, we performed gene ontology analysis using the Ingenuity Pathway Analysis (IPA) system of genes associated with CpG sites chosen in at least one round of model development. The most significantly enriched canonical pathways enriched in this gene set included the antigen presentation pathway (n = 6 genes, p = 7E-4), adenosine monophosphate kinase (AMPK) signaling (n = 15, p = 1E-3, and sonic hedgehog signaling (n = 4 genes, p = 9E-3), among others (Table 4). The top upstream regulators identified by IPA include the transcription factor PITX2 (n = 14 genes, p = 1E-5), histone H3 (n = 29 genes, p = 7E-5) and H4 (n = 14,p = 9.6E-5), miR-141 (n = 9, p = 1.9E-4), miR-9 (n = 5, p = 3.4E-4), miR-137 (n = 2, p = 8.2E-4), and bone morphogenic protein 2 (BMP2, n = 15, p = 9.0E-4), a TGF-β superfamily member, among others (Table 4).

Table 4 Ontology analysis of genes associated with rapid OA progressor DNA methylation models.

Discussion

Biomarkers hold great potential for improving clinical outcomes in common chronic diseases including OA. The early identification of OA patients who are likely to progress radiographically, clinically, or both, will allow for the enrichment of clinical trials of disease-modifying antiosteoarthritic drugs (DMOADs) with appropriate patients. Biomarkers of progression may also allow clinicians to stratify early OA patients, offering personalized care by taking into account an individual’s likelihood to respond to a particular therapy. In this pilot study, we sought to test whether baseline peripheral blood mononuclear cell DNA methylation data could be used to develop models to discriminate those patients who would go on to experience radiographic progression from nonprogressors. While based on a relatively small number of patients from a single cohort, our results nonetheless suggest that peripheral blood-based epigenetic models may be useful for OA subtype discrimination and should be evaluated in future larger studies of heterogeneous OA patients from additional cohorts.

Several groups have previously investigated biomarkers predictive of future OA progression. The largest biochemical study in this regard was the OAI biomarker studies including the osteoarthritis biomarker consortium of the Foundation for the National Institutes of Health (OABC-FNIH) which investigated a targeted set of biochemical biomarkers as predictors of OA pain and radiographic progression. Their analysis identified a combination of three biochemical biomarkers for discrimination of pain and radiographic progression from nonprogressors with a c-statistic of 0.6319, although they used a time-integrated concentration (TIC) approach, requiring multiple biomarker measurements over time as input into their prognostic model. This same study also evaluated the predictive capability of baseline biomarkers alone, but these not demonstrate robust differentiation of progressors (c = 0.586)9.

Radiographic biomarkers predicting future disease progression have also been evaluated. In 2016, Collins et al. reported models based on changes in MRI semi-quantitative markers over 24 months to predict the likelihood of radiographic progression at 48 months as part of the OABC-FNIH study26; the best model achieved a c-statistic of 0.74. Like the OABC-FNIH biochemical biomarker study, this study leveraged changes in imaging parameters over time to predict future progression. Trabecular bone texture (TBT) is perhaps the best-studied single imaging biomarker for prediction of future OA progression. Kraus et al. evaluated this in FNIH patients, with a best composite model using baseline TBT to predict future radiographic progression with a c-statistic of 0.62427. This is somewhat lower than previous reports using TBT, Janvier and colleagues, for example, reported a c-statistic of AUC 0.77 in 201728.

There have also been recent reports using machine learning to model future progression based on combinations of baseline imaging and demographic data. Joseph and colleagues in 2018 published the results of three such models, with a maximum c-statistic of 0.72 to predict progression to moderate-severe OA over 8 years including demographic data, risk factors, K/L score, cartilage WORMS score, meniscal tear, and cartilage MRI T2 imaging data29 from the OAI cohort. Also in 2018, Halilaj and colleagues published the results of a mixed-effects mixture model within the OAI cohort to predict joint space narrowing in 1243 subjects. They described an impressive c-statistic of 0.86 for prediction of future progression using a combination of baseline and year 1 follow-up variables, using a 10-cycle, 90% development/10% lockbox validation data splitting approach with 10-fold internal cross-validation, not unlike our methodology. Models based solely on baseline data, however, did not predict future radiographic progression (c ≤ 0.6)30.

Several recent studies have also identified mitochondrial haploptypes as biomarkers which can discriminate OA phenotypes. For example, in 2012 Fernandez-Moreno et al. identified significant increases in cartilage-specific biomarkers of OA in patients carrying mitochondrial haplogroup H associated with the onset of OA31. Similarly, this group later identified a decrease in incident OA risk in patients with mitochondrial haplogroup J compared to hpalogroup H (HR 0.7, p < 0.05) in both the OAI and CHECK cohorts32, as well as slowing of the rates of radiographic progression rate among OA patients with haplogroup T compared to other haplogroups within the OAI cohort33. Future studies should investigate potential complementarity between mitochondrial haplogroups and DNA methylation as predictive OA biomarkers.

Much work has been done over the past several years linking epigenetic changes, particularly DNA methylation, with OA. Our group and others have noted differential DNA methylation in inflammatory cell pathways in both cartilage and subchondral bone from OA patients14,15,16,17,34,35. A subgroup of end-stage OA patients’ cartilage demonstrates a fingerprint of epigenetic changes within inflammatory genes18,36. Given chondrocyte and subchondral bone DNA methylation aberrancy in both knee and hip OA, along with the known contributions of chronic inflammation and inflammatory gene epigenetic changes to OA, the DNA methylation patterns within PBMCs associated with OA progression we describe in this report are intuitively consistent with previously published data.

Our PBMC DNA methylation-based approach has a few potential advantages over previously published biomarkers. Most notably, our approach offers predictive capability based on a single baseline blood draw. Further, our models demonstrated higher c-statistic values than previously described biochemical biomarkers, although this should be interpreted cautiously given our relatively small group sample size and lack of non-OAI cohort confirmation. We also performed a traditional DNA methylation analysis of progressors and nonprogressors and did not identify significantly differentially methylated positions (DMPs) with a false data rate-corrected p-value ≤ 0.05. This is not particularly surprising given our mixed PBMC population. Our approach utilized the combined effects of multiple CpG sites when developing DNA methylation-based predictive models.

Relatively few of the CpG sites selected most frequently during model development have been previously associated with OA. A CpG near the transcription start site of KRT18 was selected in both M- and Beta-model development. Reductions in KRT18 expression have been associated with lumbar spine OA37. LIMS2, selected in M-development, is a focal adhesion molecule differentially expressed in cartilage and subchondral bone of frizzled-related protein (Frzb) knockout mice38. A CpG site selected in M-based assay development was located near CLCN6, which has been associated with life expectancy and aging39. This same CpG site is also associated with the gene MTHFR, which is a rheumatoid arthritis (RA) risk allele40, and the protein encoded by MTHFR is the target of the RA disease-modifying antirheumatic drug methotrexate. Only one published association has been found linking MTHFR gene mutations and OA of the knee, hand, and hip in a Turkish population41. A CpG within DLL4 was selected in Beta-based models. Dll4, a component of the Notch pathway, has been previously associated with the development of OA of the temporomandibular joint42. EEF2, also selected during Beta-based model development, has been shown to play a role in intervertebral disc degeneration and is the target of the OA-associated microRNA-143-5p43. This paucity of previously published OA association in model-selected CpGs is not surprising, as our study is the first report of peripheral blood epigenetic patterns in OA.

Despite the lack of previous OA associations among the specific CpG sites included in our models, gene ontology analysis revealed several canonical gene pathways and upstream regulators that have been previously described in OA. Unfortunately, there have not been previous epigenetic analyses of peripheral blood cells in OA in humans or mice to compare our data to. However, many pathways and regulators previously described in articular tissues overlap with our findings. For example, AMP kinase (AMPK) deficiency in chondrocytes accelerates both age- and trauma-associated OA in mice44, whereas a pharmacological stimulator of AMPK attenuates post-traumatic OA in rats45. The endocannabinoid system within synovial tissue has been associated with OA pain46. Autophagy has been widely associated with OA. Age-related loss of autophagy has been linked with OA severity in human47 and murine48 joint tissue. Furthermore, cartilage-specific deletion of mammalian target of rapamycin (mTOR) increases autophagy signaling and protects against post-traumatic OA induction in mic49. Among the topstream regulators identified in our ontology analysis was PITX2, a component of the Wnt signaling pathway key to mensenchymal stem cell function which is altered in OA patients50. MicroRNA-141 is dysregulated in human OA and contributes to pathogenesis by augmentation of lipid metabolism51, and is important in maintaining the appropriate expression of key osteoblast differentiation proteins including Runx2, Sclerostin, ALP, and Dlx552. MicroRNA-9 has been shown to directly augment the expression of the key catabolic enzyme matrix metalloproteinase-13 (MMP13) in OA patients53 and animal models54. MicroRNA-137 regulates chondrocyte metabolism via targeting Runx255. Finally, bone morphogenic protein-2 (BMP2) upregulation has been linked to OA development and increased matrix turnover56. It is a key regulator of chondrocyte metabolism57 and has been suggested as a novel OA therapeutic58. Future studies of circulating inflammatory cells and cell subsets in OA patients and/or OA animal models will be critical to confirmation of our findings and will help elucidate whether common pathway deficiencies, i.e. autophagy or Wnt signaling, are defective globally in OA rather than acting in a jont-specific manner.

Our study does have several weaknesses, the two largest being our small sample size and lack of external validation in a non-OAI cohort. We chose samples carefully in order to minimize group differences and reduce the possibility of batch effects, which we further reduced via fsva correction. Our findings are preliminary and should certainly be confirmed in a larger and more heterogeneous patient cohort. Unfortunately, our precise JSW measurement requirement59 and use of baseline peripheral blood DNA limits the possibility of direct comparison with other cohorts which are not as well characterized. These factors have led several previous studies to include only samples from the OAI7,9,19,28,29,30,60,61,62. Future studies will no doubt relax our inclusion criteria and expand the potential sources for additional samples to include larger sample sizes and additional cohorts.

A major concern with machine learning is generally the problem of overfitting. We reduced this possibility through extensive cross-validation and report the accuracy of models when tested on data unseen during model development. We included in our analysis data from both Illumina 450 k and 850 k chips, as the 450 k assays became unavailable from the manufacturer during this project. All data from 850k chips were subset to only those sites included on the 450k array and were batch corrected. We saw no evidence for differences in predictive capability with the two chip types (see above). We did see differences comparing M value-based to beta value-based models; however, this is not unsurprising given previous reports of variations in statistical measures of these values24 owing to differences in scedasticity. Finally, we are unable to draw substantial conclusions about underlying pathophysiological associations of altered PBMC DNA methylation patterns with OA given our mixed PBMC samples; this would be best investigated by future studies focusing on individual immune cell subsets. There is an emerging literature in the OA field which has demonstrated the importance of activated macrophages in disease development and progression63,64,65. We are unable to comment specifically on whether the nearly-statistically-significant increase in monocytes detected by computational estimation of PBMC composition which we found among progressors reflects this activated phenotype subset; however, future studies should certainly examine this possibility. It is important to highlight that the gene ontology analyses we performed herein were based on CpG sites selected following PBMC cell composition correction. Therefore, there is a possibility that additional, particularly macrophage- or monocyte-related, functional pathways and/or upstream regulators may have been missed by our analysis. These concerns do not, however, prevent us from performing predictive and prognostic biomarker analysis, as a clinically viable test would most easily include DNA from mixed PBMCs or whole blood.

Our data raise several questions which highlight potential avenues for future investigation. For example, it is unclear when epigenetic patterns change in the natural history of OA development and how stable they are over time; a longitudinal DNA methylation analysis in an incident OA cohort would be helpful in this regard. Our approach could also be used to evaluate PBMC epigenetic models for prediction of pain and combined pain and radiographic progression in larger OA cohorts. In this study, we did not seek to produce a consensus model by training on our entire dataset, as this would no doubt have produced significant model overfitting given our small sample size; rather, we sought to determine the feasibility and capability of PBMC methylation-based models for OA subtype discrimination more generally. Future studies on larger cohorts will no doubt have the ability to produce such a consensus model that would be of direct clinical relevance.

In summary, our pilot study provides the first analysis of peripheral blood epigenetic patterns as predictive models for knee OA radiographic progression. Our use of linear modeling applied to a large genome-wide DNA methylation dataset from well-matched case and control cohorts offer the first glimpse into the potential of future OA epigenetic classification. If confirmed, our approach could offer advantages over traditional biochemical biomarkers, including requiring a single baseline blood sample. Our results will certainly require additional validation in larger datasets but offer hope for the future development of easily accessible predictive biomarkers to personalize and improve the care of adults with knee OA.

Methods

Study design

This nested case-control study (116 total patients, 58 progressors, 58 nonprogressors) used data and biospecimens from the Osteoarthritis Initiative (OAI) database, which is available for public access at https://data-archive.nimh.nih.gov/oai/. All Osteoarthritis Initiative (OAI) participants provided written informed consent, and the study was carried out in accordance with the OAI data user agreement, approved by the Committee on Human Research of the Institutional Review Board (IRB) for the University of California, San Francisco (UCSF). The IRBs of the University of Oklahoma Health Sciences Center and Oklahoma Medical Research Foundation also reviewed and approved the project. Detailed case and control criteria are presented in Supplementary Document 1. Participants had baseline and yearly follow-up knee radiographs and PBMC DNA available. Kellgren-Lawrence Grade (KLG) and quantitative joint space width (JSW)59 were assessed by the central reading site using non-fluoroscopic fixed-flexion knee radiographs with a Synaflexer positioning device (Synaflexor, Synarc, Newark, CA). All participants had baseline KLG of 2–3 in at least one knee without a history of total knee joint replacement through 48-months. Participants with a tibial plateau rim distance of 6.5 mm at baseline, or with a change in the rim distance of >2.0 mm between baseline and follow-up were excluded due to unreliable radiographic positioning. Our case and control selection was limited by the availability of matched cases and controls, as below.

Definition of radiographic progression, case and control group characteristics

Our definition of radiographic progression was similar to the case definitions of the OABC-FNIH9. Cases had radiographic progression in the medial tibiofemoral compartment by a longitudinal loss in the minimum JSW of at least 0.7 mm from baseline to 24-month follow-up in one index knee, with persistent narrowing in the same index knee at 48 months based on radiographs obtained from a non-fluoroscopic fixed flexion protocol (Synaflexor, Synarc, Newark, CA). In each case, the contralateral (non-index) knee had less progression than the index knee, or no progression over the follow-up period. Participants with a tibial plateau rim distance of 6.5 mm at baseline, or with a change in the rim distance of >2.0 mm between baseline and follow-up were excluded due to inappropriate and/or unreliable radiographic positioning. Non-progressors were defined as those with ≤0.5 mm of JSW loss from baseline to 48 months in either knee. These thresholds were set to coincide with previous OAI biomarker studies9,60,62,66, as defined in the Foundation for the National Institutes of Health biomarker consortium project (OABC-FNIH) and were set based on the distribution of 1-month change in minimum JSW in normal knees of OAI control participants and estimated to have ≤10% probability of change due to measurement error8,67.

Controls were frequency matched with cases by age category, sex, race, and BMI category as shown in Table 1 and Supplementary Table 1 and were also frequency matched with the larger OAI cohort to broaden the applicability of our findings. Group differences were calculated using a Student t-test and statistically analyzed using a 2-tailed P value. There were no differences among the two groups in any demographic category with the exception of mean baseline Western Ontario and McMaster (WOMAC) pain subscale (Table 1), although it should be noted that there was a trend towards fewer females included among cases (53% vs. 60%, p = 0.13), more NSAID use at baseline among cases (29% vs. 17%, p = 0.13), and an increased mean baseline K/L grade among cases (2.2 vs. 2.0, p = 0.09), although no difference in mean baseline JSW was noted in cases (3.9 vs. 4.0, p = 0.34). We also evaluated the presence of various comorbid conditions among data collected by the OAI, including history of heart attack, failure, stroke, pulmonary disease, diabetes, and cancer (Table 1). We adjusted our models for these variables as part of our analysis (see below).

DNA methylation assays, PBMC composition assessment and adjustment

Five hundred nanograms of DNA was treated with sodium bisulfite (EZ DNA methylation kit, Zymo) and loaded onto Illumina Infinium HumanMethylation450k (n = 62 samples: 30 cases, 32 controls) or 850k (n = 54 samples: 28 cases, 26 controls) arrays. As no direct measures of PBMC cell type composition were available from the OAI, these data were estimated using the estimateCellCounts function of minfi. No statistically significant differences in estimated PBMC cell composition were found between groups (Table 1, Supplementary Table 2), although there was a trend towards increased monocytes among cases (7.4 ± 0.3% vs. 6.7 ± 0.3%, cases vs. controls mean ± SEM, p = 0.12). DNA methylation data were corrected for cell count variation using frozen surrogate variable analysis22 via the sva package (v. 3.28.0). This method has been previously shown to robustly correct for cell count variation and other batch effects in large-scale epigenomic studies22,23,68.

Data preprocessing

Statistical analysis was performed using R (v. 3.5.0). Raw.IDAT files were processed using the minfi package (v. 1.26.2). Illumina 850k (EPIC) array data were subset to only those sites also included in the 450k array. Preprocessing and normalization were performed using the preprocessFunnorm function. All samples passed internal controls included on each chip and had detection p ≤ 0.01 for ≥97% of cytosine-guanosine dinucleotide (CpG) positions included on each array. Epigenome-wide association studies traditionally have used DNA methylation beta values, defined as the fraction methylation (0–1 scale) for a particular CpG site. Beta values, however, are characterized by high heteroscedasticity (most beta values fall within extreme high- and low-percent methylation levels), and questions have been raised regarding statistical validity of beta value analysis24. Therefore, we also analyzed M values (the log2 ratio of methylated:unmethylated probe intensities for a given CpG site), which are approximately homoscedastic. We excluded from analysis CpG probes located on sex chromosomes, probes with known single nucleotide polymorphisms (SNPs) with minor allele frequency of ≥5%, and probes not detected in all samples, leaving a final n = 435,118 CpG sites for analysis in each sample.

Modeling

Elastic-net regularized generalized linear models were developed using the glmnet package (v. 2.0–16). As model overfitting is a frequently-occurring problem when developing classifiers on high-dimensional data, we implemented three strategies to reduce potential overfitting. First, we performed 7-fold internal cross-validation during development via the cv.glmnet function, utilizing the ‘one standard error rule’ when selecting lambda values69. Second, we tested models on lockbox data not used for training (Fig. 2). Finally, we repeated our model development and testing with 40 cycles of random splits of data into development and lockbox (validation) subsets. During each cycle of development, data were first randomly split into 60% development and 40% lockbox sets. Development data were then used to generate regularized cross-validated models and these models were tested on lockbox data and performance characteristics recorded. Model performance was assessed by the mean c-statistic (area under the receiver operator characteristic curve AUC-ROC), diagnostic odds ratio, accuracy, sensitivity, and specificity. The identity of DNA methylation sites (features) selected by gmnet for inclusion in each model, and the total number of sites required by each model, were recorded and compared. The performance of models by chip type (450k vs. 850k) was assessed by comparing accuracies using Fisher’s exact test of a 2 × 2 contingency table.

Ontology analysis

Genes associated with CpG sites selected at least once during model development (n = 969) were analyzed using Ingenuity Pathway Analysis (IPA, Qiagen) v. 48207413 build 2019-06-16 using default settings.