Abstract
Prior studies identified HLA class-II and 57 additional loci as contributors to genetic susceptibility for type 1 diabetes (T1D). We hypothesized that race and/or ethnicity would be contextually important for evaluating genetic risk markers previously identified from Caucasian/European cohorts. We determined the capacity for a combined genetic risk score (GRS) to discriminate disease-risk subgroups in a racially and ethnically diverse cohort from the southeastern U.S. including 637 T1D patients, 46 at-risk relatives having two or more T1D-related autoantibodies (≥2AAb+), 790 first-degree relatives (≤1AAb+), 68 second-degree relatives (≤1 AAb+), and 405 controls. GRS was higher among Caucasian T1D and at-risk subjects versus ≤ 1AAb+ relatives or controls (P < 0.001). GRS receiver operating characteristic AUC (AUROC) for T1D versus controls was 0.86 (P < 0.001, specificity = 73.9%, sensitivity = 83.3%) among all Caucasian subjects and 0.90 for Hispanic Caucasians (P < 0.001, specificity = 86.5%, sensitivity = 84.4%). Age-at-diagnosis negatively correlated with GRS (P < 0.001) and associated with HLA-DR3/DR4 diplotype. Conversely, GRS was less robust (AUROC = 0.75) and did not correlate with age-of-diagnosis for African Americans. Our findings confirm GRS should be further used in Caucasian populations to assign T1D risk for clinical trials designed for biomarker identification and development of personalized treatment strategies. We also highlight the need to develop a GRS model that accommodates racial diversity.
Similar content being viewed by others
Introduction
Type 1 diabetes susceptibility is largely controlled by HLA class-II genotype1, with modest contributions from at least 57 additional loci that confer varying degrees of disease protection or susceptibility2. Combinations of these alleles are thought to collectively determine an individual’s overall genetic risk, potentially resulting in heterogeneous disease presentation and etiology. The major role of HLA has been known for decades and is used as inclusion criteria for most studies of the disorder’s natural history. However, since HLA class-II is generally deemed to confer approximately half of the overall genetic risk3,4,5, it is likely that non-HLA risk loci could also be utilized to improve at-risk cohort stratification. Such measures of “total genetic risk” may be more effective at stratifying heterogeneous etiologies, for example, older progressors, who are less likely to develop multiple pre-clinical AAb and tend to have slower decline in C-peptide versus individuals with latent autoimmune diabetes in adults (LADA)6,7.
Large consortiums, such as the NIH-sponsored TrialNet, follow subjects at risk for type 1 diabetes and conduct intervention trials using composite scores that have factored in the presence of islet autoantibodies, family history, HLA risk haplotypes, as well as metabolic response markers8. These interventional efforts often aim to interdict the disease process at stage one, when there are already multiple autoantibodies present and a high likelihood of disease progression9. However, primary prevention trials aiming to avert the initiation of islet autoimmunity will likely require safe interventional efforts targeted to large population-based cohorts. Such cohorts can only be identified by genetics-based risk modeling near birth, given that autoantibody biomarkers that correspond to ongoing autoimmunity often occur early in life10,11. Consequently, the type 1 diabetes research community has developed increasingly informative genetic risk models based on genome wide association studies (GWAS) for risk prediction12.
The first attempts calculating overall type 1 diabetes genetic risk demonstrated that increased cumulative non-HLA risk alleles were associated with islet autoreactivity and disease onset, especially in the context of high-risk HLA13,14. These simple additive models only achieved area under the receiver operating characteristic curve (AUROC) of 0.66 for disease outcome. A major improvement was reported by Winkler et al. with their development of a multivariable logistic regression model to compute a type 1 diabetes genetic risk score (GRS)15. Using this model, they were able to discern patients from controls with AUROC of 0.87 to more accurately predict disease progression. In more recent works, Oram et al.16 and Patel et al.17 utilized a log-additive model18 to discriminate type 1 diabetes from type 2 diabetes patients (AUROC of 0.88) and subjects with monogenic forms of diabetes (AUROC of 0.87) from the Wellcome Trust Case Control Consortium (WTCCC) of British subjects with European Caucasian ancestry. Given the alarming increase in diabetes prevalence, particularly of type 2 diabetes in youth19 that may be difficult to discern from type 1 diabetes, there is a need to develop the GRS as a diagnostic tool. This requires examination of GRS in geographically distinct and demographically diverse patient populations with the potential for varying allele frequencies.
Our efforts reported herein expand previous analyses to southeastern United States populations, include considerations of race and ethnicity, and support the differential diagnostic utility of a GRS for clinical applications in type 1 diabetes. Such studies in distinct cohorts offer the potential to further refine the stratification of at-risk subjects and potentially elucidate type 1 diabetes subtypes. We examined several key features of the GRS, including its relationship with age of onset and the contribution of HLA class-II to that relationship, its utility in the context of subject race/ethnicity, and its capacity to aid in risk stratification.
Results
GRS effectively discerns type 1 diabetes patients and AAb+ individuals from controls and relatives within a Caucasian cohort
Until now, type 1 diabetes GRS regression models put forth by Oram et al. and Patel et al.16,17 have only been tested and validated in European Caucasian cohorts15,16,17. We sought to determine the efficacy of a similar GRS, calculated as previously described16,17, in our regional southeastern U.S. cohort comprised of type 1 diabetes patients [n = 637, age (years) median (interquartile range) 15.50 (11.67–19.75)], first-degree relatives [≤1AAb+, n = 790, age 20.75 (11.29–40.42)], second-degree relatives [≤1AAb+, n = 68, age 26.79 (12.33–45.02)], at-risk relatives (≥2AAb+, n = 46, age 15.33 (10.33–33.83)], and controls [n = 405, age 23.92 (16.42–33.25)] of various racial and ethnic backgrounds, including Caucasian, African, and Asian Americans (Fig. 1A top and 1B; Supplemental Tables 1 and 2). A genotyping panel composed of HLA imputing SNPs (Fig. 1C) and putatively identified non-HLA SNPs associated with disease risk (Fig. 1D) was utilized to compute a log-additive type 1 diabetes GRS. Since the HLA locus does not fit this log-additive model1,20, we used published ORs for imputed HLA diplotypes (Supplemental Table 3).
We began with the Caucasian subject set from the UFDI cohort, which may be genetically distinct from the European cohorts. Notably, 13.49% the Caucasian subjects reported Hispanic/Latino ethnicity (Fig. 1A bottom). The GRS was significantly higher for type 1 diabetes patients (n = 478, mean ± SD 0.277 ± 0.03, P < 0.0001 for all comparisons) compared to controls (n = 290, 0.231 ± 0.03), second-degree relatives (n = 33, 0.244 ± 0.03) and first-degree relatives (n = 611, 0.253 ± 0.03) (Fig. 2A). Anticipated dilution of risk in first- and second-degree relatives was also evident. Notably, at-risk relatives (≥2 AAb+) had a GRS (0.274 ± 0.03, P > 0.9999) similar to type 1 diabetes patients (Fig. 2A). ROC analysis demonstrated the GRS significantly discriminated between type 1 diabetes patients and control subjects, providing 73.9% specificity with 83.3% sensitivity for accurately detecting type 1 diabetes by GRS alone (AUROC = 0.86, Fig. 2B). As expected, it was less effective at discriminating type 1 diabetes from first-degree relatives (65.0% specificity, 67.4% sensitivity, AUROC = 0.72, Fig. 2B). To examine the utility of this model in classifying individuals as type 1 diabetes subjects, controls, or relatives, we calculated the balanced accuracy across the distribution of the GRS range. We found that 0.251 was the optimal threshold to classify type 1 diabetes and control subjects in this GRS model, with accuracy of 79.0% (Fig. 2C) and that a GRS threshold of 0.267 yielded accuracy of 66.7% for classifying type 1 diabetes subjects from relatives (Fig. 2D).
Multiple AAb+ subjects under 20 years of age have the highest probability of progressing to disease and may already have subclinical type 1 diabetes21. It is less clear whether older multiple AAb+ individuals will progress to disease. Consistent with this, at-risk subjects under 20 years of age in our cohort had a GRS (0.277 ± 0.03) identical to the type 1 diabetes patients’ GRS. We observed that 56.5% of at-risk subjects under 20 years of age had GRS above the UFDI type 1 diabetes patient cohort 50th centile (Fig. 2E, solid horizontal line) compared to only 22.2% of the at-risk subjects above 20 years of age. Further, for subjects <20 years of age, GRS of at-risk subjects was significantly higher than relatives, while for >20 year olds, GRS of at-risk subjects was similar to relatives (Fig. 2F).
In addition to reporting race, subjects also reported ethnicity as Hispanic/Latino or non-Hispanic/Latino. Hispanic individuals represent a genetically diverse population with mostly European, Native American, and African admixtures22,23. Since this GRS was modeled from Caucasian American and European genetic frequencies, we sought to investigate its accuracy on this diverse population. 12.9% (n = 252) of the total UFDI cohort (all races) self-reported as Hispanic/Latino. Within these, 77.38% (n = 195) reported as Caucasian, 3.57% (n = 9) as African American, and 19.05% (n = 48) as Other (multiple race or not reported, Fig. 3A; Supplemental Table 2). We found that the GRS discriminated Hispanic/Latino Caucasian patients (n = 45, 0.281 ± 0.02) from Hispanic/Latino Caucasian controls (n = 37, 0.232 ± 0.03; AUROC = 0.90) with efficacy comparable to non-Hispanic/Latino Caucasian patients (n = 433, 0.275 ± 0.03) from non-Hispanic/Latino Caucasian controls (n = 253, 0.230 ± 0.04; AUROC = 0.85) (Fig. 3B). Among Caucasian controls, mean GRS was similar for Hispanic/Latino and non-Hispanic/Latino cohorts (Mann-Whitney P = 0.981). Moreover, the GRS of this U.S.-derived Caucasian cohort, which includes Hispanic/Latino Caucasian subjects, was comparable to the GRS of a European-derived Caucasian cohort (Supplemental Table 4)16,17,24. The Hispanic African American and Hispanic Other cohorts are shown for comparison, but were not sufficiently powered for analysis (Fig. 3B).
Higher GRS associates with a younger age at diagnosis in Caucasian subjects
We next addressed whether GRS was associated with type 1 diabetes age of onset. Indeed, among Caucasian type 1 diabetes subjects, we observed a significant negative correlation between GRS and age of diagnosis (Pearson’s correlation r = −0.23, P < 0.0001, Fig. 4A). Subjects diagnosed after age 16 had lower GRS than those diagnosed from 8–16 years of age and those diagnosed under age 8 (Fig. 4B), suggesting that a higher GRS may predict earlier disease onset. Prior studies noted the HLA association with earlier onset of disease; however, the contribution of the non-HLA component of risk was not clear25,26,27. Our data clearly demonstrated the majority, if not all, of this negative age association was conferred by the HLA risk component. When the non-HLA loci were removed from the GRS calculation, the negative correlation with age (r = −0.25, P < 0.0001) was virtually the same as the full GRS (Fig. 4C,D). Conversely, when HLA was removed from the calculation, no association with age at diagnosis was observed (Fig. 4E,F).
We next sought to determine which HLA diplotypes may be affecting the age at diagnosis. High-risk HLA-DR3-DQ2 (simplified to DR3) and HLA-DR4-DQ8 (simplified to DR4) were imputed and subjects were categorized into six diplotypes in combination with lower-risk HLA (collectively denoted as DRX): DR3/DR4, DR4/DR4, DR3/DR3, DR4/DRX, DR3/DRX, and DRX/DRX. We observed the known contribution of HLA-DR3/DR4 to earlier clinical onset28, as well as a significant difference in age of diagnosis between HLA-DR3/DR4 and HLA-DR4/DR4 subjects (Fig. 5A). Distributions of numbers (Fig. 5B) and percentages (Fig. 5C) of age of diagnosis stacked by HLA risk diplotypes illustrate the skewing of HLA-DR3/DR4 individuals to earlier diagnoses. To quantify this observation, we calculated the proportion of patients diagnosed prior to 8 years of age, from 8–16 years of age, and older than 16 years of age for each of the six HLA categories (Table 1). We found that the proportion of patients with HLA-DR3/DR4 diagnosed before age 8 (44.4%) was 5.6 times greater (P < 0.01) than those diagnosed after age 16 (7.9%), while the proportion for the other five HLA categories diagnosed before age 8 (28.9%) was only 1.5 times greater than those diagnosed after age 16 (19.1%). Conversely, significantly more patients with HLA-DR4/DR4 (P < 0.01) and DRX/DRX (P < 0.05) diplotypes were diagnosed after age 16. Interestingly, HLA-DR3/DR3 patients were more likely to be diagnosed between age 8 and 16 (P < 0.01, Table 1). These results suggest that contribution of high-risk HLA-DR3 and HLA-DR4 haplotypes to age of clinical onset may be more nuanced than previously reported25,26,27,28.
Oram and Patel initially used a similar GRS model as a tool to assist in the differential diagnoses of early onset type 2 diabetes and monogenic forms of diabetes from type 1 diabetes16,17. Within the UFDI cohort, we identified five type 1 diabetes subjects with GRS values below the 99th percentile prediction band of GRS versus age at diagnosis (Fig. 4A). Initially, we observed 3 additional type 1 diabetes subjects with exceptionally low GRS that were all AAb- at onset (data not shown). Clinical follow-up revealed that two of these subjects (subject 1: age at diagnosis (yrs) = 10, BMI = 28.0, GRS = 0.118; subject 2: age at diagnosis = 15, BMI = 19.3, GRS = 0.153) were undergoing MODY testing (awaiting patient compliance), and the third (age at diagnosis = 16, BMI = 42.0, GRS = 0.184) has been re-diagnosed as having type 2 diabetes. These results support the utility of GRS in aiding in the differential diagnosis of diabetes forms when used in combination with standard clinical assessments and AAb detection.
Current GRS models are less robust for assessing type 1 diabetes risk in U.S. racial minority groups
We next examined the utility of GRS to discriminate type 1 diabetes subjects from controls or relatives within the Asian American and African American (includes Hispanic/Latino African Americans, Supplemental Table 2) subsets of the UFDI cohort. This notion emanates from previous HLA associations in African American type 1 diabetes subjects29, in addition to clear alterations in the allele frequencies of racial groups for the putative risk loci reported in the 1000 Genomes project30. Similar to Caucasian subjects (Fig. 2), GRS was significantly higher in Asian American type 1 diabetes subjects compared to controls (Fig. S1A). GRS appeared to accurately discriminate type 1 diabetes patients from controls (AUROC = 0.92; P = 0.0002) and from relatives with (AUROC = 0.86; P = 0.04) (Fig. S1B), although this cohort is insufficiently powered to draw conclusive results at a population scale (Supplemental Table 1). Additionally, no multiple AAb+ at-risk Asian American subjects were enrolled in this study; hence, there is a need to validate these findings in a larger cohort.
Once again, GRS was significantly higher in African American type 1 diabetes patients (n = 84) compared to controls (n = 63) as well as relatives (n = 118), but the study was not sufficiently powered to detect significant differences from multiple AAb+ at-risk African American subjects (n = 6, Fig. 6A; Supplemental Table 1). Within the African American cohort, we found GRS was less robust in discerning type 1 diabetes patients from controls (63.0% sensitivity, 85.3% specificity, AUROC = 0.75) or from first-degree relatives (63.0% sensitivity, 61.5% specificity, AUROC = 0.63) (Fig. 6B). Peak balanced accuracy was 68.98% at GRS = 0.233 for classifying African American subjects as type 1 diabetes patients or controls and 60.30% at GRS = 0.233 for classifying subjects as patients or relatives (Fig. 6C,D). Additionally, the HLA-mediated association between GRS and age of diagnosis observed in Caucasian patients was lost in the African American cohort (Figs S2–S3; Supplemental Table 5). HLA associated with the highest risk in Caucasians were detected in lower frequencies in African Americans, where the three highest risk HLA (HLA-DR3/DR4, -DR4/DR4, and -DR3/DR3) were only detected in African American patients and not in controls (Fig. S3; Table 2). Importantly, the SNP array utilized herein did not impute the African American-derived HLA haplotypes shown to confer type 1 diabetes risk or protection29. Though only modestly powered, several non-HLA alleles tested for GRS did not confer risk in African Americans to the same degree as in Caucasians (Table 2). Notable risk differences were observed for three SNPs tested herein: SH2B3 conferred higher risk in UFDI African Americans (OR = 2.93 [95% CI, 1.22–7.03], P = 0.013) than UFDI Caucasians (OR = 1.30 [1.06–1.59], P = 0.014); CTRB1/2 was protective in African Americans, though it did not achieve significance (OR = 0.55 [0.97–3.44], P = 0.08) in contrast to Caucasians where CTRB1/2 was clearly associated with risk (OR = 1.56 [95% CI, 1.12–2.17], P = 0.008); GAB3 only conferred risk in African Americans (OR = 1.82 [1.09–3.04], P = 0.028) and not in Caucasians (OR = 0.89 [0.88–1.44], P > 0.1) (Table 2).
Discussion
Focused genetic testing is relatively inexpensive, non-invasive, and may be scaled for population screening efforts. The implementation of such tests may be useful for refining efforts to identify subjects who would benefit from more costly AAb and interventional screening efforts that may need to be repeated over time. Given that type 1 diabetes has known genetic components conferring susceptibility, several models designed to stratify subjects as high- and low-risk have been developed in recent years. One model was recently shown to assist in the differential diagnosis of type 1 diabetes from early-onset type 2 diabetes and from monogenic diabetes16,17. We emulated this model to assess its capacity to stratify subgroups (i.e., controls, low- and high-risk relatives, type 1 diabetes patients) using cumulative genetic risk in our regional cross-sectional cohort. While the Oram et al. and the Winkler et al. models both report similar AUROC, we chose to use the Oram et al. model because it employs published, accessible OR to set weights for T1D risk loci. Using this approach, we were able to segregate Caucasian type 1 diabetes subjects from controls with 79.0% accuracy and an AUROC = 0.86. As a comparison, in a European Caucasian cohort roughly 7 times larger, Winkler et al. reported a patient versus control ROC AUC = 0.8731. Of note, GRS of our UFDI cohort were comparable to those previously scored for the WTCCC16,17,24 (Supplemental Table 5). As expected, the GRS was higher in subjects with type 1 diabetes compared to first-degree relatives, second-degree relatives, and controls. Most importantly, the GRS was also significantly higher in relatives at the highest-risk for disease development (≥2 AAb+) compared lower-risk relatives (≤1 AAb+); this was the case even in subjects under 20 years of age (0.277 ± 0.03). We note that this current study did not measure anti-insulin autoantibodies, which may have affected subject assignment as ≤1 AAb+ relatives and ≥2AAb+ at-risk relatives. These data support the notion that genotyping a limited number of selected SNPs allows for the identification of subjects at elevated-risk for developing disease. This notion has important implications for GRS use for subject enrollment into mechanistic and natural history studies of type 1 diabetes. It also highlights potential for large-scale population screening efforts for clinical diagnostics, particularly as per sample genotyping costs decline over time. We acknowledge that our genotyping is by no means comprehensive, and the potential may exist to improve prediction and ROC values as additional validated loci and causative SNPs are defined. This may be particularly true regarding ROC for type 1 diabetes subjects versus relatives sharing an appreciable portion of the genome. Ultimately, long-term longitudinal studies such as TEDDY, DAISY, TrialNet Natural History Study, and BABYDIAB will be most informative for such analyses14,15,32.
Genetic screening may not only identify high-risk individuals, but may also indicate appropriate ages to implement other screening regimens, such as AAb testing. We found a significant negative correlation between GRS and age of type 1 diabetes diagnosis, which was nearly completely accounted for by HLA diplotype. While the highest-risk HLA-DR3/DR4 diplotype was associated with the earliest age of diagnosis, as has been previously shown25,26,27, diagnosis occurred significantly later and coincided with puberty in subjects carrying the HLA-DR3/DR3 diplotype. The genes that comprise GRS account for a major proportion of the heritability of type 1 diabetes but explain much less of the variation of the heterogeneity of age of diagnosis. Improvements in this latter capacity may require better powered approaches or GWAS designed to identify genetic associations of type 1 diabetes characteristics (such as diagnosis age33 and rate of β-cell decline), which may be distinct from the variants associated with disease development.
The SEARCH for Diabetes in Youth Study recently reported that type 1 diabetes prevalence was 1 in 392 for Caucasian Americans under 20 years of age, 1 in 617 for African Americans, and 1 in 1667 for Asian Americans34,35. Currently, the vast majority of type 1 diabetes genetics studies are limited to Caucasian cohorts. However, the figures above imply that for African Americans, type 1 diabetes prevalence is almost 2/3 that of Caucasian Americans, while for Asian Americans it is almost 1/4. Thus, closer examinations of type 1 diabetes genetics within these underrepresented racial minorities in the U.S. must be performed. A limitation of the current study was our reliance on self-reported race and ethnicity, as our SNP array lacked informative ancestral markers commonly utilized in high density genome-wide arrays for imputing and assigning race and ethnicity. Nevertheless, our analysis indicated that GRS could discriminate type 1 diabetes subjects from controls in a small cohort of subjects identifying as Asian American, but larger studies are need to validate and extend these findings. For African Americans however, GRS was less effective in discerning type 1 diabetes subjects from controls, and the association between a higher GRS with early disease onset was lost. These observations are likely related to known differences in HLA-conferred disease risk or protection in the context of race27,29,36. Thus, GRS models suitable for African Americans would likely need to impute these haplotypes. Additionally, the current set of type 1 diabetes risk loci, which were identified in predominantly Caucasian cohorts, may be less effective for assessing risk in non-Caucasian individuals. These findings underscore the need to perform type 1 diabetes incidence studies and GWAS in non-Caucasian groups, enabling development of a GRS model that accounts for heterogeneity in populations.
Type 1 diabetes risk-loci that significantly predict disease in African Americans but not Caucasian Americans may underlie pathophysiologic differences in disease processes between races and may explain how African Americans accrue risk without the classical high-risk HLA types originally defined in Caucasians. Here we found that risk variants of two genes, SH2B3 and GAB3, were more predictive of type 1 diabetes in African American subjects. Interestingly, both of these genes encode proteins that affect myeloid cell development and activation. SH2B3 encodes the protein LNK, which modules cytokine signaling in myeloid cells via the signaling adapter, JAK237,38,39,40,41. Indeed, the risk variant of SH2B3/Lnk (rs3184504) is associated with altered expression of key elements of IFNγ signaling42. Thus, it is likely that SH2B3/Lnk variants modulate myeloid innate immune cells through altered sensitivity to various cytokines. The GAB3 protein product interacts with the M-CSF receptor and drives macrophage differentiation43. How the risk variant of GAB3 affects this process remains unknown. As these functional studies advance, it will be critical that investigators consider the race of study subjects as well as the presence of additional gene variations that may affect the same cells/pathways.
An important aspect of our study assessed the effect of Hispanic/Latino ethnicity within southeastern U.S. Caucasians on GRS. We found that the GRS robustly discriminates type 1 diabetes patients from controls in Hispanic/Latino Caucasian to the same degree as non-Hispanic/Latino Caucasian cohorts. The prevalence of type 1 diabetes in Hispanic/Latino American youth is roughly half that of non-Hispanic/Latino Caucasians34, yet is increasing at a greater annual rate (4.2%) versus non-Hispanic/Latino Caucasian populations (1.2%)44. Given the concurrent increase in type 2 diabetes in non-Caucasian American youth44, our findings may have significant implications for utilization of GRS in both research and clinical settings in these understudied populations.
Although our cross-sectional cohort does not include routine follow-up, we were able to utilize GRS to identify type 1 diabetes subjects whose diagnoses were questionable or have been changed subsequent to their enrollment, which further demonstrates the clinical utility of GRS as a tool to improve differential diagnoses of type 1 diabetes from early-onset type 2 and monogenic diabetes16,17. This may justify the use of GRS as a screening tool at diagnosis in order to promote the concept of precision medicine when determining which therapies may be best suited for a particular patient. Notwithstanding, since GRS only modestly discerns type 1 diabetes patients from first-degree relatives, there may be a capacity to improve the current model. The log-additive model for non-HLA risk may not be the most accurate method for computing GRS, possibly resulting in decreased specificity or loss of age-associated non-HLA risk45. Additionally, there may be more comprehensive methods to capture all HLA-associated type 1 diabetes-risk with more HLA variants (reviewed in1). Since several loci contain genes that are predicted to confer overlapping functional effects (e.g., CD25, IL2, PTPN2 in the IL-2 signaling pathway), one may expect a GRS model to include computations that account for such genetic synergies. However, this level of genetic risk modeling remains elusive, as Winkler et al. were unable to identify genetic interactions using a more extensive genotyping panel on a much larger cohort15. Moreover, models using genetics alone are not expected to predict type 1 diabetes with 100% accuracy since environmental, epigenetic, and stochastic factors (e.g., immunoreceptor V(D)J gene recombination) are also thought to impact overall risk. Perhaps even more confounding is the notion that genetic and environmental risk interactions may not be static phenomena. This may be most evident by the concomitant trends of decreasing proportion of high risk HLA in type 1 diabetes patients and increasing overall type 1 diabetes prevalence46,47. All of these aforementioned factors that are missing from this GRS model may contribute an unknown amount of bias negatively impacting GRS selectivity. Ultimately, improved accuracy of diabetes prediction models will likely require a better understanding of epistatic genetic and environmental risk interactions.
The results of this and other studies imply GRS could represent a low-cost means to assist in general population screening to identify patients who have increased risk of developing type 1 diabetes. We therefore envision the utilization of GRS to guide future trial recruitment and cohort stratification efforts. These observations strengthen the argument for focused genetic screening to monitor progression in the clinic, improve functional studies, facilitate biomarker identification, and optimize subject selection for interventional and natural history trials.
Methods
Subject enrollment and sample collection
Informed consent was obtained from subjects enrolled from outpatient clinics of the University of Florida, Gainesville, Florida; Nemours Children’s Hospital, Orlando, Florida; and Emory University, Atlanta, Georgia, under Institutional Review Board (IRB)-approval at each facility (IRB #201400709). All experiments were performed in accordance with relevant guidelines and regulations. Genomic DNA and serum samples were collected and stored at −20 °C from 1,946 research participants together termed the University of Florida Diabetes Institute (UFDI) cohort. This collection included control subjects [type 1 diabetes-unaffected and non-first- or -second-degree relatives of type 1 diabetes patients] (n = 405), first-degree relatives with ≤1 type 1 diabetes-relevant autoantibody (AAb) (n = 790), second-degree relatives with ≤1 AAb (n = 68), multiple AAb positive at-risk relatives (≥2AAb + , n = 46), and type 1 diabetes patients (n = 637). Type 1 diabetes status was assigned according to clinician diagnosis. Subjects self-reported race as Caucasian/White, African American/Black, Asian, Pacific Islander/Hawaiian, Native American/Alaskan, or Multiple/Other and separately indicated ethnicity as Hispanic/Latino or non-Hispanic/Latino (Supplemental Tables 1 and 2; Fig. 1A). The geometric mean ± SD for the type 1 diabetes diagnosis age was 8.94 ± 2.18 years (Fig. 1B).
AAb measurement
AAbs against type 1 diabetes-related autoantigens [i.e., glutamic acid decarboxylase (GAD), insulinoma-associated protein 2 (IA-2), and zinc transporter 8 (ZnT8)] were measured from serum samples via ELISA kits (KRONUS Inc., Star, ID) according to the manufacturer’s instructions48.
DNA preparation
DNA was prepared via QiaCube high-throughput nucleic acid purification system according to manufacturer’s recommendations (Qiagen, Hilden, Germany). Purified DNA was genotyped on either the custom array or manually, as described below. Samples missing HLA SNP calls or with <90% of non-HLA risk measured were excluded. The OR of the type 1 diabetes-risk alleles were derived from Immunobase.org (Supplemental Table 3, Fig. 1D).
Imputing HLA-DR-DQ diplotypes
HLA was imputed and odds ratios (OR) were computed as previously described16,17 (Fig. 1C; Supplemental Table 3). Following HLA imputation, samples were assigned one of six HLA diplotype categories: DR3-DQ2/DR4-DQ8, DR4-DQ8/DR4-DQ8, DR3-DQ2/DR3-DQ2, DR4-DQ8/X, DR3-DQ2/X, X/X, where X = non-DR3-DQ2 or non-DR4-DQ8. In addition, the highly protective HLA-DR15DQ6 haplotype was imputed, as well as HLA class-I A24 and B57, which were shown to confer susceptibility and protection, respectively, when conditioned on HLA class-II1,49.
Single nucleotide polymorphism (SNP) selection and genotyping
The HLA-DR and HLA-DQ region plus additional loci with known associations for type 1 diabetes-risk2 were considered for inclusion in a custom Taqman SNP genotyping array (ThermoFisher, Carlsbad, CA). Since the list of risk loci changes as more GWAS and meta-analyses are completed, the loci in this study are limited to those that are curated on immunobase.org as of October 2017. SNP assays passed quality control (QC) when they generated >95% successful call rates and <5% intra-sample discordance. SNPs that failed QC were excluded. Some key SNPs that either failed QC on the array or were not included on the array (rs2187668, rs7454108, rs3129889, rs1264813, rs2395029, and rs2292239) were manually genotyped using validated Taqman assays (ThermoFisher, Carlsbad, CA)16. 32 SNPs passed QC (Supplemental Table 3). The Taqman genotyping array and individual taqman assays were performed according to manufacturer instructions.
Calculating GRS
The GRS calculation emulates a previously reported multivariate logistic regression model employed by Oram et al. and Patel et al.16,17:
where β is the natural log of the OR and s is the number of risk alleles (0, 1, or 2) carried for SNP i of n loci tested. Chromosome X SNPs in male subjects were counted as 0 or 2, which assumes a dominant risk effect in the hemizygous state. H l is the HLA diplotype risk for combinations of DR3-DQ2, DR4-DQ8, and X. The summed risk was then divided by the number of alleles tested. This method used identical SNP imputing for class I and class II HLA as Oram et al. and Patel et al., and a partially overlapping set SNPs to compute non-HLA risk (compare Supplemental Table 3 to Oram et al.16).
Statistics
Data were graphed and analyses performed using GraphPad Prism software version 7 (San Diego, CA). Data are presented as ROC curve with AUC, as Tukey box and whisker plots or mean ± SD bar graphs compared via Kruskal-Wallis with Dunn’s multiple comparisons testing, scatter plots with linear regression and Pearson Correlation, or in tabular form and compared via Fisher’s exact test. Fisher’s exact test was performed using the Scipy package (version 0.18.1, https://scipy.org/) in Python3. Balanced accuracy was calculated for thresholds across the GRS range as [(predicted T1D/actual T1D) + (predicted non-T1D/actual non-T1D)]/2. Significance was defined as P < 0.05.
References
Noble, J. A. Immunogenetics of type 1 diabetes: A comprehensive review. J Autoimmun 64, 101–112, https://doi.org/10.1016/j.jaut.2015.07.014 (2015).
Pociot, F. et al. Genetics of type 1 diabetes: what’s next? Diabetes 59, 1561–1571, https://doi.org/10.2337/db10-0076 (2010).
Noble, J. A. et al. The role of HLA class II genes in insulin-dependent diabetes mellitus: molecular analysis of 180 Caucasian, multiplex families. Am J Hum Genet 59, 1134–1148 (1996).
Risch, N. Assessing the role of HLA-linked and unlinked determinants of disease. Am J Hum Genet 40, 1–14 (1987).
Lambert, A. P. et al. Absolute risk of childhood-onset type 1 diabetes defined by human leukocyte antigen class II genotype: a population-based study in the United Kingdom. J Clin Endocrinol Metab 89, 4037–4043, https://doi.org/10.1210/jc.2003-032084 (2004).
Leslie, R. D., Palmer, J., Schloot, N. C. & Lernmark, A. Diabetes at the crossroads: relevance of disease classification to pathophysiology and treatment. Diabetologia 59, 13–20, https://doi.org/10.1007/s00125-015-3789-z (2016).
Buzzetti, R., Zampetti, S. & Maddaloni, E. Adult-onset autoimmune diabetes: current knowledge and implications for management. Nat Rev Endocrinol 13, 674–686, https://doi.org/10.1038/nrendo.2017.99 (2017).
Nathan, B. M. et al. Dysglycemia and Index60 as Prediagnostic End Points for Type 1 Diabetes Prevention Trials. Diabetes Care 40, 1494–1499, https://doi.org/10.2337/dc17-0916 (2017).
Battaglia, M. et al. Understanding and preventing type 1 diabetes through the unique working model of TrialNet. Diabetologia 60, 2139–2147, https://doi.org/10.1007/s00125-017-4384-2 (2017).
Ziegler, A. G., Bonifacio, E. & Group, B.-B. S. Age-related islet autoantibody incidence in offspring of patients with type 1 diabetes. Diabetologia 55, 1937–1943, https://doi.org/10.1007/s00125-012-2472-x (2012).
Giannopoulou, E. Z. et al. Islet autoantibody phenotypes and incidence in children at increased risk for type 1 diabetes. Diabetologia 58, 2317–2323, https://doi.org/10.1007/s00125-015-3672-y (2015).
Redondo, M. J., Oram, R. A. & Steck, A. K. Genetic Risk Scores for Type 1 Diabetes Prediction and Diagnosis. Curr Diab Rep 17, 129, https://doi.org/10.1007/s11892-017-0961-5 (2017).
Winkler, C. et al. A strategy for combining minor genetic susceptibility genes to improve prediction of disease in type 1 diabetes. Genes Immun 13, 549–555, https://doi.org/10.1038/gene.2012.36 (2012).
Steck, A. K. et al. Improving prediction of type 1 diabetes by testing non-HLA genetic variants in addition to HLA markers. Pediatr Diabetes 15, 355–362 (2014).
Winkler, C. et al. Feature ranking of type 1 diabetes susceptibility genes improves prediction of type 1 diabetes. Diabetologia 57, 2521–2529, https://doi.org/10.1007/s00125-014-3362-1 (2014).
Oram, R. A. et al. A Type 1 Diabetes Genetic Risk Score Can Aid Discrimination Between Type 1 and Type 2 Diabetes in Young Adults. Diabetes Care 39, 337–344, https://doi.org/10.2337/dc15-1111 (2016).
Patel, K. A. et al. Type 1 Diabetes Genetic Risk Score: A Novel Tool to Discriminate Monogenic and Type 1 Diabetes. Diabetes 65, 2094–2099, https://doi.org/10.2337/db15-1690 (2016).
Evans, D. M., Visscher, P. M. & Wray, N. R. Harnessing the information contained within genome-wide association studies to improve individual prediction of complex disease risk. Hum Mol Genet 18, 3525–3531, https://doi.org/10.1093/hmg/ddp295 (2009).
Mayer-Davis, E. J. et al. Incidence Trends of Type 1 and Type 2 Diabetes among Youths, 2002-2012. N Engl J Med 376, 1419–1429, https://doi.org/10.1056/NEJMoa1610187 (2017).
Erlich, H. et al. HLA DR-DQ haplotypes and genotypes and type 1 diabetes risk: analysis of the type 1 diabetes genetics consortium families. Diabetes 57, 1084–1092, https://doi.org/10.2337/db07-1331 (2008).
Ziegler, A. G. et al. Seroconversion to multiple islet autoantibodies and risk of progression to diabetes in children. Jama 309, 2473–2479, https://doi.org/10.1001/jama.2013.6285 (2013).
Auton, A. et al. A global reference for human genetic variation. Nature 526, 68–74, https://doi.org/10.1038/nature15393 (2015).
Bryc, K., Durand, E. Y., Macpherson, J. M., Reich, D. & Mountain, J. L. The genetic ancestry of African Americans, Latinos, and European Americans across the United States. Am J Hum Genet 96, 37–53, https://doi.org/10.1016/j.ajhg.2014.11.010 (2015).
Consortium, W. T. C. C. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447, 661–678, https://doi.org/10.1038/nature05911 (2007).
Komulainen, J. et al. Clinical, autoimmune, and genetic characteristics of very young children with type 1 diabetes. Childhood Diabetes in Finland (DiMe) Study Group. Diabetes Care 22, 1950–1955 (1999).
Gillespie, K. M., Gale, E. A. & Bingley, P. J. High familial risk and genetic susceptibility in early onset childhood diabetes. Diabetes 51, 210–214 (2002).
Valdes, A. M. et al. Use of class I and class II HLA loci for predicting age at onset of type 1 diabetes in multiple populations. Diabetologia 55, 2394–2401, https://doi.org/10.1007/s00125-012-2608-z (2012).
Awa, W. L. et al. HLA-DR genotypes influence age at disease onset in children and juveniles with type 1 diabetes mellitus. Eur J Endocrinol 163, 97–104, https://doi.org/10.1530/EJE-09-0921 (2010).
Noble, J. A., Johnson, J., Lane, J. A. & Valdes, A. M. HLA class II genotyping of African American type 1 diabetic patients reveals associations unique to African haplotypes. Diabetes 62, 3292–3299, https://doi.org/10.2337/db13-0094 (2013).
Abecasis, G. R. et al. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65, https://doi.org/10.1038/nature11632 (2012).
Winkler, C. et al. Feature ranking of type 1 diabetes susceptibility genes improves prediction of type 1diabetes. Diabetologia 57, 2521–2529, https://doi.org/10.1007/s00125-014-3362-1 (2014).
Törn, C. et al. Role of Type 1 Diabetes-Associated SNPs on Risk of Autoantibody Positivity in the TEDDY Study. Diabetes 64, 1818–1829, https://doi.org/10.2337/db14-1497 (2015).
Inshaw, J. R. J., Walker, N. M., Wallace, C., Bottolo, L. & Todd, J. A. The chromosome 6q22.33 region is associated with age at diagnosis of type 1 diabetes and disease risk in those diagnosed under 5 years of age. Diabetologia. https://doi.org/10.1007/s00125-017-4440-y (2017).
Dabelea, D. et al. Prevalence of type 1 and type 2 diabetes among children and adolescents from 2001 to 2009. JAMA 311, 1778–1786, https://doi.org/10.1001/jama.2014.3201 (2014).
Pettitt, D. J. et al. Prevalence of diabetes in U.S. youth in 2009: the SEARCH for diabetes in youth study. Diabetes Care 37, 402–408, https://doi.org/10.2337/dc13-1838 (2014).
Erlich, H. et al. HLA DR-DQ haplotypes and genotypes and type 1 diabetes risk: analysis of the type 1 diabetes genetics consortium families. Diabetes 57, 1084–1092, https://doi.org/10.2337/db07-1331 (2008).
Bersenev, A. et al. Lnk constrains myeloproliferative diseases in mice. J Clin Invest 120, 2058–2069, https://doi.org/10.1172/JCI42032 (2010).
Gery, S. et al. Lnk inhibits myeloproliferative disorder-associated JAK2 mutant, JAK2V617F. J Leukoc Biol 85, 957–965, https://doi.org/10.1189/jlb.0908575 (2009).
Yoshida, K. et al. The landscape of somatic mutations in Down syndrome-related myeloid disorders. Nat Genet 45, 1293–1299, https://doi.org/10.1038/ng.2759 (2013).
Gonzales, A. J. et al. Oclacitinib (APOQUEL(®)) is a novel Janus kinase inhibitor with activity against cytokines involved in allergy. J Vet Pharmacol Ther 37, 317–324, https://doi.org/10.1111/jvp.12101 (2014).
Waters, M. J. & Brooks, A. J. JAK2 activation by growth hormone and other cytokines. Biochem J 466, 1–11, https://doi.org/10.1042/BJ20141293 (2015).
Westra, H. J. et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat Genet 45, 1238–1243, https://doi.org/10.1038/ng.2756 (2013).
Wolf, I. et al. Gab3, a new DOS/Gab family member, facilitates macrophage differentiation. Mol Cell Biol 22, 231–244 (2002).
Mayer-Davis, E. J. et al. Incidence Trends of Type 1 and Type 2 Diabetes among Youths, 2002–2012. N Engl J Med 376, 1419–1429, https://doi.org/10.1056/NEJMoa1610187 (2017).
Abraham, G., Kowalczyk, A., Zobel, J. & Inouye, M. Performance and robustness of penalized and unpenalized methods for genetic prediction of complex human disease. Genet Epidemiol 37, 184–195, https://doi.org/10.1002/gepi.21698 (2013).
Steck, A. K., Armstrong, T. K., Babu, S. R. & Eisenbarth, G. S. & Consortium, T. D. G. Stepwise or linear decrease in penetrance of type 1 diabetes with lower-risk HLA genotypes over the past 40 years. Diabetes 60, 1045–1049, https://doi.org/10.2337/db10-1419 (2011).
Atkinson, M. A., Eisenbarth, G. S. & Michels, A. W. Type 1diabetes. Lancet 383, 69–82, https://doi.org/10.1016/S0140-6736(13)60591-7 (2014).
Wasserfall, C. et al. Validation of a Rapid Type 1 Diabetes Autoantibody Screening Assay for Community Based Screening of Organ Donors to Identify Subjects at Increased Risk for the Disease. Clin Exp Immunol, Epub ahead of print, https://doi.org/10.1111/cei.12797 (2016).
Noble, J. A. et al. HLA class I and genetic susceptibility to type 1 diabetes: results from the Type 1Diabetes Genetics Consortium. Diabetes 59, 2972–2979, https://doi.org/10.2337/db10-0699 (2010).
Acknowledgements
We thank the blood donors who graciously participated in these studies. We are grateful for the efforts of the physicians, nursing staff, and administrators who facilitate human sample research, specifically Jennifer Hosford and Jamie Thomas. We thank members of the Brusko and Atkinson Laboratories for helpful discussions. These efforts were supported by grants from the National Institutes of Health P01 AI42288 (MAA), R01 DK106191 (TMB), UC4 DK104194 (CEM), and from the JDRF Career Development Award (2–2012–280 to TMB). RAO is supported by a Diabetes UK Harry Keen Fellowship. DJP is supported by the JDRF Postdoctoral Fellowship Award (2-PDF-2016-207-A-N).
Author information
Authors and Affiliations
Contributions
D.J.P. conceived of the study, researched the data, and wrote the manuscript; C.H.W. conceived of the study, researched the data, and reviewed/edited the manuscript; R.A.O. researched the data, contributed to discussion, and reviewed/edited the manuscript; M.D.W. researched the data and reviewed/edited the manuscript; A.L.P. wrote the manuscript and contributed to discussion; A.B.M., M.J.H., D.A.S., M.A.W., C.E.M. and M.A.A. contributed to discussion and reviewed/edited the manuscript, T.M.B. conceived of the study and reviewed/edited the manuscript. All authors approved the final version of the manuscript. Todd M. Brusko is the guarantor of this work and as such, assumes full responsibility for the ethical acquisition and reporting of data.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Perry, D.J., Wasserfall, C.H., Oram, R.A. et al. Application of a Genetic Risk Score to Racially Diverse Type 1 Diabetes Populations Demonstrates the Need for Diversity in Risk-Modeling. Sci Rep 8, 4529 (2018). https://doi.org/10.1038/s41598-018-22574-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-018-22574-5
This article is cited by
-
Clinical utility of polygenic risk scores: a critical 2023 appraisal
Journal of Community Genetics (2023)
-
Utility of genetic risk scores in type 1 diabetes
Diabetologia (2023)
-
A genomic data archive from the Network for Pancreatic Organ donors with Diabetes
Scientific Data (2023)
-
Microbial risk score for capturing microbial characteristics, integrating multi-omics data, and predicting disease risk
Microbiome (2022)
-
Epidemiology of Type 1 Diabetes
Current Cardiology Reports (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.