Introduction

The US aging population, characterized as individuals 65 years of age and older, is expanding at a rapid rate and is expected to grow for the next several decades reaching 88 million by 20501. Correspondingly, the prevalence and number of age-related diseases, including Alzheimer’s disease (AD), are anticipated to increase and further burden our healthcare system1. AD is the sixth leading cause of death in the US, and while the prevalence of other leading causes of death in the US has decreased or remained about the same, the number of deaths due to AD has significantly increased from the year 2000 to 20191.

AD is a fatal neurodegenerative disease attributed to neuronal damage and accumulating amyloid-β (Aβ) plaques in the brain, first implicating thinking, learning, and cognitive function1,2. The late-onset class of AD is sporadic, multifactorial, genetically complex (i.e., AD heritability has been estimated between 60% and 80% and is highly polygenic)3, and represents the majority (~90%) of total AD cases4. Moreover, it has been established that the progression of AD operates on a continuum from asymptomatic to AD-related dementia, with no distinct event denoting its onset; this progression is reflective of underlying accumulations of systemic and brain-specific pathology1,5,6. Early in progression there are two stages known as preclinical AD and mild cognitive impairment (MCI) due to AD that identifies individuals with AD brain changes without and with associated symptoms, respectively1.

Two of the most prominent pathological changes observed in AD are the accumulation of extracellular Aβ peptides producing senile plaques that block cell-cell signaling at synapses and the accumulation of intracellular hyperphosphorylated tau protein resulting in neurofibrillary tangles that inhibit the transportation of essential molecules1,2,3,4,7. However, the molecular and etiological events initiating AD pathologies remain to be determined7. Additional pathological changes exhibited in AD are mitochondrial dysfunction, chronic inflammation, and excess oxidative stress (OS)4,7,8,9. Mitochondrial stress, OS, and mitochondrial dysfunction are theorized to enhance AD pathology and play important roles in its pathogenesis4,10 and impaired mitochondrial function has been implicated in both AD and metabolic disease2,4,7,8,11.

Type-2 diabetes (T2D) has been shown to share similar pathological features with AD such as impaired glucose utilization, reduced mitochondrial activity, and both metabolic and mitochondrial dysfunction7,11. T2D is characterized by hyperglycemia caused by insulin resistance leading to insulin deficiency7. Pathophysiological features of T2D include islets of Langerhans cells presenting β cell loss and/or dysfunction, and spontaneous islet amyloid polypeptide aggregation7. Furthermore, there are reports that insulin regulates Aβ and tau protein metabolism, and there are numerous reviews discussing the established connections between insulin resistance, diabetes, and AD7,12,13,14.

Besides the existing general AD healthcare problems1, there are gaps in the scientific literature characterizing race/ethnicity-specific risk for disease development and progression of AD15. Recently, it has been recognized that ethnic/racial factors significantly impact biological and medical risk factors for AD1,15,16. In the US, there are more non-Hispanic whites (NHWs) living with AD than other racial/ethnic groups, although per-capita Hispanics are more likely to have AD1,17. Hispanic is a broad term, as this population encompasses a variety of ethnic subgroups that exhibit geographical and cultural differences17. The Hispanic population is represented by varying proportions of European, African, and Native American ancestry, and previous studies indicated the overall increased risk in Hispanics may be driven by a specific ethnic subgroup17. The presence of comorbid conditions (e.g., cardiovascular disease and diabetes) may explain in part, some of the disparity in AD prevalence1. In the US, Mexican Americans (MAs) represent majority of the Hispanic population, has one of the fastest-growing aging groups, and it is projected that by 2050 the number of aging MAs will triple, while rates of AD will grow six-fold among Hispanics16,18,19.

AD pathophysiology in the MA population seems to be distinct from NHWs. For example, the apolipoprotein E (APOE) allele ε4, which confers the largest risk for AD in NHWs, is far less significant in MAs. This may be in part due to the decreased frequency of the ε4 allele, combined with a smaller effect size1,18,19. Correspondingly, a recent study determined that APOE ε4 allele carrier status did not confer risk for MCI in MAs20. MAs clearly suffer from significant AD health disparities when compared to NHWs, including (1) earlier onset (~10 years) of cognitive impairment, (2) higher rates of missed diagnosis, (3) later diagnosis, and (4) increased prevalence of modifiable risk factors1,16,18,19. Depression, stroke, T2D, and obesity in the MA population are common risk factors for developing cognitive impairment that is more common in MAs, although the etiology remains unclear16,21. Lifestyle and/or metabolic health may contribute directly to age-related neurodegeneration1. Combined, these data emphasize the importance of conducting further studies to improve the diagnosis, treatment, and prevention of AD in the MA population.

Recently, there is growing evidence suggesting a correlation between common pathological changes in AD and oxidative damage to nucleic acids4,8,22. Mitochondria are highly vulnerable to oxidative DNA damage because they are predominant generators of reactive oxygen species (ROS), and their mitochondrial genome lacks histones and has a reduced capacity to repair DNA7. OS is a prominent contributor to Aβ aggregation and hyperphosphorylated tau, and numerous studies have provided evidence suggesting OS contributes to tau pathology because fatty acid oxidation accelerates tau polymerization4,8,22. In the central nervous system and peripheral tissues of AD patients, accumulation of ROS modifies the function and expression of antioxidant enzymes4,8. Also, high levels of DNA strand breaks were found in the hippocampus and cerebral cortex of AD brains4.

Mitochondrial dysfunction causes an increased mtDNA somatic mutation rate, reduced energy metabolism, increased ROS, and intensifies the mitochondrial oxidative environment2. The most common forms of oxidative damage observed in AD brains are 8-oxo-2’-deoxyguanine (8oxodG) and 8-oxo-guanine (8oxoG)4. In the cortex and cerebellum of AD patients compared to controls, significantly higher levels of 8oxodG were observed in the ventricular CSF23. Elevated levels of both forms of oxidatively modified guanine have been demonstrated in the nDNA of AD brains when compared to age-matched controls4. Interestingly, Aβ is an important factor in mitochondrial dysfunction and increases ROS production in AD4. Mitochondrial dysfunction and excessive levels of Aβ can activate the mitochondrial permeability transition pore leading to the destruction of neurons with defective mitochondria10. Furthermore, it has been demonstrated in the hippocampal neurons of AD patients that Aβ decreases the activity of essential ETC enzymes and alters mitochondrial dynamics4,8. These enzymes are highly susceptible to oxidative damage and the reduced activity of key enzymes involved in intermediate metabolism is a characteristic of abnormal cerebral glucose utilization7. Mitochondrial-induced OS may play an important role in the progression and pathophysiological changes in the brain of AD because neurons and mitochondria are sensitive to OS-inducing mitochondrial dysfunction (Fig. 1).

Fig. 1: Graphical overview of global working hypothesis for risk factors and cellular/molecular processes that contribute to neurodegeneration.
figure 1

Modifiable and unmodifiable risk factors such as age, genetics, and lifestyle/environmental factors can induce elevated levels of ROS which could lead to mitochondrial and/or metabolic pathophysiology. This pathophysiology can contribute to and exacerbate an oxidative environment, neuroinflammation, and amyloid-beta accumulation that could ultimately promote neurodegeneration. This figure was created with BioRender.com.

Previously, our lab investigated the role of mitochondria in T2D and cognitive impairment in MAs through analyzing blood-based features of mitochondrial abnormalities (i.e., mtDNA copy number and cell-free mtDNA)11. The data suggested mitochondrial dysfunction assessed by mtDNA copy number was closely related to both T2D and cognitive impairment11. Here, our objective was to determine if abnormal mitochondrial function, indicated by oxidative DNA damage, differs between population (MA vs. NHW), as well as to evaluate the effects of sex, cognitive impairment, and T2D on AD risk. Using Illumina-based next-generation sequencing, we quantified oxidatively modified guanine residues in mtDNA. Our data show that 8oxoG mutational load is significantly higher in MAs than in NHWs and is associated with cognitive function, sex, and education. Particularly, the sex effect observed was moderated by population. Stratified analysis for 8oxoG mutational load in MAs suggests significant elevation when comparing MAs with AD to normal controls.

Results

The descriptive statistics of the cohort are provided in Table 1. In both populations, MMSE, CDR sum, and years of education significantly differed between cognitive phenotypes as expected. Age was determined to significantly differ by cognitive diagnosis, and years of education was lower in the MA population. A Pearson correlation determined 8oxoG variant count did not significantly differ by age in the total cohort (Supplementary Fig. 3).

Table 1 Descriptive statistics of participants by population group and cognitive phenotype in the Texas Alzheimer’s Research and Care Consortium.

Total 8oxoG variant count is significantly higher in MA females

Total 8oxoG variant count was significantly higher in the MA population compared to NHWs; mean = 7.46 and 5.96, respectively (Fig. 2). In addition, female subjects had a higher 8oxoG variant count than males; mean = 7.06 and 6.43, respectively (Fig. 3). The more comprehensive multiple linear regression model (Table 2) pointed to a significant interaction effect between population and sex related to 8oxoG variant count; p = 0.01458, MA females being higher; p = 0.0297 (Fig. 4); in addition, years of education was identified as a significant factor (positive association). No other variables included in the multiple linear regression model demonstrated associated statistical significance (BMI, APOE, diabetes, cognition, age, population × education).

Fig. 2: 8oxoG variant count is significantly higher in Mexican American population.
figure 2

a Total 8oxoG variant count was assessed by population using a two-tailed Welch’s t-test (n = 559, t-statistic = 4.794, df = 558). Error bars represent standard error of the mean. b Violin plot showing the distribution of 8oxoG variant counts in Mexican American and non-Hispanic whites (n = 559) with effect size and confidence interval plotted on right y-axis. Dashed lines indicate the mean and dotted lines represent the 1st and 3rd quartile. The triangle represents the difference of the means, and the associated bar indicates the confidence interval.

Fig. 3: 8oxoG variant count is significantly higher in females.
figure 3

a Sex differences in 8oxoG variant count were determined using a two-tailed Welch’s t-test (n = 559, t-statistic = 1.968, df = 558). Error bars represent standard error of the mean. b Violin plot showing the distribution of 8oxoG variant counts in females and males (n = 559) with effect size and confidence interval plotted on right y-axis. Dashed lines indicate the mean and dotted lines represent the 1st and 3rd quartile. The triangle represents the difference of the means, and the associated bar indicates the confidence interval.

Table 2 8oxoG variant count and cognitive status (NC vs. MCI or AD) multiple linear regression model prediction considering population interaction effect with both sex and education.
Fig. 4: Population-by-sex interaction associated with total 8oxoG variant count shows MA females have elevated 8oxoG counts.
figure 4

a Bar graph representing total 8oxoG variant count by population and sex as tested using a two-way ANOVA (n = 559, p = 0.0297, F-statistic = 4.75, df = 557) to determine if a population × sex interaction existed. b Interaction plot of predicted 8oxoG variant counts by sex in NHWs and MAs. Error bars represent standard error of the mean.

In a subsequent multiple linear regression analysis, we investigated the potential interaction between diabetes and cognitive status, in which we did not observe significant effects (Supplementary Table 1). We also derived the count of variants for each individual that corresponded to 8oxoG “hotspots” (i.e., frequently observed variants at certain locations within the mitochondrial genome) shown in Supplementary Fig. 4. In these “hotspot” analyses, we did not observe the same trends, and thus the metric proved to be generally less informative (Supplementary Figs. 45 and Supplementary Tables 613). In addition, in the NHW population we observed associations between 8oxoG “hotspot” variant count and APOE status (Supplementary Tables 1213), which was not observed in the MA population (Supplementary Tables 10 and 11).

Population-specific effects on 8oxoG variant count

As expected, based on the previous multiple linear regression analyses, 8oxoG variant count was significantly associated with sex (females higher) for MAs as shown in Table 3. However, interestingly, cognitive status of AD was in marginally significant association with 8oxoG variant count (shaded row, Table 3; bar graph provided in Fig. 5), but this trend was not observed in NHWs (shaded row, Table 4). Two-way ANOVAs in NHWs did not show significance; however, in MAs there was significance for sex F(1,295) = 5.8 and p = 0.0166 (Fig. 5). No other variables were associated with 8oxoG variant count in the MA population. BMI and age were marginally significant (both positive) in association with 8oxoG variant count in NHWs (Table 4); no other variables were associated with 8oxoG variant count. Another intriguing result is the significant positive association of education with 8oxoG variant count that is limited to the MA population (Table 3).

Table 3 Multiple linear regression results for 8oxoG variant count within Mexican Americans.
Fig. 5: 8oxoG variant count by cognitive phenotype in each population.
figure 5

a Bar graph of 8oxoG count by cognition in NHWs tested using a two-way ANOVA (n = 260). b Bar graph of 8oxoG count by cognition in MAs tested using a two-way ANOVA (n = 299) to determine if a cognition × sex interaction existed in each population. Error bars represent standard error of the mean.

Table 4 Multiple linear regression results for 8xoG variant count within non-Hispanic whites.

Additional multiple linear regression analyses using cognition as a binary predictive variable (where MCI and AD are combined into cognitive impairment, CI, and NC is normal controls) were conducted (Supplementary Tables 25, 7, 9, 11, and 13); the higher 3-category resolution shown in Table 3 (AD/MCI/NC) revealed a potential effect of AD on 8oxoG variant count in MAs, but the effect is not observable in the CI/NC regression analyses since it is diluted by the presence of MCI (Supplementary Table 4).

Haplogroup-associated elevation and depression of 8oxoG variant count

Based on the Welch two-sided t-test, we observed haplogroup effects on 8oxoG variant burden within the combined cohort. Haplogroups A and C exhibited elevated 8oxoG variant counts (Fig. 6a and Table 5). Conversely, haplogroups I and K exhibited lower 8oxoG variant counts (Fig. 6a and Table 5). For population stratified inference, in the NHW population, using Welch’s t-test, our results demonstrate haplogroup H displayed higher 8oxoG variant counts (Fig. 6b and Table 5). Haplogroup I among NHWs showed reduced 8oxoG variant counts (Fig. 6b and Table 5). In the MA population, Welch’s t-test reported haplogroup L had significantly reduced 8oxoG variant counts when compared to all other haplogroups observed in the MA population (Fig. 6c and Table 5).

Fig. 6: 8oxoG variant count by mitochondrial haplogroup.
figure 6

a Differences in total 8oxoG variant count by mitochondrial haplogroup of the cohort was assessed using Welch’s t-test (n = 560). b Total 8oxoG variant count by mitochondrial haplogroup in NHW participants was assessed using Welch’s t-test (n = 261). c Differences in 8oxoG variant count between mitochondrial groups in the MA population was determined by performing Welch’s t-test (n = 299). Pink bars indicate significantly higher 8oxoG variant count and blue bars indicate significantly lower 8oxoG variant count. Error bars represent standard error of the mean. The mitochondrial haplogroup tree was illustrated based off the RSRS-oriented mtDNA tree build 17 from PhyloTreemt to include only macrohaplogroups and sub-macrohaplogroups represented in our cohort61.

Table 5 Mitochondrial DNA haplogroup-associated 8oxoG variant count mean within the combined cohort (NHW + MA; n = 560), MAs alone (n = 299), and NHWs alone (n = 261).

Discussion

AD was discovered over a century ago, and through research, our understanding of the disease has exponentially grown. However, there are many gaps in our knowledge, particularly with respect to how this disease affects individuals from different ethnic/racial backgrounds. Our group investigated peripheral levels of mitochondrial 8oxoG, a characteristic of mitochondrial dysfunction, and its association with cognitive impairment, T2D, and comorbidity (cognitive impairment and T2D) within the MA population compared to NHWs. We hypothesized the MA population would demonstrate higher levels of mitochondrial oxidative damage due to the number of comorbid conditions burdening this population, such as cardiovascular disease, diabetes, and depression1. Overall, our results demonstrate that 8oxoG variant count was significantly higher in MAs compared to NHWs, and this effect was largely driven by MA females. In subsequent regression analyses, we observed that 8oxoG variant count is suggestively associated with AD cognitive status (compared to control), particularly in MAs. Intriguingly, this analysis also revealed a positive association of 8oxoG variant count with education, warranting further investigation of biological and/or environmental influencers of 8oxoG.

The level of 8oxoG variant count in the mitochondrial genome was significantly higher in MAs compared to NHWs, which may be because MAs are at increased risk for metabolic disorders. Metabolic syndrome and obesity are associated with increased OS, which can lead to genomic instability such as increased levels of oxidative DNA damage24,25. Metabolic syndrome is a collection of conditions such as deficient glucose tolerance, fatty liver, and increased body weight, adiposity, and triglyceride levels25. Thus, metabolic health risk could account for the observed significant difference in levels of mitochondrial 8oxoG count. Furthermore, base excision repair (BER) is a predominant DNA repair pathway for oxidative DNA damage; failure of this system allows features of genomic instability to persist and accumulate22. Higher 8oxoG levels in MAs may be influenced by differences in DNA repair machinery expression due to the population’s associated metabolic burden and/or population-specific variants that impact DNA repair efficiency.

Interestingly, recent evidence suggests that DNA damage repair is necessary for metabolic health, derived from observations demonstrating mtDNA repair glycosylase OGG1, an essential enzyme for BER, may influence metabolic phenotypes in high-fat diet exposure24,25,26. Functional OGG1 prevents obesity and metabolic dysfunction24,25 through altered PGC-1α expression and fatty acid oxidation25; reduced levels of PGC-1α has been reproducibly observed in T2D patients2,27,28 and have been related to increased levels of ROS and decreased levels of β-oxidation enzymes29. The metabolic burden in MAs may be associated with metabolic dysfunction that could alter OGG1 function causing elevated levels of 8oxoG. Interestingly, the genetic polymorphism in OGG1 (Ser[326]Cys) has been associated with T2D risk in MAs26 further suggesting that insufficient response to oxidative DNA damage may be implicated in metabolic disease in the MA population.

There were significantly higher 8oxoG counts for MA females compared to MA males. In the literature, there is no clear consensus whether levels of DNA damage differ significantly based on biological sex, and this may be due to different sample types, techniques, and/or applied methods of detection across studies30,31. In 2014, results of a meta-analysis indicated that there are no differences between sex and DNA damage30. Conversely, a recent review determined that men have higher levels when compared to women; however, inconsistency in reports indicate that other factors such as lifestyle may contribute to the sex effect on the prevalence of such lesions31. Furthermore, most of the studies to date have not explicitly compared oxidative damage among different racial/ethnic groups in an aging population. Elevated levels of 8oxoG variant count in MA females may be partially explained by the fact that MA women have a higher frequency of T2D32. OS and mitochondrial dysfunction are well documented in T2D pathophysiology, and a restrictive diet reduces OS2. In addition, there is accumulating evidence underlining sex differences in mitochondrial function and activity, and levels of OS in an age-dependent manner33,34,35. Silaidos et al. observed that PBMCs from females exhibited significantly higher ATP levels, citrate synthase activity, uncoupled respiration, and ETC complex and system capacity when compared to PBMCs of men33. Recent evidence shows sex hormone status may be involved36. For example, mitochondrial function in female mice revealed that younger female mice display lower OS levels compared to males and that subsequent ovariectomy limited the apparent protection against DNA damage; this protection was eliminated in aged female mice35. The lack of consistent data in the literature regarding sex differences in aging and age-related diseases emphasizes the need for further work to better understand sex-associated disease risk, especially in ethnic populations that are rapidly expanding.

Our results from multiple linear regression analyses are suggestive of an AD-effect on 8oxoG variant count in the MA population. In the literature, there is accumulating evidence supporting the implication of mitochondrial dysfunction as a primary and/or secondary factor contributing to AD partially because of the significant levels of oxidative damage observed in various organs and tissues of individuals with cognitive impairment4,8. In particular, previous studies report significantly higher levels of 8oxoG and/or DNA damage in patients diagnosed with MCI or AD compared to controls, suggesting that (1) OS and subsequent DNA damage are features of AD pathophysiology, (2) accumulating oxidative DNA damage may be an early marker of AD, and (3) 8oxoG could potentially serve as a biomarker for MCI and/or AD22,37,38,39,40. However, there is little information regarding ethnic/racial differences in levels of oxidative DNA damage, and particularly peripheral levels of 8oxoG in the context of cognitive decline. Here we demonstrate population-specific variation in peripheral levels of mitochondrial oxidative DNA damage—the associations observed in the MA cohort were non-significant in the NHW cohort and had effect sizes in opposite directions; these findings emphasize the importance of future replication studies. As previously mentioned, it is possible that the MA population has a more pronounced effect due to their metabolic burden and potential genetic variation in DNA repair machinery. In addition, cognitive impairment has been well documented in T2D, which increases the risk for AD by two-fold and has been associated with the progression of more severe forms of cognitive impairment7,12,13,14,29. Moreover, OS is particularly related to amyloid and tau pathology through stimulating a vicious cycle of pathophysiology provoking mitochondrial dysfunction and metal toxicity, which would ultimately result in an increased mutational load and neurotoxic environment contributing to neuronal loss8,22,40. This gathering evidence may explain to an extent the observed suggestive association between 8oxoG variant count and AD in the MA population. Correspondingly, the stronger association reported in MA females could be attributed to the extended lifespan of women and age-related decline in sex hormones diminishing the protective effects on antioxidant defenses and mitochondrial capacity34,41,42,43. Mitochondria are responsible for steroidogenesis and its interaction with sex steroids plays an important role in the brain42. Brain levels of sex hormones are known to decline with age, therefore, emphasizing lifestyle factors, metabolic, health, and age may be of particular importance in accounting for the vulnerability of MA females to cognitive decline and associated pathophysiology42.

Interestingly, the positive association of 8oxoG variant count in MAs extended to years of education. Fletcher et al. reported associations between educational attainment and cognition in older age, after controlling for family background and genetic factors, and an interaction demonstrating those with an increased risk for AD mildly benefit from a higher educational background44. Educational attainment has been moderately studied in MAs with evidence indicating the disparity in cognitive impairment and dementia is due to genetic, behavioral, and socioeconomic factors45. Socioeconomic factors were found to be especially important in the disparity, highlighting the inequity in educational attainment among underrepresented or immigrant populations that may contribute to their risk for cognitive decline46. In addition, there are several reports indicating the protective effect of education on cognitive impairment does not entirely translate to MAs47. Data from a previous study suggests MAs may only benefit from cognitive-protective effects when years of education exceeds 12 years (i.e., education beyond high school)47. The reason for the observed positive association of 8oxoG with years of education in MAs is unclear; further studies investigating the effect of educational attainment on cognitive function and the paradoxical increase in 8oxoG in the MA population are warranted.

In the whole cohort, we observed mitochondrial haplogroups A and C had significantly higher 8oxoG variant counts, while haplogroups K and I showed significantly reduced levels each independently compared to all other haplogroups. Previous data have shown haplogroup K to demonstrate a protective effect against AD in European populations48. The significantly lower levels of 8oxoG variant count in haplogroup K may be related to its apparent low risk for developing AD that is associated with increased oxidative damage. In the NHW population, haplogroup H was found to have significantly higher levels of 8oxoG variant counts compared to all other haplogroups observed in the population. Established features of European ancestry include mtDNA haplogroups associated with largest oxygen consumption, ineffective oxygen utilization, and slightly deficient DNA repair capacity causing elevated levels of ROS that could subsequently cause elevated levels of oxidative DNA damage49. Furthermore, a study recently demonstrated synergism between APOE ε4 carrier status and mitochondrial haplogroup H—when combined, individuals were at higher risk for AD50. Therefore, the elevated 8oxoG variant count exhibited by haplogroup H that we see here was not surprising, due to their associated altered mitochondrial capacity. Conversely, in the MA population haplogroup H did not demonstrate a significant elevation in 8oxoG variant count; however, as previously mentioned the APOE risk allele appears to have less of an effect in the MA population. This observation further suggests that MAs are differentially affected by established risk factors for cognitive impairment compared to their NHW counterparts. Nonetheless, studies investigating mitochondrial haplogroup risk in neurodegeneration is very limited, and thus, it is difficult to comment on whether there is evidence to confirm or refute our findings suggesting haplogroup-specific 8oxoG variant count differences (refer to review by Ienco et al. for a comprehensive assessment of the literature)51. Furthermore, due to the limited sample size (i.e., various haplotypes are observed less in one population compared to the other) our power to detect rare mitochondrial haplotype effects is limited, and thus presumably causing the lack of overlap between the mitochondrial haplogroups associated with 8oxoG variant count in both cohorts. Through the historic geographical migration of certain groups and maternal nature of mtDNA inheritance, there are observed variations in haplotype frequencies between societal-based ethnic/racial groups52. There is accumulating evidence indicating mitonuclear allelic interactions considerably alter the expression of important health-related phenotypes by influencing the quality of oxidative phosphorylation and metabolic function52. Gene flow of the mitochondrial genome differs from that of the nuclear genome and considering the generation of differing genetic variation throughout populations, it is hypothesized that the course of mitonuclear coadaptation may be population specific52. This is likely relevant to the MA population as they are considered an admixed population. Therefore, there will be limited overlap and significant results when comparing the two populations separately, especially in relation to the whole cohort.

While our results are potentially insightful, there are several limitations to note. First, it is important to acknowledge that the methods employed here are indirect measures/indicators of oxidative damage; however, this limitation is difficult to overcome since methods for detecting oxidative damage at the per-base resolution specific to mtDNA generally have (1) technical artifacts arise during library preparation, (2) low sequencing resolution, (3) higher detection limits, and/or (4) the requirement for specific and sensitive enzymes, proteins, or antibodies53. Another obvious limitation is the lack of data regarding metabolic disease in this cohort; our study was limited to self-described diabetes, which is likely an oversimplification given the highly heterogenous nature of metabolic syndrome in the MA population. Furthermore, the inclusion of additional markers of metabolic health could have potentially helped with establishing an association. In general, it is challenging to interpret these results from a biological/mechanistic perspective, but importantly, they open the door for avenues of research that may prove highly relevant to addressing and resolving MA health disparities in age-related disease, namely, risk for AD.

Future studies will aim to increase the sample size and improve subject characterization of metabolic phenotypes to better resolve causal aspects of oxidative damage in MAs, specifically with respect to female vulnerability. We acknowledge that our data are suggestive in association to AD; however, future studies utilizing quantitative cognitive measures such as MMSE, CDR sum, and other measures of neuropsychological testing and cognitive function, may improve our power to support the implication. Furthermore, additional biochemical and genetic studies would solidify these results and aid in drawing conclusions. Such studies may include correlation analyses between 8oxoG variant load and expression of DNA repair machinery and ROS response systems, as genetic variant analysis of nuclear-encoded DNA repair genes and mitonuclear epistatic effects. Ideally, the studies conducted and proposed here would be recapitulated in matched blood and brain tissue to validate the potential application of these peripheral phenotypes as biomarkers for brain pathology. In addition, future studies will aim to include another population cohort and validate mitochondrial oxidative load using an alternative method such as liquid chromatography-tandem mass spectrometry.

To conclude, the work we present here describes a differential effect of oxidative mitochondrial damage that is associated with cognitive decline among MA females. We also describe a unique approach for sensitive quantification of putative oxidative damage in blood, a highly accessible tissue, and its potential relevance to cognitive aging in MAs. Furthermore, we identify a potential role for mtDNA-based haplogroup risk in 8oxoG accumulation. The systemic elevation of 8oxoG load specifically in MA females may point to an underlying source of risk for cognitive decline in this vulnerable group, revealing avenues for more precise prevention, diagnosis, and treatment of cognitive dysfunction.

Methods

Cohort design and samples

Cohort

TARCC is the Texas Alzheimer’s Research and Care Consortium, a longitudinal collaborative research initiative between ten Texas medical research institutions. The goal of TARCC is to investigate factors involved in the development and progression of AD in the MA population compared to NHWs.

Participants

This study was approved under the University of North Texas Health Science Center IRB #1330309-1; informed written consent was obtained from participants (or their legally authorized proxies) to take part in the study and allowing the publication of findings before data collection. Aging subjects enrolled in TARCC (N = 559; Table 1) who were diagnosed with AD (n = 104), MCI (n = 127), or normal cognition (n = 328) were selected to optimize matching with respect to age, sex and T2D distribution across MA and NHW fractions. An annual standardized assessment was conducted for each participant at one of the five original participating sites that included a medical evaluation, neuropsychological testing, an interview, and a blood draw. Buffy coat samples from 261 NHWs and 299 MAs with the aforementioned cognitive phenotypes were analyzed in this work.

Measurement of mtDNA mutational load indicative of oxidative damage from buffy coat

DNA extraction

DNA was extracted from 200 μL of buffy coat sample using the Mag-Bind® Blood & Tissue DNA HDQ 96 kit (Omega Bio-tek, Norcross, GA) using the Hamilton Microlab STARlet automated liquid handler (Hamilton Company, Reno, NV).

Whole mtDNA amplification

Whole mitochondrial genome for each sample was amplified using REPLI-g® Human Mitochondrial DNA kit (Qiagen, Venlo, Netherlands) following the manufacturer’s protocol. This amplification approach follows a phi29 polymerase-based rolling circle and multiple displacement amplification. The purpose of mitochondrial genome amplification was to increase the amount of mtDNA relative to nuclear DNA to help with providing enough mtDNA for adequate coverage for whole-genome sequencing.

mtDNA sequencing

The Nextera XTTM DNA Library Preparation kit (Illumina, San Diego, CA) was used to prepare the library for sequencing following the manufacturer’s protocol. The samples were sequenced on the NextSeq 550 Sequencer (Illumina) platform generating paired-end reads of 200 bp with an average read depth of 1855X.

Sequence mapping/alignment and variant calling

Raw mtDNA reads were aligned to the reference genome hg38 via BWA-MEM (v0.7.17) using the default parameter for mapping54. Generated SAM files were processed post-alignment with SAMtools (v.1.9) to produce BAM files that were sorted, indexed, and statistically assessed by coordinate55. All reads in the processed post-alignment BAM files were assigned to a single new read-group through the Picard tool AddOrReplaceReadGroups (http://broadinstitute.github.io/picard). Through GATK4 the Spark application of the Picard tool MarkDuplicates was employed on the single read-group BAM files to remove duplicate reads that may have resulted from sample preparation or the sequencing instrument56. BAM files with duplicate reads removed were indexed with SAMtools (v.1.9)55. BAM files from the previous step were used for calling somatic mutations with low allelic fractions for each sample excluding read orientation base qualities below 30 via a GATK4 tool variant caller named Mutect2 utilizing their mitochondria mode that automatically sets parameters for high-depth mitochondrial variant calling56,57.

Oxidation artifact assessment

Oxidative somatic mutations have a low allelic fraction due to their prevalence which can also be affected by tissue heterogeneity (among other factors). CollectOxoGMetrics from Picard was utilized (http://broadinstitute.github.io/picard), a tool that calculates Phred-scaled probability scores based on low allelic frequency, sequence base context, and read orientation to distinguish alternative basecalls likely resulting from a true variant from those that may result from technical oxidative damage, specifically 8oxoG (Supplementary Fig. 1). Mutational oxidative damage results from 8oxoG base-pairing with cytosine or adenine during library preparation leading to G>T or C>A transversions during PCR amplification (https://support.illumina.com). See Costello et al. for a comprehensive analysis of next-generation sequencing 8oxoG artifact generation and detection58. The text file outputs from each file were subjected to manual review to exclude technical oxidative artifacts. Prior to the identification of total 8oxoG variant count, all detected somatic variants for each subject were assessed for any technical oxidative variants that may have been incorrectly identified as a true variant.

Identification of variants indicative of oxidative damage

From the variant call files (vcf), we aimed to identify the specific mutational events that would result from oxidative damage to the template DNA mtDNA. Samples vcf files were converted to tab delimited text files through vcflib, a library collection of tools to manipulate and describe sequence variation59. The variant call data were imported into Excel for manual data processing in order to remove indels, transitions, and non-oxidative transversions for the selection of oxidative variants. Oxidative variants were selected based on the mutagenic property of 8oxoG mispairing with adenine ultimately resulting in the signature oxidative transversion mutations (i.e., a G, T, C, or A alternative allele call where the reference allele call was a T, G, A, or C, respectively) shown in Supplementary Fig. 2. Remaining variants indicative of oxidative damage were then further processed by removing variant calls with a read depth of less than 250 reads, removing individual SNPs (variants called in >90% reads), and removing variants where calls were limited to one orientation (forward or reverse; i.e., requiring coverage from both strands). Variants indicative of oxidative damage were summed for each sample and normalized for read depth (variant count per 1000 read depth) in both populations to test for group differences: cognitive function, sex, T2D, and comorbidity (T2D and cognitive impairment). Oxidative “hotspots” were identified as 8oxoG variant locations that occurred in at least 25 participants in the cohort.

Haplogroup assessment

In order to assess if background mitochondrial variants may be implicated in 8oxoG variant count, we used the NGS sequence data to derive haplogroups for statistical testing of group differences. Variant data were imported into Excel for manual processing in order to generate a list of individual SNPs for each sample (variants called in >90% reads). Each individual profile of mtDNA variants was imported into HaploGrep 2 (v.2.4.0), an online haplogroup classification tool60. Haplogroups were defined in our statistical analyses based on the individual’s identified macrohaplogroup or submacrohaplogroup. The sample size for this analysis was n = 560; one additional individual of unknown cognitive phenotype (specified as “other” and omitted from previously described analyses) was included here since this analysis is independent of cognitive phenotype.

Statistical analyses

Statistical analyses were performed using Microsoft Excel, IBM SPSS software (v.24.0), and R software (v. 4.0.3). Welch’s t-test (two-tailed) and two-way ANOVA were performed on 8oxoG mutational load to compare between both population groups and haplogroups. Multiple linear regression analysis was performed to evaluate the relationship between cognition, sex, age, education, and diabetes with 8oxoG variant count both within the whole study cohort and in stratified analyses of MAs and NHWs.