Introduction

Leprosy is one of the oldest known human infectious diseases1. Primarily caused by the infectious agent Mycobacterium leprae, but also caused by the infectious agent M. lepromatosis, this chronic granulomatous bacterial disease presents symptoms that mainly involve the skin, the peripheral nervous system and the reticuloendothelial system, but other systems, such as the upper respiratory tract, bones and joints, eyes and adrenal glands can also be affected1,2,3,4. The Ridley–Jopling system classifies leprosy into polar groups: Lepromatous (LL) and Tuberculoid (TT); and unstable borderline spectrum: Borderline Tuberculoid (BT), Borderline Borderline (BB) and Borderline Lepromatous (BL)1,5.

Notably, M. leprae is dependent on the production of energy and nutritional products by the host, resulting from the host–pathogen interaction, undoubtedly involving the main function of mitochondria of energy generation, due to the cellular signaling pathways in which these organelles participate and connect their metabolism to meet their nutrient demands6,7. Furthermore, due to the decay of the M. leprae genome, which leads to strain uniformity, it is suggested that the genetic background of the host, and not bacterial variability, is a central aspect of susceptibility to leprosy1.

In this sense, mitochondria play a key role in the production of energy in the form of Adenosine Triphosphate (ATP), both by the tricarboxylic acid cycle (TCA) and mainly by oxidative phosphorylation (OXPHOS), in which the electrons produced in the TCA are pumped from the mitochondrial matrix into the intermembrane space (IMS) and transferred through the respiratory complexes in the electron transport chain (ETC)8,9,10,11,12. The ETC consists of five protein complexes (I–V) and two electron carriers (ubiquinone and cytochrome c), with Complex I being the main entrance and the largest component of this respiratory chain and of crucial importance in cell metabolism for reducing quinone by oxidation of NADH and regeneration of NAD+ (nicotinamide adenine dinucleotide, oxidized) in the mitochondrial matrix10,12,13,14.

Abnormalities in the formation and function of OXPHOS complexes are associated with genetic mutations and can lead to various defects, including synapse damage, axonal degeneration, excessive ROS production, apoptosis and cell death, influencing the process of complex diseases, such as cancer, neurodegenerative diseases and infectious diseases6,10,15.

In fact, variations specifically in Complex I genes encoded by mtDNA have been associated with several diseases in humans, such as metabolic diseases, neurodegenerative disorders and bacterial infections14,16. Furthermore, it is important to mention that different cells have a variable number of mtDNA copies, with the presence of two or more genotypes of the same mutation in varying proportions15,17. This phenomenon is called mitochondrial heteroplasmy and, despite being a normal part of healthy human biology, it is also relevant in pathological processes18,19.

Importantly, these mutations and their levels of heteroplasmy may influence the host response to leprosy, leading to differences in the evolution of this infection. In this study, we investigated the presence of variants in mtDNA genes encoding Complex I regarding leprosy, in search of new potential biomarkers of development and progression of this disease, while also assessing the nuclear and mitochondrial genomic ancestry of our cohort.

Results

Genomic ancestry

It is widely known that the Brazilian population is ethnically diverse, so we analyzed the genomic ancestry composition of our cohort through a set of nuclear markers previously developed by our research group20,21. We observed a similar pattern of nuclear genomic ancestry between case and control (Fig. 1), with European contribution more present in both groups (EUR; case mean = 0.613; control mean = 0.558), followed by Native American (NAT; case mean = 0.234; control mean = 0.246) and African (AFR; case mean = 0.136; control mean = 0.236) contributions. When comparing the paired case and control ancestries, there was no significant difference between the groups; however, when the ancestries were compared to each other in the case group, all of them were statistically significant (Mann–Whitney test, NAT vs. EUR, p-value = 4.26e−09; NAT vs. AFR, p-value = 1.08e−02; EUR vs. AFR, p-value = 7.98e−11), which demonstrates heterogeneity in this sample group. In addition, although our cohort is made up of an intrinsically admixed population, both cases and controls present similarly distributed ancestry, making this a good endemic control for further analysis. Recently, we demonstrated that, by analyzing the mitochondrial ancestry in the same cohort, the European haplogroup H2 was the most frequent in both groups; however, the Native American haplogroups A, B, C and D together accounted for nearly half of the maternal ancestry contribution22.

Figure 1
figure 1

Distribution of the three nuclear ancestries (African-AFR, European-EUR and Native American-NAT) was found in both case and control groups.

Then, considering the mitochondrial ancestry, we analyzed the distribution of mitochondrial variants of Complex I genes in the three leprosy subtypes and the control group, filtering the mutations for those with heteroplasmy > 0.05 and < 0.95 (Fig. 2). It is possible to observe a different pattern between the number of heteroplasmic variants among leprosy subtypes. Despite European ancestry being the most frequent in both groups, it did not correlate with the proportion of heteroplasmic variants between the subtypes and the control group.

Figure 2
figure 2

Number of Complex I variants in the three ancestries (African-AFR, European-EUR, Native American-NAT) in each leprosy subtype (BL, BT, and LL) and control group.

Finally, we stratified the macrohaplogroups belonging to the control group and the case subtypes (BT, BL, LL) and analyzed the levels of heteroplasmy of the variants in both groups according to each macrohaplogroup (Fig. 3). After this processes, it was possible to observe that the BT subgroup presented a higher level of heteroplasmy in the macrohaplogroups A and L3 that belong to Native American and African ancestry, respectively, in addition to presenting a more prominent heteroplasmic profile than the BL and LL subgroups, which, in turn, presented a slightly more similar pattern to each other. This heterogeneity between the subgroups suggests that the levels of mitochondrial heteroplasmy may differ depending on the polarization of the clinical forms of leprosy, thus being able to influence the oxidative process and the development of the disease.

Figure 3
figure 3

Heteroplasmy levels of variants according to each macrohaplogroups in the control group and the case subgroups (BT, BL, and LL).

Distribution and characterization of mitochondrial variants

Furthermore, we analyzed the number of variants that were exclusive to the leprosy poles (Pole L being constituted by subtypes BL and LL; Pole T consisting of subtype BT) and to control, as well as those shared between such groups, that is, the intersection between groups, according to the heteroplasmy filtering of > 0.05 and < 0.95 (Fig. 4). Interestingly, the control contains the higher number of exclusive variants (n = 86), which might be protective against leprosy development, and 74 variants were exclusive of only one leprosy pole (either Pole T or Pole L), suggesting that these variants might influence the onset of specific clinical forms of leprosy.

Figure 4
figure 4

Distribution of found mitochondrial variants in the healthy control group and the leprosy poles (Pole T, and Pole L). Each dark dot indicates the group with the respective number of variants and each line represents the intersection between groups. The set size is the overall number of variants.

Remarkably, when analyzing the predictions of the pathogenicity of the mutations exclusive to one of the poles (Pole T = 42, Pole L = 32), it is possible to observe a predominance of synonymous mutations in leprosy poles (Fig. 5). It is interesting to note that there is a more generalized distribution of mutations by genes in the pole T, with MT-ND1, MT-ND5 and MT-ND6 presenting the greatest number of variants (Fig. 5A). Furthermore, when analyzing the mutations per gene, MT-ND4 and MT-ND5 stand out for having a greater number of variants in pole L when compared to the other analyzed Complex I genes, indicating that mutations in these genes may particularly influence susceptibility to this disease (Fig. 5B).

Figure 5
figure 5

Distribution of shared variants in leprosy poles by gene with their predicted consequence. (A) Variants found only in the leprosy Pole T. (B) Variants found only in the leprosy Pole L.

Notably, six variants were shared between both clinical poles of leprosy, but not by the control (Fig. 4), suggesting these variants might play a role in leprosy development, regardless of the subtype. The pathogenicity characterization of these variants is shown in Table 1. Five variants were characterized as synonyms and one as missense (MT-ND1).

Table 1 In silico pathogenicity characterization of the variants found only in the leprosy poles (Pole T and Pole L).

Discussion

Although leprosy is considered a relevant public health problem in multiple countries, it is still understudied and neglected, especially concerning mitochondrial genetic aspects and, particularly, in the northern region of Brazil22,23. The Brazilian population is highly mixed, with a genetic contribution from different ethnic groups, mainly European, Native American and African groups24,25. In this perspective, the introduction of leprosy in Brazil may be associated with the period of colonization of the country by the Portuguese and, to a lesser extent, by the Dutch, French and Spanish, as well as the slave trade from Africa26,27,28.

Extensive research indicates with a high level of confidence that the ancestral genetic composition significantly influences the susceptibility and/or resistance to various diseases, including cancer and diabetes, and it is strongly associated with the immune response to infectious diseases, including leprosy6,8,26. In addition, in case–control association studies, genetic ancestry is of great relevance in mixed populations, given that this genetic diversity can influence genotypic distribution and phenotypic heterogeneity due to population stratification24.

Importantly, our findings on nuclear and mitochondrial ancestry corroborate the historical processes of colonization and population formation in northern Brazil, with a prevalence of European and Native American ancestry in the studied group (Fig. 1)22,29. Da Silva et al. demonstrated in their study that the contribution of different ethnic groups in the genetic composition of the Amazonian population can influence the risk of developing leprosy, with European ancestry having a great influence on this susceptibility24. Furthermore, Pinto et al., when analyzing different INDEL markers for nuclear ancestry, indicated that an increase in the European contribution to the population increases the risk of developing leprosy, while an increase in the African contribution decreases the risk of developing leprosy, and the analysis of Native American contribution did not result in any statistically significant difference27.

However, mitochondrial ancestry is still an interesting matter to be debated, as there are still few studies on whether mitochondrial haplogroups can confer genetic susceptibility to mycobacterial infection. One of the only studies in this regard showed that, in the Han Chinese population, haplogroups did not confer susceptibility to leprosy6. However, in our study, it was possible to observe that the Native American macrohaplogroup was more prevalent in all subtypes of the disease, which suggests that this ancestry may influence the development of distinct leprosy subtypes, particularly BL after M. leprae infection22. Furthermore, the different profiles of heteroplasmy found in our study between mitochondrial genes in the case subgroups (especially BT) and the control group suggest that heteroplasmy can influence the dysregulation of mitochondrial oxidative processes and the type of host immune response to M. leprae, dividing them into the clinical forms of leprosy22,30,31,32.

Regarding Complex I of OXPHOS, Tió-Coma et al. analyzed gene expression by RNA sequencing in blood samples between patients before the presence of leprosy symptoms and contact control patients, as well as differential longitudinal gene expression between each patient. Among the results of that study, five mitochondrial genes (MT-ND2, MT-ND4, MT-ND5, MT-CO1 and MT-CYB) were found involved in OXPHOS and negatively regulated in leprosy patients, thus presenting a disadvantage for the successful elimination of M. leprae33. It is worth noting that three of these genes (MT-ND2, MT-ND4, MT-ND5) encode Complex I, with MT-ND2 and MT-ND6 being essential subunits for the formation of the NADH dehydrogenase respiratory chain in the mitochondrial membrane, playing a critical role in oxidative phosphorylation33.

It is necessary to highlight that several studies indicate that variants in mtDNA genes are involved in various infectious processes, cancer and neurodegenerative diseases9,34. In the study by Tonsing et al., the authors analyzed the entire mitochondrial genome of blood samples from patients with tuberculosis to understand the mtDNA variants that may predispose the analyzed population to the disease, and observed a total of 83 variants in non-coding regions, in addition to two mutations with deleterious/damaging effects on MT-ND2 genes (m.4824G>A) and MT-ND6 (m.14180C>T)35. In addition, these variants are close to a highly conserved position of the protein, which can cause an empty space inside the molecule, breaking hydrogen bonds and disturbing the correct folding of the protein35. In our study, we found the same mutations, with the m.4824G>A observed in two patients with the BT subtype and one in the control group, which may indicate that, in the case of M. leprae infection, that mutation could influence susceptibility to the disease due to its prevalence in the case group.

Importantly, as seen in Table 1, one of the variants that were shared between all leprosy poles, but not found in the control, is a missense variant with a benign prediction. The other five variants are synonymous, and they are present in MT-ND5. Although most of these variants have identification in the dbSNP, as far as we know, there are no previous reports in the global literature on these six variants, nor reports on the ClinVar database in association with other diseases.

Here, we investigated variants in mitochondrial genes encoding Complex I subunits in leprosy patients and contact controls from a Brazilian population. It was possible to observe a significant number of shared mutations by all groups and mutations present only in the case group, suggesting that mitochondrial dysfunction, highlighted by the energy generation of OXPHOS, is implicated in the pathogenesis of leprosy and the viability of M. leprae in the host.

In addition, in the analyses of the genomic ancestry of this population, it was possible to observe that the mitochondrial haplogroups, mainly the European and African haplogroups, could influence the increased risk or protection against leprosy, respectively. This is particularly important because, to date, there are no studies in the literature that associate mitochondrial ancestry with the development of leprosy. Thus, more clinical studies with larger cohorts should be carried out, using robust statistical tests, to investigate the mitochondrial genetic profile of individuals affected by the disease and the influence of the mitochondrial genes that encode OXPHOS complexes, to expand the understanding of the pathogenesis of M. leprae and suggest potential biomarkers for early detection of leprosy.

Methods

Sampling

Blood samples were obtained from patients affected by leprosy (case group), composed of borderline lepromatous (BL) leprosy (n = 12), lepromatous (LL) leprosy (n = 11) and borderline tuberculoid (BT) leprosy (n = 10), in addition to unrelated household individuals (n = 37, control group), all residents of Pará state, Brazil. For each patient, both case and control groups, one blood sample (5 mL) was collected. The clinical and laboratory diagnosis of the patients was established by the parameters previously described5. This study adhered to the Declaration of Helsinki and was approved by the Ethics Committee of the Institute of Health Sciences at the Federal University of Pará (CEP-ICS/UFPA n. 197/07), and all participants read and signed an informed consent form.

DNA extraction and quantification

Total DNA was extracted by phenol–chloroform method based on Sambrook et al.36. Quantification to verify the concentration and purity of the extracted material was performed with NanoDrop 1000 spectrophotometer (Thermo Fisher Scientific, Wilmington, DE, USA).

Amplification and sequencing

Amplification of mtDNA from the total DNA was performed by conventional PCR with specific primers, as described by Cavalcante et al.8, to cover the entire mitochondrial genome. To verify the quality of the amplification, the samples were applied to a 1% agarose gel and, later, measured in a Qubit 2.0 fluorometer for the library preparation (Thermo Fisher Scientific).

The sequencing of the complete mitochondrial genome was previously described by our research group22. All samples (n = 70) were qualified for the whole mitochondrial genome sequencing. Nextera XT DNA Library Preparation Kit (Illumina Inc., Chicago, IL, USA) was used to prepare the libraries and MiSeq Reagent Kit V3 (600-cycles) (Illumina) for sequencing on the MiSeq System (Illumina), according to the manufacturer's instructions. During the preparation of the libraries, High Sensitivity D1000 ScreenTape was used on the Agilent 2200 TapeStation System (Agilent Technologies, Santa Clara, CA, USA) to assess the quality of the genetic material.

Bioinformatics and statistical analyses

After sequencing, we updated the pipeline for bioinformatics analysis previously described8. Paired-end sequencing reads were trimmed to remove leading and trailing low quality and to scan reads with a 3-base wide sliding window. After trimming, the reads were aligned with the human reference mtDNA sequence (Revised Cambridge Reference Sequence, rCRS) and used to map and classify sequences. For the SNP calling, annotation and heteroplasmy detection, we used mutserve for each sample with the quality parameters as previously described22. Finally, for this study, we selected only the seven mitochondrial genes that encode Complex I proteins to further investigate the mutation profile: MT-ND1, MT-ND2, MT-ND3, MT-ND4, MT-ND4L, MT-ND5 and MT-ND6.

The analysis of the prediction of the pathogenicity of the mutations was performed using Mitochondrial Disease Sequence Data Resource (MSeqDR, https://mseqdr.org/index.php)37, the Genome Aggregation Database (gnomAD, https://gnomad.broadinstitute.org/)38, the Single Nucleotide Polymorphism database (dbSNP, https://www.ncbi.nlm.nih.gov/snp/)39 and the ClinVar database (https://www.ncbi.nlm.nih.gov/clinvar/)40.

To analyze the genomic consequence, impact, and pathogenicity prediction of these variants, MSeqDR was used. As a double check and for lack of information on the variants on the platform described above, they were also sought on the gnomAD and dbSNP platforms. Finally, once the prediction of mutations was inferred, their clinical consequences were analyzed based on the data currently available at ClinVar.

All statistical analyses were performed in R (R Core Team)41 or JASP (JASP Team, 2022)42. R packages ggplot243 and UpSetR44 were used for graphic representations. In all analyses, the p-value was considered statistically significant when ≤ 0.05.

Ethics approval and consent to participate

The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of Institute of Health Sciences at the Federal University of Pará (CEP-ICS/UFPA n. 197/07), and all participants read and signed an informed consent form.

Inclusion and diversity

We support inclusive, diverse, and equitable conduct of research.