Main

Biologists have long been fascinated by the question of how new species are formed and the mechanisms that maintain isolation between them. One key factor in the formation and maintenance of new species is the emergence of genetic incompatibilities that reduce viability or fertility in hybrids1. As originally described by the Dobzhansky–Muller model of hybrid incompatibility2,3, the unique sets of mutations that accumulate in diverging species may interact poorly when they are brought together in hybrids, given that they have never been tested against one another by selection. Owing to the technical challenges of identifying these interactions4, only around a dozen genes involved in hybrid incompatibilities have been precisely mapped5 and exploration of the functional and evolutionary causes of hybrid incompatibilities has been limited to a small number of cases in model organisms4.

This knowledge gap leaves key predictions about the evolutionary processes that drive the emergence of hybrid incompatibilities untested. For one, theory suggests that incompatibilities should be more common within dense gene networks, both because genes involved in such interactions are expected to be tightly co-evolving and because the number of potentially incompatible genotypes explodes as the complexity of the genetic interaction increases7,8. Consistent with this prediction, mutagenesis experiments have highlighted the sensitivity of multi-protein interactions to changes in any of their components8. However, genetic interactions are notoriously difficult to detect empirically9, and this problem is exacerbated with complex genetic interactions10. Such technical challenges may explain the rarity of incompatibilities involving three or more genes in the empirical literature8 (but see refs. 9,11,12,13,14).

Another open question is the degree to which the genes that become involved in hybrid incompatibilities are predictable from their molecular or evolutionary properties. Researchers have proposed that rapid molecular evolution will increase the rate at which incompatibilities accumulate between species4,5,6. Although several known incompatibilities involve genes showing signatures of positive selection, it is unclear how unusual rates of protein evolution are in these genes relative to the genomic background5,6. The mitochondrial genome, in particular, has been proposed as a hotspot for the accumulation of genetic incompatibilities15,16, owing to substitution rates up to 25 times higher than the nuclear genome in many animals17,18 and the potential for sexually antagonistic selection driven by its predominantly maternal inheritance19,20, among other factors21. At the same time, nuclear and mitochondrial proteins must interact with each other in key steps of ATP synthesis, increasing the likelihood of coevolution between these genomes22,23. These factors suggest that interactions between mitochondrial- and nuclear-encoded proteins could have an outsized role in the emergence of hybrid incompatibilities15, consistent with results from numerous species24,25,26.

As we begin to identify the individual genes underlying hybrid incompatibilities, the next frontier is evaluating the processes that drive their evolution. Over the past two decades, it has become abundantly clear that hybridization is exceptionally common in species groups where it was once thought to be rare27,28. As a result, it is now appreciated how frequently species derive genes from their relatives29,30,31. The effects of historical hybridization on the evolution of hybrid incompatibilities have been poorly investigated32, since the foundational theory in this area was developed before the ubiquity of hybridization was fully appreciated7.

Here we use an integrative approach to precisely map the genetic basis and physiological effects of a lethal mitonuclear hybrid incompatibility in swordtail fish and uncover its evolutionary history. The sister species Xiphophorus birchmanni and Xiphophorus malinche began hybridizing approximately 100 generations ago in multiple river systems33 after premating barriers were disrupted by habitat disturbance34, and are a powerful system to study the emergence of hybrid incompatibilities in young species. Despite their recent divergence35 (around 250,000 generations; 0.5% divergence per basepair), some hybrids between X. birchmanni and X. malinche experience strong selection against incompatibilities35,36. One incompatibility that causes melanoma has been previously mapped in this system and population genetic patterns suggest that dozens may be segregating in natural hybrid populations35,36,37,38. Moreover, the ability to generate controlled crosses39,40 and the development of high-quality genomic resources38,41 makes this system particularly tractable for studying hybrid incompatibilities in natural populations. Leveraging data from controlled laboratory crosses and natural hybrid populations, we pinpoint two nuclear-encoded X. birchmanni genes that are lethal when mismatched with the X. malinche mitochondria in hybrids, explore the developmental and physiological effects of this incompatibility, and trace its evolutionary history.

Mapping mitonuclear incompatibilities

To identify loci under selection in X. birchmanni × X. malinche hybrids, we generated approximately 1× low-coverage whole-genome sequence data for 943 individuals from an F2 laboratory cross and 359 wild-caught hybrid adults, and applied a hidden Markov model to data at more than 600,000 ancestry-informative sites along the genome to infer local ancestry (approximately 1 informative site per kilobase37,42; Methods and Supplementary Information 1.1.11.1.4). Using these results, we found evidence for a previously unknown incompatibility between the nuclear genome of X. birchmanni and the mitochondrial genome of X. malinche (Supplementary Information 1.1.51.1.10). Our first direct evidence for this incompatibility came from controlled laboratory crosses (Methods and Supplementary Information 1.1.1). Because the cross is largely unsuccessful in the opposite direction, all laboratory-bred hybrids were the offspring of F1 hybrids generated between X. malinche females and X. birchmanni males and harboured a mitochondrial haplotype derived from the X. malinche parent species. Offspring of F1 intercrosses are expected to derive on average 50% of their genome from each parent species. This expectation is satisfied genome wide and locally along most chromosomes in F2 hybrids (on average 50.3% X. malinche ancestry; Supplementary Fig. 1). However, we detected six segregation distorters genome wide40, with the most extreme signals falling along a 6.5 Mb block of chromosome 13 and a 4.9 Mb block of chromosome 6 (Fig. 1a,d).

Fig. 1: Admixture mapping pinpoints mitonuclear incompatibility in Xiphophorus.
figure 1

a, Average ancestry of F2 hybrids on chromosome (chr.) 13 reveals a large region of segregation distortion towards X. malinche ancestry. The region shaded blue shows the 99% quantiles of X. malinche ancestry at all ancestry informative sites genome wide. The dashed line represents expected X. malinche ancestry for this cross. The purple arrow points to the position of ndufs5. b, Results of ABC simulations estimating the strength of selection on X. malinche mitochondria combined with X. birchmanni ancestry at ndufs5. Shown is the posterior distribution from accepted simulations; the vertical line indicates the maximum a posteriori estimate for selection coefficient (s). c, Observed genotype frequencies of different genotype combinations of ndufs5 and mitochondrial haplotypes in admixture mapping population. d, Average ancestry of F2 individuals on chromosome 6 reveals a large region of segregation distortion towards X. malinche ancestry. The region shaded blue shows the 99% ancestry quantiles and expected ancestry as in a, and the red arrow points to position of ndufa13. e, Results of ABC simulations estimating the strength of selection on X. malinche mitochondria combined with X. birchmanni ndufa13, as in b. f, Observed genotype frequencies of different genotype combinations of ndufa13 and mitochondrial haplotypes in the admixture mapping population. g, Admixture mapping of associations between nuclear ancestry and mitochondrial haplotype in natural hybrids using a partial correlation approach, controlling for genome-wide ancestry. The blue line indicates the 10% false-positive rate genome-wide significance threshold determined by simulations. The peak visible on chromosome 15 is driven by interactions with the X. birchmanni mitochondria and an unknown nuclear gene (Supplementary Information 1.1.10 and 1.2.1). h, Schematic of identified interactions with the X. malinche mitochondrial genome from mapping data. The dashed line indicates a subtle interaction between ndufs5 and ndufa13 (see text, Fig. 2 and Supplementary Information 1.2.5).

Closer examination of genotypes in the chromosome 13 region showed that almost none of the surviving individuals harboured homozygous X. birchmanni ancestry in a 3.75 Mb subregion (Fig. 1c and Supplementary Fig. 2; 0.1% observed versus 25% expected). This pattern is unexpected even in the case of a lethal incompatibility involving only nuclear loci (see Supplementary Information 1.1.1), but is consistent with a lethal mitonuclear incompatibility. Using approximate Bayesian computation (ABC) approaches we inferred the strength of selection against X. birchmanni ancestry in this region that was consistent with the observed genotypes and ancestry deviations. We estimated posterior distributions of selection and dominance coefficients and inferred that selection on this genotype in F2 is largely recessive and essentially lethal (maximum a posteriori estimate h = 0.12 and s = 0.996, 95% credible interval h = 0.010–0.194 and s = 0.986–0.999; Fig. 1b, Extended Data Fig. 1, Methods and Supplementary Information 1.2.11.2.2).

The degree of segregation distortion observed in F2 individuals on chromosome 6 is also surprising (Fig. 1d). Only 3% of individuals harbour homozygous X. birchmanni ancestry in this region (compared with 0.1% in the chromosome 13 region and 25% on average at other loci across the genome; Fig. 1f), which is again lower than expected for a nuclear–nuclear hybrid incompatibility (Supplementary Information 1.1.1). ABC approaches indicate that selection on homozygous X. birchmanni ancestry on chromosome 6 is also severe (maximum a posteriori estimate h = 0.09 and s = 0.91, 95% credible interval interval h = 0.01–0.21 and s = 0.87–0.94; Fig. 1e, Extended Data Fig. 1 and Supplementary Information 1.2.2). Thus, our F2 data show that homozygous X. birchmanni ancestry in regions of either chromosome 13 or chromosome 6 is almost completely lethal in hybrids with X. malinche mitochondria (Fig. 1h).

To formally test for the presence of a mitonuclear incompatibility involving chromosome 13 and chromosome 6, or elsewhere in the genome, we leveraged data from natural hybrid populations. Most naturally occurring X. birchmanni × X. malinche hybrid populations are fixed for mitochondrial haplotypes from one parental species (Supplementary Information 1.1.2 and 1.1.6). However, a few populations segregate for the mitochondrial genomes of both parental types, and we focused on one such population (the ‘Calnali low’ population, hereafter referred to as the admixture mapping population). Admixture mapping for associations between nuclear genotype and mitochondrial ancestry (after adjusting for expected covariance due to genome-wide ancestry36) revealed two genome-wide significant peaks and one peak that approached genome-wide significance (Fig. 1g and Supplementary Tables 13). The strongest peak of association spanned approximately 77 kb and fell within the region of chromosome 13 identified using F2 crosses (Fig. 1g). This peak was also replicated in another hybrid population (Methods, Supplementary Fig. 3 and Supplementary Information 1.1.5) and contains only three genes: the NADH dehydrogenase ubiquinone iron–sulfur protein 5 (ndufs5), E3 ubiquitin–protein ligase, and microtubule–actin cross-linking factor 1. Of these three genes, only ndufs5 forms a protein complex with mitochondrially encoded proteins, which along with other evidence implicates it as one of the interacting partners in the mitonuclear incompatibility (Fig. 1c, Extended Data Fig. 2 and Supplementary Fig. 4; see Supplementary Information 1.1.61.1.9).

We also identified a peak on chromosome 6 that approached genome-wide significance (Fig. 1g, Supplementary Fig. 5, Supplementary Table 2 and Supplementary Information 1.1.10) and fell precisely within the segregation distortion region previously mapped in F2 hybrids (Fig. 1d and Supplementary Information 1.1.1). This peak contained 20 genes, including the mitochondrial complex I gene ndufa13 (Extended Data Fig. 2, Supplementary Fig. 5, Supplementary Information 1.1.10 and Methods). Depletion of non-mitochondrial parent ancestry at ndufa13 was unidirectional (Fig. 1f), consistent with selection acting only against the combination of the X. malinche mitochondria with homozygous X. birchmanni ancestry at ndufa13 (see Supplementary Information 1.2.31.2.4). Genomic analyses in natural hybrid populations confirmed this asymmetry (Extended Data Fig. 2).

Together, these results indicate that at least two X. birchmanni nuclear genes cause incompatibility when they are mismatched in ancestry with the X. malinche mitochondria (Fig. 1h and Supplementary Information 1.2.5). These genes, ndufs5 and ndufa13, belong to a group of proteins and assembly factors that form respiratory complex I (ref. 43) (see Supplementary Table 1 for locations of the 51 annotated complex I genes in the Xiphophorus genome). Complex I is the first component of the mitochondrial electron transport chain that ultimately enables the cell to generate ATP. Both nuclear proteins interface with several mitochondrially derived proteins at the core of the complex I structure, hinting at the possibility that physical interactions could underlie this multi-gene mitonuclear incompatibility.

Interactions with X. birchmanni mitochondrial DNA

Admixture mapping analysis also identified a strong peak of mitonuclear association on chromosome 15, which we briefly discuss here and in Supplementary Information 1.1.10 and 1.2.1. This peak was associated with X. birchmanni mitochondrial ancestry (Extended Data Fig. 3), indicating that it has a distinct genetic architecture from the incompatibility involving the X. malinche mitochondria and X. birchmanni ndufs5 and ndufa13. Specifically, analysis of genotypes at the admixture mapping peak indicates that X. birchmanni mitochondrial ancestry is incompatible with homozygous X. malinche ancestry on chromosome 15 (Fig. 1c and Extended Data Fig. 3). This region did not contain any members of complex I, but dozens of genes in this interval interact with known mitonuclear genes (see Supplementary Table 3 and Supplementary Information 1.1.10). The fact that we detect incompatible interactions with both the X. malinche mitochondria (at ndufs5 and ndufa13) and the X. birchmanni mitochondria (ndufs5 and chromosome 15) in our admixture mapping results supports the idea that mitonuclear interactions can act as ‘hotspots’ for the evolution of hybrid incompatibilities15.

Lethal effects in early development

The combination of X. birchmanni ndufs5 or ndufa13 with the X. malinche mitochondria appears to be lethal by the time individuals reach adulthood. To investigate the developmental timing of the incompatibility, we genotyped pregnant females from the admixture mapping population and recorded the developmental stages of their embryos44 (swordtails are livebearing fish; Methods). We found a significant interaction between developmental stage and ndufs5 genotype, whereas ndufa13 genotype did not affect developmental stage (Fig. 2a,b, Supplementary Figs. 69 and Supplementary Information 1.3.1). Genotyping results revealed that embryos with homozygous X. birchmanni ancestry at ndufs5 and X. malinche mitochondria are present at early developmental stages, but that these embryos did not develop beyond a phenotype typical of the first seven days of gestation (the full length of gestation is 21–28 days in Xiphophorus; Fig. 2a,b,d,e). Individuals with mismatched ancestry at ndufs5 whose siblings were fully developed still had a detectable heartbeat but had consumed less yolk than their siblings and remained morphologically underdeveloped (Fig. 2d, Extended Data Fig. 4 and Supplementary Figs. 1014). Unlike other species, in Xiphophorus this developmental lag could itself cause mortality, since embryos that do not complete embryonic development inside the mother do not survive more than a few days after birth (Supplementary Information 1.3.1 and Supplementary Table 4). Given that complex I inhibition lethally arrests development in zebrafish embryos45,46, we also tested the effects of complex I inhibition on X. birchmanni and X. malinche fry, and found a similar level of sensitivity (Supplementary Information 1.3.2).

Fig. 2: Effect of incompatibility on Xiphophorus hybrid embryos.
figure 2

a, Developmental stage and ndufs5 genotypes of hybrid embryos with X. malinche (Xm) mitochondria. b, Developmental lag of embryos with X. malinche mitochondria with varying ndufs5 genotypes compared with their most developed broodmate. c, Frequency of homozygous X. birchmanni ndufa13 ancestry over F2 hybrid development. Dots and lines represent observed frequency ± 2 × s.e.m. (n = 208 embryos, 74 juveniles, 932 adults). d, F2 siblings showing different phenotypes as a function of ndufs5 genotype: a heterozygote (top) and an ndufs5-incompatible sibling with matched scale (bottom left) and magnified (bottom right). e, Frequency of homozygous X. birchmanni ancestry along chromosome 13 in embryos with X. malinche mitochondrial DNA (mtDNA) that lagged their siblings by at least 1 developmental stage (red) versus those without developmental lag (blue) (Supplementary Information 1.1.7). Dashed line indicates ndufs5 location. Corresponding analyses of chromosomes 6 and 15 are shown in Supplementary Figs. 79. fh, Respiratory and morphometric measurements in F2 embryos. To control for length, residuals of each variable regressed against length are plotted (Supplementary Information 1.3.4 and 1.3.5). Grey dots denote individual measurements, coloured points and bars show mean ± 2 × s.e.m. for each genotype, and brackets with asterisks denote significant differences from Tukey’s honest significant difference test. f, Relationship between ndufs5 genotype and respiration rate in hybrids (n = 40 X. birchmanni, 102 heterozygotes, 47 X. malinche). Xb, X. birchmanni. g, Relationship between ndufa13 genotype and heart rate in hybrids (n = 39 X. birchmanni, 95 heterozygotes, 46 X. malinche). h, Relationship between ndufa13 genotype and width of sinu-atrium, a peristaltic canal between the yolk and embryonic atrium, in hybrids (n = 37 X. birchmanni, 82 heterozygotes, 42 X. malinche). i, Frequency of heterozygotes at ndufs5 among juveniles and adults of varying ndufa13 genotypes. Dots and lines represent observed frequency ± 2 × s.e.m. The dashed line represents expected frequency under additive selection (Supplementary Information 1.2.5).

In contrast to individuals with mismatched ancestry at ndufs5, those with ndufa13 mismatch survived embryonic development but suffered mortality in the early post-natal period (Fig. 2c). We tracked 74 F2 fry from 24 h post birth to adulthood (Supplementary Information 1.3.3). We found that most fry with incompatible genotypes at ndufa13 had already suffered mortality by the time tracking began, with only 7 individuals found 24 h post birth that were homozygous X. birchmanni at ndufa13 (versus 19 expected; binomial P = 0.0005). No natural mortality was observed between 1 day and 3 months post birth (Supplementary Information 1.3.3).

Physiology and complex fitness effects

Our analysis of developing embryos indicates that individuals with the ndufs5 incompatibility exhibited abnormal embryonic development, whereas those with the ndufa13 incompatibility did not. This suggests that these genes may drive lethality through partially distinct mechanisms. Thus, we chose to further investigate the effects of ndufs5 and ndufa13, and their possible interactions. We sampled 235 F2 embryos at a range of developmental stages and measured their overall rates of respiration (Supplementary Information 1.3.41.3.5). We also used imaging of these embryos to track cardiovascular phenotypes as these have been associated with ndufa13 defects in mammals47. We found that incompatible genotypes at ndufs5 and ndufa13 affected a range of phenotypes, including heart rate, length relative to compatible siblings, and length-corrected head size (Extended Data Figs. 4 and 5, Supplementary Figs. 1117 and Supplementary Tables 57). ndufa13 mismatch has a large effect on cardiovascular phenotypes, including heart rate and the size of the sinu-atrium (an embryo-specific heart chamber; Fig. 2g,h, Supplementary Figs. 14 and 16 and Supplementary Tables 6 and 8), whereas ndufs5 affects only heart rate (Supplementary Fig. 14 and Supplementary Table 6). We find initial evidence that cardiac defects persist into adulthood in surviving individuals with ndufa13 mismatch (Extended Data Fig. 5 and Supplementary Information 1.3.3). By contrast, ndufs5 mismatch has a major effect on rates of respiration and yolk consumption during development (Fig. 2f, Supplementary Figs. 1820 and Supplementary Tables 911).

Naively, the separable impacts of incompatible genotypes at ndufs5 and ndufa13 could indicate that even though these proteins are in physical contact in complex I (see below), they represent two distinct hybrid incompatibilities. We investigated this question by taking advantage of rare survivors of the ndufa13 incompatibility in an expanded dataset of 1,010 F2 hybrids. Using this dataset, we were able to identify dozens of survivors of the ndufa13 incompatibility (3.4% of individuals) and found that genotypes at ndufa13 and ndufs5 were not independent (χ2 association test P = 0.032; Supplementary Information 1.2.5). Upon further investigation, we found that the majority of survivors of the ndufa13 incompatibility had homozygous X. malinche ancestry at ndufs5, suggesting that harbouring even one X. birchmanni allele at ndufs5 may sensitize fry to the ndufa13 incompatibility. Indeed, we found that individuals that had heterozygous ancestry at ndufs5 were significantly under-represented among surviving ndufa13 incompatible individuals (Permutation test P = 0.015; Fig. 2i and Supplementary Information 1.2.5). These findings highlight a subtle but significant non-additive effect of ndufs5 and ndufa13 on survival.

Mitochondrial biology in heterozygotes

Because few individuals homozygous for incompatible genotypes at ndufs5 or ndufa13 survive past birth, our previous experiments focused on embryos. However, the small size of Xiphophorus embryos prevents us from using assays that directly target complex I. To further explore the effects of the hybrid incompatibility on complex I function in vivo, we turned to adult F1 hybrids (Fig. 3a). Since F1 hybrids that derive their mitochondria from X. malinche and are heterozygous for ancestry at ndufs5 and ndufa13 are fully viable, we tested whether there was evidence for compensatory regulation that might be protective in F1 hybrids. We found no evidence for significant differences in expression of ndufs5 or ndufa13 (Supplementary Information 1.3.6 and Supplementary Figs. 21 and 22) or in mitochondrial copy number (Fig. 3b and Supplementary Information 1.3.7) between F1 hybrids and parental species.

Fig. 3: Physiology and proteomics of viable heterozygotes.
figure 3

a, Schematic of ancestry at loci of interest in X. birchmanni × malinche F1 hybrids (the incompatibility is not lethal in F1; Supplementary Information 1.2.2). b, Real-time quantitative PCR analysis of differences in mitochondrial copy number in liver tissue by genotype. The ratio is derived from the difference in Ct between the single-copy nuclear gene Nup43 and the mitochondrial gene nd1 (n = 3 X. birchmanni, 6 F1, 6 X. malinche). ce, Results of Oroboros O2K respirometer assay for adult X. birchmanni, X. malinche and hybrid individuals (n = 7 fish per genotype), with X. malinche mitochondria and heterozygous ancestry at ndufs5 and ndufa13. c, Complex I efficiency in F1 hybrids and parental species. d, Time to reach the maximum rate of complex I-driven respiration after ADP addition in F1 hybrids and parental species (see Supplementary Fig. 25 for example time-to-peak curves). For corresponding analysis of time to reach peak complex I- and complex II-driven respiration after addition of succinate, see Supplementary Fig. 26. e, Maximum respiration rates during full O2K protocol in F1 hybrids and parental species. f, Allelic balance of ndufs5 in the F1 hybrid transcriptome and proteome. Left, allele-specific expression of ndufs5 in adult F1 hybrids (n = 3 fish). Right, results of quantitative mass spectrometry analysis of ndufs5 peptides in mitochondrial proteomes derived from adult F1 hybrids (n = 5 fish). Dots show the proportion of area under the spectral curve contributed by the X. malinche allele in each individual. The data on the left are for endogenous peptides present in F1 hybrids, and the data on the right are results for heavy isotope-labelled peptide controls. The peptides quantified for each species are shown on the graph. af, Coloured dots and bars show the mean ± 2 × s.e.m., and grey dots show individual data.

With no indication of a compensatory regulatory response, we reasoned that we might be able to detect reduced mitochondrial complex I function in hybrids heterozygous for ancestry at ndufs5 and ndufa13. We quantified respiratory phenotypes in isolated mitochondria using an Oroboros O2K respirometer in adult hybrids and parental species (Methods, Supplementary Fig. 23 and Supplementary Information 1.3.8). We found that complex I efficiency was lower in hybrids (Fig. 3c and Supplementary Fig. 24, orthogonal contrast t = −2.53, P = 0.023, n = 7 per genotype), and that the time required for hybrids to reach maximum complex I-driven respiration was around 2.5 times longer (orthogonal contrast t = 4.303, P < 0.001; Fig. 3d and Supplementary Fig. 25). Conversely, overall levels of mitochondrial respiration were unaffected by genotype (Fig. 3e, orthogonal contrast t = 0.078, P = 0.94, n = 7 per genotype; Supplementary Information 1.3.8) as were other measures of mitochondrial integrity and function (Supplementary Figs. 26 and 27 and Supplementary Information 1.3.8 and 1.3.9). Together, these data point to reduced function of complex I without broader phenotypic consequences in individuals that are heterozygous for incompatible alleles48.

Given the physiological evidence for some reduction in complex I function in hybrids heterozygous at ndufs5 and ndufa13, we predicted that there might be an altered frequency of protein complexes incorporating both X. malinche mitochondrial proteins and X. birchmanni proteins at ndufs5 and ndufa13 in F1 hybrids. To test this prediction, we took a mass spectrometry-based quantitative proteomics approach. We used stable isotope-labelled peptides to distinguish between the X. birchmanni and X. malinche ndufs5 and ndufa13 peptides in mitochondrial proteomes extracted from F1 hybrids (n = 5; see Methods and Supplementary Information 1.4.11.4.4). Although endogenous ndufa13 peptides were not observed frequently enough to quantify accurately, we found consistent deviations from the expected 50:50 ratio of X. birchmanni to X. malinche peptides for ndufs5 in F1 hybrids, with a significant overrepresentation of matched ancestry at ndufs5 in the mitochondrial proteome (t = 3.96, P = 0.016; Fig. 3f, Supplementary Fig. 28 and Supplementary Information 1.4.5). Since we did not observe allele-specific expression of ndufs5 (Fig. 3f and Supplementary Information 1.3.6), this result is consistent with disproportionate degradation of X. birchmanni-derived ndufs5 peptides in the mitochondrial proteome or differences in translation of ndufs5 transcripts derived from the two species.

Mitonuclear substitutions in complex I

To begin to explore the possible mitochondrial partners of ndufs5 and ndufa13 among the 37 non-recombining genes in the swordtail mitochondrial genome, we turned to protein modelling, relying on high-quality cryo-electron microscopy (cryo-EM)-based structures49,50,51. Although these structures are only available for distant relatives of swordtails, the presence of the same set of supernumerary complex I subunits and high sequence similarity suggest that using these structures is appropriate (Supplementary Tables 12 and 13, Supplementary Figs. 2931 and Supplementary Information 1.4.6).

Barring a hybrid incompatibility generated by regulatory divergence (see Supplementary Information 1.3.6), our expectation is that hybrid incompatibilities will be driven by amino acid changes in interacting proteins52. We used the program RaptorX53 to generate predicted structures of X. birchmanni and X. malinche Ndufs5, Ndufa13 and nearby complex I proteins encoded by mitochondrial and nuclear genes, which we aligned to a mouse cryo-EM complex I structure49 (Fig. 4a, Supplementary Figs. 2931 and Methods). Using these structures, we visualized amino acid substitutions between X. birchmanni and X. malinche at the interfaces of Ndufs5, Ndufa13 and mitochondrial-encoded proteins (Extended Data Fig. 6 and Supplementary Figs. 32 and 33). Whereas there are dozens of substitutions in the four mitochondrial-encoded proteins that are in close physical proximity to Ndufs5 or Ndufa13 (Supplementary Fig. 29; Nd2, Nd3, Nd4l and Nd6), there are only five cases where amino acid substitutions in either nuclear-encoded protein are predicted to be close enough to contact substitutions in any mitochondrial-encoded protein, all of which involve Nd2 or Nd6 (Fig. 4a and Extended Data Table 1; see Supplementary Fig. 33 for pairwise visualizations of interacting proteins). These paired substitutions in regions of close proximity between mitochondrial- and nuclear-encoded proteins suggest that nd2 and nd6 are the genes most likely to be involved in the mitochondrial component of the hybrid incompatibility (Fig. 4a,b Extended Data Fig. 6 and Supplementary Figs. 3335), and will be promising candidates for functional validation when such approaches become possible in swordtails.

Fig. 4: Predicted structures of Xiphophorus respiratory complex I and evolutionary rates of incompatible alleles.
figure 4

a, Left, Xiphophorus respiratory complex I structures generated by RaptorX using alignment to a template mouse cryo-EM structure. Coloured protein structures include Ndufs5 and Ndufa13 and the four mitochondrial-encoded nd gene products in contact with Ndufs5 or Ndufa13. Right, expanded view showing the surface of predicted contacts between these proteins. Solid black lines highlight two areas predicted to be in close contact between interspecific substitutions (alpha carbon distance ≤ 10 Å for all models), and dashed lines show three additional areas in which there was weaker evidence for a predicted contact based on computational analyses (side chain distance ≤ 12 Å in at least one model). Asterisks denote residues with substitutions in X. birchmanni computationally predicted to affect protein function (Extended Data Table 3). b, Detailed view of the interface between Ndufs5, Nd2 and Nd6. Spheres highlight substitutions between X. birchmanni and X. malinche. For substitutions predicted to be in close proximity, residues are labelled with letters denoting the X. malinche allele, the residue number, and the X. birchmanni allele, respectively (Extended Data Table 3 and Supplementary Information 1.4.6). c, Gene tree for ndufs5 generated with RAxML, highlighting an excess of substitutions along the X. birchmanni branch. The scale bar represents the number of nucleotide substitutions per site. Derived non-synonymous substitutions are indicated by red ticks along the phylogeny; spacing between ticks is arbitrary. d, Distri bution of log10(dN/dS) between X. birchmanni and X. malinche across all nuclear genes with values for ndufs5 and ndufa13 highlighted. e, Gene tree for ndufa13 (as in c), highlighting an excess of substitutions along the X. birchmanni branch.

Rapid evolution of complex I proteins

Theory predicts that hybrid incompatibilities are more likely to arise in rapidly evolving genes4,5,6,7. Consistent with this hypothesis, ndufs5 is among the most rapidly evolving genes genome-wide between X. birchmanni and X. malinche (Fig. 4c,d). Aligning the ndufs5 coding sequences of X. birchmanni, X. malinche and 12 other swordtail species revealed that all 4 amino acid substitutions that differentiate X. birchmanni and X. malinche at ndufs5 were derived on the X. birchmanni branch (Fig. 4c). Phylogenetic tests indicate that there has been accelerated evolution of ndufs5 on this branch (inferred ratio of non-synonymous substitutions per non-synonymous site to synonymous substitutions per synonymous site (dN/dS) > 99, N = 4, S = 0, codeml branch test P = 0.005; Fig. 4c). Similar patterns of rapid evolution are observed at ndufa13, which also showed evidence for accelerated evolution in X. birchmanni (Fig. 4e; dN/dS = 1.2, N = 3, S = 1, codeml branch test P = 0.002). Although explicit tests for adaptive evolution at ndufs5 and ndufa13 could not exclude a scenario of relaxed selection (Extended Data Table 2 and Supplementary Information 1.5.1 and 1.5.2), our comparisons across phylogenetic scales highlight strong conservation in some regions of the proteins and rapid turnover in others, complicating our interpretation of this test (Supplementary Fig. 36).

Rapid evolution of ndufs5 and ndufa13 could be driven by coevolution with mitochondrial substitutions, a mechanism that has been proposed to explain the outsized role of the mitochondria in hybrid incompatibilities15,54. Indeed, there is an excess of derived substitutions in the X. birchmanni mitochondrial protein Nd6, one of the proteins that physically contacts Ndufs5 and Ndufa13 (Extended Data Fig. 7 and Extended Data Table 2; codeml branch test P = 0.005). Moreover, several of the substitutions observed in both mitochondrial and nuclear genes are predicted to have functional consequences (Extended Data Table 3 and Supplementary Information 1.5.1), including ones predicted to be in contact between Ndufs5, Ndufa13, Nd2 and Nd6 (Fig. 4a,b and Extended Data Fig. 6).

Introgression of incompatibility genes

The presence of a mitonuclear incompatibility in Xiphophorus is especially intriguing, given previous reports that mitochondrial genomes may have introgressed between species29. While X. malinche and X. birchmanni are sister species based on the nuclear genome, they are mitochondrially divergent, with X. malinche and Xiphophorus cortezi grouped as sister species based on the mitochondrial phylogeny29 (Fig. 5a,b). As we show, all X. cortezi mitochondria sequenced to date are nested within X. malinche mitochondrial diversity (Fig. 5b, Supplementary Fig. 37 and Supplementary Information 1.5.3 and 1.5.4). Simulations indicate that gene flow, rather than incomplete lineage sorting, drove replacement of the X. cortezi mitochondria with the X. malinche sequence (P < 0.002 by simulation; Fig. 5c and Supplementary Information 1.5.4).

Fig. 5: Phylogenetic analysis and ancestry mapping suggest that genes underlying the mitonuclear incompatibility have introgressed from X. malinche into X. cortezi.
figure 5

a, Nuclear phylogeny of Xiphophorus species, showing that X. birchmanni and X. malinche are sister species29. b, Phylogeny constructed from whole mitochondrial DNA sequences showing that X. cortezi mitochondria are nested within X. malinche mitochondrial diversity. c, Results of simulations modelling expected mitochondrial divergence between X. malinche and X. cortezi in a scenario with no gene flow. The first set of simulations used the average mitochondrial substitution rate observed between Xiphophorus species (red), and the second used the minimum mitochondrial substitution rate observed (teal). The dotted line shows observed divergence between mitochondrial haplotypes in X. malinche and X. cortezi, indicating that past mitochondrial introgression is more likely than incomplete lineage sorting. d, Clustal alignment of Ndufs5 sequences shows that X. malinche and X. cortezi (Xc) have identical amino acid sequences at Ndufs5, hinting at possible introgression of the nuclear ndufs5 gene, whereas X. birchmanni is separated from them by four substitutions. Similar patterns are observed for ndufa13. Colours indicate properties of the amino acid, and asterisks indicate locations where the amino acid sequences are identical. e, Non-mitochondrial parent ancestry is strongly depleted in natural X. cortezi×X. birchmanni hybrid populations fixed for the X. cortezi mitochondrial haplotype at ndufs5 and ndufa13. Upward arrowheads along x-axes show the locations of ndufa13 (red) and ndufs5 (purple) on chromosomes 6 and 13, respectively, which fall in minor parent ancestry deserts in both independently formed populations, as expected for strong hybrid incompatibilities. Step curves show average X. birchmanni ancestry in 250-kb windows, and horizontal lines show the genome-wide average for each population.

The introgression of the mitochondrial genome from X. malinche into X. cortezi raises the possibility that other complex I genes may have co-introgressed55. Indeed, the nucleotide sequence for ndufs5 is identical between X. malinche and X. cortezi, and the sequence of ndufa13 differs by a single synonymous mutation (although conservation of both genes is high throughout Xiphophorus; Supplementary Figs. 38 and 39). The identical amino acid sequences of the proteins suggest that hybrids between X. cortezi and X. birchmanni are likely to harbour the same mitonuclear incompatibility we observe between X. malinche and X. birchmanni, as a result of ancient introgression between X. malinche and X. cortezi (Fig. 5d and Supplementary Information 1.5.31.5.5).

This inference is supported by analysis of three contemporary X. birchmanni × X. cortezi hybrid populations40 (Supplementary Fig. 40). We find that all known X. birchmanni × X. cortezi hybrid populations are fixed for the mitochondrial genome from X. cortezi (that originated in X. malinche) and show a striking depletion of X. birchmanni ancestry at ndufs5 and ndufa13 (Fig. 5e and Supplementary Fig. 41). This replicated depletion is not expected by chance (Fig. 5e and Supplementary Information 1.5.6, P = 0.0001) and instead indicates that selection has acted on these regions. These results suggest that the mitonuclear incompatibility observed in X. birchmanni × X. malinche is also active in hybridizing X. birchmanni×X. cortezi populations. This exciting finding shows that genes underlying hybrid incompatibilities can introgress together, transferring incompatibilities between related species.

Discussion

Here we investigate the genetic and evolutionary forces that drive the emergence of hybrid incompatibilities. Theory predicts that hybrid incompatibilities involving multiple genes should be common7,8, but with few exceptions9,11,12,13, they remain almost uncharacterized at the genic level8. We have identified incompatible interactions in mitochondrial complex I that cause hybrid lethality in laboratory and wild populations. Our findings in naturally hybridizing species echo predictions from theory and studies in laboratory models9,11,12,13 suggesting that protein complexes may be a critical site of hybrid breakdown.

Researchers have proposed mitonuclear interactions as hotspots for the emergence of hybrid incompatibilities, given that mitochondrial genomes often experience higher substitution rates between species17,18,56, yet must intimately interact with nuclear proteins to perform essential cellular functions22,23. Our findings support this prediction, identifying incompatible interactions with both the X. malinche and X. birchmanni mitochondria. We also show that there has been exceptionally rapid evolution in both mitochondrial and interacting nuclear genes in X. birchmanni (Fig. 4). Whether driven by adaptation or some other mechanism, our findings support the hypothesis that the coevolution of mitochondrial and nuclear genes could drive the overrepresentation of mitonuclear interactions in hybrid incompatibilities22,23,54. More broadly, our results are consistent with predictions that rapidly evolving proteins are more likely to become involved in hybrid incompatibilities than their slowly evolving counterparts4,5,6.

Characterizing the incompatibility across multiple scales of organization enabled us to explore the mechanisms through which it acts57,58,59. Our results suggest that in the case of the X. malinche mitochondria hybrid lethality is mediated through arrested development in utero of individuals with mismatched ancestry at ndufs5, whereas individuals with ndufa13 mismatch have vascular defects and typically die shortly after birth. Intriguingly, individuals with ndufa13 mismatch that do survive are much less likely to harbour any X. birchmanni alleles at ndufs5 (Fig. 2i). Together, our results indicate that a subtle three-way interaction overlays two strong pairwise mitonuclear incompatibilities at ndufs5 and ndufa13. Evolutionary biologists have been fascinated by the idea that hybrid incompatibilities may commonly involve three or more genes following theoretical work by Orr7 nearly 30 years ago, but this question has been challenging to address empirically. Our results highlight how the nuances of actual fitness landscapes may defy simplifying assumptions.

Finally, this mitonuclear incompatibility provides a new case in which the same genes are involved in incompatibilities across multiple species30,38,60. However, tracing the evolutionary history of the genes that underlie it adds further complexity to this phenomenon: we found that introgression has resulted in the transfer of genes underlying the incompatibility from X. malinche to X. cortezi, and evidence from X. birchmanni×X. cortezi hybrid populations indicates that the incompatibility is probably under selection in these populations as well. The possibility that hybridization could transfer incompatibilities between species has not been previously recognized, perhaps due to an underappreciation of the frequency of hybridization. The impact of past hybridization on the structure of present-day reproductive barriers between species is an exciting area for future inquiry.

Methods

Biological materials

Wild parental and hybrid individuals used in this study were collected from natural populations in Hidalgo, Mexico (permit no. PPF/DGOPA-002/19). Artificial F1 and F2 hybrids were generated using mesocosm tanks as described previously39. Caudal fin clips were used as the source for all DNA isolation and for flow cytometry, and liver tissue for RNA-seq, respirometry, and proteomic assays were collected following Stanford Administrative Panel on Laboratory Animal Care (APLAC) protocol no. 33071.

Genotyping and local ancestry calling

Genomic DNA was extracted from fin clips and individually barcoded tagmentation-based libraries were generated (Supplementary Information 1.1.3). Hybrids were genotyped with low-coverage whole-genome sequencing followed by local ancestry inference across the 24 Xiphophorus chromosomes and the mitochondrial genome using the ancestryinfer pipeline38,39,42,61 (Supplementary Information 1.1.3 and 1.1.4). We converted posterior probabilities for each ancestry state to hard calls for downstream analysis, using a posterior probability threshold of 0.9, and analysed ancestry variation across the genome.

QTL and admixture mapping

The regions interacting with the mitochondrial genome were first identified based on analysis of segregation distortion in 943 F2 hybrids generated from F1 crosses between X. malinche females and X. birchmanni males (Supplementary Information 1.1.1 and Langdon et al.40). Since all hybrids in this artificial cross harboured the X. malinche mitochondria, we scanned for regions of exceptionally high X. malinche ancestry along the genome (>60% X. malinche ancestry), identifying one such region on chromosome 13 and one on chromosome 6 (Fig. 1; see also ref. 40). Evidence for interactions between these regions and the mitochondrial genome were confirmed using admixture mapping in two hybrid populations that segregated for the mitochondrial haplotype of both species (Supplementary Information 1.1.2): the Calnali Low hybrid population (n = 359) and the Chahuaco falls hybrid population (n = 244). In brief, we used a partial correlation analysis to identify regions of the genome strongly associated with mitochondrial ancestry, after regressing out genome-wide ancestry to account for covariance in ancestry due to population structure (see ref. 36 and Supplementary Information 1.1.5 and 1.1.9). Significance thresholds were determined using simulations (Supplementary Information 1.1.5; see also Supplementary Information 1.1.6).

Estimating selection on the incompatibility

We used an ABC approach to estimate the strength of selection against the incompatible interaction between the X. malinche mitochondrial haplotype and X. birchmanni ancestry at the two nuclear genes involved in the hybrid incompatibility: ndufs5 and ndufa13 (Supplementary Information 1.2.2). For these simulations, we asked what selection coefficients (0–1) and dominance coefficients (0–1) could generate the observed deviations from the expectation of 50:50 X. birchmanni–X. malinche ancestry in F2 hybrids at ndufs5 and ndufa13 after two generations of selection. We performed 500,000 simulations for each interaction and accepted or rejected simulations based on comparisons to the real data using a 5% tolerance threshold (Supplementary Information 1.2.2). We also evaluated evidence for incompatible interactions with the X. birchmanni mitochondrial haplotype (Supplementary Information 1.2.11.2.4).

Embryo staging and genotyping

To pinpoint when in development the incompatibility between the X. malinche mitochondria and X. birchmanni nuclear genotypes causes lethality, we collected a dataset on the developmental stages of embryos with different genotype combinations. Whole ovaries were removed from pregnant females and embryos were individually dissected. Each embryo was assigned a developmental stage ranging from 1–11 based on established protocols for poeciliid embryos44. Unfertilized eggs were excluded from analysis. Following staging, individual embryos (n = 296) were genotyped as described above and in Supplementary Information 1.3.1. We tested for significant differences in developmental stage between siblings with compatible and incompatible genotype combinations using a two-sided two-sample t-test (Supplementary Information 1.3.1) and examined differences in ancestry between large groups of siblings that varied in their developmental stages (Supplementary Information 1.1.7). We also collected data on embryonic stage and variability between siblings in embryonic stage from both pure parental species (Supplementary Information 1.3.1). We used a different approach to pinpoint the timing of ndufa13 lethality given that it appeared to act postnatally (Supplementary Information 1.3.3).

Embryo respirometry and morphometrics

To study the mechanisms of ndufs5- and ndufa13-driven lethality, we performed oxygen consumption measurements on F2 embryos in a Loligo plate respirometer (Supplementary Information 1.3.4). Embryos were dissected from mothers and transferred to wells of a 24-well plate, where their oxygen consumption was measured over 60 min. The measurement was then repeated in media dosed with 5 μM rotenone to test sensitivity to complex I inhibition, after which the embryos were video recorded and photographed for morphometrics in ImageJ. We used linear models to test the effect of ndufs5 genotype, ndufa13 genotype, and individual standard length on a number of variables, controlling for batch effects (Supplementary Information 1.3.4 and 1.3.5).

Mitochondrial respirometry

To further evaluate mitochondrial function in individuals heterozygous for the mitonuclear incompatibility (Supplementary Information 1.3.6 and 1.3.7), we conducted respirometry assays on X. birchmanni, X. malinche, and hybrid individuals that had the X. malinche mitochondria and were heterozygous for the nuclear components of the hybrid incompatibility (n = 7 of each genotype). Mitochondria were isolated from whole liver tissue and mitochondrial respiration was quantified using the Oroboros O2K respirometry system62 (Supplementary Fig. 23). A step-by-step description of this protocol and methods used to calculate respiratory flux control factors is outlined in Supplementary Information 1.3.8. We complemented the results of these respirometry experiments with measures of mitochondrial membrane potential using a flow cytometry-based approach (Supplementary Information 1.3.9).

Parallel reaction monitoring proteomics

For parallel reaction monitoring (PRM) with mass spectrometry, we used a similar approach to that used for respirometry to isolate whole mitochondria from five F1 hybrids (which harboured X. malinche mitochondria). This approach is described in detail in Supplementary Information 1.4.1. In brief, we designed heavy isotope-labelled peptides to distinguish between the X. birchmanni and X. malinche copies of Ndufs5 and Ndufa13, facilitating quantification of the peptides of interest in the mitochondrial proteome (Supplementary Information 1.4.2). Mitochondrial isolates were prepared for mass spectrometry and combined with heavy isotope-labelled peptides in known quantities (see Supplementary Information 1.4.3), then submitted to Orbitrap mass spectrometry with separation with ultra performance liquid chromatography and PRM for ion selection. The protocol for mass spectrometry and PRM is described in detail in Supplementary Information 1.4.4.

To analyse the results, the focal peptide’s spectral peak was identified based on the peak of the heavy isotope-labelled spike-in peptide. We focused analysis on the Ndufs5 peptide WLL[L/P]QSGEQPYK, since other endogenous peptides were below the expected sensitivity limits of our PRM protocol (Supplementary Information 1.4.5). We normalized the intensity of the Ndufs5 peptide based on the known spike-in quantity, and quantified the proportion of Ndufs5 in each F1 individual derived from X. malinche versus X. birchmanni (Supplementary Information 1.4.5). We tested whether these ratios significantly deviated from the 50:50 expectation for F1 hybrids using a two-sided one-sample t-test.

Complex I protein modelling

Mapping results allowed us to identify ndufs5 and ndufa13 as X. birchmanni genes that interact negatively with X. malinche mitochondrial genes. We used a protein modelling-based approach with RaptorX (http://raptorx.uchicago.edu) to identify the mitochondrial genes most likely to interact with ndufs5 and ndufa13 (see Supplementary Information 1.4.6). Using the mouse cryo-EM structure (Protein Data Bank (PDB) ID 6G2J) of complex I, we identified proteins in contact with Ndufs5 and Ndufa13, which included several mitochondrial (Nd2, Nd3, Nd4l and Nd6) and nuclear (Ndufa1, Ndufa8, Ndufb5 and Ndufc2) proteins. We then used RaptorX to predict structures for both the X. birchmanni and X. malinche versions of the proteins. In addition, we evaluated the robustness of these predictions to choice of cryo-EM template; see Supplementary Information 1.4.6.

Analysis of evolutionary rates

Comparison of predicted protein sequences encoded by ndufs5, ndufa13 and mitochondrial genes of interest (nd2 and nd6) revealed a large number of substitutions between X. birchmanni and X. malinche. We calculated dN/dS between X. birchmanni and X. malinche for all annotated protein coding genes throughout the genome and found that both ndufs5 and ndufa13 have rapid protein evolution (Fig. 4d and Supplementary Information 1.5.1). Examining these mutations in a phylogenetic context revealed that many substitutions in ndufs5, nudfa13 and nd6 were derived in X. birchmanni. We tested for significant differences in evolutionary rates on the X. birchmanni lineage and for predicted functional impacts of these substitutions; these analyses are described in Supplementary Information 1.5.1.

Tests for ancient introgression

Previous work had indicated that the mitochondrial phylogeny in Xiphophorus is discordant with the whole-genome species tree29. Specifically, although X. birchmanni and X. malinche are sister species based on the nuclear genome, X. malinche and X. cortezi are sister species based on the mitochondrial genome. We used a combination of PacBio amplicon sequencing of 10 individuals (2 or more per species, Supplementary Information 1.5.3) and newly available whole-genome resequencing data to confirm this result and polarize the direction of the discordance by constructing maximum likelihood mitochondrial phylogenies with the program RAxML63. We performed similar phylogenetic analyses of the nuclear genes that interact with the X. malinche mitochondria (ndufs5 and ndufa13; Supplementary Information 1.5.3). Combined with phylogenetic results, simulation results suggest that gene flow from X. malinche into X. cortezi is the most likely cause of the discordance we observe between the mitochondrial and nuclear phylogenies (Supplementary Information 1.5.3 and 1.5.4). Since X. malinche and X. cortezi are not currently sympatric, this suggests ancient gene flow between them (Supplementary Information 1.5.5).

X. birchmanni × X. cortezi hybridization

To investigate the possibility that hybrids between X. birchmanni and X. cortezi share the same mitonuclear incompatibility as observed in hybrids between X. birchmanni and X. malinche (Supplementary Information 1.5.6), we took advantage of genomic data from recently discovered hybrid populations between X. birchmanni and X. cortezi64. Using data from three different X. birchmanni × X. cortezi populations and a permutation-based approach, we tested whether ancestry at ndufs5 and ndufa13 showed lower mismatch with mitochondrial ancestry than expected given the genome-wide ancestry distribution. This analysis is described in detail in Supplementary Information 1.5.6.

Animal care and use

All methods were performed in compliance with Stanford APLAC protocol no. 33071.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.