The reunion of two lineages of the Neotropical brown stink bug on soybean lands in the heart of Brazil

Soares, Patricia L.; Cordeiro, Erick M. G.; Santos, Frederico N. S.; Omoto, Celso; Correa, Alberto S.

doi:10.1038/s41598-018-20187-6

Download PDF

Article
Open access
Published: 06 February 2018

The reunion of two lineages of the Neotropical brown stink bug on soybean lands in the heart of Brazil

Patricia L. Soares¹,
Erick M. G. Cordeiro¹,
Frederico N. S. Santos¹,
Celso Omoto¹ &
…
Alberto S. Correa¹

Scientific Reports volume 8, Article number: 2496 (2018) Cite this article

3973 Accesses
30 Citations
5 Altmetric
Metrics details

Subjects

Abstract

The rapid pace of conversion of natural areas to agricultural systems is highly concerning, and the consequences for conservation and pest management are not yet fully understood. We examined mitochondrial (COI and Cytb) and nuclear (ITS1) gene regions of 21 populations of the stink bug Euschistus heros, to investigate the genetic diversity, genetic structure, and demographic history of this emerging soybean pest in South America. Two deep lineages that diverged in the Pliocene (4.5 Myr) occur over wide areas of Brazil. Historical changes during the Plio-Pleistocene led to significant genetic differences between E. heros populations, which differentiated further in several biomes. The northern lineage is older, more diverse, and prevalent in the Amazon and Caatinga, while the southern lineage is younger, less diverse, and prevalent in the Atlantic Forest and Chaco biomes. Euschistus heros populations are expanding in size and range but at different rates, strongly affected by environmental variables. Secondary contact between the main lineages is now occurring, mainly in areas of intensive farming and particularly in the Cerrado, an important agricultural frontier. Individuals adapted to different environmental conditions and to large monocultures might currently be combining into a panmictic and hard-to-control pest population.

Evolutionary history and colonization patterns of the wing dimorphic grasshopper Dichroplus vittatus in two Argentinean biomes

Article Open access 21 February 2022

One maternal lineage leads the expansion of Thaumastocoris peregrinus (Hemiptera: Thaumastocoridae) in the New and Old Worlds

Article Open access 26 February 2020

The direction, timing and demography of Popillia japonica (Coleoptera) invasion reconstructed using complete mitochondrial genomes

Article Open access 26 March 2024

Introduction

The drastic climate changes during the Plio-Pleistocene have been considered the main cause of high levels of diversification in many areas in Brazil^1,2,3. The four major Brazilian biomes, the Atlantic Forest (AF), Cerrado (central savannas), Amazon and Caatinga (seasonally dry tropical forest) have undergone profound changes during glaciation and interglacial cycles⁴. Due to this complexity and landscape composition, the processes of diversification by vicariance and habitat refugia are frequently invoked to explain the high levels of species endemism and diversity in this part of the planet^5,6,7,8. Recent findings suggest that forest expansion and contraction and ‘historically stable areas’ may also have played a major role in the differentiation of lineages⁹.

Today, the Amazon and Atlantic forests are separated by a unique mosaic composed of savannas and woodlands that covers a large area between the two forest biomes, from Argentina and Paraguay (the Argentinean and Paraguayan Chaco) and continues through the central Brazilian Cerrado to the Caatinga in northeastern Brazil. This belt of mostly sparse dry vegetation is known as the ‘major South American disjunction’^3,10,11 and forms a natural barrier preventing the movement of organisms between the northern Brazilian biomes and the Atlantic Forest^12,13,14. Recent human impacts on these biomes, particularly the expansion of agricultural areas, have drastically changed the landscape and likely the connection between ecosystems, rearranging patterns that began to be formed millions of years ago.

The expansion of agricultural frontiers necessitates conversion of the native habitat to agriculture¹⁵. Fragments of native vegetation become embedded in a matrix of cropland and pasture that will eventually affect the species and ecosystem dynamics¹⁶. In some instances, habitat loss can limit the geographic range of endemic species, although certain species may thrive in their new surroundings. Cropland areas can provide exploitable resources, e.g. the expansion of soybean cultivation in Brazil in recent decades¹⁷. Soybean crops were formerly limited to southern Brazil (Atlantic Forest); in the early 1970s, advances in farming methods and new varieties allowed soybean farmers to expand into a new and important frontier, the Cerrado^18,19,20.

Farming in the Cerrado has had both negative and positive impacts over the last 40 years. The expansion of soybean crops caused extensive environmental impacts such as fragmentation and the loss of natural areas, a matter of great concern for ecologists and conservationists^2,21,22. On the other hand, Brazil is the second largest soybean producer after the USA, and soybeans account for an important share of its GDP²³. For these reasons, it is important to understand both the impacts of soybean expansion on connecting natural populations, and the influx of insect pests from natural areas into soybean croplands.

The Neotropical brown stink bug, Euschistus heros (Fabr. 1798) (Hemiptera: Pentatomidae), is one of the most important pests of soybeans²⁴. This Neotropical native is widespread in South America, living in markedly different environments including the Amazon Forest, Cerrado, Caatinga and Atlantic Forest²⁵. The dispersal ability of E. heros is not well known, but is considered to be poor, perhaps because of its limited flight activity and diapause behavior^26,27. This polyphagous pest feeds on members of Fabaceae, Solanaceae, Brassicaceae, Compositae, and Malvaceae, and often reaches high population densities on soybeans^28,29,30,31. Rarely reported before the 1970s^24,32,33, E. heros has since increased in abundance and is now found in all major soybean-producing regions²⁴. Recently, this pest was recorded in Argentina³⁴ and Paraguay²⁴, raising concerns regarding a possible range expansion to other locations in South, Central and North America²⁴.

Here we present a genealogy of mitochondrial and nuclear DNA sequences from E. heros. We addressed several questions regarding the genetic diversity, population structure, and demographic history of E. heros populations in Brazil and Paraguay, examining the potential role of past events in the differentiation of lineages, and the recent events (i.e. soybean expansion) promoting the admixture of ancient lineages. Our first objective was to determine the genetic distribution, studying population divergence and population structure. Second, we investigated the demographic history of E. heros in different biomes in South America. Finally, we used a modelling approach to explore how environmental variables and soybean expansion can explain the genetic pattern of current populations of E. heros in Brazil.

Results

Genealogical inferences

Genealogical relationships among 111 mitochondrial haplotypes indicate two well-supported E. heros lineages separated by 52 mutational steps, and an estimated genetic distance of D = 0.042 (Fig. 1a). The southern lineage (S) haplotypes occur mainly in southern regions of South America (Fig. 1b). Small percentages of lineage S haplotypes were also found in central and northeastern Brazil (Fig. 1b), in a wide range of habitats (Table 1). The northern lineage (N) occurs mainly in northern and northeastern South America and was not present in Paraguay or southern Brazil (Table 1). A total of 91 haplotypes were identified as private haplotypes; the most frequent variants were H2 and H38 (n = 8), both from lineage S (Table 1).

Table 1 Sampling localities of Euschistus heros, with code, biomes, mitochondrial haplotype from two concatenated genes (COI-Cytb), haplotype nuclear ITS1 region, and geographic coordinates.

Full size table

Analyses of the ITS1 region revealed a single nucleotide polymorphism variation separating the haplotypes. The six haplotypes were separated by a one-step mutation (Fig. 1c and Supplementary material S2). We created two alternative ITS1 sequences for all individuals that showed ambiguity in the polymorphic site, to recreate the heterozygotes. The ITS1 network had lower haplotype diversity than the mitochondrial network. Haplotype HA was the most frequent (70.16%) and was widely distributed across all regions (Table 1). Haplotype HC was found in southern and central South America, and associated individuals previously identified as mitochondrial lineage S (Table 1 and Supplementary Fig S2). Haplotype HD (21.77%) was found only in northern South America, and associated individuals previously identified as mitochondrial lineages N or S (Table 1). The single haplotypes HB, HE and HF were found in populations RS1, MT1, and MT2, respectively (Table 1). The sharing of nuclear haplotypes by specimens from both mitochondrial lineages may indicate that these lineages can interbreed (Fig. 1c).

Diversity statistics

These South American E. heros populations showed extensive mitochondrial diversity. Of 159 concatenated mitochondrial sequences of COI and Cytb analyzed, 111 haplotypes were found; most (82%) of these haplotypes were private and only 18% were shared among individuals. The overall haplotype diversity, nucleotide diversity and mean number of nucleotide differences were h = 0.991, π = 0.03312 and K = 32.892, respectively (Table 2). The haplotype diversity was similar in lineage N (h = 0.984) and lineage S (h = 0.982); however, lineage N (π = 0.0090) had higher nucleotide diversity than lineage S (π = 0.0062). For the different biomes, the haplotype diversity among biomes (groups) was low, ranging from 0.967 in the Amazon to 1.000 in the Chaco. The nucleotide diversity varied more widely among biomes, from 0.00627 in the Atlantic Forest to 0.03147 in the Amazon Forest (Table 2). Locations where both lineages were present had the highest levels of nucleotide diversity and mean numbers of nucleotide differences. Sequence analysis of the ITS1 region identified six haplotypes with a haplotype diversity of 0.461, nucleotide diversity of 0.0008 and mean number of nucleotide differences of 0.499 (Table 2). The haplotype diversity and nucleotide diversity were higher in lineage N (h = 0.503; π = 0.0008) than lineage S (h = 0.355; π = 0.0006), previously defined by the mitochondrial network. Among biomes, the highest diversity was observed in the Chaco (h = 0.600; π = 0.0009) and the lowest diversity in the Atlantic Forest (h = 0.315; π = 0.0005) (Table 2). The Amazon Forest, Caatinga and Cerrado have higher nucleotide diversity due to mixing of the two lineages in these areas.

Table 2 Measures of genetic diversity for Euschistus heros based on two concatenated mitochondrial genes (COI-Cytb) and ITS1 region.

Full size table

Mitochondrial divergence dating

The estimated age of origin of the lineage S (southern lineage) clade was 4.5 Myr (95% C.I 2.801–6.453 Myr), during the Pliocene, with intense diversification in the Pleistocene and Holocene (Fig. 2b).

Population Structure

Lineage N haplotypes were associated mainly with the Amazon Forest and Caatinga, with one, more recent clade (CW) associated with central Brazil, a transitional region among the Cerrado, Caatinga and Amazon Forest (Fig. 2a,c). Lineage S predominates in the Atlantic Forest and Chaco, with a lower frequency in the Cerrado, Caatinga and Amazon Forest (Fig. 2a,c).

At the regional scale, the E. heros populations showed high genetic structure, as assessed by the Analysis of Molecular Variance (AMOVA). Differences among populations accounted for most of the genetic variances in mtDNA (56.57%, P < 0.001) and a high and significant value in ITS1 regions (20.18%, P < 0.001) (Table 3). A test of the hypothesis that the genetic variation is structured by biomes showed that 45.07% of the mtDNA total variance was distributed among biomes (Φ_CT = 0.450, P < 0.001). Furthermore, the larger portion of genetic variation within populations (39.61%, Φ_ST = 0.603) indicates overall genetic differentiation in these populations (Table 3). Analysis of the ITS1 region supported the mitochondrial data, showing a significant structuring by biome (Φ_CT = 0.173, P < 0.001), in which most of the genetic variation was within populations (Table 3).

Table 3 Analysis of molecular variance (AMOVA) for genetic structure of Euschistus heros based on two concatenated mitochondrial genes (COI-Cytb) and ITS1 region.

Full size table

Demographic statistics inferred for mitochondrial genes

Considering the two lineages, significant negative values were found in both the Tajima’s D and Fu’s Fs neutrality tests, indicating population expansion or purifying selection. Considering the biomes, the neutrality test statistics did not fully agree with one another. Fu’s Fs statistic was significantly negative for all biomes, but only the Atlantic Forest biome had a significant negative Tajima’s D value (Table 4).

Table 4 Neutrality test statistics and mismatch distribution analysis for Euschistus heros based on two concatenated mitochondrial genes (COI-Cytb).

Full size table

For the lineages, the mismatch distribution analysis resulted in a nonsignificant SSD (P > 0.05), indicating a recent demographic expansion of lineage S but not lineage N (P = 0.04). For the biomes, a nonsignificant SSD (P > 0.05) was also found for the E. heros populations in all biomes but the Caatinga (P = 0.03) (Table 4). The nonsignificant raggedness index (P > 0.17) supports the spatial-expansion model of populations of lineages, biomes, and the entire group (all populations combined) (Table 4). The τ values were higher in the Cerrado and the Amazon Forest, τ = 57.5 and τ = 58.7, respectively, compared to the other three biomes, Atlantic Forest (τ = 6.4), Chaco (τ = 6.0) and Caatinga (τ = 7.3) (Table 4).

The expansion of populations in the Amazon Forest, Chaco, Caatinga, Cerrado and Atlantic Forest occurred within the last 500 years, corresponding to a recent expansion during the Quaternary according to the Bayesian skyline plot analysis (Fig. 3). The Chaco and Atlantic Forest populations remained stable during the past 100 years, while the Caatinga and Cerrado populations are still expanding. According to the effective population size (Ne), the Atlantic Forest population is the largest, followed by the Caatinga. The Cerrado, Amazon Forest and Chaco populations have similar sizes, but the Cerrado population is still expanding very rapidly, while the Chaco population is expanding slowly and the Amazon population is now contracting (Fig. 3).

Environmental features and soybean expansion modelling the current mitochondrial lineage distribution

Three models passed the cutoff (i.e. models that were less than 2 units away from the “best” model) to explain the presence (%) of the southern lineage at a given location (i.e. the probability of finding an individual from the southern lineage). The best predictors were the ‘max temperature of the warmest month’, ‘latitude’, and ‘annual mean temperature’ (Fig. 4 and Table 5). The most important variables were the ‘max temperature of the warmest month’ and ‘latitude’, which received the highest score in all top models. The model performance improved when ‘annual mean temperature’ was excluded, i.e. AIC_c (36.22 and 35.35) and w_i (0.09 and 0.14) (Table 5). The best model selected (AIC_c = 35.35) was 22.29 units away from the null model (AIC_c = 57.64). The variable ‘latitude’ (0.50) was more important than ‘max temperature of the warmest month’ (0.43) in the best model according to the z-scored beta (null deviance = 18 on 18 d.f., residual deviance = 4.04 on 16 d.f.) (Table 5). Latitude (0.9) was also the most important variable in the second-best model compared to the ‘max temperature of the warmest month’ (0.7) and ‘annual mean temperature’ (0.65) (null deviance 18 on 18 d.f., residual deviance = 3.469 on 15 d.f.) and in the third-best model (0.66) compared to ‘mean temperature of wettest quarter’ (0.35) and ‘max temperature of the warmest month’ (0.56) (null deviance 18 on 18 d.f., residual deviance = 3.531 on 15 d.f.). None of the three best models included soybean variables. The two soybean variables, ‘time since first soybean harvest’ and ‘soybean expansion rate’, ranked 7^th and 16^th in overall importance. The ‘time since first soybean harvest’ was strongly correlated with ‘latitude’ (r = 0.90, d.f. = 17, P = 0.000), ‘max temperature of the warmest month’ (r = 0.61, d.f. = 17, P = 0.005), and ‘annual mean temperature’ (r = 0.76, d.f. = 17, P = 0.000).

Table 5 Model-selection results showing the top three models for the response variable ‘presence of the southern lineage’.

Full size table

Discussion

Our results revealed two divergent deep lineages of E. heros in South America. The two COI-Cytb haplotype groups are separated by 52 mutational steps and have an estimated genetic distance of 4.2% (K2P). The number of mutation steps separating the two E. heros lineages is exceptionally high, raising the question of the possible presence of cryptic species. Rolston³⁵ described a species in Middle America that is morphologically similar to E. heros, Euschistus atrox (Westwood) 1837, distributed in Colombia, Guiana, and Panama. However, examination of the external morphology, the sharing of ITS1 alleles by both lineages, and the admixture of lineages in the laboratory support the hypothesis of a single species that encompasses the two divergent lineages. Our results reveal the need for a thorough systematic review of the genus Euschistus.

The two E. heros lineages are geographically separated from one another, with one clade clustering the northernmost populations (i.e. northern and northeastern Brazil), and a second clade clustering the southernmost populations (i.e. southern and southeastern regions). Both mitochondrial lineages expanded to form a mixed zone upon secondary contact in the central and southwestern regions. It is not clear when the reunion occurred, but the formerly isolated populations seem to have come into contact before reproductive isolation was complete^36,37. A related point to consider is that all but one of the northern haplotypes found in the Cerrado (CW) were phylogenetically grouped together in one clade, indicating a subgroup differentiation. The central-western (CW) subgroup likely occupied the region much earlier than the southern lineage arrived and before the first soybean fields were established. This is strong evidence against the hypothesis that the E. heros expansion was purely associated with the expansion of soybean cultivation during the 1970s.

The divergence time of the two main clades is estimated as occurring during the Pliocene (i.e. 4.5 Myr). This divergence seems to be associated with a cooling and drying of the global environment, which caused the separation of the Amazon Forest from the southern part of the Atlantic Forest and the consequent expansion of grasslands and savannas³⁸. Temperature cycles were also associated with more recent diversification events during the Pleistocene (i.e. differentiation of the CW group). Deep sequence divergence dating to the Pliocene is also reported for other organisms in the Neotropics^39,40, and phylogeographic structure has been found in amphibians in the Atlantic Forest³, reptiles in the Cerrado⁴¹, and plants^4,42.

Spatial genetic structuring by biomes was also found among subpopulations of E. heros. Separation into the Amazon Forest, Caatinga, Cerrado, Atlantic Forest and Chaco biomes seems to be the best way to explain the genetic variance hierarchically. Thus, separating insects by biomes can help us to understand the pattern of lineage mixing, diversity and demographic history. The haplotype diversity of E. heros was high and similar among biomes and lineages. This pattern is the result of the high number of private haplotypes found in E. heros populations in all biomes. The higher nucleotide diversity of lineage N compared to lineage S can be explained under the ‘historical climate’ stability models, where a stable environment such as the Amazon Forest can offer conditions for a population to persist, resulting in elevated intraspecific genetic diversity^43,44. Unstable regions, on the other hand, would be associated with recent or multiple-event colonization, resulting in lower intraspecific genetic diversity and signatures of expansion⁴. Therefore, the northern biomes (Amazon Forest and Caatinga) were the most stable environment, while the Atlantic Forest was the least stable environment. Another consideration is that lineage S is associated with areas that have undergone intense transformation due to agricultural practices, and has experienced population dynamics linked with farming cycles and control tactics.

Although E. heros’ limited dispersal capacity likely helped to preserve the pattern formed during the late Tertiary and Quaternary as an outcome of the climate changes, the last 100 years were an important turning point for E. heros populations (i.e. soybean introduction and expansion of farming starting at the end of the 19^th century). It is plausible that farming and trade routes have increased the admixture process in certain areas, especially the Cerrado and connecting areas, even though there are still large areas where the two lineages have not yet encountered each other, showing that the pattern is still well preserved.

Recent signals of expansion were detected for E. heros lineages and in all biomes sampled. The inferences regarding population growth were supported by the neutrality tests, the unimodal mismatch distribution and the demographic expansion parameters (τ). Spatial expansion is also occurring, given that no significant Raggedness values were found. Apart from differences in test sensitivity, the lack of full agreement between tests for E. heros in each biome might indicate a more complex scenario. Multiple processes affecting local diversity and the noise from human intervention causing population reduction, population subdivision, bottlenecks, and facilitation of dispersal resulting in the secondary contact might affect the precise demographic estimates for a species^45,46,47.

We also conducted a Bayesian Skyline analysis to test the hypothesis of recent expansion in all biomes and to determine how the effective sizes of the populations behaved over time. The period of E. heros population growth in all areas overlaps with the period of intense changes caused by the increase of urban occupation and agricultural area in South America⁴⁸. It may be that the resulting habitat loss not only did not affect E. heros populations negatively, but has even been advantageous. One possible hypothesis to explain the success of E. heros is shifting hosts from natural areas to agricultural fields, especially soybeans but also cotton and bean fields³¹. A second hypothesis is that one or more traits occur in a latitudinal cline^49,50,51. The species’ association with environmental gradients should also be considered, given the possibility of differences in traits and adaptations such as reproductive diapause^27,52,53.

We used environmental and soybean variables to make phylogeographic inferences to predict the predominance of lineage S over lineage N at a given location. Selected models had similar AICc scores and considerably reduced the number of variables, down to 4 for the percentage of lineage S models. The two most important variables were the maximum temperature of the warmest month and the latitude. Temperature and photoperiod both affect this species, and might induce quiescence behavior and other possible differences in physiological responses. Latitude, on the other hand, can be correlated with geographic distance, environmental gradients, and agricultural gradients, as in the case of the soybean expansion. Our data support the predictions of the latitudinal-gradient hypothesis, even though distinct demographic scenarios can be expected at different times of E. heros’ evolutionary history. The time since the first soybean harvest correlates with latitude and other bioclimate variables, which likely decreased the importance of this variable in the model.

The reunion of the two long-separated lineages might have unforeseen consequences for one of the largest soybean-producing regions in the world. The two lineages are united again in central Brazil, where an agricultural revolution started in the 1970s and continues today, pushing soybean fields northward⁵⁴. It is possible that the northern and southern populations of E. heros are exchanging adaptations in admixture zones. However, knowledge of the differences between the two lineages is limited, because their presence was unknown until this point^55,56. The changing status of E. heros from a secondary to a primary pest in soybean crops and the reasons for this are poorly understood. In recent years, the increase of population densities in soybean fields, the shorter quiescence period, larger host range (i.e. damage in cotton crops) and pesticide tolerance/resistance have been frequently reported in E. heros populations^30,31,57. These concerns increase in a scenario of GM soybean introduction, no-till management, and expansion to diversity-hotspot areas.

Material and Methods

Sample collection and DNA extraction

One hundred fifty-nine specimens of E. heros were collected between 12/2015 and 07/2016 from 21 different localities across five South American biomes. Twenty sampling sites were in Brazil and one site in Paraguay. Specimens were collected as adults, from the canopy of soybeans, using a beating cloth under the plants. Individuals were preserved in ethanol (>95%) at –20 °C until laboratory manipulation, after which the remaining tissue from all specimens was stored at –80 °C. DNA was extracted from the head tissue of an adult specimen, using the modified CTAB protocol⁵⁸.

PCR amplification and DNA sequencing

Fragments of two mitochondrial and one nuclear region were amplified by polymerase chain reaction (PCR), using specific mitochondrial primers developed for this project and previously developed ITS1 primers⁵⁹. The Cytochrome c Oxidase Subunit 1 (COI) fragment was amplified using the forward primer (5′-ACCGCACATGCATTTGTAATAA-3′) and the reverse primer (5′-GTGGCTGATGTGAAGTATGCTC-3′), and the Cytochrome b (Cytb) fragment was amplified using the forward primer (5′-GGATATGTTTTACCTTGAGGACA-3′) and the reverse primer (5′-GGAATTGATCGTAAGATTGCGTA-3′). To amplify the ITS1 rnDNA region (18 S partial – ITS1 complete – 5.8 S partial) we used the forward primer CAS18SF1 (5′- TACACACCGCCCGTCGTACTA-3′) and the reverse primer CAS5p8sB1d (5′- ATGTGCGTTCRAAATGTCGATGTTCA-3′). The PCR reactions were performed in a total volume of 25 μL containing 80 ng total DNA, 1.5 mM/μL MgCl₂, 0.1 mM/μL dNTPs, 0.4 pmol/μL of each primer, 1 U of Taq DNA Polymerase (Synapse Inc.) and Buffer (10 × Taq DNA Buffer). PCR cycles consisted of denaturation at 95 °C for 3 min, followed by 35 cycles with denaturation at 95 °C for 30 s, annealing at 54 °C for 40 s, polymerization at 72 °C for 1.5 min and final extension at 72 °C for 10 min. Subsequently, the PCR products were separated on agarose gel (1.5% w/v) and observed under ultraviolet light. The amplicons were purified using 0.33 μL EXO I, 0.33 μL FastAp and 0.34 μL of ultra-pure water together with 10 μL of each PCR product, held at 37 °C for 30 min, then at 80 °C for 15 min. The PCR product Sanger sequencing was performed by the Animal Biotechnology Laboratory at ESALQ, University of São Paulo.

Assembly of sequence datasets

All sequences were aligned and edited manually using the software Sequencher 4.0.1 (Gene Codes Corp., Ann Harbor, MI, USA). To eliminate missing data, sequences were interrupted at 607 bp for the COI gene, 386 bp for the Cytb gene and 638 bp (18 S partial – 52 bp; ITS1 complete – 416 bp; 5.8 S partial – 170 bp) for the ITS1 region. There were no insertions or deletions in the sequences obtained. All sequences (datasets) obtained in this study were deposited in NCBI-GenBank (https://www.ncbi.nlm.nih.gov/genbank/) with the accession numbers MG651970 - MG652128 (COI), MG652129 - MG652287 (Cytb), and MG654513 - MG654636 (ITS1).

The presence of nuclear paralogs of mitochondrial origin (termed numts)⁶⁰ was inspected in the mitochondrial gene fragments, using the software MEGA v.5.2⁶¹. Three signatures of numts were searched: (i) indels that introduce frameshifts, (ii) out-of-place inframe stop codons that lead to premature termination of protein translation, and (iii) lack of codon position substitution bias toward the 3rd position, that lead to a higher rate of non-synonymous mutations. The presence of signatures (i) and (ii) is enough to consider a given sequence a numt. No numts were detected in the COI or CtyB sequences; therefore, we included all mitochondrial sequences in our analysis. The posterior analyses were performed using concatenated mitochondrial genes (COI-Cytb).