Phylogenomics of the Hyalella amphipod species-flock of the Andean Altiplano

Zapelloni, Francesco; Pons, Joan; Jurado-Rivera, José A.; Jaume, Damià; Juan, Carlos

doi:10.1038/s41598-020-79620-4

Download PDF

Article
Open access
Published: 11 January 2021

Phylogenomics of the Hyalella amphipod species-flock of the Andean Altiplano

Francesco Zapelloni¹^na1,
Joan Pons²^na1,
José A. Jurado-Rivera¹,
Damià Jaume² &
…
Carlos Juan^1,2

Scientific Reports volume 11, Article number: 366 (2021) Cite this article

1657 Accesses
6 Citations
7 Altmetric
Metrics details

Subjects

Abstract

Species diversification in ancient lakes has enabled essential insights into evolutionary theory as they embody an evolutionary microcosm compared to continental terrestrial habitats. We have studied the high-altitude amphipods of the Andes Altiplano using mitogenomic, nuclear ribosomal and single-copy nuclear gene sequences obtained from 36 Hyalella genomic libraries, focusing on species of the Lake Titicaca and other water bodies of the Altiplano northern plateau. Results show that early Miocene South American lineages have recently (late Pliocene or early Pleistocene) diversified in the Andes with a striking morphological convergence among lineages. This pattern is consistent with the ecological opportunities (access to unoccupied resources, initial relaxed selection on ecologically-significant traits and low competition) offered by the lacustrine habitats established after the Andean uplift.

The Intercontinental phylogeography of neustonic daphniids

Article Open access 04 February 2020

Mitochondrial genomes of two Polydora (Spionidae) species provide further evidence that mitochondrial architecture in the Sedentaria (Annelida) is not conserved

Article Open access 30 June 2021

Successful post-glacial colonization of Europe by single lineage of freshwater amphipod from its Pannonian Plio-Pleistocene diversification hotspot

Article Open access 29 October 2020

Introduction

Lakes with an uninterrupted history of more than 100,000 years (ancient lakes) may be considered as natural laboratories for evolutionary research as they constitute hotspots of aquatic animal speciation and phenotypic diversity¹. Changes in lake size and episodes of desiccation are considered to be critical factors in the speciation and extinction of lake faunas, with the creation of new habitats after lake expansions as the primary driver of intra-lake diversification^2,3,4. For instance, cichlid radiations in the East African Lakes seem to have been triggered by lake expansions after periods of intense desiccation, with the surviving species filling up empty niches after lake refilling².

Lake Titicaca, located in the Andean high plateau in the central Andes of Perú, Bolivia and Argentina at an elevation of 3806 m, is part of an extensive intermontane endorheic area of about 200,000 km² that also includes Lake Poopó, and the salt flats of Coipasa and Uyuni⁵ (Fig. 1). The Titicaca is the only ancient Lake present in South America, with a presumed age of between 3 and 2 million years (Ma)^5,6 ensuing the end of the uplift of the northern Andean Altiplano (at 5.4 ± 1.0 Ma⁷). The Lake consists of two main sub-basins, the northern one (Lake Chucuito) attains a maximum depth of 285 m, being separated from the southern sub-basin (Lake Huiñaimarca) by the Tiquina Strait (Fig. 1). Like many other ancient lakes such as the East African ones, the Altiplano lakes have experienced a complex palaeo-environmental history as the water level was subjected to at least three significant expansions from the Early to Middle Pleistocene. These shifts resulted in the joining of the different lake basins of the Altiplano into a single hydrological unit^8,9,10. Besides, during global interglacial periods, the water level dropped considerably, resulting in an increase in water salinity, and a closed-basin configuration of the Titicaca¹¹. These significant lake-level fluctuations split apart the Titicaca basin into three palaeolakes in recent times (8000 year before present), presumably influencing the population dynamics of the lacustrine taxa found therein¹².

Several aquatic species-flocks (e.g. gastropod molluscs, crustacean amphipods, cyprinodontid fish) dwell in the Titicaca and its peripherical water bodies^4,6,13,14,15. The genus Hyalella S. I. Smith, 1874 (Fam. Hyalellidae Bulycheva, 1957) constitutes the only epigean amphipod lineage present in continental waters of South America¹⁶, with at least 18 species recorded in the Andean Altiplano, including Lake Titicaca^17,18,19,20. Previous studies using mitochondrial cytochrome c oxidase subunit I (cox1) DNA sequences supplemented with a nuclear marker demonstrated that the Hyalella in the Titicaca are polyphyletic, and divide into five major genetic lineages embedded in the broader South American Hyalella phylogeny^14,15. Several independent colonizations deriving from South American lineages that diverged from each other in the last 15–20 My seem to have populated the Andean Altiplano, with evidence of the occurrence of intra-lacustrine diversification detected in at least two of the clades^14,15. In a previous work, we applied molecular species delimitation criteria concluding that at least one-third of the Hyalella species diagnosed in that study are likely endemic to the Titicaca and neighbouring water bodies¹⁵. We also uncovered a disagreement between morphology and genetic data in the Titicacan Hyalella, with cases of same morpho-species displaying very distinct mitochondrial haplotypes, while others diagnosed as belonging to the same cox1 molecular operational taxonomic unit (MOTU). No significant discordances between mitochondrial and nuclear data were observed in Titicacan amphipods by either the Adamowicz et al.¹⁴ or Jurado-Rivera et al.¹⁵ surveys. Morphological conflict with molecular phylogenies has also been detected in amphipods of ancient Lake Baikal²¹. These disagreements were proposed to derive by rapid morphological and ecological differentiation coupled with phenotypic convergence^21,22,23.

The use of mitochondrial genomes for phylogenetics in metazoans, including crustaceans, has exploded in recent years due to the advances experienced in genomic techniques²⁴. Although the usefulness of mitochondrial sequences as a sole genetic marker has been questioned²⁵, they are still widely used in phylogenetic inference, species delimitation or identification and, in conjunction with nuclear multilocus data, to detect mito-nuclear discrepancies derived from hybridization/introgression events²⁶. In the present study, we contribute the mitogenomes and the SSU, plus LSU nuclear ribosomal DNA sequences of 32 Hyalella High Andes amphipods. Also, we identify single-copy nuclear orthologous genes present as a single copy in most species (orthogroups) to investigate their evolutionary origin and phylogenetic relationships.

Our main goal is to establish an evolutionary scenario for amphipods of the Andean plateau by achieving the following objectives:

1.
to infer a robust phylogeny of Hyalella in the Altiplano, sampling their full geographic distribution in the two primary Titicaca sub-basins, main satellite lakes, and other high-altitude water bodies in the area;
2.
to estimate tree node ages and date of colonization of the Lake by the different lineages, testing the hypothesis of occurrence of recent intra-lacustrine radiation in particular clades; and
3.
to test the occurrence of mito-nuclear discordances and the congruence of sequence data with morphological characters under a phylogenetic framework.

The results obtained should shed light on questions such as (1) did clade diversification occur synchronously among different lineages? (2) are diversifications concordant with the time-frame of endorheic basins establishment ensuing the northern Andean Altiplano uplift? Furthermore, (3) do the emergence of morphological key-innovation traits explain the Andean amphipod radiation or were the ecological opportunities provided by the emergence of island-like mountain lacustrine habitats the main factor implied in their diversification?

Results

Mitogenome phylogeny

Thirty-five new Hyalella complete or partial mitogenomes representing species from each major clade in the Titicaca Altiplano area plus samples from Ecuador and Chile were obtained (Table 1). Details of mitogenome size, sequence completeness, gene order and A + T content are given in Supplementary Text 1 and Supplementary Table 1). The alignment of the 13 mitochondrial protein-coding genes (PCGs) of the South American Hyalella species plus the North American H. azteca and three outgroup taxa (Parhyale hawaiensis, Platorchestia japonica and Platorchestia parapacifica) comprised 11,073 bp (available at https://github.com/Frazapel/Hyalella-amphipod-Phylogenomics). Xia’s test showed low levels of substitution saturation except for third coding positions in which moderate levels of saturation were detected (Supplementary Text 1). The Maximum Likelihood (ML) phylogenetic tree robustly supports Hyalella as a monophyletic clade divided into two major lineages: one comprising H. franciscae (from southern Chile), and all the representatives of clade C, which consists of one not yet formally described species of the Titicaca with smooth body integument—H. “krolli”—that is sister to the heavily armoured Titicacan H. armata, and another sister lineage sampled at the western border of the Altiplano (Laguna Súchez-Huaytire) represented by a species with smooth body integument. The second highly supported major lineage encompasses the North American species H. azteca, and all the remaining South American species (Titicaca + Altiplano clades A, B, D and E plus the taxa from Ecuador clade F), with all subclades supported with maximum bootstrap values except the node relating clades A and B (that received a 95 bootstrap support value; Fig. 2a). None of the alternative topologies obtained under ML could be rejected by the Approximately Unbiased Test of Phylogenetic Tree Selection (AU test) (Supplementary Table 2) except for the one showing H. azteca as sister to the rest of South American Hyalella, although with a marginal P value (P = 0.0294). However, this hypothesis could not be rejected when the analysis did not include outgroups (P = 0.6230). The trees were also explored under a Bayesian framework at both nucleotide and amino acid levels implementing various nucleotide and codon substitution models (Supplementary Fig. 2a–j). The results of these analyses converged to the same or similar mitogenome tree topologies with minor differences in support values. Phylogenies based on single mitochondrial genes were less resolved and supported than those based on the concatenated genes, with the more informative atp6, cob, cox1, cox3 nad1, nad3 and nad4 genes rendering the same topology as the all genes dataset. In contrast, cox2 and nad5 supported an alternative topology in which H. azteca and the South American Hyalella are reciprocally monophyletic (Supplementary Fig. 2).

Table 1 Taxa included in the analysis.

Full size table

Nuclear rDNA phylogeny

The alignment of the small and large nuclear ribosomal DNA sequences were 2395 and 5437 bp in length, respectively, after removal of long species-specific insertions present in the outgroups and H. azteca (available at https://github.com/Frazapel/Hyalella-amphipod-Phylogenomics). The ML trees obtained showed a backbone similar to the mitochondrial phylogeny, either based on MAFFT alignments or from those considering secondary RNA structures or Bayesian implementing the doublet model. However, some differences were affecting terminal tips and lower support was obtained at some deep nodes (Fig. 2b). In particular, the nuclear rDNA phylogeny provides relatively low support for the relationship between H. azteca and the South American clades excluding clade C (bootstrap value = 93), with low confidence for the node relating the two main Hyalella lineages (clade C and the remaining Hyalella lineages) (bootstrap value = 81) (Fig. 2b). The removal of poorly aligned positions and divergent regions and recoding gaps as binary characters irrespective of their length produced nearly the same phylogenetic relationships with slight variations on tips of the tree.

Nuclear phylogeny based on single-copy nuclear genes

A total of 76 single-copy nuclear gene segments present in the 36 low-coverage Hyalella genomic libraries were retrieved using Orthofinder²⁷. BLAST similarity searches showed that 53 orthologous gene fragments matched known or predicted proteins of H. azteca (Supplementary Text 1 and Supplementary Table 3). The ML phylogeny obtained from the concatenated supermatrix (34,557 bp with a mean of 10.7% missing data, mainly due to missing orthologous in H. azteca) displayed the same six highly supported clades as the mitochondrial and nuclear ribosomal analyses, with clade C and H. azteca being part of a polytomy (Fig. 3a). The comparison of the nuclear protein-coding and mitogenome tree-topologies showed that they are roughly concordant (Baker’s index = 0.915; cophenetic index = 0.987), except for disagreement in the respective relationship of clades A, D and E (Supplementary Fig. 3). The Shimodaira-Hasegawa test (SH test) showed that a high number of gene trees were significantly preferred over the species tree (P < 0.05), i.e. they conflict with the concatenated tree-topology (Supplementary Table 4). This fact was further confirmed by gene and site-concordance factors²⁸ except for the nodes defining main clades, where bootstrap and concordance factors were in complete agreement (Fig. 3a). The relative position in the base of the tree of Ecuador (F) and Altiplano (B) clades have maximum bootstrap support but moderate gene and site-concordance factors, suggesting that for these nodes there is a real tree discordant signal in the single locus trees. All other nodes tend to have different gene (gCF) and site-concordance factors (sCF) with often low values, implying this is due to a limited phylogenetic signal with short tree branches in most gene trees, in particular on tip nodes. The species tree based on multispecies coalescent models (MSC) was similar to the concatenated ML tree (Fig. 3b). The multispecies coalescent analyses that included the mitochondrial PCGs as another linkage group added to the orthologous nuclear sequences produced an alternative tree topology with maximum posterior probabilities at all relevant nodes (Fig. 3c). This topology supports the reciprocal monophyly of North and South American Hyalella and within the latter, distinguishes between two well-defined species-groups: clade C as sister to the clade embracing A + B + D + E + F. Finally, the species tree obtained with ASTRAL differed from the former phylogenetic hypotheses at various relevant nodes (Supplementary Fig. 4).

Estimation of divergence times

Molecular clock analyses using mitogenomic data and Bayes Factors favoured the reciprocal monophyly of North and South American Hyalella as shown in the robust MSC tree based on mitogenomes and single nuclear genes. The two independent calibration schemes applied rendered similar age estimates along the phylogeny, showing congruence with each other as deduced from the result of cross-validation analysis (Fig. 4 and Table 2). The mean time for the first divergence within the South American taxa was estimated at ca. 19 Ma (node 2 in Fig. 4; 95% HPD interval 13–25 Ma). It represents the divergence time of the common ancestor of clade C and the remaining clades. The average time of the initial divergence within the Most Recent Common Ancestor (MRCA) of A, B, E and D lineages was estimated at 11.7 Ma (95% HPD interval 8–15 Ma using calibration 1) (node 6 in Fig. 4 and Table 2). For clades A, D and E the mean time of the first divergence were estimated to be almost synchronous at a narrow interval of 2.1–2.8 Ma (Nodes 9, 10 and 12) with 95% HPD intervals largely overlapping. The age for clade B resulted much younger (estimated age 0.4 Ma; node 11 in Fig. 4 and Table 2). These results suggest that species diversification of the different lineages in the Altiplano started mainly at the end of the Pliocene. The age of clade C is older (6.2–6.3 Ma; 95% HPD interval 4.4–8.4 Ma using calibration 1), although the age of Altiplano species-group in this clade is concordant with that obtained in the other lineages (2.6 Ma; 95% HPD 1.8–3.5, not shown).

Table 2 Estimated ages and 95% higher posterior densities (HPD) for each node in the best mitochondrial tree according to Bayes Factor scores obtained in BEAST under the two alternative calibration schemes (MRCA of node A and tree root after Adamowicz et al.¹⁴; see text for details).

Full size table

Discussion

Phylogenomics of the Andean Altiplano Hyalella

Nuclear and mitochondrial phylogenomic analyses strongly support the hypothesis that the Titicaca species-flock originated from multiple colonization events from independent Hyalella ancestral lineages, with clades not showing marked relationships to particular sub-basins or regions in the main Lake or adjacent water bodies. Previous results using a short mitochondrial gene fragment (cox1) on a broad South American sampling of the genus pointed to the occurrence of at least five major distant monophyletic clades in the Titicaca and other waterbodies in the northern Altiplano^14,15. Clades A, B, C and E were shown to be part of lineages displaying a wider South American distribution, while members of clade D were exclusive of the Altiplano¹⁵. The mitogenome phylogeny reported herein is fully compatible with the tree topology obtained based on the cox1 fragment, showing that the South American Hyalella species sampled are divided into two distant ancestral lineages. A highly supported clade comprises clade C^14,15, a lineage that includes the species examined from southern Chile (H. franciscae) plus two taxa from the northern Andean Altiplano (the heavily armoured Titicacan species H. armata, and another species with smooth body integument found at Laguna Súchez-Huaytire, on the western limit of the Altiplano). The other major lineage is composed of four northern Altiplano clades (clades A, B, D and E) plus a clade from the southern Ecuador highlands (clade F). The former clades have a wide distribution in the study area of Peru and Bolivia, as samples from each clade were collected in the two major basins of the Titicaca but also from peripheral lakes, lagoons and streams outside the main lake (Table 1; Fig. 2a and see Jurado-Rivera et al.¹⁵). The low sequence divergence, and the lack of resolution within clades A, B, D and E suggest the occurrence of rapid and recent diversification in the northern Altiplano.

The phylogenetic relationship of the South American clades with the North American H. azteca is ambiguous in the mitogenomic trees as the AU test could not confidently reject two competing tree topologies: (1) North and South American Hyalella as reciprocally monophyletic (as in Adamowicz et al.¹⁴ versus 2) clade C sister to the remaining clades (including H. azteca). This ambiguity may be caused by the long tree-branch leading to H. azteca, compared to the shorter branches displayed by the South American clades, and by a highly unbalanced representation of the North and South American Hyalella species in the phylogeny, as only the mitogenome of H. azteca is currently available for the North American lineages of the genus. The SSU and LSU ribosomal phylogenetic hypotheses were mostly consistent with the mitogenome-based tree concerning the main clades. However, less support was obtained for the interclade relationships (Fig. 2b). In particular, a weak support for the relationship of clade B to other clades. Nevertheless, the mitogenome and nuclear ribosomal trees agreed in the monophyly of the node from which all clades derive with the exclusion of the distant clade C. Trees obtained using single-copy nuclear genes, either from concatenated data or applying the multispecies coalescent model, confirmed previous results. However, interclade relationships were still sensitive to the method used. A comparison of the mitochondrial and nuclear phylogenies showed that discrepancies were also focalized at the tip nodes in clades A, E and D, likely reflecting shallow divergences and low phylogenetic signal at this level, in particular for single-copy highly conserved nuclear genes (Supplementary Fig. 3). The more robustly supported phylogeny obtained resulted from applying the multispecies coalescent model to the dataset including the mitochondrial protein-coding genes as a single linkage group plus the single-copy nuclear gene sequences (a total of 45.6 Kb of DNA sequence information). This topology supports North American (H. azteca), and South American Hyalella as reciprocally monophyletic, with clade C as sister to all other clades (Fig. 3c).

Divergence times and palaeohydrology

Calibration of the molecular clock by two independent methods provided similar estimates for the split of the North and South American Hyalella at around 25 Ma, with Andean Altiplano lineages (node 2) inferred to date back to the early Miocene (Fig. 4 and Table 2). The diversification of Altiplano species within clades is much more recent in comparison, showing in three lineages time frames consistent with the presumed formation of Titicaca (between 3 and 2 Ma^6,29). Thus, clades A, D and E seem to have diversified recently and relatively rapidly by intra-lacustrine diversification. Their estimated ages are posterior to the final uplift of the northern Andes during the late Pliocene or early Pleistocene 2–4 Ma, being coeval to paleolake Mataro, ancestor of current Lake Titicaca^30,31,32.

The palaeohydrology of ancient lakes has played an essential role in the assembly of their endemic biota with significant lake-level fluctuations dramatically changing the outline, chemistry and ecological conditions of these basins^3,6. The Titicaca has captured satellite lakes at higher altitudes in the Altiplano during major hydrological high stand phases of its geological history³². Extreme high water stands occurred during the Pleistocene (the palaeolake mentioned above Mataro c. 1.5–1.6 Ma) when the water level reached 140 m higher than at present, flooding most of the Altiplano³². Conversely, the Lake was significantly reduced as recently as ca. 90,000 years ago, when the water table presumably lowered to − 240 m, resulting in a water column of only 45 m at the deepest portions of the Chucuito basin. Other lake level regressions took place 12,000 years ago (− 110 m) and between 8000 and 3600 years ago, when the lake level was settled at − 90 m below its present stand³². These cycles of expansion and retreat, with the consequent changes in water salinity, may have exerted a major impact on the speciation and diversification of the Titicacan fauna^4,6,13,14,15. The evolution of lacustrine species in the Altiplano may have been therefore governed by an intricate pattern of cycles of colonization followed by population expansions and intra-lacustrine diversification. In contrast, extinction and allopatric vicariance within sub-basins may have prevailed during regressions, with subsequent dispersal to different lakes and water streams of the Altiplano¹⁴.

Morphological convergence and replicated radiations

The mitogenomic phylogeny agrees with previous results in uncovering a striking discordance with morphological taxonomic determinations, suggesting a high incidence of morphological convergence even among distant lineages¹⁵. This convergence is particularly notorious in generalist taxa with smooth body integument, that are polyphyletic as they show divergent mitogenomes placed in two or more distinct clades (e.g. in lineages A, B, C, D and E in the mitochondrial ML tree; see Fig. 2a). No significant association between morphology and nuclear phylogenetic relationships is detected either, suggesting that this pattern may fit better with replicated radiations rather than due to reticulate evolution³³.

Habitat specialization and trophic regimes of Andean amphipods are mostly unknown. However, a considerable trophic overlap, omnivorous opportunist feeding habits and a high dispersal ability–such as those described elsewhere for the Eulimnogammarus amphipods of Lake Baikal³⁴, could explain the high degree of convergence of generalist morphotypes observed in the Titicacan Hyalella. Besides, armoured spiny morphologies have evolved independently multiple times in the Titicacan Hyalella¹⁵, a result confirmed here with phylogenomic data as armoured body forms have episodically appeared in all Andean clades except in clade F (Figs. 2a,b and 3a). Spiny morphologies are frequent in several amphipod marine families. However, they are rare among epigean continental water forms, except for members of the Lake Baikal species-flock, with at least four independent transitions from non-spiny to spiny forms and two reversals²¹. Spines are also known in Caspian gammaroids, and in members of the genus Fuxiana (Lake Fuxian, China)^35,36. Armoured spiny morphologies in the Titicaca have been related to the predation pressure exerted by the cyprinodontid killifish endemic to the lake^19,37. Thus, these morphologies can be envisaged as the result of a coevolutionary arms race with predators in replicated radiations, resulting in convergence rather than representing key-innovation traits.

Notorious cases of replicated radiations leading to morphological convergence are known in ancient lacustrine systems, such as the cichlid fishes of Africa's rift lakes that radiated in parallel^40,41 or the amphipod assemblage of Lake Baikal^21,42. Trophic phenotypes of phylogenetically distinct clades of cichlids endemic to different lakes are textbook examples of morphological convergence⁴³. This convergence derives from ecological opportunities and similar adaptive landscapes facilitating the evolution of similar suites of ecomorphs despite independent evolutionary histories⁴⁴. Ecological opportunity, lack of competition and open habitats are factors that have been related to rapid diversification on islands and island-like systems such as lakes and mountains^45,46. The ecological opportunities offered by the emergence of island-like habitats has been suggested as the leading cause of rapid diversification of Lupinus and other plant genera after the Andean uplift, with diversification rates not dissimilar to cichlid fish radiations in east African lakes⁴⁷. Empty lacustrine habitats may have been repeatedly colonized by South American Hyalella linages after the Andean uplift and the formation of endorheic basins in the area, with the more successful generalist morphotypes rapidly diversifying in parallel.

Concluding remarks

In summary, our results show that Andean amphipods derive from older South American lineages and have experienced recent diversification episodes linked to the uplift of the northern Andean Altiplano and the formation of high-altitude endorheic basins, which have suffered repeated cycles of expansion and retreat. The lack of resolution of the phylogenetic relationships among the different Altiplano Hyalella clades, the very shallow divergences displayed within most of the clades and the discordance between gene trees and morphology-based species denominations by the convergence of body forms, point to a diversification driven by the ecological opportunities offered by the island-like lacustrine habitats established after the Andean uplift. These results added to the recent developments accomplished in the study of evolutionary patterns of endemic Titicacan gastropods and fish^4,6,13 establish an emergent study-system to understand species diversification.

Methods

Taxon sampling

We selected samples as a set of crucial representative taxa based on a cox1 species delimitation analysis of material from Lake Titicaca and nearby high-altitude water bodies in Peru and Bolivia (see Jurado-Rivera et al.¹⁵). We also included congeneric taxa from southern Ecuador (high-altitude lakes at El Cajas Massif; Azuay) and southernmost Chile (Madre de Dios Island; Magallanes Region) for comparative purposes. A total 35 Hyalella specimens were used for this study, of which 19 collected in the Titicaca, 12 at other surrounding water bodies in the Altiplano, three from the Ecuador Andes and one from the Austral region of Chile (Table 1). Thirteen of the specimens fall within clade A of Adamowicz et al.¹⁴ and include four of the molecular operational taxonomic units delimited in Jurado-Rivera et al.¹⁵; other seven in clade D (representing two delimited spp.); seven in clade E (from four delimited spp.); three in clade C (MOTUs C1 and C2); and two in clade B (representing the delimited MOTU B1) (Table 1). Samples were collected using a hand-held plankton net or with a small dredge operated from the lakeshore or a boat. Specimens were preserved in the field in 96% ethanol immediately after collection. We also included in the analyses the mitogenomes of the Titicaca a species in a previous study⁴⁸ and H. azteca retrieved from the genome project of this species (Bioproject accession PRJNA243935 Hyalella azteca U.S. Lab Strain⁴⁹).

Mitochondrial genomes

Genomic DNA from each sample was purified from a single specimen using the Qiagen DNeasy Blood & Tissue kit (Qiagen, Hilden, Germany) following manufacturer instructions and RNA was removed using 60 μg of RNAse A solution (Promega, Madison, WI, USA). Individual shotgun genomic libraries were constructed using the Hyper Library construction kit from Kapa Biosystems (Wilmington, Massachusetts, USA) from 100–500 ng of genomic DNA and fragments of about 480 pb that were indexed with Illumina TruSeq adapters. After quantification by qPCR, up to 13 libraries each corresponding to a particular sample/species were pooled in equimolar concentrations and pair-end sequenced (2 × 150 bp) in a single lane of Illumina HiSeq2500. Fastq files were demultiplexed with bcl2fastq software (Illumina) with adapter sequences and low-quality bases (< Q30) removed in Trimmomatic v0.33⁵⁰. We assembled both paired and unpaired clean reads in SPAdes (v3.13)⁵¹ using three kmers (21, 35 and 47 nucleotides) to maximize assembly yield. We compared the contig sequences obtained in the previous step using BLASTn (e-value 30) against the mitogenome of H. lucifugax (ENA acc. number LT594767) as a reference to filter contigs containing mitochondrial sequences. The completion of the mitochondrial contigs was assessed using the script circularizationCheck.py (mitoMaker)⁵² (script available from https://github.com/RemiAllio/MitoFinder/blob/master/circularizationCheck.py) with contigs extended by mapping reads during several iteration steps in GENEIOUS 11.1.5 when needed⁵³. The mitogenomes were annotated using MITOS2 webserver⁵⁴ and genes manually curated in GENEIOUS, particularly at 5′ and 3′ ends.

Ribosomal DNA sequences and nuclear single-copy orthologous genes

We used a similar BLASTn approach as described above to retrieve the contigs from each library matching the nuclear SSU and LSU ribosomal sequences of H. azteca⁴⁹ and Drosophila melanogaster (GenBank acc. number M21017). To search for Hyalella single-copy nuclear orthologous genes we used Orthofinder v2.3.11⁵⁵ to explore the contig sequences of each genome library to identify orthologous groups. First, all possible Open Reading Frames (ORFs) with a minimum length of 225 bp between stop codons at both strands for each contig sequence were identified from each Hyalella library using getorf (EMBOSS v6.6.0.0⁵⁶). The obtained sequences were then translated into protein and the Hyalella specimen-specific protein-sequence files used as input for searching in Orthofinder using the dendroblast and diamond options. Subsequent analyses were based on the DNA version of the protein sequences found, and the results of a random species used to perform similarity searches against the H. azteca RefSeq genome database⁴⁹ using tblastn to retrieve the corresponding orthologous sequences of the congeneric reference taxon.

Phylogenetic analyses of mitochondrial genomes

Individual mitochondrial protein-coding genes (PCGs) were aligned at the amino-acid level using MUSCLE⁵⁷ with the corresponding DNA sequences concatenated using Phyutility⁵⁸. The best partition scheme starting from the 39 possible maximum (i.e. splitting by 13 PCGs and the three codon positions) and the best-fitting nucleotide substitution evolutionary models for each partition were estimated in IQ-TREE v1.6.10^59,60. The partitioning scheme selected consisted in subdividing by codon position, with the exception that atp8 2nd sites that were included in the 1st position partition. The alternative simpler hypothesis consisting in including 2nd atp8 sites in the corresponding 2nd partition retrieved a marginal larger BIC value (258,833.3737), so this option was selected for all posterior analyses. GTR + I + G4 was selected as the best substitution model for each mitogenome partition. The total cophenetic index of the phylogenetic tree obtained under the best partitioning scheme indicated that the tree is balanced (not asymmetrical; TCI = 2441, from a possible value range from 619 to 9139). For analyses in IQTREE we applied the parameter-rich “edge-unlinked partition model”, i.e. each partition has its own set of branch lengths, thus accounting for the possibility of a non-constant evolutionary rate through time for particular nucleotide positions (heterotachy). Nucleotide substitution saturation was explored using the Xia test in DAMBE6⁶¹ and the total cophenetic index to check the balance of the phylogenetic tree obtained under the best partitioning scheme⁶². Maximum Likelihood (ML) and Bayesian analyses were performed in IQ-TREE and MrBayes v3.2⁶³, respectively. We also explored the implementation of codon-based substitution models⁶⁴ through the analysis of the dataset at the protein level and using mixed models to accommodate data heterogeneity⁶³. The mitogenomes of the distant amphipods Parhyale hawaiensis (NC_039402), Platorchestia japonica (MG010370) and Platorchestia parapacifica (MG010371) were included in the analyses and the former species used to root the trees. Resulting ML topologies were compared with the Approximately Unbiased Test of Phylogenetic Tree Selection as implemented in IQTREE.

Phylogenetic analyses of nuclear ribosomal DNA sequences

We investigated the impact of considering ribosomal SSU and LSU secondary structures in both sequence alignments and phylogenetic hypotheses. First, we aligned Hyalella and outgroup sequences in MAFFT v7.450⁶⁵ using the default FFT-NS-1 algorithm (i.e., without considering secondary structures). In other analyses, the secondary structures of the two nuclear ribosomal RNAs of Anopheles albimanus (L78065) were used as guides in RNAsalsa v0.8.1⁶⁶ to define stem and loop regions at the highest stringency (i.e. 1.00), thus ensuring that only conserved loops were included in the alignment. We also analyzed ribosomal sequences partitioned independently as stems and loops to assess bias in nucleotide composition and substitutions rates⁶⁷. The doublet model, i.e. assuming that base pairs in RNA secondary structures are not phylogenetically independent, was also explored in MrBayes. Finally, to assess the impact in the phylogenetic signal of both poorly aligned positions and highly divergent regions we examined the effect of not considering gaps using Gblocks v0.91b under default parameters⁶⁸ or, alternatively, defined gaps as binary characters irrespective of their length⁶⁹ in SeqState v1.4⁷⁰. BIC scores indicated that a single partition merging the LSU and SSU sequences was preferred over other schemes such as considering the two genes as different partitions or partitioning by stems and loops, so the two ribosomal markers were concatenated for downstream phylogenetic analyses. ML and Bayesian analyses were performed as described above by implementing the substitution model selected (GTR + I + G4). The SSU and LSU sequences of Parhyale hawaiensis (obtained from its genome project Bioproject accession PRJNA306836⁷¹) and the SSU of Platorchestia japonica (EF582936.1) were used as outgroups. The A. albimanus guide sequences were removed from the final alignment.

Phylogenetic analyses of nuclear single-copy orthologous genes

The orthologous gene regions sequenced in all samples were aligned using MAFFT with poorly aligned and gappy regions subsequently removed using the gappyout algorithm in trimAl v1.4⁷². Sliding windows of six bp were applied to identify regions showing divergences higher than 1.5× the average across the sequences using R and Perl scripts (available from https://github.com/brunonevado/trimming)⁷³. These regions were considered as missing data. Furthermore, sequences shorter than 100 bp were excluded from the analysis after alignment. Individual ortholog alignments were concatenated in Phyutility and ML phylogeny was obtained using IQ-TREE implemented the best partition scheme (1st + 2nd codon sites GTR + I + G; 3rd sites GTR + G) with 1000 fast bootstrap support replicates. To explore congruence among individual gene trees and the concatenated alignment tree topology we performed the Shimodaira-Hasegawa test (SH test)⁷⁴ using IQ-TREE v1.6.10. In addition, gene- and site-concordance factors²⁸ calculating the proportion of genes (gCF) or nucleotide sites (sCF) that are concordant with any particular branch in the supermatrix (concatenated) tree were computed using IQ-TREE vs. 2.0.5⁶⁰. As concatenation of genome DNA sequence data has been shown to be prone to systematic biases⁷⁵ we also inferred the species trees based on the multispecies coalescent model (MSC). The StarBeast2 package⁷⁶ in BEAST2 v. 2.6⁷⁷ was used to implement the MSC using the 76 orthologous sequences. For this analysis, the 37 terminals were classified into 18 putative species after the results obtained in a previous molecular delimitation study using mitochondrial cox1 sequences¹⁵. Bayesian analyses were run for 2 × 10⁹ generations sampling every 5000 using a single partition with a GTR + G model and the nucleotide substitution rate estimated from geological time constraints (see below). A strict clock was implemented for gene trees and a relaxed uncorrelated log-normal distribution prior for the species tree. The convergence of the runs and node age densities were assessed in Tracer v1.7.0. Species tree, gene trees, posterior probabilities and other parameters were estimated with the package Treeannotator included in BEAST2 distribution discarding the first 10% as burn-in. Another MSC approach was explored including the mitochondrial protein-coding genes as a single “gene” added to the 76 orthologous nuclear sequences assuming a 0.5 diploid value and appropriate model and substitution rates for the mitochondrial sequences. We also estimated the nuclear gene species tree in agreement with the largest number of quartet trees within a set of unrooted gene trees with ASTRAL III vs. 5.7.3⁷⁸ . Comparison of mitochondrial and nuclear single-copy trees was based on the Baker’s Gamma correction⁷⁹ , a measure of association between two dendrograms. To calculate the statistical significance of the index we performed a permutation test (100 replicates) by randomly shuffling the labels of one of the trees and calculating the distribution under the null hypothesis of fixed tree topologies.

Estimation of divergence times

Molecular dating analyses were performed on the mitochondrial nucleotide sequence dataset after removing the divergent sequences of the outgroup P. hawaiensis and the two Platorchestia species. Topologies of the trees, model parameter values and node ages were co-estimated and optimized in BEAST v1.10.4⁸⁰. Bayesian analyses were run for 100 million generations sampling every 10,000 generations. The convergence of the runs and node age densities were assessed in Tracer v1.7.0, and tree topology and parameters in TreeAnnotator. Clock models (strict vs uncorrelated log-normal), diversification models (Yule vs Birth–Death), and alternative tree topologies were compared based on Bayes Factors calculated with marginal likelihood values estimated from path-sampling analyses in BEAST (40 steps with 5 million generations each). As the reciprocal monophyly of North and South American Hyalella was not resolved in the previous analyses, we used Bayes Factors to contrast the two alternative hypotheses (e.g. H. azteca or South American clade C at the base of the tree). This analysis favoured the former hypothesis with moderate to strong support (Bayes Factor = 11.438). Consequently, for estimating Hyalella divergence times based on the mitogenome dataset in BEAST we used this tree topology as a constrain, with model parameter values and node ages optimized and co-estimated in the analysis from the data. A second Bayes Factor analysis selected a relaxed clock with an uncorrelated log-normal distribution even excluding the distant amphipod outgroups from the mitochondrial dataset (Bayes Factor = 55.8). The ages of two nodes were implemented separately as constraints to calibrate the relaxed molecular clock and compared by cross-validation. Each calibration point was specified as a log-normal distribution prior with 95% confidence intervals. Firstly, we assumed that the MRCA of clade A –a species group that likely derives from an intra-lacustrine diversification^14,15– cannot precede the paleolake formation that gave rise to the present Lake Titicaca. The oldest and highest of the paleolakes in the Altiplano (Lake Mataro) lies just above an ash bed with an estimated age of 2.8 ± 0.4 Ma³⁰. This age estimate and its confidence limits were used as age constraint for clade A. Alternatively, the age of the root of the tree was defined based on the cox1 rate (0.0189 nucleotide substitutions per site per million years) obtained by Adamowicz et al.¹⁴ (20.63 Ma with a 95% confidence interval of 15.615–27.086 Ma). Mean values and confidence intervals of parameters and ages were estimated discarding 10% of the run as burn-in in both calibrations.

Data availability

The DNA sequences generated during the current study are available in GenBank with the following accession numbers: MT672015-MT672049 (mitochondrial genomes), MT823207-MT823242 (small subunit nuclear ribosomal sequences), MW047111-MW047146 (large subunit ribosomal nuclear sequences) and MW234509-MW237207; MW039618-MW039653 (single-copy orthologous nuclear sequences). The phylogenetic trees and DNA sequence alignments obtained in this study are available in nexus format in the GitHub repository (https://github.com/Frazapel/Hyalella-amphipod-Phylogenomics).

References

Cristescu, M. E., Adamowicz, S. J., Vaillant, J. J. & Haffner, D. G. Ancient lakes revisited: from the ecology to the genetics of speciation. Mol. Ecol. 19, 4837–4851 (2010).
Article PubMed Google Scholar
Sturmbauer, C., Husemann, M. & Danley, P. D. Explosive speciation and adaptive radiation of east african cichlid fishes. In Biodiversity Hotspots (eds Zachos, F. & Habel, J.) 333–362 (Springer, Berlin Heidelberg, 2011).
Chapter Google Scholar
Miura, O., Urabe, M., Nishimura, T., Nakai, K. & Chiba, S. Recent lake expansion triggered the adaptive radiation of freshwater snails in the ancient Lake Biwa. Evol. Lett. 3, 43–54 (2019).
Article PubMed Google Scholar
Wolff, C., Albrecht, C. & Wilke, T. Recovery from interglacial-related bottleneck likely triggered diversification of Lake Titicaca gastropod species flock. J. Great Lakes Res. 46, 1199–1206 (2019).
Article Google Scholar
Dejoux, C. & Iltis, A. Lake Titicaca: A Synthesis of Limnological Knowledge (Springer, Berlin, 1992).
Book Google Scholar
Kroll, O. et al. The endemic gastropod fauna of Lake Titicaca: correlation between molecular evolution and hydrographic history. Ecol. Evol. 2, 1517–1530 (2012).
Article PubMed PubMed Central Google Scholar
Kar, N. et al. Rapid regional surface uplift of the northern Altiplano plateau revealed by multiproxy paleoclimate reconstruction. Earth Planet. Sci. Lett. 447, 33–47 (2016).
Article ADS CAS Google Scholar
Fornari, M., Risacher, F. & Féraud, G. Dating of paleolakes in the central Altiplano of Bolivia. Palaeogeogr. Palaeoclimatol. Palaeoecol. 172, 269–282 (2001).
Article Google Scholar
Baker, P. A., Fritz, S. C. & Baker, P. A. Nature and causes of quaternary climate variation of tropical South America. Quat. Sci. Rev. 124, 31–47 (2015).
Article ADS Google Scholar
Nunnery, J. A., Fritz, S. C., Baker, P. A. & Salenbien, W. Lake-level variability in Salar de Coipasa, Bolivia during the past ∼40,000 yr. Quat. Res. 91, 881–891 (2019).
Article CAS Google Scholar
Fritz, S. C., Baker, P. A., Tapia, P., Spanbauer, T. & Westover, K. Evolution of the Lake Titicaca basin and its diatom flora over the last ~370,000 years. Palaeogeogr. Palaeoclimatol. Palaeoecol. 317–318, 93–103 (2012).
Article Google Scholar
Mourguiart, P. Historical changes in the environment of Lake Titicaca: evidence from ostracod ecology and evolution. Adv. Ecol. Res. 31, 497–520 (2000).
Article Google Scholar
Takahashi, T. & Moreno, E. A RAD-based phylogenetics for Orestias fishes from Lake Titicaca. Mol. Phylogenet. Evol. 93, 307–317 (2015).
Article PubMed Google Scholar
Adamowicz, S. J. et al. The Hyalella (Crustacea: Amphipoda) species cloud of the ancient Lake Titicaca originated from multiple colonizations. Mol. Phylogenet. Evol. 125, 232–242 (2018).
Article PubMed Google Scholar
Jurado-Rivera, J. A., Zapelloni, F., Pons, J., Juan, C. & Jaume, D. Morphological and molecular species boundaries in the Hyalella species flock of Lake Titicaca (Crustacea: Amphipoda). Contrib. Zool. 89, 353–372 (2020).
Article Google Scholar
Väinölä, R. et al. Global diversity of amphipods (Amphipoda; Crustacea) in freshwater. Hydrobiologia 595, 241–255 (2008).
Article Google Scholar
González, E. R. & Watling, L. Three new species of Hyalella from Chile (Crustacea: Amphipoda: Hyalellidae). Hydrobiologia 464, 175–199 (2001).
Article Google Scholar
González, E. R. & Watling, L. Two new species of Hyalella from Lake Titicaca, and redescriptions of four others in the genus (Crustacea: Amphipoda). Hydrobiologia 497, 181–204 (2003).
Article Google Scholar
González, E. R. & Coleman, C. O. Hyalella armata (Crustacea, Amphipoda, Hyalellidae) and the description of a related new species from Lake Titicaca. Org. Divers. Evol. 2, 271–273 (2002).
Article Google Scholar
Coleman, C. O. & Gonzalez, E. R. New hyalellids (Crustacea, Amphipoda, Hyalellidae) from Lake Titicaca. Org. Divers. Evol. 6, 218–219 (2006).
Article Google Scholar
Naumenko, S. A. et al. Transcriptome-based phylogeny of endemic Lake Baikal amphipod species flock: fast speciation accompanied by frequent episodes of positive selection. Mol. Ecol. 26, 536–553 (2017).
Article CAS PubMed Google Scholar
Gante, H. F. et al. Genomics of speciation and introgression in Princess cichlid fishes from Lake Tanganyika. Mol. Ecol. 25, 6143–6161 (2016).
Article PubMed Google Scholar
Pinho, C., Cardoso, V. & Hey, J. A population genetic assessment of taxonomic species: the case of Lake Malawi cichlid fishes. Mol. Ecol. Resour. 19, 1164–1180 (2019).
Article PubMed PubMed Central Google Scholar
Jurado-Rivera, J. A. et al. Phylogenetic evidence that both ancient vicariance and dispersal have contributed to the biogeographic patterns of anchialine cave shrimps. Sci. Rep. 7, 2852 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Galtier, N., Nabholz, B., Glémin, S. & Hurst, G. D. D. Mitochondrial DNA as a marker of molecular diversity: a reappraisal. Mol. Ecol. 18, 4541–4550 (2009).
Article CAS PubMed Google Scholar
Allio, R., Donega, S., Galtier, N. & Nabholz, B. Large variation in the ratio of mitochondrial to nuclear mutation rate across animals: implications for genetic diversity and the use of mitochondrial DNA as a molecular marker. Mol. Biol. Evol. 34, 2762–2772 (2017).
Article CAS PubMed Google Scholar
Emms, D. M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 20, 1–14 (2019).
Article Google Scholar
Minh, B. Q. et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530–1534 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lavenu, A. Origins. In Lake Titicaca (eds Dejoux, C. & Iltis, A.) 405–448 (Springer, Berlin, 1992).
Google Scholar
Blodgett, T. A., Isacks, B. L. & Lenters, J. D. Constraints on the origin of paleolake expansions in the central Andes. Earth Interact. 1, 1–28 (1997).
Article Google Scholar
Gregory-Wodzicki, K. M. Uplift history of the Central and Northern Andes: a review. Bull. Geol. Soc. Am. 112, 1091–1105 (2000).
Article Google Scholar
Blard, P. H. et al. Lake highstands on the Altiplano (Tropical Andes) contemporaneous with Heinrich 1 and the Younger Dryas: new insights from 14C, U-Th dating and δ18O of carbonates. Quat. Sci. Rev. 30, 3973–3989 (2011).
Article ADS Google Scholar
Losos, J. B. & Ricklefs, R. E. Adaptation and diversification on islands. Nature 457, 830–836 (2009).
Article ADS CAS PubMed Google Scholar
Morino, H., Kamaltynov, R. M., Nakai, K. & Mashiko, K. Phenetic analysis, trophic specialization and habitat partitioning in the Baikal amphipod genus Eulimnogammarus (Crustacea). Adv. Ecol. Res. 31, 355–376 (2000).
Article Google Scholar
Sars, G. O. Crustacea Caspia. Contributions to the knowledge of the carcinological fauna of the Caspian Sea. Part III. Amphipoda.e. Bull. Acad. Imp. Sci. Saint Pétersbourg. 1, 179–242 (1894).
Google Scholar
Sket, B. Fuxiana yangi g.n., sp.n. (Crustacea: Amphipoda), a “Baikaloid” amphipod from the depths of Fuxian Hu, an ancient lake in the karst of Yunnan, China. Arch. Hydrobiol. 147, 241–255 (1999).
Article Google Scholar
Lauzanne, L., Loubens, G. & Osorio, F. Fish fauna. In Lake Titicaca (eds Dejoux, C. & Iltis, A.) 405–448 (Springer, Berlin, 1992). https://doi.org/10.1007/978-94-011-2406-5_10
Chapter Google Scholar
Kocher, T. D., Conroy, J. A., McKaye, K. R. & Stauffer, J. R. Similar morphologies of cichlid fish in lakes tanganyika and malawi are due to convergence. Mol. Phylogenet. Evol. 2, 158–165 (1993).
Article CAS PubMed Google Scholar
Kocher, T. D. Adaptive evolution and explosive speciation: the cichlid fish model. Nat. Rev. Genet. 5, 288–298 (2004).
Article CAS PubMed Google Scholar
Meyer, A., Kocher, T. D., Basasibwaki, P. & Wilson, A. C. Monophyletic origin of Lake Victoria cichlid fishes suggested by mitochondrial DNA sequences. Nature 347, 550–553 (1990).
Article ADS CAS PubMed Google Scholar
Rüber, L. & Adams, D. C. Evolutionary convergence of body shape and trophic morphology in cichlids from Lake Tanganyika. J. Evol. Biol. 14, 325–332 (2001).
Article Google Scholar
Macdonald, K. S., Yampolsky, L. & Duffy, J. E. Molecular and morphological evolution of the amphipod radiation of Lake Baikal. Mol. Phylogenet. Evol. 35, 323–343 (2005).
Article CAS PubMed Google Scholar
Hulsey, C. D. Function of a key morphological innovation: fusion of the cichlid pharyngeal jaw. Proc. R. Soc. B Biol. Sci. 273, 669–675 (2006).
Article Google Scholar
Burress, E. D. et al. Island- and lake-like parallel adaptive radiations replicated in rivers. Proc. R. Soc. B Biol. Sci. 285, 20171762 (2018).
Article Google Scholar
Schluter, D. The Ecology of Adaptive Radiation (Oxford University Press, Oxford, 2000).
Google Scholar
Gavrilets, S. & Vose, A. Dynamic patterns of adaptive radiation. Proc. Natl. Acad. Sci. U.S.A. 102, 18040–18045 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Hughes, C. & Eastwood, R. Island radiation on a continental scale: exceptional rates of plant diversification after uplift of the Andes. Proc. Natl. Acad. Sci. U.S.A. 103, 10334–10339 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Juan, C. et al. The mitogenome of the amphipod Hyalella lucifugax (Crustacea) and its phylogenetic placement. Mitochondrial DNA Part B 1, 755–756 (2016).
Article PubMed PubMed Central Google Scholar
Poynton, H. C. et al. The toxicogenome of Hyalella azteca: a model for sediment ecotoxicology and evolutionary toxicology. Environ. Sci. Technol. 52, 6009–6022 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Bankevich, A. et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 19, 455–477 (2012).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Schomaker-Bastos, A. & Prosdocimi, F. mitoMaker: A Pipeline for Automatic Assembly and Annotation of Animal Mitochondria Using Raw NGS Data. 1–10 (2018) https://doi.org/10.20944/preprints201808.0423.v1.
Geneious | Bioinformatics Software for Sequence Data Analysis.
Bernt, M. et al. MITOS: Improved de novo metazoan mitochondrial genome annotation. Mol. Phylogenet. Evol. 69, 313–319 (2013).
Article PubMed Google Scholar
Emms, D. M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 20, 238 (2019).
Article PubMed PubMed Central Google Scholar
Rice, P., Longden, L. & Bleasby, A. EMBOSS: The European molecular biology open software suite. Trends Genet. 16, 276–277 (2000).
Article CAS PubMed Google Scholar
Edgar, R. C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797 (2004).
Article CAS PubMed PubMed Central Google Scholar
Smith, S. A. & Dunn, C. W. Phyutility: a phyloinformatics tool for trees, alignments and molecular data. Bioinformatics 24, 715–716 (2008).
Article CAS PubMed Google Scholar
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017).
Article CAS PubMed PubMed Central Google Scholar
Nguyen, L.-T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Article CAS PubMed Google Scholar
Xia, X. DAMBE6: New tools for microbial genomics, phylogenetics, and molecular evolution. J. Hered. 108, 431–437 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mir, A., Rosselló, F. & Rotger, L. A. A new balance index for phylogenetic trees. Math. Biosci. 241, 125–136 (2013).
Article MathSciNet PubMed MATH Google Scholar
Ronquist, F. & Huelsenbeck, J. P. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19, 1572–1574 (2003).
Article CAS PubMed Google Scholar
Yang, Z., Nielsen, R. & Hasegawa, M. Models of amino acid substitution and applications to mitochondrial protein evolution. Mol. Biol. Evol. 15, 1600–1611 (1998).
Article CAS PubMed Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS PubMed PubMed Central Google Scholar
Stocsits, R. R., Letsch, H., Hertel, J., Misof, B. & Stadler, P. F. Accurate and efficient reconstruction of deep phylogenies from structured RNAs. Nucleic Acids Res. 37, 6184–6193 (2009).
Article CAS PubMed PubMed Central Google Scholar
Schöniger, M. & Von Haeseler, A. A stochastic model for the evolution of autocorrelated DNA sequences. Mol. Phylogenet. Evol. 3, 240–247 (1994).
Article PubMed Google Scholar
Talavera, G. & Castresana, J. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst. Biol. 56, 564–577 (2007).
Article CAS PubMed Google Scholar
Simmons, M. P. & Ochoterena, H. Gaps as characters in sequence-based phylogenetic analyses. Syst. Biol. 49, 369–381 (2000).
Article CAS PubMed Google Scholar
Müller, K. SeqState: primer design and sequence statistics for phylogenetic DNA datasets. Appl. Bioinform. 4, 65–69 (2005).
Article Google Scholar
Kao, D. et al. The genome of the crustacean Parhyale hawaiensis, a model for animal development, regeneration, immunity and lignocellulose digestion. Elife 5, e20062 (2016).
Article PubMed PubMed Central CAS Google Scholar
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Article PubMed PubMed Central CAS Google Scholar
Nevado, B., Wong, E. L. Y., Osborne, O. G. & Filatov, D. A. Adaptive evolution is common in rapid evolutionary radiations. Curr. Biol. 29, 3081-3086.e5 (2019).
Article CAS PubMed Google Scholar
Shimodaira, H. & Hasegawa, M. Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol. Biol. Evol. 16, 1114–1116 (1999).
Article CAS Google Scholar
Gadagkar, S. R., Rosenberg, M. S. & Kumar, S. Inferring species phylogenies from multiple genes: concatenated sequence tree versus consensus gene tree. J. Exp. Zool. Part B Mol. Dev. Evol. 304, 64–74 (2005).
Article CAS Google Scholar
Heled, J. & Drummond, A. J. Bayesian inference of species trees from multilocus data. Mol. Biol. Evol. 27, 570–580 (2010).
Article CAS PubMed Google Scholar
Bouckaert, R. et al. BEAST 2: a software platform for bayesian evolutionary analysis. PLoS Comput. Biol. 10, e1003537 (2014).
Article PubMed PubMed Central CAS Google Scholar
Zhang, C., Rabiee, M., Sayyari, E. & Mirarab, S. ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees. BMC Bioinform. 19, 153 (2018).
Article Google Scholar
Baker, F. B. Stability of two hierarchical grouping techniques case 1: sensitivity to data errors. J. Am. Stat. Assoc. 69, 440 (1974).
Google Scholar
Suchard, M. A. et al. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 4, vey016 (2018).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Oliver Kroll for providing specimens and Christian Albrecht, Tom Wilke, Geoff Boxshall, Franck Bréhier and Edmundo Moreno for access to samples and helpful discussions. Thanks also to Miguel Alonso for assisting with fieldwork. D. Jaume produced the drawing of Hyalella solida. Our work is supported by the Spanish MINECO Grant CGL2016-76164-P, financed by the Agencia Española de Investigación (AEI) and the European Regional Development Fund (FEDER). FZ benefits by grant BES-2017-081069 of the Spanish Ministerio de Ciencia, Innovación y Universidades.

Author information

These authors contributed equally: Francesco Zapelloni and Joan Pons.

Authors and Affiliations

Department of Biology, University of the Balearic Islands, Ctra. Valldemossa km 7’5, 07122, Palma de Mallorca, Balearic Islands, Spain
Francesco Zapelloni, José A. Jurado-Rivera & Carlos Juan
IMEDEA (CSIC-UIB), Mediterranean Institute for Advanced Studies, C/ Miquel Marquès 21, 07190, Esporles, Balearic Islands, Spain
Joan Pons, Damià Jaume & Carlos Juan

Authors

Francesco Zapelloni
View author publications
You can also search for this author in PubMed Google Scholar
Joan Pons
View author publications
You can also search for this author in PubMed Google Scholar
José A. Jurado-Rivera
View author publications
You can also search for this author in PubMed Google Scholar
Damià Jaume
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Juan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.J. and D.J. conceived and designed the research and collection of samples. F.Z., J.P. and J.A.J-R. performed laboratory work, mitogenome editing and phylogenetic analyses. F.Z. and J.A.J-R. performed figure design. C.J. wrote the manuscript with contributions from all authors.

Corresponding author

Correspondence to Carlos Juan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zapelloni, F., Pons, J., Jurado-Rivera, J.A. et al. Phylogenomics of the Hyalella amphipod species-flock of the Andean Altiplano. Sci Rep 11, 366 (2021). https://doi.org/10.1038/s41598-020-79620-4

Download citation

Received: 18 September 2020
Accepted: 10 December 2020
Published: 11 January 2021
DOI: https://doi.org/10.1038/s41598-020-79620-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

The Intercontinental phylogeography of neustonic daphniids

Mitochondrial genomes of two Polydora (Spionidae) species provide further evidence that mitochondrial architecture in the Sedentaria (Annelida) is not conserved

Successful post-glacial colonization of Europe by single lineage of freshwater amphipod from its Pannonian Plio-Pleistocene diversification hotspot

Introduction

Results

Mitogenome phylogeny

Nuclear rDNA phylogeny

Nuclear phylogeny based on single-copy nuclear genes

Estimation of divergence times

Discussion

Phylogenomics of the Andean Altiplano Hyalella

Divergence times and palaeohydrology

Morphological convergence and replicated radiations

Concluding remarks

Methods

Taxon sampling

Mitochondrial genomes

Ribosomal DNA sequences and nuclear single-copy orthologous genes

Phylogenetic analyses of mitochondrial genomes

Phylogenetic analyses of nuclear ribosomal DNA sequences

Phylogenetic analyses of nuclear single-copy orthologous genes

Estimation of divergence times

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links