Abstract
The class Branchiopoda, whose origin dates back to Cambrian, includes ~ 1200 species which mainly occupy freshwater habitats. The phylogeny and systematics of the class have been debated for long time, until recent phylogenomic analyses allowed to better clarify the relationships among major clades. Based on these data, the clade Anostraca (fairy and brine shrimps) is sister to all other branchiopods, and the Notostraca (tadpole shrimps) results as sister group to Diplostraca, which includes Laevicaudata + Spinicaudata (clam shrimps) and Cladoceromorpha (water fleas + Cyclestherida). In the present analysis, thanks to an increased taxon sampling, a complex picture emerges. Most of the analyzed mitogenomes show the Pancrustacea gene order while in several other taxa they are found rearranged. These rearrangements, though, occur unevenly among taxa, most of them being found in Cladocera, and their taxonomic distribution does not agree with the phylogeny. Our data also seems to suggest the possibility of potentially homoplastic, alternative gene order within Daphniidae.
Similar content being viewed by others
Introduction
Species belonging to the class Branchiopoda are distributed worldwide and mainly occupy freshwater habitats, including lakes and ephemeral or temporary ponds, with a few species inhabiting marine environments1. The origin of branchiopods is estimated dating back to the Cambrian2,3 and the class is currently recognized to include at least 1200 aquatic species (including brine shrimps, tadpole shrimps, fairy shrimps, water fleas)4. Branchiopods comprise four extinct and nine extant orders. The superorder Sarsostraca is formed by the extinct order Lipostraca and the extant Anostraca5, the latter including around 300 species distributed into the two suborders Anostracina (fairy shrimps) and Artemiina (brine shrimps)6. The superorder Calmanostraca comprises the extinct order Kazacharthra and the extant Notostraca, the well-known tadpole shrimps, often considered living fossils due to their morphological stasis. The superorder Cladocera (water fleas) is the most diverse taxon within Branchiopoda and includes the extinct orders Cryptopoda and Proanomopoda, and the extant orders Anomopoda, Ctenopoda, Haplopoda and Onychopoda5,7. The order Anomopoda, in particular, includes the genus Daphnia O. F. Müller, 1785 whose members are well-known model organisms in ecological and evolutionary studies8. The three clam shrimp orders Laevicaudata (which only includes the family Lynceidae), Spinicaudata and Cyclestherida were once included in the group Conchostraca, recently proved to be paraphyletic5,9. Cyclestherida and the super order Cladocera are sister clades, forming the taxon Cladoceromorpha which, together with Spinicaudata, belongs to the taxon Onychocaudata10. The order Laevicaudata groups with Onychocaudata in the clade Diplostraca. Finally, Notostraca and Diplostraca form the group Phyllopoda, with Anostraca as sister clade11,12.
Mitochondrial DNA contains popular molecular genetic markers, widely used for both phylogenetic (mainly for relatively recent divergences) and population genetics studies especially because of quite conserved gene compositions, obvious gene orthology and relative ease of sequencing13. The metazoan mitochondrial DNA molecule is generally considered a quite stable part of the genome, about 16 Kb in size and harbouring 37 genes: 13 protein coding genes (PCGs) which encode subunits of oxidative phosphorylation enzymes, two rRNA genes, 22 tRNA genes, and a non-coding control region14. However, there are many exceptions to this paradigm, mainly brought to the attention since the advent of massive mitogenome sequencing in the last decades15. The most interesting feature of mitogenomes is, probably, the gene order (GO) rearrangement, where the sequential order of genes along the molecule can change, also switching to the opposite DNA strand, and can be subject to gene duplications/deletions13,14. An interesting use of GO rearrangement is to help reconstructing the phylogenetic history of organisms, exploiting its relatively rare occurrence and the unlikely event of parallel evolution of identical GOs16,17. For example, the close evolutionary relationship between crustaceans and insects, which are in fact both included in the Pancrustacea clade18,19, has been further confirmed by the sharing of the same mitogenomic GO (the PanGO pattern), which differs from the so-called ancestral arthropod GO (AAGO pattern) by a single tRNA displacement20,21. Given the many examples of GO rearrangements in pancrustaceans at different taxonomic levels22,23,24,25,26,27,28,29,30,31, the analysis of GO synapomorphy, which may be particularly helpful in phylogenetic reconstruction, may be difficult25. In addition to this, several examples of GO parallel evolution have been reported in hexapods32.
The first branchiopod mitogenomic study analyzed 24 sequences from 20 species of three anostracan families (Artemiidae, Streptocephalidae and Thamnocephalidae), of the notostracan family Triopsidae, of the spinicaudatan family Limnadiidae and of two cladoceran families, Sididae (Ctenopoda) and Daphniidae (Anomopoda)33. This study, on one hand, confirmed the phylogenetic informativeness of the sequence but, on the other hand, it also highlighted significant differences in mitogenome nucleotide composition and substitution rates among the analyzed clades. However, only a small number of GO rearrangements were found, likely due to the limited taxonomic sampling. Since the first study of GO rearrangements in Branchiopoda, though, new mitogenomes have become available in public databases for several lineages, in particular for Cladocera. Along with the previously analyzed dataset, we further considered five new mitogenomes, de novo assembled from genome sequencing data34,35, and 37 mitogenomes which become available from NCBI GenBank (39 sequences from recently published papers listed in Suppl. Table S1 and three from the DNAmark Project [https://dnamark.ku.dk/]). Therefore, with respect to the previous analysis33, additional 39 species are presently considered, belonging to the following families: Chirocephalidae (Anostraca), Triopsidae (Notostraca), Leptestheriidae (Spinicaudata), Polyphemidae (Onychopoda), Bosminidae, Moinidae and Daphniidae (Anomopoda). With this dataset we highlight a more dynamic nature of branchiopod mitogenome GOs, with some Cladocera showing more structural instability. Moreover, we found identical GOs shared by a few species scattered among different genera within the Daphniidae family.
Results
Newly sequenced mitogenomes assembly and annotation
We obtained new mitogenomes from ongoing genome projects of four species35: Lepidurus couesii Packard, 1875, Lepidurus apus apus (Linnaeus, 1758), Triops longicaudatus (LeConte, 1846) and Leptestheria dahalacensis (Rüppel, 1837). The length of mitogenomes ranges from 15,105 bp (L. dahalacensis) to 15,292 bp (L. couesii), and they all show 13 PCGs, 22 tRNAs and two rRNAs genes. The T. longicaudatus mitogenome has already been characterized in other five samples36 and the one presently obtained does not show any peculiar structural variation. The assembly of Eulimnadia texana (Packard, 1871) mitogenome resulted in a 14,884 bp long molecule, whose annotation evidenced the same gene content as the newly sequenced ones.
Annotation of mitogenomes downloaded from GenBank was confirmed for all entries, except for the trnM duplications in Diaphanosoma celebensis Stingelin, 1900 and Diaphanosoma excisum G.O. Sars, 1885, and the omission of unassigned regions (URs).
Dataset construction and phylogenetic analyses
In addition to the five newly assembled mitogenomes, already assembled mitogenomes were further extracted from both genome-wide studies (such as, for example37,38) or from single mitogenome sequencing (see Suppl. Table S1 for a full list).
Phylogenetic reconstructions based on the available 66 mitogenomes were performed using three datasets: nucleotide alignments of the thirteen concatenated mitochondrial PCGs, both including and excluding 3rd codon positions to account for possible substitution saturation, plus the two ribosomal RNA genes, and on the aligned PCGs’ amino acids. The three maximum likelihood trees built on the three datasets generally agree with each other, except for a few minor differences (Fig. 1; Supplementary Figure S1). In all phylogenetic analyses the deepest split separates the Anostraca and Phyllopoda clades, which are both recovered with maximum support in the Bayesian inference and high support in maximum likelihood trees with bootstrap values ranging from 96 to 100 (Fig. 1; Supplementary Figure S1).
Within the Anostraca clade, the family Chirocephalidae, here represented by the species Eubranchipus grubii (Dybowski, 1860), is a sister group to all the other Anostraca when the nucleotide datasets are considered. The topology changes when analyzing the amino acids dataset, as E. grubii results as a sister group to presently considered Thamnocephalidae and Streptocephalidae taxa only (Supplementary Figure S1). On the other hand, the paraphyly of analyzed Thamnocephalidae remains well supported in all analyses (Fig. 1; Supplementary Figure S1). Samples of the Artemia genus (family Artemidae) are recovered in a monophyletic clade with maximum nodal support, although the relationship between Artemia monica (Verrill, 1869) (formerly indicated as Artemia franciscana Kellogg, 1906 but recently revised39) and the clade including Artemia sinica (Cai, 1989), Artemia urmiana Günter, 1899 and Artemia tibetiana Abatzopoulos et al., 1998 shows lower support in the nucleotide’s maximum likelihood analyses. Moreover, Artemia salina (Linnaeus, 1758) results in sister relationship with all other Artemia species in the nucleotide dataset including PCGs 3rd codon positions and in the amino acid dataset, but it is recovered as sister to A. monica when 3rd codon positiona are not considered (Fig. 1; Supplementary Figure S1).
Within Phyllopoda, Notostraca is supported with maximum values in all analyses while Onychocaudata is fully supported in the Bayesian inference and highly supported in maximum likelihood trees (bootstrap = 98–99) (Fig. 1; Supplementary Figure S1). The tree topologies within Notostraca are fully overlapping and all main nodes receive maximum support; samples of the genera Triops Schrank, 1803 and Lepidurus Leach, 1819 are correctly separated in genus-specific monophyletic clades. Triops cancriformis (Bosc, 1801) results as sister to all other Triops species, and the clade formed by Triops granarius (Lucas, 1864) and Triops sp. EPP is sister to the clade T. longicaudatus + Triops australiensis (Spencer and Hall, 1895) (Fig. 1; Supplementary Figure S1). In the Lepidurus clade, L. couesii and Lepidurus arcticus (Pallas, 1793) result the most similar to each other, while Lepidurus apus lubbocki (Brauer, 1873) is sister to all other Lepidurus taxa (Fig. 1; Supplementary Figure S1).
Within Onychocaudata, samples of Cladocera and Spinicaudata are grouped in monophyletic clades with maximum support. In the latter clade, members of Limnadiidae and Leptestheriidae are correctly separated in two monophyletic clusters (Fig. 1; Supplementary Figure S1). Within Cladocera, the three Diaphanosoma Fischer, 1850 species (family Sididae) are grouped together and is sister relationship with the other Cladocera considered. Taxa belonging to Daphniidae, the most represented family in our dataset, form a monophyletic clade, with maximum nodal support in all analyses. Its sister relationship with Moina brachiata (Jurine, 1820) (family Moinidae) is well supported only in the Bayesian inference, being poorly supported in the maximum likelihood analyses on full nucleotide and amino acid dataset (bootstrap = 57–74), and lacking support when 3rd codon positions are not considered (Fig. 1; Supplementary Figure S1). The genus Daphnia and the two subgenera included in the analyses (Daphnia and Ctenodaphnia) are clearly supported. Phyletic relationships within the subgenera are maintained throughout datasets and analyses, except for two differences. In the subgenus Daphnia, the relative placement of Daphnia obtusa (Kurz, 1874) and Daphnia retrocurva Forbes, 1882 is switched in the amino acid data set, and the relative placement of the Daphnia cristata (G.O. Sars, 1862) + Daphnia longispina (O.F. Müller, 1776) clade and the Daphnia galeata G.O. Sars, 1864 + D. longispina clade is switched when PCGs 3rd codon positions are not considered (Fig. 1; Supplementary Figure S1).
Mitogenomes annotations, gene order and rearrangements
All analyzed mitogenomes include the PCGs and rRNA genes described in the putative ancestral arthropod mitogenome, although the tRNA content could vary to some extent. There are, however, four mitogenomes obtained from the NCBI database which are not complete, namely Daphnia melanica Hebert, 1995, Da. obtusa, Scapholeberis mucronata (O.F. Müller, 1776) and Polyphemus pediculus (Linnaeus, 1761).
Among the analyzed 66 branchiopod mitogenomes we found evidence of 13 distinct GOs (Fig. 2). The most common one characterizes the majority of Phyllopoda taxa: all Spinicaudata, all Notostraca, but L. apus lubbocki showing an unassigned region (UR)33, and 17 out 32 Cladocera. This pattern is identical to the PanGO pattern, which differs from the AAGO one because of the reverse transposition of the trnL2 gene20,21.
The other 15 cladoceran taxa showed nine different GOs, three being found in the family Daphniidae. In the genus Daphnia, the PanGO pattern can be found in all species belonging to the subgenus Daphnia, as well as in three species of the subgenus Ctenodaphnia. The remaining Ctenodaphnia taxa exhibited three distinct GOs. The most common one, hence referred to as Clado-I, is shared between Daphnia lumholtzi G.O. Sars, 1885, Daphnia barbata Weltner, 1898 and Daphnia dolichocephala G.O. Sars, 1895, and it was also found in the other daphniids Ceriodaphnia reticulata (Jurine, 1820) and Simocephalus serrulatus (Koch, 1841). This GO differs from the PanGO pattern because of the duplication and reverse transposition of the trnI gene. Another GO, hence termed Clado-II, characterizes the two species Daphnia magna Straus, 1820 and Daphnia hispanica Glagolev & Alonso, 1990 and showed a reverse transposition of the trnI with respect to the PanGO pattern. The third GO can be observed only in Daphnia similis Claus, 1876 and differs from the PanGO pattern by the reverse transposition of the trnI and trnQ-trnM-nad2-trnW reversal (Fig. 2).
As far as the other cladoceran taxa are concerned, the GOs pattern looks even more complex. Species representative of Moinidae (M. brachiata), Bosminidae (Bosmina coregoni Baird, 1857) and Polyphemidae (P. pediculus) all show different GOs with respect to the PanGO pattern. M. brachiata shows the trnH reverse transposition; B. coregoni presents the trnW transposition and the trnQ reversal flanked by a 50 bp-long UR; finally, P. pediculus exhibits the transposition of trnI, trnQ and trnW (although, being only partially sequenced, further rearrangements cannot be excluded). The incomplete mitogenome of Sc. mucronata showed a trnI located between the rrnS and the CR, as in the Clado-I and Clado-II patterns; however, since it lacks the portion between CR and nad2 gene, it is not possible to attribute it to one of the two patterns.
In the family Sididae, the three analyzed samples, all belonging to the genus Diaphanosoma, showed three different GOs. Because of the high GO diversity and the consequent complexity in inferring the rearrangement pattern, these mitogenomes were compared with each other and with the PanGO pattern in a phylogenetic framework using TreeREx40. Unfortunately, data are not clear enough to result in a highly consistent ancestral reconstruction scenario (Supplementary Figure S2). According to the phylogenetic relationships, the two most similar GO are those of Di. celebensis and Di. excisum which share a duplication of the trnM and only differ for (i) the lack of trnS1 in the former species and (ii) the transposition of the control region in the latter one (Fig. 2; Supplementary Figure S2). The scenario of possible rearrangements between these two GOs and the one of the sister species Diaphanosoma dubium Manujlova, 1964 includes several events of transposition, reverse transposition, and reversal (Supplementary Figure S2). Finally, rearrangements leading from the putative ancestral PanGO pattern to that of Sididae are inferred as even more complex, comprising both a transposition and two possible tandem duplication and random loss events (Supplementary Figure S2).
The Anostraca GO pattern is slightly different from the PanGO pattern, showing a transposition of the trnM-nad2-trnW and a reversal of trnI (Fig. 2), and characterizes all anostracan taxa but the thamnocephalid Branchinella kugenumaensis (Ishikawa, 1895). The latter mitogenome, in fact, shows a reversal rearrangement involving the segment trnM-nad2-trnW-trnI (Fig. 2).
The MLGO method reconstructed the PanGO pattern as the ancestral one, except for the Anostraca clade (Fig. 2). Furthermore, the PanGO pattern was reconstructed at all Phyllopoda nodes with only two exceptions. The first one concerns the Ctenodaphnia clade, for which Clado-I was indicated as the ancestral pattern (Fig. 2). It is to be noted, though, that the common ancestor of Daphnia and Ctenodaphnia resulted to have the PanGO pattern (Fig. 2). The second exception concerns internal nodes of the Diaphanosoma clade, for which it was inferred a GO identical to the one of Di. celebensis but with a trnS1 gene instead of the duplicated trnM gene.
Discussion
In the present work we report on the analysis of mitogenomes from some Branchiopoda lineages by providing new data and including data from the NCBI public database. Overall, the evolution of 66 mitogenomes has been investigated through maximum likelihood and Bayesian phylogenetic analyses based on PCGs and rRNA genes, and gene order structural analysis. Despite the lack of some important lineages, such as Laevicaudata and Cyclestherida, data presented here point out interesting features of branchiopod mitogenomes.
The phylogenetic relationships among major lineages of branchiopods, after being debated for a long time, have been recently better clarified through phylotranscriptomic and phylogenomic approaches10,12. Our analyses of the three datasets (i.e., PCGs with and without 3rd codon positions + rRNAs; PCGs amino acids) are in agreement with each other, the only exception being the possible paraphyly of Anostracinae (here represented by Chirocephalidae, Thamnocephalidae and Streptocephalidae taxa) using the nucleotide dataset, which was also found in the phylotranscriptomics analysis, and are in line with the current accepted phylogeny. Moreover, our phylogenetic analyses are in good agreement with most recent molecular studies. For example, results pointed out in previous studies on Notostraca41,42,43 are hereby confirmed, including the close relationship between the sample Triops sp. EPP and T. granarius44, and the same holds for Spinicaudata45. As far as Cladocera is concerned, the phylogenetic relationships between orders are in line with previous analyses10,12,38,46, although Haplopoda representatives are missing here. Moreover, our results are consistent with Daphniidae mitogenome phylogenies obtained by Cornetti and colleagues38 and with a very recent mitogenomic analyses which was published during the writing of the present work46. Though, it is worth noting that nuclear genes phylogenomic analyses pointed out a different clustering pattern of the genera Megafenestra Dumont & Pensaert, 1983, Scapholeberis and Simocephalus, suggesting a possible limitation of mitochondrial genes phylogenetic signal10,38.
The most striking feature of our new analysis is the finding of several different GOs and their taxonomic distribution. As previously noticed, the GO in Branchiopoda was considered rather stable, with just few exceptions33. In the present analysis, thanks to the increased taxon sampling, even though still limited with respect to the entire branchiopod diversity, a more complex picture emerges. First of all, even if most mitogenomes show the PanGO pattern, we found several rearrangements. In the second place, we found GO rearrangements occurring unevenly among taxa, with Cladocera showing most of them. Finally, the taxonomic distribution of GO rearrangements does not generally agree with the phylogeny.
It has been recently pointed out that published problematic mitogenomes are increasing, with potential bias with respect to species identification, bad annotation, or assembly issues47, suggesting caution when reusing literature data. In the present work, rearranged mitogenomes have been taken from literature studies which, however, implemented several methods to avoid potential assembly bias (as in37,38) or gave the same GO although being produced in independent studies (like D. hispanica38 and D. magna48). Moreover, annotations were further validated in the present study by using the same pipeline for all analyzed mitogenomes and we did not find signs of potential artifact in genome assembly. Therefore, potential biases were reduced as much as possible, although they cannot be entirely excluded.
Mitogenome gene rearrangements are frequently observed in many metazoan taxa17. Within Pancrustacea, for example, several different GOs have been recorded at various taxonomic levels. In a recent survey in Hexapoda, one of the most studied groups of metazoans, about 47% of mitogenomes were found rearranged with respect to the PanGO pattern, which is the prevalent GO in the class32. In non-branchiopod crustacean taxa, although much less studied, still several variations with respect to the PanGO pattern have been observed: for example, in Decapoda26,28,29, Amphipoda23,31, Copepoda24,27 and Ostracoda22,30.
Interestingly, the observed rearrangements were also found unevenly distributed, again in line with data on Hexapoda32 and other crustaceans28. Reasons behind GO rearrangements are still debated, as there is no general evidence of association with a specific driving force. Yet, either molecular mechanisms49,50,51,52 or life history and ecological traits27,28 have been tentatively proposed, although none of them seem generalizable13,28,53. During the writing of the present paper, a large-scale mitogenomic analysis focusing on Cladocera was published which highlighted the same GOs instability in this clade46 as we found in the present analysis. At the moment, though, it is difficult to formulate a reliable hypothesis to explain the observed distribution of GO rearrangements on the basis of clade-specific traits.
The decoupling of the phylogenetic pattern and the GOs distribution pattern is somehow surprising, especially considering that Clado-I and Clado-II GO patterns appear, in the present study, as potentially homoplastic54. In the analysis of Xu and collaborators46 this went unnoticed probably due to the differences in the Ctenodaphnia dataset, which is wider in the present analysis as we included the broad taxon sampling of this subgenus from Cornetti et al.38. The MLGO analysis indicated the PanGO pattern as ancestral to Cladocera and, more in particular, it was also found ancestral to the whole Daphniidae family, to the whole Daphnia genus and to the most recent ancestor of C. reticulata and of Si. serrulatus. On the other hand, the same analysis indicated the Clado-I pattern as ancestral to the subgenus Ctenodaphnia, although this clade includes three taxa exhibiting the PanGO pattern and further three taxa with different GOs (two with the Clado-II one, and Da. similis with its own GO). The occurrence of the Clado-I pattern in phylogenetically separate lineages, such as Ctenodaphnia spp., C. reticulata and Si. serrulatus, that started to diverge more than 150 million years ago10, together with the MLGO inference of PanGO pattern as ancestral to Daphniidae, could be explained as the result of a multiple, independent evolution of the same GO pattern or because of collateral evolution, i.e. the differential inheritance of polymorphic characters evolved in an ancestral population or shared after hybridization55. Independent evolution of GOs has been found occurring frequently in Hexapoda and in other metazoans, both between closely related and distantly related species32,56. Data collected for this study do not allow to test whether collateral evolution may have had a role in determining the observed GOs pattern of distribution. Though, it is interesting to note that presently observed, potentially homoplastic GOs differ from each other and from the PanGO pattern by the duplication and/or the positioning of the trnI gene(s). In the insect order Hymenoptera, tRNA genes are known to be subject to rearrangement more than other genes, thus suggesting the selective neutrality of their position along the mitochondrial GO57. Therefore, it is possible to hypothesize that the ease with which tRNA gene positions rearrange could make more likely their multiple, independent occurrences. In line with these findings, the MLGO inference of Clado-I pattern as ancestral to the Ctenodaphnia clade may have two further possible implications: (i) the Clado-II pattern and the Da. similis GO were derived from the Clado-I one, likely through the loss of the trnI gene in the ancestral position (i.e., downstream the control region in the PanGO pattern) and a further rearrangement in Da. similis; (ii) three Ctenodaphnia species, that do not cluster together, actually showed the PanGO pattern, suggesting independent reversals to the ancestral GO possibly obtained after the trnI loss at the derived position (i.e., upstream the control region in the PanGO pattern).
The observed intriguing pattern of GO evolution in Cladocera still remains, therefore, to be clarified: although it clearly appears to derive from the ancestral PanGO pattern, its evolution in some parts of the tree cannot be reconstructed because of uncertainty likely linked to missing data. As previously suggested46, the analysis of more mitogenomes and widening the taxon sampling in this region of the branchiopods’ tree of life, to include other major lineages and high quality mitogenome assemblies, will allow to better elucidate this issue.
Materials and methods
New mitogenome assembly and annotation
Total DNA was extracted from individual specimens of L. couesii, L. apus apus, T. longicaudatus and L. dahalacensis using the DNA extraction kit (STRATEC), after dissection for gut removal. Whole genome sequencing has been carried out on Illumina HiSeqX platform (Macrogen Inc., South Korea), paired end (2 × 150 bp) and library insert size = 350 bp. Raw reads were quality trimmed and deprived of adapters using Trimmomatic v. 0.3558, with default settings. A random subset of 4 million read pairs per sample was de novo assembled from using SPAdes v. 3.11.1 with default parameters59. No reference sequences have been used to guide the assembly. For Eulimnadia texana 4 four million random read pairs from genome sequencing raw reads were used34 (SRA accession number SRR4787251; mitogenome available in Suppl. Data S1). For each assembly, the contig containing the mitogenome was found through a BLAST search using as query previously sequenced cox1 genes for L. couesii (NCBI GenBank acc. no. DQ148287), L. apus apus (DQ834543), T. longicaudatus (JX110647) and L. dahalacensis (DQ872786), and the rrnS gene for E. texana (AY779667). Mitogenome annotation was performed de novo using MITOS260, with RefSeq 89 Metazoa as reference parameter. Where necessary, gene boundaries were manually corrected by similarity with published branchiopod mitochondrial genomes33. Genes for tRNAs were further confirmed by means of ARWEN v. 1.2.361.
Dataset construction and phylogenetic analysis
All available branchiopod mitogenomes were obtained from NCBI GenBank (last accessed in April 2021), therefore producing, together with the presently de novo assembled mitogenomes, a dataset of 66 sequences for 57 species (Supplementary Table S1). Two hexapod mitogenomes, belonging to Thermobia domestica Packard, 1873 and Reticulitermes flaviceps (Oshima, 1908) (Supplementary Table S1), were used as outgroup for phylogenetic analyses. Annotations of mitogenomes from the database were checked with the same pipeline to verify possible mistakes and to better define boundaries of possible Unassigned Regions (UR). The latter were considered only when longer than 50 bp.
All the thirteen protein-coding mitochondrial genes (PCGs) and the two ribosomal RNAs were aligned using MAFFT v. 7.39762 using respectively the L-INS-i and E-INS-i algorithms for PCGs and rRNAs, respectively, and single gene alignments were concatenated. Maximum likelihood phylogenetic reconstruction was performed on the IQ-TREE web server63 using the automatic evaluation of the partitioning scheme and substitution model (Supplementary Table S2), edge unlinked partition and FreeRate heterogeneity. Ten runs were performed with 1000 Ultrafast bootstrap replicates each and the tree with the best likelihood score was chosen. Bayesian inference for tree reconstruction was performed only on the full nucleotide dataset using MrBayes v3.2.764, with best partition scheme and substitution models determined using PartitionFinder2 v2.1.165 (Supplementary Table S2). Two runs, each including 10,000,000 generations, were performed with sampling every 1000 trees. Convergence was assessed using Tracer v1.766 and by checking PSRF values and average standard deviation of split frequencies.
Gene order analysis
Gene order (GO) rearrangements were pairwise compared through the analysis of common intervals, i.e. group of consecutive genes occurring in multiple genomes, using the software CReX67 with default parameters. The software TreeREx40 was also used with designers’ suggested settings (available at http://pacosy.informatik.uni-leipzig.de/185-0-TreeREx.html). It performs the same analysis of CReX but in a phylogenetic framework, by means of ancestral GOs reconstruction: this is particularly useful when pairwise GO comparison is difficult because of high rearrangement levels. On the other hand, both CReX and TreeREx do not manage GOs with duplicated or missing genes: since in our branchiopod dataset there are several duplicated or missing genes, especially regarding tRNA genes, gene duplications/deletions were treated manually and TreeREx was limited to the analysis of the family Sididae.
The attempt to reconstruct ancestral GOs was carried out by means of the MLGO (Maximum Likelihood for Gene Order analysis68) method, using the presently obtained phylogenetic tree, based on PCGs (including 3rd codon positions) + rRNAs nucleotide alignment, as backbone tree.
Data availability
Newly sequenced and assembled mitogenomes have been deposited in NCBI GenBank under the following accession numbers: Lepidurus couesii, OL757477; Lepidurus apus apus, OL757476; Triops longicaudatus, OL757475; Leptestheria dahalacensis, OL757474.
References
Brendonck, L., Rogers, D. C., Olesen, J., Weeks, S. & Hoeh, W. R. Global diversity of large branchiopods (Crustacea: Branchiopoda) in freshwater. Hydrobiologia 595, 167–176 (2008).
Walossek, D. The upper Cambrian Rehbachiella, its larval development, morphology and significance for the phylogeny of Branchiopoda and Crustacea. Hydrobiologia 298, 1–13 (1995).
Sun, X. Y. et al. Dating the origin of the major lineages of Branchiopoda. Palaeoworld 25, 303–317 (2016).
Adamowicz, S. & Purvis, A. How many branchiopod crustacean species are there? Quantifying the components of underestimation. Global Ecol. Biogeogr. 14, 455–468 (2005).
Van Damme, K. & Kotov, A. A. The fossil record of the Cladocera (Crustacea: Branchiopoda): Evidence and hypotheses. Earth Sci. Rev. 163, 162–189 (2016).
Rogers, D.C. Branchiopoda (Anostraca, Notostraca, Laevicaudata, Spinicaudata, Cyclestherida) in Encyclopedia of inland waters vol. 2 (ed. Likens, G. E.) 242–249 (Oxford: Elsevier Science, 2009).
Forró, L., Korovchinsky, N.M., Kotov, A.A., Petrusek, A.,. Global diversity of cladocerans (Cladocera; Crustacea) in freshwater. Hydrobiologia 595, 177–184 (2008).
Miner Brooks, E., De Meester, L., Pfrender, M. E., Lampert, W. & Hairston, N. G. Linking genes to communities and ecosystems: Daphnia as an ecogenomic model. Proc. Soc. B 279, 1873–1882 (2012).
Negrea, S., Butnariuc, N. & Dumont, H. J. On the phylogeny, evolution and classification of the Branchiopoda (Crustacea). Hydrobiologia 412, 191–212 (1999).
Van Damme, K., Cornetti, L., Fields, P. D. & Ebert, D. Whole-genome phylogenetic reconstruction as a powerful tool to reveal homoplasy and ancient rapid radiation in waterflea evolution. Syst. Biol. 2, 94 (2021).
Olesen, J. & Richter, S. Onychocaudata (Branchiopoda: Diplostraca), a new high-level taxon in branchiopod systematics. J. Crust. Biol. 33, 62–65 (2013).
Schwentner, M., Richter, S., Rogers, D. C. & Giribet, G. Tetraconatan phylogeny with special focus on Malacostraca and Branchiopoda: Highlighting the strength of taxon-specific matrices in phylogenomics. Proc. R. Soc. B. 285, 20181524 (2018).
Gissi, C., Iannelli, F. & Pesole, G. Evolution of the mitochondrial genome of Metazoa as exemplified by comparison of congeneric species. Heredity 101, 301–320 (2008).
Boore, J. L. Animal mitochondrial genomes. Nucleic Acids Res. 27, 1767–1780 (1999).
Formaggioni, A., Luchetti, A. & Plazzi, F. Mitochondrial genomic landscape: A portrait of the mitochondrial genome 40 years after the first complete sequence. Life 11, 663 (2021).
Dowton, M., Castro, L. R. & Austin, A. D. Mitochondrial gene rearrangements as phylogenetic characters in the invertebrates: The examination of genome “morphology”. Invertebr. Syst. 16, 345–356 (2002).
Boore, J. L. & Brown, W. M. Big trees from little genomes: Mitochondrial gene order as a phylogenetic tool. Curr. Opin. Genet. Dev. 8, 668–674 (1998).
Friedrich, M. & Tautz, D. Ribosomal DNA phylogeny of the major extant arthropod classes and the evolution of myriapods. Nature 376, 165–167 (1995).
Regier, J. et al. Arthropod relationships revealed by phylogenomic analysis of nuclear protein-coding sequences. Nature 463, 1079–1083 (2010).
Boore, J. L., Lavrov, D. V. & Brown, W. M. Gene translocation links insects and crustaceans. Nature 392, 667–668 (1998).
Rota-Stabelli, O. et al. Ecdysozoan mitogenomics: Evidence for a common origin of the legged invertebrates, the Panarthropoda. Genome Biol. Evol. 2, 425–440 (2010).
Ogoh, K. & Omiya, Y. Complete mitochondrial DNA sequence of the sea-firefly, Vargula hilgendorfii (Crustacea, Ostracoda) with duplicate control regions. Gene 327, 131–139 (2004).
Bauzà-Ribot, M. M., Jaume, D., Juan, C. & Pons, J. The complete mitochondrial genome of the subterranean crustacean Metacrangonyx longipes (Amphipoda): A unique gene order and extremely short control region. Mitochondr. DNA 20, 1–12 (2009).
Minxiao, W., Song, S., Chaolun, L. & Xin, S. Distinctive mitochondrial genome of Calanoid copepod Calanus sinicus with multiple large non-coding regions and reshuffled gene order: useful molecular markers for phylogenetic and population studies. BMC Genom. 12, 73 (2011).
Cameron, S. L. Insect mitochondrial genomics: implications for evolution and phylogeny. Annu. Rev. Entomol. 59, 95–117 (2014).
Basso, A. et al. The highly rearranged mitochondrial genomes of the crabs Maja crispata and Maja squinado (Majidae) and gene order evolution in Brachyura. Sci. Rep. 7, 4096 (2017).
Weydmann, A. et al. Mitochondrial genomes of the key zooplankton copepods Arctic Calanus glacialis and North Atlantic Calanus finmarchicus with the longest crustacean non-coding regions. Sci. Rep. 7, 13702 (2017).
Tan, M. H., Gan, H. M., Lee, Y. P., Poore, G. C. B. & Austin, C. M. Digging deeper: new gene order rearrangements and distinct patterns of codons usage in mitochondrial genomes among shrimps from the Axiidea, Gebiidea and Caridea (Crustacea: Decapoda). PeerJ 5, e2982 (2017).
Tan, M. H. et al. ORDER within the chaos: Insights into phylogenetic relationships within the Anomura (Crustacea: Decapoda) from mitochondrial sequences and gene order rearrangements. Mol. Phylogenet. Evol. 127, 320–331 (2018).
Kakui, K., Munakata, M., Tanaka, H. & Hiruta, C. Mismatch between similarity of mitochondrial gene order and phylogenetic distance in Podocopa (Crustacea: Ostracoda). Zool. Anz. 293, 145–148 (2021).
Zapelloni, F., Jurado-Rivera, J. A., Jaume, D., Juan, C. & Pons, J. Comparative mitogenomics in Hyalella (Amphipoda: Crustacea). Genes 12, 292 (2021).
Moreno-Carmona, M., Cameron, S. L. & Prada Quiroga, C. F. How are the mitochondrial genomes reorganized in Hexapoda? Differential evolution and the first report of convergences within Hexapoda. Gene 791, 14571 (2021).
Luchetti, A., Forni, G., Skaist, A. M., Wheelan, S. J. & Mantovani, B. Mitochondrial genome diversity and evolution in Branchiopoda (Crustacea). Zool. Lett. 5, 15 (2019).
Baldwin-Brown, J. G., Weeks, S. C. & Long, A. D. A new standard for crustacean genomes: the highly contiguous, annotated genome assembly of the clam shrimp Eulimnadia texana reveals HOX gene order and identifies the sex chromosome. Genome Biol. Evol. 10, 143–156 (2018).
Luchetti, A. et al. Comparative genomics of tadpole shrimps (Crustacea, Branchiopoda, Notostraca): Dynamic genome evolution against the backdrop of morphological stasis. Genomics 113, 4163–4172 (2021).
Horn, R. L. & Cowley, D. E. Evolutionary relationships within Triops (Branchiopoda: Notostraca) using complete mitochondrial genomes. J. Crust. Biol. 34, 795–800 (2014).
Beninde, J., Möst, M. & Meyer, A. Optimized and affordable high-throughput sequencing workflow for preserved and nonpreserved small zooplankton specimens. Mol. Ecol. Resour. 20, 1632–1646 (2020).
Cornetti, L., Fields, P. D., Van Damme, K. & Ebert, D. A fossil-calibrated phylogenomic analysis of Daphnia and the Daphniidae. Mol. Phylogenet. Evol. 137, 250–262 (2019).
Sainz-Escudero, L., López-Estrada, E. K., Rodríguez-Flores, P. C. & García-París, M. Settling taxonomic and nomenclatural problems in brine shrimps, Artemia (Crustacea: Branchiopoda: Anostraca), by integrating mitogenomics, marker discordances and nomenclature rules. PeerJ 9, e10865 (2021).
Bernt, M., Merkle, D. & Middendorf, M. An algorithm for inferring mitochondrial genome rearrangements in a phylogenetic tree in Comparative Genomics International Workshop, RECOMB-CG 2008, Lecture Notes in Bioinformatics vol. 5267 (eds. Nelson, C.E., Vialette, S.) 143–157 (Springer, 2008).
Mantovani, B., Cesari, M. & Scanabissi, F. Molecular taxonomy and phylogeny of Italian Lepidurus taxa (Branchiopoda: Notostraca). It. J. Zool. 76, 358–365 (2009).
Vanschoenwinkel, B. et al. Toward a global phylogeny of the “living fossil" crustacean order of the notostraca. PLoS ONE 7, e34998 (2012).
Korn, M., Rabet, N., Ghate, H. V., Marrone, F. & Hundsdoerfer, A. K. Molecular phylogeny of the Notostraca. Mol. Phylogenet. Evol. 69, 1159–1171 (2013).
Gan, H. M., Wasserman, R. J., Dalu, T. & Rogers, D. C. The complete mitogenome of a South African cryptic species of tadpole shrimp within the Triops granarius (Lucas, 1864) species group. Mitochondr. DNA Part B Resour. 4, 455–456 (2019).
Schwentner, M. et al. Phylogeny and biogeography of Spinicaudata (Crustacea: Branchiopoda). Zool. Stud. 59, 44 (2020).
Xu, S.-L. et al. Mitogenomics of Cladocera (Branchiopoda): Marked gene order rearrangements and independent predation roots. Mol. Phylogenet. Evol. 164, 107275 (2021).
Sangster, G. & Luksenburg, J. A. Sharp increase of problematic mitogenomes of birds: Causes, consequences, and remedies. Genome Biol. Evol. 13, 210 (2021).
Cheng, R. et al. Complete mitochondrial genome sequence of Daphnia magna (Crustacea: Cladocera) from Huaihe in China. J. Lake Sci. 28, 414–420 (2016).
Mueller, R. L. & Boore, J. L. Molecular mechanisms of extensive mitochondrial gene rearrangement in plethodontid salamanders. Mol. Biol. Evol. 22, 2104–2112 (2005).
Kurabayashi, A. et al. Phylogeny, recombination, and mechanisms of stepwise mitochondrial genome reorganization in mantellid frogs from Madagascar. Mol. Biol. Evol. 25, 874–891 (2008).
Shao, R., Dowton, M., Murrell, A. & Barker, S. C. Rates of gene rearrangement and nucleotide substitution are correlated in the mitochondrial genomes of insects. Mol. Biol. Evol. 20, 1612–1619 (2003).
Plazzi, F., Puccio, G. & Passamonti, M. Comparative large-scale mitogenomics evidences clade-specific evolutionary trends in mitochondrial DNAs of Bivalvia. Genome Biol. Evol. 8, 2544–2564 (2016).
Tan, M. H. et al. Comparative mitogenomics of the Decapoda reveals evolutionary heterogeneity in architecture and composition. Sci. Rep. 9, 10756 (2019).
Wake, D. B., Wake, M. H. & Specht, C. D. Homoplasy: From detecting pattern to determining process and mechanism of evolution. Science 331, 1032–1035 (2011).
Stern, D. The genetic causes of convergent evolution. Nat. Rev. Genet. 14, 751–764 (2013).
Mindell, D. P., Sorenson, M. D. & Dimcheff, D. E. Multiple independent origins of mitochondrial gene order in birds. PNAS USA 95, 10693–10697 (1998).
Dowton, M., Cameron, S. L., Dowavic, J. I., Austin, A. D. & Whiting, M. F. Characterisation of 67 mitochondrial gene rearrangements in the Hymenoptera suggests that mitochondrial tRNA gene position is selectively neutral. Mol. Phylogenet. Evol. 26, 1607–1617 (2009).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Nurk, S. et al. Assembling genomes and mini-metagenomes from highly chimeric reads. In Research in Computational Molecular Biology. RECOMB 2013. Lecture Notes in Computer Science (eds Deng, M. et al.) 158–170 (Springer, 2013).
Donath, A. et al. Improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes. Nucleic Acids Res. 47, 10543–10552 (2019).
Laslett, D. & Canbäck, B. ARWEN, a program to detect tRNA genes in metazoan mitochondrial nucleotide sequences. Bioinformatics 24, 172–175 (2008).
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Trifinopoulos, J., Nguyen, L. T., von Haeseler, A. & Minh, B. Q. W-IQ-TREE: A fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res. 44, 232–235 (2016).
Ronquist, F. et al. MrBayes 3.2: Efficient bayesian phylogenetic inference and model choice across a large model space. Syst. Biol. 61, 539–542 (2012).
Lanfear, R., Frandsen, P. B., Wright, A. M., Senfeld, T. & Calcott, B. PartitionFinder 2: New methods for selecting partitioned models of evolution for molecular and morphological phylogenetic analyses. Mol. Biol. Evol. 34, 772–773 (2016).
Rambaut, A., Drummond, A. J., Xie, D., Baele, G. & Suchard, M. A. Posterior summarization in bayesian phylogenetics using tracer 1.7. Syst. Biol. 67, 901–904 (2018).
Bernt, M. et al. CREx: Inferring genomic rearrangements based on common intervals. Bioinformatics 23, 2957–2958 (2007).
Hu, F., Lin, Y. & Tang, J. MLGO: Phylogeny reconstruction and ancestral inference from gene-order data. BMC Bioinform. 15, 354 (2014).
Acknowledgements
Authors are grateful to the DNAmark Danish National DNA Reference Database (https://dnamark.ku.dk/) for providing us part of the presently analyzed mitogenomes. This work has been supported by Canziani funding to AL and BM; the PhD grant to FC was co-funded by Canziani and by the Natural History Museum of Denmark. Authors want to thank the Editor and two anonymous Reviewers whose comments and suggestions greatly improved the manuscript.
Author information
Authors and Affiliations
Contributions
F.C. and A.L. performed the sequencing, assemblies, analyses and interpretated results; A.L. and B.M. conceived the study, planned, and coordinated the work; B.M. participated in interpreting results; F.C., A.L. and B.M. wrote the paper. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Castellucci, F., Luchetti, A. & Mantovani, B. Exploring mitogenome evolution in Branchiopoda (Crustacea) lineages reveals gene order rearrangements in Cladocera. Sci Rep 12, 4931 (2022). https://doi.org/10.1038/s41598-022-08873-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-08873-y
This article is cited by
-
The median eyes of trilobites
Scientific Reports (2023)
-
Daphnia sp. (Branchiopoda: Cladocera) Mitochondrial Genome Gene Rearrangement and Phylogenetic Position Within Branchiopoda
Biochemical Genetics (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.