Abstract
Transmembrane conductance of small uncharged solutes such as glycerol typically occurs through aquaglyceroporins (Glps), which are commonly encoded by multiple genes in metazoan organisms. To date, however, little is known concerning the evolution of Glps in Crustacea or what forces might underly such apparent gene redundancy. Here, we show that Glp evolution in Crustacea is highly divergent, ranging from single copy genes in species of pedunculate barnacles, tadpole shrimps, isopods, amphipods and decapods to up to 10 copies in diplostracan water fleas although with monophyletic origins in each lineage. By contrast the evolution of Glps in Copepoda appears to be polyphyletic, with surprisingly high rates of gene duplication occurring in a genera- and species-specific manner. Based upon functional experiments on the Glps from a parasitic copepod (Lepeophtheirus salmonis), we show that such lineage-level gene duplication and splice variation is coupled with a high rate of neofunctionalization. In the case of L. salmonis, splice variation of a given gene resulted in tissue- or sex-specific expression of the channels, with each variant evolving unique sites for protein kinase C (PKC)- or protein kinase A (PKA)-regulation of intracellular membrane trafficking. The combined data sets thus reveal that mutations favouring a high fidelity control of intracellular trafficking regulation can be a selection force for the evolution and retention of multiple Glps in copepods.
Similar content being viewed by others
Introduction
Aquaglyceroporins (Glps) are a phylogenetically distinct grade of water channels (aquaporins) that facilitate the transmembrance conductance of small uncharged solutes such as glycerol, urea or metalloids in addition to water1,2,3,4,5. In contrast to the water-selective branches of aquaporins, which typically display narrow selectivity filters composed of four aromatic arignine (ar/R) residues6,7,8,9, the cross-sectional sizes of the Glp selectivity filters are broader and thus facilitate the passage of larger molecules10,11,12. Evolutionary studies have shown that Glps are widespread in both prokrayotic and eukaryotic organisms, but are not ubiquitous, having been lost in certain lineages of protists, plants and insects2,4,13,14,15. In plants and insects, Glps were supplanted by other members of the aquaporin superfamily, either via horizontal gene transfer of nodulin 26-like integral proteins (NIPs) and GlpF-like intrinsic proteins (GIPs) in plants2,4,13,16,17 or through functional co-option and molecular supplantation by the entomoglyceroporins (Eglps) in hemipteran and holometabolous insects14,18. The absence of classical Glps in model organisms such as Drosophila has thus obfuscated a deeper understanding of their evolution and function in the arthropod lineage.
Arthropods are a highly diverse, yet monophyletic phylum of joint-legged molting animals that are classified into four major subphyla, the Chelicerata (e.g., sea spiders, horseshoe crabs and arachnids), and a subclade of Mandibulata comprised of the Myriapoda (e.g., centipedes and millipedes), the Crustacea (e.g., waterfleas, tadpole shrimps, barnacles, copepods and decapods) and the Hexapoda (e.g., entognathans and insects)19. Glps have been identified in selected species of each subphylum, including multiple genes in arachnids, a single copy in a centipede (Strigamia maritima), multiple genes in water fleas and copepods and one or two genes in more basal lineages of Hexapoda2,14,20,21,22,23,24. However taxon sampling has remained limited, particularly for Crustacea due to the abscence of genomic and transcriptomic data, and it has thus not been possible to determine whether crustacean Glps are paraphyletic as indicated for Chelicerata or monophyletic as indicated for insects14,23. Deciphering the basis for such relationships is an important step toward understanding the origin and divergence of Glp function within each class of organism.
In our previous study, we identified three glp genes (glp1, −2 and −3) in the parasitic copepod Lepeophtheirus salmonis, which expresses the glp1 and −3 genes as splice variants to form the N-terminal protein isoforms Glp1_v1/v2 and Glp3_v1/v2, respectively23. RNA expression profiling revealed that the glp1_v1 isoform is expressed in pre-adult and adult males, which also occurs in a related species of caligid copepod, Caligus rogercresseyi25,26, while the other transcripts are detected in all stages of the life cycle23. In addition, functional analyses of the proteins showed that cAMP was required to promote glycerol transport of the Glp1_v1 isoform23. These observations implied that glp gene duplication and splice variation may have promoted stage- and sex-specific expression in copepods, and that cAMP-dependent phosphorylation of certain N-terminal residues may be involved in the membrane trafficking regulation of the channels.
To determine whether the molecular regulation of the L. salmonis Glps has a common ancestral origin within Copepoda and/or other crustaceans, we used Bayesian inference to re-infer the phylogenetic interrelationships of the L. salmonis glps with coding sequences (CDS) assembled from the genomes or transcriptomes of 120 crustaceans including 32 species of copepod together with CDS assembled from myriapod and insect genomes. To further understand whether the derived proteins and/or isoforms have evolved new functions, we developed paralog- and isoform-specific antibodies to determine the cell-type expression sites of the L. salmonis Glps and used site-directed mutagenesis and heterologous expression of the proteins to identify specific residues involved in the Glp intracellular trafficking regulation.
The combined data sets reveal that glp gene evolution is highly divergent in Crustacea, ranging from single copies in many species of isopods, amphipods and decapods to up to 10 copies in cladoceran water fleas although with monophyletic origins in each lineage. By contrast the evolution of glps in Copepoda appears to be polyphyletic, with surprisingly high rates of gene duplication occurring in a genera- and species-specifc manner. The functional data for the L. salmonis Glps further reveal that such lineage-level gene duplication and splice variation can be coupled with a high rate of neofunctionalization. In the case of L. salmonis, splice variation of a given gene resulted in tissue- or sex-specific expression of the channels, with each variant evolving unique sites for protein kinase C (PKC)- or protein kinase A (PKA)-regulation of membrane trafficking.
Results
A high copy number variation between crustacean Glps
To reassess the diversity of crustacean Glps in relation to those of other arthropods, we focused on glp CDS assembly in the Mandibulata (Fig. 1A), since our previous analysis revealed that mandibulatan glps form a separate clade to those of the Chelicerata14. Bayesian analysis of an initial codon data set, which included 212 crustacean, four myriapod and 25 insect CDS, revealed that for each lineage, except the Copepoda, glp gene evolution appears to be monophyletic with high posterior probabilities (pp) supporting nodes at the level of the order (Fig. 1B; Supplementary Fig. 1A). This is despite high copy number variation between the lineages. For example, within Pancrustacea, single copy genes are found in the majority of taxa from the orders Pedunculata, Notostraca, Isopoda, Amphipoda and Decapoda as well as the class Insecta. Conversely, two paralogs are found in the Euphausiacea, and up to three are encoded in the genomes of Arguloida, Sessilia, and some caridean shrimps and prawns within Decapoda. In the latter case, a separate analysis revealed that the glp gene duplication in Caridea may only have occurred within the Palaemonidae family to form the three paralogs (Supplementary Fig. 1B). In addition, glp gene copy numbers have rapidly increased in the Daphnidae family of diplostracan water fleas, primarily through tandem duplication. Based upon the gene complement of D. carinata, we named the genes A1–A9 and B1–B3 in accordance with their phylogenetic distributions and their genomic loci in two linkage groups (Supplementary Fig. 1C). Although multiple glps were also detected in Myriapoda, they form a separate clade to those of the Pancrustacea, and consequently glp gene evolution in the Myriapoda initially appeared to be monophyletic. However, a separate analysis of glp CDS from 28 species of myriapods, indicates that within the classes Pauropoda, Symphyla and Diplopoda, glp evolution is polyphyletic with genes separated into two potential subclades A and B (Supplementary Fig. 1D).
In contrast to the monophylic gene families observed in most Crustacea, glp gene evolution in the Copepoda appears to be polyphyletic (Fig. 1B) with one clade of genes clustering as a sister branch to the Thecostraca and Arguloida (pp = 0.85), and a second clade clustering between Insecta and Branchiopoda (pp = 0.5). Hence, although nodes at the level of organismal order were well supported with pp > 0.95, statistical support values between the classes of organisms was low, indicating incomplete taxon sampling.
To further investigate the evolution of glp genes within Copepoda, we assembled 98 glp CDS from 32 species within 16 families from four orders (Siphonostomatoida, Cyclopoida, Calanoida and Harpacticoida), and computed their interrelationshps via Bayesian inference. The resultant tree generated the two clades identified in the crustacean analysis with high statistical support (pp = 1.0) and we therefore named them A and B (Fig. 1C). The tree topology reveals that although glp gene copy numbers vary considerably, gene origins within each clade are probably monophyletic for the Siphonostomatoida, Cyclopoida and Harpacticoida orders, but polyphyletic for the Calanoida order within the A clade. A surprising feature revealed by the analysis is nevertheless the high level of gene duplication in the different lineages. With the exception of the ancestral gene duplication that gave rise to the A and B clades, we detected 31 duplications within the four copepod orders and the presence of splice-variants within each of the orders except Cyclopoida (Fig. 1C; Supplementary Fig. 1A). The data thus show that glp gene expansions within Copepoda are mostly occurring in a family- and genera-specific manner, and that the L. salmonis glp gene and isoform complement thus evolved specifically within the genus. Indeed, although gene duplicates and splice variants also exist in the closely related caligid copepod Caligus rogercresseyi, there appear to be only two genes, rather than three26 and the glp repertoires are thus not fully conserved between the genera.
Expression and cellular localization of L. salmonis Glps
To investigate the expression and cellular localization of the L. salmonis Glps, we produced affinity-purified antibodies against each paralog and isoform. The specificity of each antibody was tested by Western blot analysis on total membrane protein extracts from Xenopus laevis oocytes expressing the full complement of the L. salmonis aquaporins (Bib, PripL, Aqp12L1; Aqp12L2, Glp1_v1; Glp1_v2, Glp2, Glp3_v1 and Glp3_v2; Fig. 2A–E, Supplementary Fig. S2A–E). The results show that each of the L. salmonis Glp antisera generated specifically recognized its corresponding antigen, therefore likely indicating that these antibodies do not cross-react with any of the other aquaporins. For some channel variants, including Glp1_v1, Glp1_v2 and Glp3_v1, bands of ~18–22 kDa were detected, indicating some degradation in the X. laevis oocyte expression system (Fig. 2A, B, D). A higher band of ~60 kDa was detected with the Glp3_v2 antibody indicating that the Glp3_v2 isoform is also present as a dimer (Fig. 2E).
Western blots for Glp1_v1, Glp1_v2, Glp2, Glp3_v1 and Glp3_v2 on protein extracts from adult male and female L. salmonis showed strongly reactive bands for each of the antibodies of approximately the same molecular mass as the predicted monomers (Glp1_v1: 34.4; Glp1_v2: 32.4; Glp2: 28.6; Glp3_v1: 34.5; Glp3_v2: 35.8 kDa) (Fig. 2A–E). Antibodies that were preadsorbed by the antigenic peptides were negative for the same bands (Fig. 2F–J, Supplementary Fig. S2F–O). These data reveal that Glp1_v1 is specific to adults males (Fig. 2F) as previous noted for the RNA23, while Glp1_v2, Glp2, Glp3_v1 and Glp3_v2 are expressed in both sexes (Fig. 2 G–J). In the case of Glp1_v2, however, a more intense reaction in the female compared to the male could be related to an additonal sex-specific expression in the oocytes (see below).
The cellular localizations of the Glps in adult male and female L. salmonis were subsequently determined by immunofluorescence microscopy using the affinity-purified antibodies. The male-specifc expression of Glp1_v1 was detected in type 3 tegumental glands that are bilaterally located in the subepidermal tissue of the cephalothorax (Fig. 3A–F). The glands specifically develop in preadults and adults and extend posterolaterally along the cephalothorax27. The long duct structures, which discharge secretions far from the main gland tissues, are indicated by the smaller area of positive staining as the gland extends posterolaterrally (Fig. 3D, F). Preadsorption of the antiserum with the peptide antigen led to a complete absence of staining in the same tissues (Fig. 3G, H). In contrast to Glp1_v1, the Glp1_v2 isoform is expressed in the enterocytes throughout the lengths of the intestines of both males (Fig. 4A–E) and females (Fig. 4F, Supplementary Fig. S3A, B). In these experiments, the channel was localized intracellularly, and preadsorbed antibodies yielded no signal (Supplementary Fig. S3C–F; K–L). In females, Glp1_v2 was also localized intacellulary within ovarian oocytes (Fig. 4G, H) and the cortical cytoplasm of the stacked oocytes in the immature egg strings (Fig. 4I, J). Preadsorbed antibodies gave no signal in the oocytes (Supplementary Fig. S3G, H). Glp2 was found in the epithelia lining type 1 tegumental glands, which are numerous in both males (Fig. 5A–C) and females (Fig. 5D–F). No signal was detected by preadsorbed antibodies (Fig. 5G, H). Glp3_v1 is specifically located in the apical membrane throughout the intestinal brush border of both males (Fig. 6A–C) and females (Fig. 6F–J), with no signal detected by preadsorbed antibodies (Fig. 6D, E; Supplementary Fig. S3I, J). Finally Glp3_v2 is localized in the the epithelia of blood vessels and blood sinuses surrounding the intestines of both males (Fig. 7A–C) and females (Fig. 7F–H) with no staining observed when using preadsorbed antibodies (Fig 7D, E, I, J). These data thus show that the closely related Glp2 and Glp3 paralogs, including both splice variants of the Glp3 channel are each expressed in different tissues. Conversely, isoforms of the Glp1 and Glp3 paralogs (Glp1_v2 and Glp3_v1), which are distantly related in the A and B tree clusters, are both expressed in the enterocytes, but with differential subcellular localization. These observations reveal that the signal transduction pathways regulating the intracellular channel trafficking of Glp1_v2 and Glp3_v1 in the enterocytes may be different.
Protein kinases regulate the intracellular trafficking of L. salmonis Glps
To investigate whether PKC or PKA signal transduction pathways are involved in the intracellular trafficking of the L. salmonis Glps, we expressed each paralog and variant in X. laevis oocytes exposed to either the PKC activator phorbol 12-myristate 13-acetate (PMA) or the cAMP-PKA activator, forskolin (FSK), respectively. In the latter instance, the FSK-exposed oocytes were preincubated with the phosphodiesterase inhibitor 3-isobutyl-1-methylxanthine (IBMX). We then visualized the changes in plasma membrane channel content via immunofluorescence and image analysis of the frog oocytes, as well as by Western blots of the total and plasma membrane extracts using the isoform-specific antibodies (Fig. 8; Supplementary Data 2). Injection of Glp1_v1 resulted in significant increases (p < 0.001; one-way ANOVA with Dunnett’s multiple comparison test) of the channel in the plasma membrane fraction compared to the controls controls (treated with the DMSO vehicle) when exposed to PMA or FSK (Fig. 8A–C; Supplementary Fig. S4A–B). Conversely, the experiments with Glp1_v2 (Fig. 8D–F: Supplementary Fig. S4C–D), Glp2 (Fig. 8G–I; Supplementary Fig. S4E–F) and Glp3_v1 (Fig. 8J–L; Supplementary Fig. S4G–H) only resulted in significant increases (p < 0.001; one-way ANOVA with Dunnett’s multiple comparison test) of the channels in the plasma membrane fractions when the frog oocytes were exposed to FSK. However, in oocytes expressing Glp3_v2, no changes in the amount of the channel in the plasma membrane were observed after PMA or FSK treatments (Fig. 8M–O; Supplementary Fig. S4I–J).
To corroborate the above results we investigated the changes in osmotic water (Pf) and glycerol (Pgly) permeability of oocytes expressing each paralog and variant under the same experimental conditions (Fig. 9; Supplementary Data 2). These data show that the Pf and Pgly of Glp1_v1-injected oocytes were significantly increased (p < 0.05; one-way ANOVA with Dunnett’s multiple comparison test) compared to controls in the presence of PMA or FSK (Fig. 9A). In fact, exposure of the oocytes to either activator was required in order to detect the increase in Pf or Pgly with respect to water-injected controls. This was also the case for the Glp1_v2 isoform, although in this latter case, a significant increase (p < 0.05; one-way ANOVA with Dunnett’s multiple comparison test) in Pf and Pgly was only detected in the presence of FSK (Fig. 9B). The Pf and Pgly of Glp2 and Glp3_v1-injected oocytes also increased significantly (p < 0.05; one-way ANOVA with Dunnett’s multiple comparison test) with respect to controls in the presence of FSK (Fig. 9C–D). As in the previous immunofluoresence experiments, no changes in the Pf or Pgly of Glp3_v2-injected oocyte were detected in relation to the controls when the oocytes were exposed to either PMA or FSK (Fig. 9E). Taken together these independent experiments reveal that the intracellular trafficking of the Glp1_v1 isoform is activated by both the PKC and PKA signal transduction pathways, yet only the PKA and not the PKC signal transduction pathway is involved in the intracellular trafficking induction of the Glp1_v2, Glp2 and Glp3_v1 channels in oocytes. Conversely, neither the PKC nor the PKA signal transduction pathways appear to regulate the intracellular trafficking of the Glp3_v2 isoform.
Isoform-specific sites regulate the membrane trafficking of L. salmonis Glps
To test the hypothesis that intracellular trafficking of Glp1_v1, Glp1_v2, Glp2 and Glp3_v1 channels is controlled by PKC and/or PKA phosphorylation of the channels, we initially conducted in silico searches for relevant phosphorylation sites in the intracellular domains of each channel. This yielded several potential sites in the N-termini or loop B (Fig. 10A). To determine whether such sites are functional for either of the kinases, we mutated each to an aspartate (D), which mimics constitutive phosphorylation, and re-examined the Pf of oocytes expressing each mutant under exposure to DMSO, PMA or FSK as above. The equivalent expression of each mutant in relation to the wild type was validated via Western blots of total membrane protein extracts using the isoform-specific antibodies.
For oocytes expressing wild-type Glp1_v1, the Pf was further elevated in response to PMA or FSK, as observed previously, while the Pf of the Glp1_v1-T3D mutant oocytes was not increased with PMA, but remained stimulated by FSK (Fig. 10B; Supplementary Data 2). This reveals that Glp1_v1 T3 is a functional PKC site, but that another PKA site appears to exist. In contrast, oocytes expressing the Glp1_v1-T14D mutant showed the same changes in Pf after PMA or FSK treatment as those expressing the wild-type, whereas the Pf of the Glp1_v1-S111D oocytes was positively affected by PMA but not by FSK (Fig. 10B). We therefore concluded that T3 and S111 are the functional PKC and PKA sites in Glp1_v1, respectively.
For Glp1_v2, oocytes expressing the Glp1_v2-S6D mutant mimmicked the effect of PMA and FSK on the wild-type (Fig. 10C; Supplementary Data 2). However the Glp1_v2-S94D mutant oocytes showed a constitutively elevated Pf with respect to the wild-type, while FSK had no effect (Fig. 10C). Consequently, S94 appears to be the functional PKA site in Glp1_v2.
For Glp2, only one potential PKA site (S10) was identified in the N-terminus of the channel. Oocytes expressing the Glp2-S10D mutant showed an increased Pf with respect to that of the wild-type, while the positive effect of FSK observed in the wild-type was abolished in the mutant (Fig. 10E; Supplementary Data 2). This demonstrates that S10 is the PKA functional site in Glp2.
Finally, of the two potential PKA sites (S5 and S43) found in the N-terminus of Glp3_v1, only the oocytes expressing the Glp3_v1-S5D mutant showed an elevated Pf with respect to the wild-type, which was not affected further by FSK (Fig. 10F; Supplementary Data 2). Conversely, the oocytes expressing the Glp3_v1-S43D mutant mimicked the effect of FSK on the wild-type (Fig. 10F), revealing that S5 and not S43 is the functional PKA site.
For all of the paralogs and their isoforms, immunoblotting experiments showed that oocytes expressed equivalent amounts of the wild-type and mutants (Fig. 10D and G; Supplementary Fig. 5A–D), indicating that the observed effects were not caused by differential expression mechanisms. These experiments thus confirm that the intracellular trafficking of four of the five L. salmonis Glp channels can be induced by the PKC and/or PKA signal transduction pathways.
Discussion
The present phylogenetic analysis of glp CDS in Crustacea is to the best of our knowledge the first to reveal the striking variability in glp gene copy number between the different lineages. It is surprising to find only single-copy glps in the the species of pedunculate thecostracans (barnacles), as well as the majority of species of isopods, amphipods and decapods investigated. This contrasts the moderate to high levels of glp gene redundancy in other crustacean lineages, such as the euphausiid krills with two copies, the sessilian thecostracans and palaemonidan prawns with up to three copies, and the diplostracan water fleas and calanoid copepods with up to nine or ten paralogs in a given species. These latter levels of glp gene redundancy have, however, been reported in other lineages of Ecdysozoa. For example, Nematoda and Tardigrada encode between five to eight glp paralogs in their genomes, whereas diverse lineages of Chelicerata, including arachnid ticks, scorpions and spiders evolved between three to five glp paralogs14,28,29. A review of the glp gene complement in a more distantly related chelicerate, the Atlantic horseshoe crab (Limulus polyphemus) also reveals eight paralogs encoded in the genome of this species. As previously reported for vertebrates2,15,30, the basis for some of the higher glp copy numbers in chelicerates such as L. polyphemus and the arachnids is partially rooted in ancestral whole genome duplications (WGD)31,32. However, although polyploidy is recognized in the isopod Trichonicus sp., the amphipod Pontoporeia affinis, and a parthenogenic strain of the anacostracan brine shrimp (Artemia sp.)33, WGD is not widely known to have occurred in Crustacea despite large variations in the sizes of their genomes34,35. Consequently other mechanisms of gene duplication must have generated the glp redundancy in this lineage.
The analysis of the glp complements in the diplostracan water fleas revealed a high level of gene linkage in the two species D. carinata and D. pulex with sequences assigned to chromosomes. Although the syntenic relationships are not fully conserved due to block rearrangements between the species (see Supplementary Fig. 1B), it seems likely that tandem duplication was a major driver of glp expansion in this lineage. This is consistent with the high prevalence of tandem gene clusters in the genomes36,37. Conversely, the increased repertoire of glp channels in the palaemonidan prawns seems to be associated with the burst of transposon activity that shaped their genomes38. This latter mechanism is also thought to have shaped the very large genomes of the euphausiid krills, which are between ~4 – 14 times longer than the human genome34. Hence tandem duplication and transposon activity can explain the increases in glp copy numbers in several lineages of Crustacea. Since such redundancy is thought to buffer phenotypes from genomic variations and thus confer advantages for an organism´s ability to evolve39, it is surprising to note the lack of glp redundancy in so many species.
This is not the case for the Copepoda. In contrast to the other lineages of Crustacea studied, the copepod glps appear to have polyphyletic origins. This would also imply asymmetrical gene loss in the other lineages of Crustacea. If long-branch attraction is discarded, then the lack of glp redundancy may be due to asymmetric loss of the genes that are orthologous to the two copepod clusters. Such gene losses are suggested to have occurred on a large-scale in the Insecta, which tend to have single-copy glps or have lost them completely14, yet are considered to have experienced multiple rounds of WGD during their evolution40. In the present analysis, however, data were only available for four orders of Copepoda, and it is thus too early to draw conclusions on the definitive origins of the copepod glps. What is clear, however, is that within the four orders of Copepoda analyzed, there are broad levels of species- and genera-specific duplications of the glps. In addition, within three of the orders, Calanoida, Harpacticoida and Siphonostomatoida, we also found evidence that the functional repertoires are further increased through splice variation.
To gain insight into the molecular basis for the retention and function of the different Glp paralogs and isoforms in copepods, we investigated the cellular localization of each variant in the hematophagous L. salmonis copepod. We selected this model, since we had previously shown that the transcripts are expressed in the adults and each translated protein functions as a Glp23. The immunolocalization data revealed that the L. salmonis Glps are expressed in five different tissues, the type 3 tegumental glands of males (Glp1_v1), and the type 1 tegumental glands (Glp2), the blood vessels and blood sinuses surrounding the intestine (Glp3_v2), and the enterocytes (Glp1_v2, Glp3_v1) of both sexes. In addition, the Glp1_v2 splice variant is expressed in the oocytes of females. Such divergent tissue localizations indicate that the Glps have in most instances neofunctionalized rather than subfunctionalised to play specific roles in the fluid and nutrient homeostasis of L. salmonis. However, in the case of Glp1_v2 and Glp3_v1 variants, which derive from the distantly related A and B clusters, respectively, there is redundant expression in the same enterocytes, but they are not colocated in the plasma membrane. Indeed the intracellular localization of Glp1_v2 in the enterocytes was also seen in the female oocytes in a pattern that is highly reminiscent of Aqp1ab in marine teleost oocytes41,42,43. The respective intracellular and apical location of the Glp1_v2 and Glp3_v1 variants in the enterocytes might therefore represent a form of subfunctionalization.
These observations promted us to investigate the intracellular trafficking regulation of the different Glp channels from L. salmonis. Since reversible phosphorylation of specific amino acid sites induced by vasopressin- or vasotocin-related neuropeptides activating PKA and PKC signal transduction pathways is a well-established mechanism governing the intracellular trafficking of aquaporins44,45,46, we initially tested whether such pathways can regulate the L. salmonis Glps. Independent experiments that examined the fractional change in plasma membrane content and the Pf of X. laevis oocytes expressing the Glps when exposed to PMA and FSK provided consistent evidence that the PKC and PKA pathways activate the membrane insertion of Glp1_v1, while plasma membrane trafficking of Glp1_v2, Glp2 and Glp3_v1 is regulated by PKA only. In contrast, neither of these two pathways regulate the intracellar trafficking of Glp3_v2. The most direct evidence was obtained from the site-directed mutagenesis experiments, which demonstrated that phosphorylation of specific channel residues by PKC or PKA indeed differentially regulates the intracellular trafficking of the L. salmonis Glps. However, the data futher revealed that not all of the predicted sites are functional, which precludes definitive comparisons with the Glps of other species based solely upon in silico predictions. The data for L. salmonis nevertheless reveal that there is isoform-specific pathway regulation of the channels with PKC and PKA regulating the Glp1_v1 variant, but only PKA regulating the Glp1_v2 variant. Conversely, the PKA pathway regulates the Glp3_v1 variant, but not the Glp3_v2 variant. As a result, the PKA pathway can regulate four channels that are each expressed in different tissues, with extra controls added by the PKC pathway for the male-specifc Glp1_v1 isoform. Intriguingly, we found that the two Glp paralogs expressed in enterocytes (Glp1_v2 and Glp3_v1) show differential subcellular localization. Since both of these paralogs are regulated by the PKA signaling pathway, other yet unknown mechanisms must also be involved in the intracellular trafficking regulation of the Glp1_v2 and Glp3_v1 channels.
In conclusion, we find that the evolution of Glps within the Crustacea is highly divergent, with large variances in gene copy numbers between the lineages. Species within the orders Pedunculata, Notostraca, Isopoda, Amphipoda and Decapoda typically retain single copy genes, while those within the orders Sessilia, Diplostraca, Euphausiacea, the class Copepoda, and the Palaemonidae family of decapod prawns have signifcantly expanded their glp gene repertoires. Gene expansion is associated with tandem duplications and bursts of transposon activities, rather than WGD. The highest copy numbers are currently found in the Daphnidae family of diplostracan water fleas, but the highest diversity is observed in Copepoda with large-scale genera- or species-specific duplications within two distantly related clusters. Based upon experimental evidence of the Glp proteins in a parastic copepod, we find that glp gene duplication and splice variation has not resulted in functional redundancy of the channels. On the contrary, due to the evolution of unique regulatory sites for the PKA- and PKC-signal transduction pathways within the N-terminal or loop B domains of each paralog and isoform, the increased repertoire of Glps affords a high fidelity control over the channel membrane trafficking even when expressed in the same cell. These findings therefore suggest that neofunctionalization or subfunctionalization associated with intracellular trafficking represents an important selective force for Glp evolution in Copepoda.
Methods
Biological samples
A laboratory strain of L. salmonis was raised on Atlantic salmon (Salmon salar)47. Prior to sampling of L. salmonis specimens, the fish were sedated with a mixture of benzocaine (60 mg/L) and methomidate (5 mg/L) and euthanised with a blow to the head. All experiments were conducted in accordance with the regulations approved by the governmental Norwegian Animal Research Authority (http://www.fdu.no/fdu/).
Sequence, phylogenetic and syntenic analyses
Contiguous peptide sequences were identified and assembled following tblastn queries of open source whole genome shotgun (WGS), transcriptome shotgun (TSA) and nucleotide databases (NCBI [blast.ncbi.nlm.nih.gov]). The corresponding nucleotide sequences were then retrieved from the respective databases and trimmed to match each peptide fragment, and finally concatenated to construct a coding sequence (CDS) for each gene or transcript15,45. Nucleotide sequence data reported are available in the Third Party Annotation Section of the DDBJ/ENA/GenBank databases under the accession numbers TPA: BK034896-BK035227. Prior to Bayesian (Mr Bayes v3.2.2)48 analyses, data sets of the deduced amino acids were aligned using the L-INS-I or G-INS-I algorithms of MAFFT v7.45349, and converted to codon alignments using Pal2Nal15,45,50. Bayesian phylogenetic analyses with model parameters nucmodel = 4by4, nst = 2, rates = gamma were performed on the codon alignments following removal of the N- and C-termini and gapped regions containing less than three sequences. Two separate data sets of glp codons from Mandibulata (N = 241) and Copepoda (N = 98) were constructed (Supplementary Figs. 2 and 3) and analyzed with 15 and 5 million Markov chain Monte Carlo (MCMC) generations, respectively. Each run consisted of three heated and one cold chain with the resulting posterior distributions examined for convergence and an effective sample size >1400 using Tracer version 1.751. Majority rule consensus trees were summarized with a burnin of 25%, processed with Archaeopteryx52 and rendered with Geneious (Biomatters Ltd, New Zealand). Alignment files together with the accession numbers are provided in Supplementary data 1: Files S1-S4. The syntenic analyses of the Daphnia genes were conducted via tblastn searches of WGS databases. In silico searches for potential phosphorylation sites were carried out using the NetPhos 3.1 Server (http://www.cbs.dtu.dk/services/NetPhos/)53.
L. salmonis Glp antibodies
N-terminal or extracellular loop-C peptide sequences (Glp1_v1: MSTDLDKPYHSRLT; Glp1_v2: MSKKGSFD; Glp2: GYRSGPFVAG; Glp3_v1: KPVKGLLYKSFDFE; Glp3_v2: HSEGEGQNKDLEAT) were synthesized and injected in rabbits to raise paralog- and isoform-specific polyclonal antibodies (Agrisera AB, Sweden). The antisera were affinity purified against the synthetic peptides54, and their specificity confirmed by ELISA, as well as by immunofluorescence microscopy and immunoblotting of X. laevis oocytes
Functional characterization of L. salmonis Glps
Constructs for heterologous expression in X. laevis oocytes were generated by subcloning the full-length L. salmonis glp cDNAs into pT7T expression vectors23. Point mutations in the wild-type sequences were introduced using the QuikChange Lightning Site-Directed Mutagenesis Kit (Agilent Technologies). All constructs in pT7T vectors were resequenced to validate that the correct mutations were present. The cRNA synthesis and isolation of stage V–VI oocytes were performed55. Oocytes were transferred to modified Barth’s solution [MBS: 88 mM NaCl, 1 mM KCl, 2.4 mM NaHCO3, 0.82 mM MgSO4, 0.33 mM Ca(NO3)2, 0.41 mM CaCl2, 10 mM HEPES and 25 µg/ml gentamycin, pH 7.5] and injected with 50 nl of distilled water (control) or 50 nl of water containing 20 ng (Glp1_v1, Glp1_v2 and Glp3_v1), 2–10 ng (Glp3_v2) or 0.25 ng (Glp2) cRNA.
The Pf of water-injected and Glp-expressing oocytes was determined using a swelling assay at pH 7.555,56. The Pgly was determined volumetrically in isotonic MBS at pH 7.5, where NaCl was replaced by 160 mM glycerol57. The osmolarity of the solution was measured with a vapor pressure osmometer (Vapro 5600, Wescor, USA), and adjusted to 200 mOsm with NaCl if necessary. The effect of the PKC activator PMA or the cAMP-PKA activator FSK on oocyte Pf and Pgly was respectively tested by preincubating the oocytes with 100 nM PMA for 30 min, or with 100 µM IBMX for 1 h and then with 100 µM FSK for 30 min, before the time-course determination of oocyte swelling.
Protein extraction and immunoblotting
Total and plasma membrane fractions of X. laevis oocytes were isolated58. L. salmonis specimens were dissociated with a glass dounce homogenizer in ice-cold RIPA buffer containing 150 mM NaCl, 50 mM Tris-HCl, pH 8, 1% Triton X-100, 0.5% sodium deoxycholate, 1 mM EDTA, 1 mM EGTA, EDTA-free protease inhibitors (Roche Applied Science, Mannheim, Germany), 1 mM Na3VO4 and 1 mM NaF, and centrifuged at 14,000 g for 10 min at 4 °C. The supernatant was mixed with 2× Laemmli sample buffer containing 200 µM dithiothreitol, heated at 95 °C for 15 min, deep frozen in liquid nitrogen, and stored at −80 °C. Protein extracts were deglycosylated with PNGase F (New England Biolabs) at 37 °C for 2 h and the reaction was blocked at 95 °C for 10 min before SDS-PAGE. Immunoblotted45 membranes were incubated with Glp1_v1 (1:500), Glp1_v2 (1:500), Glp2 (1:500), Glp3_v1 (1:1000) or Glp3_v2 (1:1000) antibodies overnight at 4 °C. The specificity of the reactions was determined by incubation of duplicate membranes with the antiserum preadsorbed with the corresponding antigenic peptide.
Immunofluorescence microscopy
Fixation of X. laevis oocytes and L. salmonis specimens, and processing for immunostaining on histological sections55. Sections were incubated with 1:300 dilutions for each primary antibody and 1.1000 dilution of Cy3-conjugated anti-rabbit antibody. Labeled sections were photographed at 63x magnification with a Zeiss Axio Imager Z1/ApoTome fluorescence microscope (Carl Zeiss Corp., Belcodène, France). Images from negative control sections were taken with the same fluorescence intensity and exposure times than those used for the positives. In X. laevis oocytes, the relative abundance of each Glp at the oocyte surface, in the presence or absence of PMA and FSK, was semiquantified using the ImageJ open-source software (version 1.46r). A section of the oocyte with fixed dimensions enclosing the plasma membrane and cytoplasm was generated, and the pixel intensity within each region was recorded. The dimensions of the oocyte section were kept constant for all images from oocytes expressing the same Glp. The pixel values from one image from six oocytes per treatment were analyzed.
Statistics and reproducibility
Data (mean ± SEM) on the percentage of Glp in the oocyte plasma membrane, Pf and Pgly were statistically analyzed by one-way ANOVA, followed by the Dunnett’s multiple comparison tests, or by an unpaired Student’s t tests. Data were tested for normal distribution (Shapiro-Wilk test) and homogeneity of variances (Forsythe and Barlett test) prior to conducting parametric tests. Statistical analyses were carried out using the SigmaPlot software v12.0 (Systat Software Inc.) and GraphPad Prism v8.4.3 (686) (GraphPad Software). In all cases, statistical significance was defined as P < 0.05 (*), P < 0.01 (**), or P < 0.001 (***).
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
All data generated or analyzed during this study are included in this manuscript (and its supplementary information files). The complete alignments shown in Fig. 1 and Supplementary Fig. 1 are provided in Supplementary Data 1. The source data underlying plots shown in figures are provided in Supplementary Data 2. All other data is available from authors upon reasonable request.
Code availability
Mr Bayes software used for phylogenetic inference is available at https://github.com/NBISweden/MrBayes/releases. MAFFT used for multiple sequence alignments is available at https://mafft.cbrc.jp/alignment/software. PAL2NAL used for codon conversion is available at http://www.bork.embl.de/pal2nal/.
References
Heymann, J. B. & Engel, A. Aquaporins: phylogeny, structure, and physiology of water channels. N. Physiol. Sci. 14, 187–193 (1999).
Abascal, F., Irisarri, I. & Zardoya, R. Diversity and evolution of membrane intrinsic proteins. Biochim. Biophys. Acta 1840, 1468–1481 (2014).
Mukhopadhyay, R., Bhattacharjee, H. & Rosen, B. P. Aquaglyceroporins: generalized metalloid channels. Biochim. Biophys. Acta 1840, 1583–1591 (2014).
Finn, R. N. & Cerdà, J. Evolution and functional diversity of aquaporins. Biol. Bull. 229, 6–23 (2015).
Finn, R. N. & Cerdà, J. Aquaporin. In Encyclopedia of Signaling Molecules. 2nd Edition (ed Choi, S.) 1–18 (Springer, New York, 2018).
Ho, J. D. et al. Crystal structure of human aquaporin 4 at 1.8 angstrom and its mechanism of conductance. Proc. Natl Acad. Sci. USA 106, 7437–7442 (2009).
Murata, K. et al. Structural determinants of water permeation through aquaporin-1. Nature 407, 599–605 (2000).
Sui, H. X., Han, B. G., Lee, J. K., Walian, P. & Jap, B. K. Structural basis of water-specific transport through the AQP1 water channel. Nature 414, 872–878 (2001).
Savage, D. F., Egea, P. F., Robles-Colmenares, Y., O’Connell, J. D. & Stroud, R. M. Architecture and selectivity in aquaporins: 2.5 angstrom X-ray structure of aquaporin Z. PLoS Biol. 1, 334–340 (2003).
Fu, F. et al. Structure of a glycerol-conducting channel and the basis for its selectivity. Science 290, 481–486 (2000).
Savage, D. F., O’Connell, J. D. III, Miercke, L. J. W., Finer-Moore, J. & Stroud, R. M. Structural context shapes the aquaporin selectivity filter. Proc. Natl Acad. Sci. USA 107, 17164–17169 (2010).
de Maré, S. W., Venskutonyte, R., Eltschkner, S., de Groot, B. L. & Lindkvist-Petersson, K. Structural basis for glycerol efflux and selectivity of human aquaporin 7. Structure 28, 1–8 (2020).
Zardoya, R., Ding, X. D., Kitagawa, Y. & Chrispeels, M. J. Origin of plant glycerol transporters by horizontal gene transfer and functional recruitment. Proc. Natl Acad. Sci. USA 99, 14893–14896 (2002).
Finn, R. N., Chauvigné, F., Stavang, J. A., Belles, X. & Cerdà, J. Insect glycerol transporters evolved by functional co-option and gene replacement. Nat. Commun. 6, 7814 (2015).
Yilmaz, O. et al. Unravelling the complex duplication history of deuterostome glycerol transporters. Cells 9, 1663 (2020).
Danielson, J. A. H. & Johanson, U. Unexpected complexity of the aquaporin gene family in the moss Physcomitrella patens. BMC Plant Biol. 8, 45 (2008).
Pommerrenig, B. et al. Functional evolution of nodulin 26-like intrinsic proteins: from bacterial arsenic detoxification to plant nutrient transport. N. Phytol. 225, 1383–1396 (2020).
Van Ekert, E. et al. Molecular and functional characterization of Bemisia tabaci aquaporins reveals the water channel diversity of hemipteran insects. Insect Biochem. Mol. Biol. 77, 39–51 (2016).
Giribet, G. & Edgecombe, G. D. The phylogeny and evolutionary history of arthropods. Curr. Biol. 29, R592–R602 (2019).
Holmes, S. P., Li, D., Ceraul, S. M. & Azad, A. F. An aquaporin-like protein from the ovaries and gut of American dog tick (Acari: Ixodidae). J. Med. Entomol. 45, 68–74 (2008).
Ball, A., Campbell, E. M., Jacob, J., Hoppler, S. & Bowman, A. S. Identification, functional characterization and expression patterns of a water-specific aquaporin in the brown dog tick, Rhipicephalus sanguineus. Insect Biochem. Mol. Biol. 39, 105–112 (2009).
Campbell, E. M., Burdin, M., Hoppler, S. & Bowman, A. S. Role of an aquaporin in the sheep tick Ixodes ricinus: assessment as a potential control target. Int. J. Parasit. 40, 15–23 (2010).
Stavang, J. A. et al. Phylogenomic and functional analyses of salmon lice aquaporins uncover the molecular diversity of the superfamily in Arthropoda. BMC Genomics 16, 618 (2015).
Peng, J. et al. Transcriptomics-based identification of aquaporin diversity in the house dust mite Dermatophagoides farinae (Acariformes: Pyroglyphidae). J Insect Sci. 18, 11 (2018).
Farlora, R., Araya-Garay, J. & Gallardo-Escárate, C. Discovery of sex-related genes through high-throughput transcriptome sequencing from the salmon louse Caligus rogercresseyi. Mar. Genom. 15, 85–93 (2014).
Farlora, R., Valenzuela-Muñnoz, V., Chávez-Mardones, J. & Gallardo-Escárate, C. Aquaporin family genes exhibit developmentally-regulated and host-dependent transcription patterns in the sea louse Caligus rogercresseyi. Gene 585, 119–127 (2016).
Øvergård, A.-C. et al. Exocrine glands of Lepeophtheirus salmonis (Copepoda: Caligidae): Distribution, developmental appearance, and site of secretion. J. Morphol. 277, 1616–1630 (2016).
Huang, C. G., Lamitina, T., Agre, P. & Strange, K. Functional analysis of the aquaporin gene family in Caenorhabditis elegans. Am. J. Physiol. 292, C1867–C1873 (2007).
Grohme, M. A. et al. The aquaporin channel repertoire of the tardigrade Milnesium tardigradum. Bioinformatics Biol. Insights 2013, 7 (2013).
Finn, R. N., Chauvigné, F., Hlidberg, J. B., Cutler, C. P. & Cerdà, J. The lineage-specific evolution of aquaporin gene clusters facilitated tetrapod terrestrial adaptation. PLoS ONE 9, 11 (2014).
Kenny, N. J. et al. Ancestral whole-genome duplication in the marine chelicerate horseshoe crabs. Heredity 116, 190–199 (2016).
Schwager, E. E. et al. The house spider genome reveals an ancient whole-genome duplication during arachnid evolution. BMC Biol. 15, 62 (2017).
Song, C. et al. Polyploid organisms. Sci. China Life Sci. 55, 301–311 (2012).
Jeffery, N. W. The first genome size estimates for six species of krill (Malacostraca, Euphausiidae): large genomes at the north and south poles. Polar Biol. 35, 959–962 (2012).
Alfsnes, K., Leinaas, H. P. & Hessen, D. O. Genome size in arthropods; different roles of phylogeny, habitat and life history in insects and crustaceans. Ecol. Evol. 7, 5939–5947 (2017).
Colbourne, J. K. et al. The ecoresponsive genome of Daphnia pulex. Science 331, 555–561 (2011).
Lee, B.-Y. et al. The genome of the freshwater water flea Daphnia magna: a potential use for freshwater molecular ecotoxicology. Aquat. Toxicol. 210, 69–84 (2019).
Yuan, J. et al. Genome sequences of marine shrimp Exopalaemon carinicauda Holthuis provide insights into genome size evolution of Caridea. Mar. Drugs 15, 213 (2017).
Kafri, R., Springer, M. & Pilpel, Y. Genetic redundancy: new tricks for old genes. Cell 136, 389–392 (2009).
Li, Z. et al. Multiple large-scale gene and genome duplications during the evolution of hexapods. Proc. Natl Acad. Sci. USA 115, 4713–4718 (2018).
Cerdà, J., Zapater, C., Chauvigné, F. & Finn, R. N. Water homeostasis in the fish oocyte: New insights into the role and molecular regulation of a teleost-specific aquaporin. Fish. Physiol. Biochem. 39, 19–27 (2013).
Zapater, C., Chauvigné, F., Tingaud-Sequeira, A., Finn, R. N. & Cerdà, J. Primary oocyte transcriptional activation of aqp1ab by the nuclear progestin receptor determines the pelagic egg phenotype of marine teleosts. Dev. Biol. 377, 345–362 (2013).
Cerdà, J., Chauvigné, F. & Finn, R. N. The physiological role and regulation of aquaporins in teleost germ cells. In Adv. Exp. Med. Biol. vol 969 (ed Yang, B.) 149–171 (Springer, Dordrecht, 2017).
Deen, P. M. T. et al. Requirement of human renal water channel aquaporin-2 for vasopressin-dependent concentration of urine. Science 264, 92–95 (1994).
Chauvigné, F. et al. The vertebrate Aqp14 water channel is a neuropeptide-regulated polytransporter. Commun. Biol. 2, 462 (2019).
Nesverova, V. & Törnroth-Horsefield, S. Phosphorylation-dependent regulation of mammalian aquaporins. Cells 8, 82 (2019).
Hamre, L. A., Glover, K. A. & Nilsen, F. Establishment and characterisation of salmon louse (Lepeophtheirus salmonis (Kroyer 1837)) laboratory strains. Parasitol. Int. 58, 451–460 (2009).
Ronquist, F. & Huelsenbeck, J. P. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19, 1572–1574 (2003).
Katoh, K. & Toh, H. Recent developments in the MAFFT multiple sequence alignment program. Brief. Bioinformatics 9, 286–298 (2008).
Suyama, M., Torrents, D. & Bork, P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 34, W609–W612 (2006).
Rambaut, A., Drummond, A. J., Xie, D., Baele, G. & Suchard, M. A. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst. Biol. 67, 901–904 (2018).
Han, M. V. & Zmasek, C. M. phyloXML: XML for evolutionary biology and comparative genomics. BMC Bioinformatics 10, 356 (2009).
Blom, N., Gammeltoft, S. & Brunak, S. Sequence and structure-based prediction of eukaryotic protein phosphorylation sites. J. Mol. Biol. 294, 1351–1362 (1999).
Zapater, C., Chauvigné, F., Norberg, B., Finn, R. N. & Cerdà, J. Dual neofunctionalization of a rapidly evolving Aquaporin-1 paralog resulted in constrained and relaxed traits controlling channel function during meiosis resumption in teleosts. Mol. Biol. Evol. 28, 3151–3169 (2011).
Chauvigné, F., Boj, M., Vilella, S., Finn, R. N. & Cerdà, J. Subcellular localization of selectively permeable aquaporins in the male germline of a marine teleost reveals spatial redistribution in activated spermatozoa. Biol. Reprod. 89, 1–17 (2013).
Chauvigné, F., Boj, M., Finn, R. N. & Cerdà, J. Mitochondrial aquaporin-8-mediated hydrogen peroxide transport is essential for teleost spermatozoon motility. Sci. Rep. 5, 7789 (2015).
Chauvigné, F., Lubzens, E. & Cerdà, J. Design and characterization of genetically engineered zebrafish aquaporin-3 mutants highly permeable to the cryoprotectant ethylene glycol. BMC Biotechnol. 11, 34 (2011).
Kamsteeg, E. J. & Deen, P. M. T. Detection of aquaporin-2 in the plasma membranes of oocytes: A novel isolation method with improved yield and purity. Biochem. Biophys. Res. Commun. 282, 683–690 (2001).
Acknowledgements
This work was supported by grants from the Norwegian Research Council (Grants no. 254872/E40, 204813/E40 and 294768/E40 to R.N.F., and 203516/O30 to F.N.) and the Spanish Ministry of Science and Innovation (MICINN) (Grant no. PID2019-103868RB-I00 to J.C). Participations of M.C. and J.A.S. were funded by the Norwegian Research Council (Grant nos 254872/E40 and 204813/E40, respectively). Participation of F.C. was funded by a “Ramon y Cajal” contract (RYC-2015-17103) from the Spanish MINECO.
Author information
Authors and Affiliations
Contributions
R.N.F., M.C. and J.C. designed the research; M.C, F.C., J.A.S. and R.N.F. performed the research; F.N. contributed with materials and the acquisition of animal samples; M.C., R.N.F. and J.C. analyzed the data and M.C., R.N.F. and J.C. wrote the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Catalán-García, M., Chauvigné, F., Stavang, J.A. et al. Lineage-level divergence of copepod glycerol transporters and the emergence of isoform-specific trafficking regulation. Commun Biol 4, 643 (2021). https://doi.org/10.1038/s42003-021-01921-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s42003-021-01921-9
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.