European sea bass genome and its variation provide insights into adaptation to euryhalinity and speciation

Tine, Mbaye; Kuhl, Heiner; Gagnaire, Pierre-Alexandre; Louro, Bruno; Desmarais, Erick; Martins, Rute S.T.; Hecht, Jochen; Knaust, Florian; Belkhir, Khalid; Klages, Sven; Dieterich, Roland; Stueber, Kurt; Piferrer, Francesc; Guinand, Bruno; Bierne, Nicolas; Volckaert, Filip A. M.; Bargelloni, Luca; Power, Deborah M.; Bonhomme, François; Canario, Adelino V. M.; Reinhardt, Richard

doi:10.1038/ncomms6770

Download PDF

Article
Open access
Published: 23 December 2014

European sea bass genome and its variation provide insights into adaptation to euryhalinity and speciation

Mbaye Tine^1,2^na1,
Heiner Kuhl²^na1,
Pierre-Alexandre Gagnaire^3,4^na1,
Bruno Louro⁵,
Erick Desmarais³,
Rute S.T. Martins⁵,
Jochen Hecht^2,6,
Florian Knaust²,
Khalid Belkhir³,
Sven Klages²,
Roland Dieterich¹,
Kurt Stueber¹,
Francesc Piferrer⁷,
Bruno Guinand³,
Nicolas Bierne^3,4,
Filip A. M. Volckaert⁸,
Luca Bargelloni⁹,
Deborah M. Power⁵,
François Bonhomme^3,4,
Adelino V. M. Canario⁵ &
…
Richard Reinhardt^1,2

Nature Communications volume 5, Article number: 5770 (2014) Cite this article

19k Accesses
293 Citations
14 Altmetric
Metrics details

Subjects

Abstract

The European sea bass (Dicentrarchus labrax) is a temperate-zone euryhaline teleost of prime importance for aquaculture and fisheries. This species is subdivided into two naturally hybridizing lineages, one inhabiting the north-eastern Atlantic Ocean and the other the Mediterranean and Black seas. Here we provide a high-quality chromosome-scale assembly of its genome that shows a high degree of synteny with the more highly derived teleosts. We find expansions of gene families specifically associated with ion and water regulation, highlighting adaptation to variation in salinity. We further generate a genome-wide variation map through RAD-sequencing of Atlantic and Mediterranean populations. We show that variation in local recombination rates strongly influences the genomic landscape of diversity within and differentiation between lineages. Comparing predictions of alternative demographic models to the joint allele-frequency spectrum indicates that genomic islands of differentiation between sea bass lineages were generated by varying rates of introgression across the genome following a period of geographical isolation.

Genome sequences reveal global dispersal routes and suggest convergent genetic adaptations in seahorse evolution

Article Open access 17 February 2021

Genome of the estuarine oyster provides insights into climate impact and adaptive plasticity

Article Open access 12 November 2021

Complex population structure of the Atlantic puffin revealed by whole genome analyses

Article Open access 29 July 2021

Introduction

The European sea bass (Dicentrarchus labrax) is a teleost fish found in the north-eastern Atlantic Ocean and throughout the Mediterranean and Black seas. This is an economically important species, and natural stocks are subject to intensive exploitation by professional and sport fisheries, thus raising future conservation and management issues. It is ranked fourth in European aquaculture production and, although undergoing domestication for the last three decades, selective breeding programmes are still in their infancy.

The European sea bass inhabits coastal waters, where reproduction occurs; however, it can also enter brackish waters in estuarine areas and coastal lagoons, and occasionally rivers. Thus, the European sea bass is an euryhaline fish tolerating a wide range of salinities (0–60 psu)¹. Although several genes involved in the physiological response to variation in salinity have been identified², the genetic basis of its broad halotolerance remains unclear. Because adaptation to varying environments may also involve gene duplications³, obtaining the complete genome sequence of the European sea bass may help to identify the genetic basis for adaptation to euryhalinity.

Atlantic and Mediterranean sea bass populations represent two genetically distinct lineages that naturally hybridize in the Alboran Sea⁴. Describing genomic variation patterns that arise from the interaction between these two diverging lineages has important implications for understanding the evolutionary forces at play during the diversification of marine species. In addition, a detailed picture of genomic variation within and between lineages will provide useful information to orientate domestication and assist conservation management of sea bass populations⁵.

Here we produce a high-quality draft sequence of the European sea bass genome by combining high-throughput sequencing with genetic and physical maps^6,7,8. The current assembly spans 675 Mbp and contains ~86% of the contigs ordered and orientated along 24 chromosomes, representing one of the highest-quality fish genomes available. Most sea bass chromosomes are highly collinear with those of upper teleosts, confirming the evolutionary stability of fish genomes⁹. We annotate 26,719 genes and identify gene family expansions, some of which are promising candidates for euryhaline adaptation. In addition, we characterize genome-wide variation patterns by restriction-site-associated DNA (RAD)-sequencing ~2.5% of the genome in 100 individuals from the Mediterranean and Atlantic lineages, using three individuals of the congeneric D. punctatus as an outgroup. We find that genomic variation within and between sea bass lineages is influenced by large-scale variation in local recombination rates and by diversifying selection translating into variable rates of introgression across the genome following post-glacial secondary contact. These results provide new insights into the genetic correlates of speciation, as it occurs in a typical marine species with high levels of gene flow and a large effective population size.

Results

Genome sequencing and assembly

We sequenced the whole genome of a single meiogynogenetic male of the European sea bass to an average coverage depth of 30 × using a combination of whole-genome shotgun, mate pair and BAC end sequencing (Supplementary Table 1). The reads from three independent sequencing technologies (23.3 Gb) were assembled into a 675-Mb draft genome, with an N50 contig length of 53 kb and an N50 scaffold length of 5.1 Mb (Supplementary Table 2). Using a reference radiation hybrid map⁶, linkage maps^10,11 and collinearity with the genome of Gasterosteus aculeatus¹², we assigned 86% of contig sequences (575 Mb) to 24 chromosomal groups⁶ (Supplementary Table 2). Comparison between a previous assembly of three chromosomes⁸ and the current assembly resulting from whole-genome sequencing (WGS) revealed high similarities with locally improved scaffold orientation (Supplementary Fig. 1).

Genome annotation

A combination of ab initio gene prediction, homology search and transcript mapping resulted in 26,719 annotated genes (Supplementary Tables 3 and 4). The 18,253 bp of the whole mitogenome was assembled and annotated, and ranked among the largest in teleost fish (Supplementary Table 5).

Repetitive DNA sequences within the sea bass genome accounted for 21.47% of the assembly, with 3.87% of class I transposable elements and 4.19% of DNA transposons (Supplementary Table 6), in line with what has been found in other fish genomes^9,13. Similarly, the average GC content of the sea bass genome (40.4%) was comparable to that of other teleosts¹⁴. The distribution of GC content was rather homogeneous within chromosomes (Supplementary Fig. 2). We found a lower percentage of GC in noncoding regions (39.6%) compared with protein-coding regions (52.6%), and the third-codon positions GC content was 60.7%, suggesting the effect of selection for codon usage, or a stronger effect of biased gene conversion in coding regions relative to noncoding regions¹⁵. We found almost complete synteny and large blocks of collinearity between sea bass chromosomes and homologous chromosomes of the three most closely related teleost taxa that have a chromosome-scale assembly (G. aculeatus, Oreochromis niloticus and Tetraodon nigroviridis; Fig. 1 and Supplementary Table 7). This matches well with the position of sea bass relative to other derived teleosts on a phylogenetic tree reconstructed using 621 1:1 orthologous proteins from 20 sequenced fish genomes (Fig. 2). Our phylogenomic analysis also supports the recently resolved relationships among fully sequenced percomorph fishes, which group sea basses and sticklebacks in a common clade^16,17.

Figure 1: Collinear blocks showing the overall degree of synteny between the European sea bass (*D. labrax*) genome and seven other publicly available teleost genomes represented outside the sea bass chromosomal ring.

Figure 2: Phylogenetic tree based on 621 1:1 high-quality orthologous protein-coding genes from 20 sequenced fish genomes, showing the relationships between European sea bass (*D. labrax*) and other fish species (half of which belong to the Series Percomorpha).

Gene duplication and the evolution of euryhalinity

We evaluated the evolutionary consequences of gene expansion and differential duplicate gene loss on biological function pathways. Some pathways were enriched within certain chromosomes, including neuroactive ligand-receptor interaction (LG2), cell adhesion molecules (LG13 and LG14), endocytosis (LG19), DNA repair and nucleotide excision repair (LG20). Among the enriched pathways, we detected gene family expansions that may have played a role in adaptation to euryhalinity. These include claudins^18,19, aquaporins²⁰, arginine-vasotocin (AVT) receptors²¹, prolactin (PRL)²² and its receptor (PRLR). Although euryhalinity is an old innovation in aquatic vertebrates that has potentially favoured species diversification²³, the sea bass genome provides new insights into the genomic modifications at play in this key adaptation.

While mammalian genomes have 27 claudins²⁴, D. labrax has 61 claudin copies (Supplementary Fig. 3), thus exceeding the 54 copies found in the zebrafish (Danio rerio)²⁵, which are largely explained by the teleost-specific whole-genome duplication (TGD)²⁶. Chromosome blocks containing claudin genes are duplicated in LG13 (15 genes) and LG14 (14 genes) (Figs 1 and 3) and the two claudin clusters of 11 (LG13) and 10 (LG14) tandem genes exhibit conserved synteny to claudins 3 and 4 in human chromosome 7. Expansion of these clusters occurred before the TGD event²⁶, since the spotted gar Lepisosteus oculatus has both a single synteny block in LG22 (as in humans) and a tandem cluster of 10 claudins (Fig. 3). However, the coelacanth Latimeria chalumnae has a synteny block with only two claudin genes, indicating an expansion specific to the Actinopterygii lineage occurred²⁷. While synteny blocks 1–3 seem to be part of an ancestral chromosome, both Sarcopterygii and Actinopterygii have gained lineage-specific insertions (blocks 4M and 4F) between blocks 1 and 2 (Fig. 3), which appear to have driven claudin expansion both in mammals and teleosts. After the TGD, the teleost-specific duplicate, block 4F on LG14, was translocated between blocks 2 and 3. The LG13 duplicate, block 4F, was eliminated, while block 3 was translocated to a distant part of the chromosome, blocks 1 and 2 being maintained together.

**Figure 3: Comparison of claudin gene synteny between sea bass LG 13 and 14 and other vertebrate chromosomes including human.**

The sea bass and zebrafish genomes contain 18 members of the aquaporin family (Supplementary Fig. 4), representing the largest repertoire of functional aquaporins in vertebrates²⁰. Both sea bass and zebrafish share duplications of AQP0a-b, AQP1a-b, AQP10a-b and the putative loss of one of the two AQP4, AQP7 and AQP12 paralogs. However, the AQP5/1 and one of the AQP9 paralogs present in D. rerio are not found in the sea bass. Conversely, sea bass is the only teleost that retains four AQP8 copies as a result of TGD²⁸.

Teleost genomes contain a large diversity of AVT receptors²¹, with up to six in G. aculeatus, O. niloticus and O. latipes and seven in the sea bass. The latter included duplicated V1A and expansion of V2 from four to five copies, including a novel V2B-like receptor (Supplementary Fig. 5).

While the majority of teleosts have maintained single copies of PRL and PRL-like genes, sea bass and Takifugu rubripes²⁹ retained two PRL-like genes and Oryzias latipes retained two PRL genes (Supplementary Fig. 6). The PRL receptor gene (PRLR) is also duplicated in most fish species; however, an extra copy of PRLR-like homolog is present in sea bass, T. rubripes, G. aculeatus and in some cichlids (Supplementary Fig. 7).

The sea bass genome has the highest number of gene copies linked to ion and water regulation (94 genes) among fully sequenced teleosts. To test the hypothesis that this gene expansion is associated with the degree of euryhalinity in teleost fishes, a comparison was made to the copy number in the genomes of euryhaline (T. nigroviridis, O. latipes, O. niloticus and G. aculeatus) and stenohaline fishes (D. rerio, Gadus morhua and T. rubripes); however, no clear pattern was found (Fig. 2, Supplementary Table 8). However, the analysis of the protein-coding sequences of each gene family showed accelerated evolution of the sea bass PRLR copies one to three and of the novel vasotocin receptor V2B-L, suggesting that relaxed purifying selection or positive selection has occurred for these genes in the lineage leading to sea bass (Supplementary Tables 9–12). Other osmoregulation-related gene duplicates also displayed signatures of positive selection, although not exclusively in sea bass (for example, PRL-L2). Altogether, these results suggest that the lineage leading to sea bass has retained a large number of genes involved in ion and volume regulation since TGD, and this may have facilitated the evolution of some genes conferring high tolerance to rapid salinity changes.

Searches for recently duplicated genes identified six paralogs that are specific to sea bass, five of which exhibit signatures of positive selection at several amino-acid positions in at least one of the duplicates (Supplementary Table 13). For one of these genes, the nuclear receptor co-activator 5 (NCOA5), sex-differential expression has been reported in Nile tilapia during larval development³⁰. Because sex-determining loci in teleosts often involve recently duplicated genes³¹, NCOA5 represents an interesting candidate component of the genetic system for sex determination in sea bass.

Genome-wide patterns of polymorphism and recombination

A genetic variation map was produced from 100 wild-caught sea bass from the Atlantic Ocean and western Mediterranean Sea. We additionally analysed three spotted sea bass (D. punctatus) individuals as outgroup, in order to identify divergent sites between D. labrax and D. punctatus and to polarize single nucleotide polymorphisms (SNPs) within D. labrax (Supplementary Table 14). Genomic variation within and among individuals, lineages and species was examined at ~178,000 RAD tags³², providing an average marker density of one pair of RAD tags every 7.5 kb and a 2.5% genome coverage. A total of 234,148 SNPs were found (mean read depth 48 × per individual), revealing an average nucleotide diversity (π) of 2.52 × 10⁻³ in the Atlantic lineage, 2.60 × 10⁻³ in the Mediterranean lineage and 2.31 × 10⁻³ in the spotted sea bass. There was broad variation in nucleotide diversity within chromosomes, with a frequently observed fivefold reduced diversity in central chromosomal regions compared with the chromosomal extremities (Fig. 4a). Polymorphism was negatively correlated with divergence between D. labrax and the outgroup species D. punctatus (R²=0.085, P<10⁻¹⁵), indicating that the genomic landscape of relative divergence between species was slightly influenced by chromosomal variations in diversity within D. labrax (Fig. 4b). Recombination rates were markedly reduced in central chromosomal regions (Fig. 4c) and were positively correlated to nucleotide diversity (R²=0.586, P<10⁻¹⁵), while the correlation was negative between recombination and divergence (R²=0.078, P<10⁻¹⁵). Thus, the possible mutagenic role of recombination could not explain the patterns of nucleotide diversity within the sea bass genome, providing evidence for Hill–Robertson effects (including background selection and/or hitchhiking effects)^33,34. Consistent with the effects of selection on SNP variation at a local genomic scale, there was a lower mean expected nucleotide diversity in exonic (H_e=0.067) compared with intronic (H_e=0.075) and intergenic (H_e=0.073) regions. We also observed a diminishing average heterozygosity with decreasing distance to the nearest exon on a very local scale (<200 bp; Supplementary Fig. 8), reflecting either Hill–Robertson effects or the direct impact of purifying selection on evolutionarily constrained sites in both coding and cis-regulatory sequences.

**Figure 4: Distribution of population genetic parameters calculated in 150-kb windows across the different chromosomes of sea bass genome (x refers to LGx, and not to a sexual chromosome).**

Genomic landscape of differentiation between lineages

Genetic distinctiveness between the Atlantic and Mediterranean lineages of D. labrax was revealed by principal component analysis, in which both lineages appeared equally distant from the closely related spotted sea bass species (Fig. 5a). The genome-wide average genetic differentiation between lineages was low (F_ST=0.028) and consistent with previous estimates based on microsatellite markers^4,35,36. However, SNP-by-SNP F_ST estimates between lineages were heterogeneously distributed across the genome, with highly differentiated markers (some of which reached differential fixation) usually clustering within regions of several hundred Kb to >1 Mb. These genomic islands of differentiation were present in all chromosomes except for LG24 and tended to map disproportionately to central chromosomal regions (Fig. 4d). The local averaged F_ST was negatively correlated with the local recombination rate (R²=0.257, P<10⁻¹⁵) and nucleotide diversity (R²=0.207, P<10⁻¹⁵), as previously described in other species^37,38. However, a few highly differentiated regions were also found outside low-recombining and poorly polymorphic chromosomal regions (for example, LG13). Therefore, the heterogeneous genomic landscape of differentiation between lineages was only partially explained by the reduced diversity found in the regions of reduced recombination.

**Figure 5: Population structure and demographic history of the European sea bass, *D. labrax*.**

Demographic divergence history

To determine whether and when genomic differentiation patterns were influenced by gene flow, the past demography of European sea bass lineages was inferred using a composite likelihood approach³⁹. Seven alternative models of historical divergence were fitted to the joint allele-frequency spectrum of Atlantic and Mediterranean lineages (Fig. 5b), including scenarios of strict isolation, isolation-with-migration, ancient migration and secondary contact. The observed allele-frequency spectrum was not satisfactorily reproduced by standard demographic models that assume the gene flow parameter to be shared among loci (Supplementary Fig. 9). A custom secondary contact model with heterogeneous gene flow across the genome⁴⁰ (Fig. 5c) produced a significantly higher fit compared with all other alternative models (Fig. 5d). This analysis revealed several key features of the divergence history of sea bass lineages (Supplementary Table 15). Divergence accumulated during a period of isolation at least 20 times longer than the age of secondary contact. Genetic introgression following secondary contact was strongly asymmetric from the Atlantic to Mediterranean lineage, and occurred at highly variable rates across the genome. We estimated that ~35% of the genome did not freely introgress from one lineage to the other, which mirrors the proportion of the genome in which the windowed F_ST lies above the genome-wide average differentiation (Fig. 4d). Thus, the genomic islands of differentiation between sea bass lineages appear to have resulted from the erosion of divergence through differential introgression.

Discussion

The European sea bass genome assembly represents one of the first high-quality draft genomes available for an aquaculture fish species and will provide a valuable resource for future evolutionary analyses and genetic improvement. Compared with solely NGS-based genome assemblies, the sea bass genome assembly has benefited from low-coverage Sanger sequencing reads which, combined with data from two different NGS platforms, led to increased contig size. Scaffolding was also improved by combining mate pairs with BAC-end sequences, and a chromosome-scale assembly was obtained using available genetic linkage and radiation hybrid maps anchoring scaffolds to the 24 chromosomes comprising the sea bass karyotype. This genome assembly is a prerequisite to whole-genome or targeted resequencing, RNA-seq, chromatin immunoprecipitation-seq and Methyl-seq experiments, and will therefore catalyse genomic studies in sea bass. In addition, it will help to improve the assembly of newly sequenced genomes from related orders, since large regions of collinearity have been observed with the genome assemblies of other teleost fishes. The sea bass genome sequence may thus facilitate genomic studies in other economically important fishes.

Expansions of gene families related to ion and volume regulation were likely mediated through an elevated rate of duplicate gene retention after TGD, providing a genetic basis for adaptation to euryhalinity in the European sea bass and possibly other highly adaptable species. This finding questions the relative importance of whole-genome duplication events versus single-gene duplication in the evolution of euryhalinity in fish, which should motivate broader-scale comparative genomic and experimental studies in the future.

The European sea bass also provides an interesting model for understanding the evolutionary mechanisms involved in speciation. Our results imply genetically based reproductive barriers reducing gene flow between Atlantic and Mediterranean sea bass lineages. We reveal important aspects of the genetic architecture of differentiation between these two lineages, such as the disproportionate mapping of the genomic islands of differentiation within low-recombining regions. Evidence for differential gene flow after secondary contact implies that the genomic islands in sea bass are not simply an incidental consequence of reduced diversity in low-recombining regions⁴¹, but also the product of reduced gene flow in these regions. Furthermore, some genomic islands of high differentiation are also observed in regions of high diversity. Therefore, the genomic landscape of differentiation in sea bass likely results from the preferential erosion of divergence in the genomic regions with high recombination rates. The question of the timing of gene flow has been partly answered by our modelling approach, which dated the secondary contact to ca. 11,500 years BP, and estimated the divergence time to ca. 270,000 years BP (using a per-generation mutation rate of 10⁻⁸ per bp and a generation time of 5 years). This puts the onset of secondary gene flow to the last glacial retreat and strongly supports the role of distributional range shifts caused by Pleistocene glacial periods in promoting divergence. The true evolutionary history is admittedly more complex, since quaternary glacial oscillations may have resulted in intermittent gene flow during divergence. This may explain the contrast between the divergence captured by our analysis of the nuclear genome and the 2.8% divergence found in the nonrecombining mitochondrial genome, which may be involved in old cytonuclear incompatibilities that have remained protected from fixation for a much longer period than the recent history revealed by the coalescence of the rest of the genome. This new picture of the dual origin of the genetic variability in European sea bass brings important data to explain the long-standing observation of genetic discontinuities at the Almeria–Oran front in many marine species⁴². It reinforces the hypothesis of anciently diverged genetic backgrounds being trapped by the Atlantic/Mediterranean environmental boundary⁴³, and further supports the role of allopatric isolation as the main driver of marine speciation.

Methods

Sequencing strategy

All sequencing libraries were constructed from the genomic DNA of a single Adriatic sea bass individual (meiogynogenetic male 57)^44,45 (kindly provided by the late A. Libertini, CNR, Venice, Italy through J. B. Taggart, University of Stirling, Stirling, UK). The BAC library⁴⁴ was obtained from the German Resources Center for Genome Research (RZPD, Berlin, Germany). Plasmid libraries were constructed as previously described⁷. Template preparation was performed automatically at the Max Planck Institute for Molecular Genetics (MPIMG, Berlin, Germany). Purification was based on size-selective precipitation in polyethylene-glycol 6000/2-propanol mixtures⁴⁶. Template DNA was sequenced on ABI3730xl capillary sequencers and raw sequencing data were processed using the PHRED basecaller⁴⁷ and LUCY⁴⁸ for quality clipping and vector clipping.

WGS libraries that were submitted to pyrosequencing were constructed according to the manufacturers’ protocols (Roche) and sequenced on a ROCHE 454 FLX Titanium sequencer. Eight mate-pair libraries of ~20-kb insert size were constructed at 454 Life Sciences (Branford, USA) to support genome scaffolding. For each library, half of a picotiter plate was sequenced on the GS FLX Titanium system. A small-insert library (300 bp) for Illumina sequencing was constructed using the NEBNext DNA Sample Prep Reagent Set 1 (NEB, Ipswich, USA). For the two larger insert libraries (500 and 1,000 bp), only five PCR cycles were performed to reduce amplification bias effects. All libraries were sequenced with 100-bp paired-end runs on an Illumina GAIIx instrument.

Genome assembly

Before assembly, Illumina sequencing reads were end-clipped to keep the largest part of a read (>64 bp) in which no quality value below 11 occurred. Duplicate reads and residual sequencing adapters were removed. All, 454 reads from 20-kb mate-pair libraries were filtered for duplicates and chimeric mates by mapping them to older assembly versions of the D. labrax genome using gsMapper v2.3 (Roche) and confirming collinearity with G. aculeatus and O. latipes. Sanger reads and 454 single-end reads were directly used for assembly using Celera Assembler v6.1 (CA6.1)⁴⁹.

Several softwares for WGS assembly were tested, among which SOAPdenovo (for Illumina data), Newbler (Roche) and CA6.1. CA6.1 was the best-performing one, although the first assemblies needed editing because of inconsistencies in the long-range mate-pair data. These issues were solved by removing duplicates and screening for potential chimeric mate pairs by making use of the strong collinearity of sea bass with the stickleback¹². After WGS assembly, long-range continuity was improved by re-scaffolding the CA6.1 scaffolds using the BAMBUS scaffolder⁵⁰, including the 454 20-kb mate pairs and BAC ends. These scaffolds were mapped against the G. aculeatus and O. latipes genomes by Megablast. Scaffolds were grouped into sets that were assigned to distinct chromosomal groups in these species to reduce the complexity of chromosome assembly.

Finally, markers from the radiation hybrid map⁶, genetic linkage maps^10,11, BAC-end and 454 20-kb mate pairs were mapped on these scaffold groups and CONSED⁵¹ was applied to manually build chromosomal sized superscaffolds. Chromosome names conformed with the naming adopted in the radiation hybrid map. Sequences that could not be ordered into chromosomal groups were concatenated into a superscaffold of unordered genomic pieces (UN). UCSC genome browser of the sea bass genome is at http://seabass.mpipz.mpg.de.

Transcriptome sequencing and assembly

Paired-end RNA-sequencing was performed using sea bass offspring obtained by crossing wild parents at the Ifremer Station Expérimentale d’Aquaculture (Palavas-les-Flots, France). Four pools of three to four individual liver and intestine total RNA extracts were assembled to prepare four RNAseq libraries of distinct Mediterranean and Atlantic origin (Supplementary Table 4) at the CERBM (Université de Strasbourg, France). mRNA was purified from total RNA using poly-T oligo-attached magnetic beads and fragmented using divalent cations at 95°C during 5 min. After reverse transcription with random primers and ligation to adapters, enrichment was performed through 13 cycles of PCR amplification. PCR products were then purified with AMPure beads (Agencourt). Size selection of ~250–350 bp fragments was performed by electrophoresis on a 2% agarose gel. Libraries were sequenced on four lanes of an Illumina GAIIx using 2 × 100 pb paired-end reads. After library cleaning with Trimmomatic⁵², reads from each of the four lanes were mapped independently on the draft sequence of the sea bass genome with Tophat and then assembled with Cufflinks⁵³, and a reference transcriptome was built by merging results of the four lanes with Cuffmerge.

Genome annotation

The ab initio gene prediction was carried out with GENESCAN⁵⁴. For homology-based predictions, we first mapped known teleost proteins from Ensembl and GenBank databases on the sea bass genome using SPALN aligner v2.12 (ref. 55). Custom scripts were used to choose the best-scoring protein match in a cluster of matches defined by exact exon–exon matches in a first iteration and overlapping exons in a second iteration. Second, the CDS models from SPALN were combined with assembled RNAseq data by splitting the gtf files of all predictions and transcripts into overlapping exon–intron–exon fragments and merging them using Cuffmerge⁵³. This resulted in a high number of possible transcript models. We used Transdecoder ( http://transdecoder.sourceforge.net) to assign coding sequences to these models and used the length of the CDS as a score for each model. We weighted the scores of these different models based on their origin (highest rank: RNAseq only, lowest rank: SPALN only). Afterwards, the best-scoring transcript from a cluster of redundant CDS was selected as the reference gene model.

Reference genes resulting from our combined annotation were functionally annotated using Blast2GO v2.6.4 (ref. 56) and a custom-blast database consisting of vertebrate proteins from GenBank. In additional, the gene model proteins were submitted to InterProScan to identify shared signatures with proteins of known function. Functional annotation of RNAseq data was performed using blastx against Ensembl fish species proteomes and the NCBI nr database. Results were contrasted with annotation based on the direct homology-based method, and permitted annotation of new genes and correction of some gene predictions.

Whole-genome alignments

Reference genome sequences of seven teleost fishes (G. aculeatus, O. niloticus, T. nigroviridis, T. rubripes, O. latipes, G. morhua and D. rerio) were downloaded from the Ensembl website. The LAST tool was applied to produce fast and sensitive whole-genome alignments. We screened for best one-to-one alignments using custom scripts removing repetitive and suboptimal alignments. Blocks of shared collinearity were constructed using the BlockDisplaySatsuma script from the Satsuma v1.17 package⁵⁷. Blocks of collinearity were further combined allowing unaligned regions of up to 100 kb and visualized in the sea bass genome browser.

Phylogenetic reconstruction

Twenty publicly available genomes were downloaded and formatted as SPALN v2.12 databases. The Ensembl proteins from O. niloticus were used to predict genes in all species by homology using SPALN v2.12. Predicted proteins for each species were filtered to retain only the best-scoring hit to O. niloticus, showing nearly complete coverage and a minimum alignment of 60% of amino-acid residues to an O. niloticus protein. We only retained 621 1:1 orthologous proteins that were found in all 20 genomes. Multiple alignments computed for each protein using ClustalW2.1 were then concatenated and Prottest used to determine the best amino-acid substitution model (JTT I+G) and phyML used to build phylogenetic trees. Other phylogenetic reconstruction methods were also tested (Neighbor Joining, FastTreeMP, MrBayes) and in all trees the nodes were highly supported and only a single difference in tree topology appeared between NJ and the other methods. Visualization of the trees was carried out in TreeDyn.

Enrichment pathway analysis

Chromosome gene sets were tested for pathway enrichments in comparison with the whole-genome gene set. Sea bass genes were queried against KEGG orthology (KO) database using KOBAS⁵⁸. KO terms of the annotated genes within each sea bass chromosome were used to identify statistically enriched related pathways in chromosomal regions after applying the FDR correction.

Detection of gene synteny and collinearity

MCScanX’s algorithm was used to perform synteny and collinearity detection across the genome of different species, as well as within the sea bass genome to search for collinear blocks indicative of duplication events and gene family expansions. We used the fraction of collinear genes as a metric to identify collinear blocks. Circular genome representations were created using Circos⁵⁹.

Detection of duplicated genes

Sea bass-specific gene duplications were detected using a pipeline that interrogates the Ensembl database of orthology groups Compara, using genes from eight teleost genomes to build Compara gene trees (D. rerio, G. morhua, O. niloticus, T. rubripes, T. nigroviridis, X. maculatus, G. aculeatus and O. latipes). Putative duplicated genes were then analysed in a rigorous phylogenetic context to assess true orthology. Alignment of protein sequences of all putative orthologues and closely related paralogues in all vertebrates was carried out for each group of duplicates. Amino-acid sequences were aligned using MAFFT⁶⁰ under E-INS-I or L-INS-I options. Sequences with insufficient information were discarded. Aligned coding sequences were used to reconstruct a gene tree with PHYML 3.0 (ref. 61) using default options. The tree topology was analysed for compatibility with gene duplication occurring in the sea bass lineage. For confirmed duplicated genes, the amino-acid alignment was then transferred into Translator X⁶² to guide nucleotide sequence alignment and obtain aligned codon sequences. The tree topology was used as user tree option in the analysis of branch- and site-specific codon evolution, which was implemented in the Data Monkey web server ( http://www.datamonkey.org/). Branch Site REL⁶³, a branch-specific test for positive selection was used to assess the presence of significantly divergent branches in the gene tree. A mixed effects model of evolution⁶⁴ test was used to assess the presence of codons/positions under positive selection.

RAD sequencing, SNP discovery and genotyping

Wild sea bass individuals were sampled from both the Atlantic Ocean (n=50) and the western Mediterranean Sea (n=50), where two distinct lineages have been documented for D. labrax⁴ (Supplementary Table 9). Three individuals from the closely related spotted sea bass D. punctatus were sampled in a Tunisian lagoon and used as an outgroup. Genomic DNA was isolated using the DNeasy Blood and Tissue Kit (Qiagen) and digested with BamHI. Seven RAD libraries were constructed by multiplexing 15 uniquely barcoded individuals per library, following an adaptation of the original protocol³². Each library was subsequently sequenced on a separate lane of an Illumina HiSeq2000 instrument with 101-bp single-end reads.

Illumina reads were demultiplexed and quality-filtered using Stacks v1.07 (ref. 65). Cleaned individual reads were mapped to the reference genome using Bowtie v2.0 with the very-sensitive option⁶⁶, allowing at most three mismatches per alignment. We then called SNPs from the aligned reads using the Stacks v1.07 pipeline, with a minimum read depth of 5 × per individual per allele to infer individual genotypes. Only RAD loci that were present in at least 70% of the samples in each population were considered for nucleotide diversity analysis.

Population genetic analyses

Genomic patterns of nucleotide diversity and genetic differentiation were computed in Stacks v1.07 as the weighted average of π and F_ST in 150-kb windows⁶⁵. We used a custom script to calculate the mean proportion of fixed differences per bp (d_f) between D. labrax and D. punctatus in 150-kb windows. SNPs were annotated to exonic, intronic or intergenic regions using SNPdat⁶⁷, and the mean expected heterozygosity in different regions was calculated with VCFtools v0.1.11 (ref. 68).

Variable recombination rates along each chromosome (ρ=4N_er per kb) were estimated using a Bayesian reversible-jump MCMC scheme under the crossing-over model of interval in LDhat⁶⁹, with 10 million iterations and 5 million burn-ins. Population structure was examined using a principal component analysis in SNPRelate⁷⁰. The demographic history of Atlantic and Mediterranean sea bass populations was inferred from their joint site frequency spectrum (SFS) in δaδi v1.6.3 (ref. 39). We used 109,329 SNPs that were polymorphic in D. labrax but fixed in D. punctatus to determine the most parsimonious ancestral allele in D. labrax. The joint SFS was projected to 45 individuals in each population to avoid missing genotypes. We considered seven alternative models of historical divergence: Strict Isolation (SI), Isolation-with-Migration (IM), Ancient Migration (AM), Secondary Contact (SC) and their heterogeneous migration rates versions: IM2m, AM2m and SC2m (Supplementary Fig. 9).

Additional information

How to cite this article: Tine, M. et al. European sea bass genome and its variation provide insights into adaptation to euryhalinity and speciation. Nat. Commun. 5:5770 doi: 10.1038/ncomms6770 (2014).

Accession codes. Sequencing reads have been deposited in the GenBank/EMBL/DDBJ Sequence Read Archive under the accession codes CBXY010000001 to CBXY010037781 (for Contigs) and HG916827 to HG916851 (for chromosomal scale assembly). Mitochondrial genomes for D. labrax Mediterranean, D. labrax Atlantic and D. punctatus have been deposited in GenBank/EMBL/DDBJ under the accession codes KJ168065, KJ168064 and KJ168066, respectively.

Accession codes

Accessions

EMBL/GenBank/DDBJ

References

Pickett, G. D. & Pawson, M. G. Sea Bass. Biology, Exploitation and Conservation Vol. 12, Chapman & Hall (1994).
Boutet, I., Long, Ky, C. L. & Bonhomme, F. A transcriptomic approach of salinity response in the euryhaline teleost, Dicentrarchus labrax. Gene 379, 40–50 (2006).
Article CAS Google Scholar
Ohno, S. Evolution by Gene Duplication Springer (1970).
Lemaire, C., Versini, J. J. & Bonhomme, F. Maintenance of genetic differentiation across a transition zone in the sea: discordance between nuclear and cytoplasmic markers. J. Evol. Biol. 18, 70–80 (2005).
Article CAS Google Scholar
Allendorf, F. W., Hohenlohe, P. A. & Luikart, G. Genomics and the future of conservation genetics. Nat. Rev. Genet. 11, 697–709 (2010).
Article CAS Google Scholar
Guyon, R. et al. A radiation hybrid map of the European sea bass (Dicentrarchus labrax) based on 1581 markers: synteny analysis with model fish genomes. Genomics 96, 228–238 (2010).
Article CAS Google Scholar
Kuhl, H. et al. The European sea bass Dicentrarchus labrax genome puzzle: comparative BAC-mapping and low coverage shotgun sequencing. BMC Genomics 11, 68 (2010).
Article Google Scholar
Kuhl, H. et al. Directed sequencing and annotation of three Dicentrarchus labrax L. chromosomes by applying Sanger- and pyrosequencing technologies on pooled DNA of comparatively mapped BAC clones. Genomics 98, 202–212 (2011).
Article CAS Google Scholar
Schartl, M. et al. The genome of the platyfish, Xiphophorus maculatus, provides insights into evolutionary adaptation and several complex traits. Nat. Genet. 45, 567–572 (2013).
Article CAS Google Scholar
Chistiakov, D. A. et al. A combined AFLP and microsatellite linkage map and pilot comparative genomic analysis of European sea bass Dicentrarchus labrax L. Anim. Genet. 39, 623–634 (2008).
Article CAS Google Scholar
Chistiakov, D. A. et al. A microsatellite linkage map of the European sea bass Dicentrarchus labrax L. Genetics 170, 1821–1826 (2005).
Article CAS Google Scholar
Jones, F. C. et al. The genomic basis of adaptive evolution in threespine sticklebacks. Nature 484, 55–61 (2012).
Article CAS Google Scholar
Chen, S. et al. Whole-genome sequence of a flatfish provides insights into ZW sex chromosome evolution and adaptation to a benthic lifestyle. Nat. Genet. 46, 253–260 (2014).
Article CAS Google Scholar
Han, L. & Zhao, Z. Comparative analysis of CpG islands in four fish genomes. Comp. Funct. Genomics 2008, 565631 (2008).
Article Google Scholar
Duret, L. & Galtier, N. Biased gene conversion and the evolution of mammalian genomic landscapes. Annu. Rev. Genomics Hum. Genet. 10, 285–311 (2009).
Article CAS Google Scholar
Betancur-R, R. et al. The tree of life and a new classification of bony fishes. PLoS Curr. 5, (2013).
Near, T. J. et al. Phylogeny and tempo of diversification in the superradiation of spiny-rayed fishes. Proc. Natl Acad. Sci. USA 110, 12738–12743 (2013).
Article ADS CAS Google Scholar
Loh, Y. H., Christoffels, A., Brenner, S., Hunziker, W. & Venkatesh, B. Extensive expansion of the claudin gene family in the teleost fish, Fugu rubripes. Genome Res. 14, 1248–1257 (2004).
Article CAS Google Scholar
Tipsmark, C. K., Baltzegar, D. A., Ozden, O., Grubb, B. J. & Borski, R. J. Salinity regulates claudin mRNA and protein expression in the teleost gill. Am. J. Physiol. - Regul. Integr. Comp. Physiol. 294, R1004–R1014 (2008).
Article CAS Google Scholar
Tingaud-Sequeira, A. et al. The zebrafish genome encodes the largest vertebrate repertoire of functional aquaporins with dual paralogy and substrate specificities similar to mammals. BMC Evol. Biol. 10, 38 (2010).
Article Google Scholar
Ocampo Daza, D., Lewicka, M. & Larhammar, D. The oxytocin/vasopressin receptor family has at least five members in the gnathostome lineage, inclucing two distinct V2 subtypes. Gen. Comp. Endocrinol. 175, 135–143 (2012).
Article CAS Google Scholar
Manzon, L. A. The role of prolactin in fish osmoregulation: a review. Gen. Comp. Endocrinol. 125, 291–310 (2002).
Article CAS Google Scholar
Schultz, E. T. & McCormick, S. D. inEuryhaline Fishes eds McCormick Stephen D, Farrell A. P., Brauner C. J. Academic Press (2013).
Mineta, K. et al. Predicted expansion of the claudin multigene family. FEBS Lett. 585, 606–612 (2011).
Article CAS Google Scholar
Baltzegar, D. A., Reading, B. J., Brune, E. S. & Borski, R. J. Phylogenetic revision of the claudin gene family. Mar. Genomics 11, 17–26 (2013).
Article Google Scholar
Amores, A., Catchen, J., Ferrara, A., Fontenot, Q. & Postlethwait, J. H. Genome evolution and meiotic maps by massively parallel DNA sequencing: spotted gar, an outgroup for the teleost genome duplication. Genetics 188, 799–808 (2011).
Article CAS Google Scholar
Nikaido, M. et al. Coelacanth genomes reveal signatures for evolutionary transition from water to land. Genome Res. 23, 1740–1748 (2013).
Article CAS Google Scholar
Cerdà, J. & Finn, R. N. Piscine aquaporins: an overview of recent advances. J. Exp. Zool. 313A, 623–650 (2010).
Article Google Scholar
Wang, Y., Li, J., Yan Kwok, A. H., Ge, W. & Leung, F. C. A novel prolactin-like protein (PRL-L) gene in chickens and zebrafish: cloning and characterization of its tissue expression. Gen. Comp. Endocrinol. 166, 200–210 (2010).
Article CAS Google Scholar
Tao, W. et al. Characterization of gonadal transcriptomes from Nile tilapia (Oreochromis niloticus) reveals differentially expressed genes. PLoS ONE 8, e63604 (2013).
Article ADS CAS Google Scholar
Kikuchi, K. & Hamaguchi, S. Novel sex-determining genes in fish and sex chromosomeevolution. Dev. Dyn. 242, 339–353 (2013).
Article CAS Google Scholar
Baird, N. A. et al. Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS ONE 3, e3376 (2008).
Article ADS Google Scholar
Charlesworth, B., Morgan, M. & Charlesworth, D. The effect of deleterious mutations on neutral molecular variation. Genetics 134, 1289–1303 (1993).
CAS PubMed PubMed Central Google Scholar
Maynard Smith, J. & Haigh, J. The hitch-hiking effect of a favourable gene. Genet. Res. 23, 23–35 (1974).
Article Google Scholar
Quéré, N. et al. Gene flow at major transitional areas in sea bass (Dicentrarchus labrax) and the possible emergence of a hybrid swarm. Ecol. Evol. 2, 3061–3078 (2012).
Article Google Scholar
Naciri, M., Lemaire, C., Borsa, P. & Bonhomme, F. Genetic study of the Atlantic/Mediterranean transition in sea bass (Dicentrarchus labrax). J. Hered. 90, 591–596 (1999).
Article Google Scholar
Roesti, M., Hendry, A. P., Salzburger, W. & Berner, D. Genome divergence during evolutionary diversification as revealed in replicate lake–stream stickleback population pairs. Mol. Ecol. 21, 2852–2862 (2012).
Article Google Scholar
Nachman, M. W. & Payseur, B. A. Recombination rate variation and speciation: theoretical predictions and empirical results from rabbits and mice. Philos. Trans. R. Soc. Biol. Sci. 367, 409–421 (2012).
Article Google Scholar
Gutenkunst, R. N., Hernandez, R. D., Williamson, S. H. & Bustamante, C. D. Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS Genet. 5, e1000695 (2009).
Article Google Scholar
Roux, C., Tsagkogeorga, G., Bierne, N. & Galtier, N. Crossing the species barrier: genomic hotspots of introgression between two highly divergent Ciona intestinalis species. Mol. Biol. Evol. 30, 1574–1587 (2013).
Article CAS Google Scholar
Cruickshank, T. E. & Hahn, M. W. Reanalysis suggests that genomic islands of speciation are due to reduced diversity, not reduced gene flow. Mol. Ecol. 23, 3133–3157 (2014).
Article Google Scholar
Patarnello, T., Volckaert, F. A. M. J. & Castilho, R. Pillars of Hercules: is the Atlantic–Mediterranean transition a phylogeographical break? Mol. Ecol. 16, 4426–4444 (2007).
Article Google Scholar
Bierne, N., Welch, J., Loire, E., Bonhomme, F. & David, P. The coupling hypothesis: why genome scans may fail to map local adaptation genes. Mol. Ecol. 20, 2044–2072 (2011).
Article Google Scholar
Whitaker, H. A., McAndrew, B. J. & Taggart, J. B. Construction and characterization of a BAC library for the European sea bass Dicentrarchus labrax. Anim. Genet. 37, 526 (2006).
Article CAS Google Scholar
Francescon, A. et al. Assessment of homozygosity and fertility in meiotic gynogens of the European sea bass (Dicentrarchus labrax L.). Aquaculture 243, 93–102 (2005).
Article Google Scholar
Kuhl, H. et al. A Comparative BAC map for the gilthead sea bream (Sparus aurata L.). J. Biomed. Biotechnol. 2011, 1–7 (2011).
Article Google Scholar
Ewing, B. & Green, P. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 8, 186–194 (1998).
Article CAS Google Scholar
Li, S. & Chou, H. H. LUCY2: an interactive DNA sequence quality trimming and vector removal tool. Bioinformatics 20, 2865–2866 (2004).
Article CAS Google Scholar
Miller, J. R. et al. Aggressive assembly of pyrosequencing reads with mates. Bioinformatics 24, 2818–2824 (2008).
Article CAS Google Scholar
Pop, M., Kosack, D. S. & Salzberg, S. L. Hierarchical scaffolding with Bambus. Genome Res. 14, 149–159 (2004).
Article CAS Google Scholar
Gordon, D. & Green, P. Consed: a graphical editor for next-generation sequencing. Bioinformatics 29, 2936–2937 (2013).
Article CAS Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS Google Scholar
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 7, 562–578 (2012).
Article CAS Google Scholar
Burge, C. & Karlin, S. Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 268, 78–94 (1997).
Article CAS Google Scholar
Iwata, H. & Gotoh, O. Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features. Nucleic Acids Res. 40, e161 (2012).
Article CAS Google Scholar
Conesa, A. et al. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21, 3674–3676 (2005).
Article CAS Google Scholar
Grabherr, M. G. et al. Genome-wide synteny through highly sensitive sequence alignment: Satsuma. Bioinformatics 26, 1145–1151 (2010).
Article CAS Google Scholar
Wu, J., Mao, X., Cai, T., Luo, J. & Wei, L. KOBAS server: a web-based platform for automated annotation and pathway identification. Nucleic Acids Res. 34, W720–W724 (2006).
Article CAS Google Scholar
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res. 19, 1639–1645 (2009).
Article CAS Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS Google Scholar
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321 (2010).
Article CAS Google Scholar
Abascal, F., Zardoya, R. & Telford, M. J. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations. Nucleic Acids Res. 38, (2010).
Kosakovsky, P. S. L. et al. A random effects branch-site model for detecting episodic diversifying selection. Mol. Biol. Evol. 28, 3033–3043 (2011).
Article Google Scholar
Murrell, B. et al. Detecting individual sites subject to episodic diversifying selection. PLoS Genet. 8, e1002764 (2012).
Article CAS Google Scholar
Catchen, J. M., Amores, A., Hohenlohe, P., Cresko, W. & Postlethwait, J. H. Stacks: building and genotyping loci de novo from short-read sequences. Genes Genomes Genet. 1, 171–182 (2011).
CAS Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Article CAS Google Scholar
Doran, A. G. & Creevey, C. J. Snpdat: easy and rapid annotation of results from de novo snp discovery projects for model and non-model organisms. BMC Bioinformatics 14, 45 (2013).
Article Google Scholar
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
Article CAS Google Scholar
McVean, G. A. et al. The fine-scale structure of recombination rate variation in the human genome. Science 304, 581–584 (2004).
Article ADS CAS Google Scholar
Zheng, X. et al. A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics 28, 3326–3328 (2012).
Article CAS Google Scholar

Download references

Acknowledgements

This work was initiated during the MARINE GENOMICS EUROPE project EU-FP6 505403 and was supported by grants from the Max Planck Society (MPG for genome sequencing, assembly and annotation), LIFECYCLE EU-FP7 222719 and BMBF-01GS0805 to R.R., and the French ANR grants LABRAD-SEQ 11-PDOC-009-01 to P.-A.G. and REGULBASS 09-GENM-003 to B.G. M.T. received a fellowship from the MPG, H.K. was partly financed by LIFECYCLE EU-FP7 222719 and F.K. by the BMBF-01GS0805 grant. L.B. was supported by the grant Progetto 12/MI/2004 from the Veneto Region. R.S.T.M. and B.L. were in receipt of fellowships SFRH/BPD/66742/2009 and SFRH/BPD/89889/2012 from the Foundation for Science and Technology of Portugal.

Author information

Mbaye Tine, Heiner Kuhl and Pierre-Alexandre Gagnaire: These authors contributed equally to this work

Authors and Affiliations

Max Planck Genome-centre Cologne, Carl-von-Linné-Weg 10, D-50829 Köln, Germany,
Mbaye Tine, Roland Dieterich, Kurt Stueber & Richard Reinhardt
Max Planck Institute for Molecular Genetics, Ihnestrasse 63, D-14195 Berlin, Germany,
Mbaye Tine, Heiner Kuhl, Jochen Hecht, Florian Knaust, Sven Klages & Richard Reinhardt
Institut des Sciences de l'Evolution (UMR 5554), CNRS-UM2-IRD, Place Eugène Bataillon, Montpellier, F-34095, France
Pierre-Alexandre Gagnaire, Erick Desmarais, Khalid Belkhir, Bruno Guinand, Nicolas Bierne & François Bonhomme
Station Méditerranéenne de l’Environnement Littoral, Université Montpellier 2, 2 Rue des Chantiers, F-34200 Sète, France,
Pierre-Alexandre Gagnaire, Nicolas Bierne & François Bonhomme
CCMAR-Centre of Marine Sciences, University of Algarve, Building 7, Campus de Gambelas, 8005-139 Faro, Portugal,
Bruno Louro, Rute S.T. Martins, Deborah M. Power & Adelino V. M. Canario
BCRT, Charité-Universitätsmedizin Berlin, Augustenburger Platz 1, D-13353 Berlin, Germany,
Jochen Hecht
Institut de Ciències del Mar, Consejo Superior de Investigaciones Científicas (CSIC), Passeig Marítim, 37-49, Barcelona, 08003, Spain
Francesc Piferrer
Laboratory of Biodiversity and Evolutionary Genomics, University of Leuven, Charles Deberiotstraat 32, B-3000 Leuven, Belgium,
Filip A. M. Volckaert
Dipartimento di Biomedicina Comparata e Alimentazione, Università di Padova, Padova, I-35124, Italy
Luca Bargelloni

Authors

Mbaye Tine
View author publications
You can also search for this author in PubMed Google Scholar
Heiner Kuhl
View author publications
You can also search for this author in PubMed Google Scholar
Pierre-Alexandre Gagnaire
View author publications
You can also search for this author in PubMed Google Scholar
Bruno Louro
View author publications
You can also search for this author in PubMed Google Scholar
Erick Desmarais
View author publications
You can also search for this author in PubMed Google Scholar
Rute S.T. Martins
View author publications
You can also search for this author in PubMed Google Scholar
Jochen Hecht
View author publications
You can also search for this author in PubMed Google Scholar
Florian Knaust
View author publications
You can also search for this author in PubMed Google Scholar
Khalid Belkhir
View author publications
You can also search for this author in PubMed Google Scholar
Sven Klages
View author publications
You can also search for this author in PubMed Google Scholar
Roland Dieterich
View author publications
You can also search for this author in PubMed Google Scholar
Kurt Stueber
View author publications
You can also search for this author in PubMed Google Scholar
Francesc Piferrer
View author publications
You can also search for this author in PubMed Google Scholar
Bruno Guinand
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Bierne
View author publications
You can also search for this author in PubMed Google Scholar
Filip A. M. Volckaert
View author publications
You can also search for this author in PubMed Google Scholar
Luca Bargelloni
View author publications
You can also search for this author in PubMed Google Scholar
Deborah M. Power
View author publications
You can also search for this author in PubMed Google Scholar
François Bonhomme
View author publications
You can also search for this author in PubMed Google Scholar
Adelino V. M. Canario
View author publications
You can also search for this author in PubMed Google Scholar
Richard Reinhardt
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.T., H.K. and P.-A.G. contributed equally to this work. P.-A.G., M.T., H.K., A.V.M.C. wrote the manuscript. F.B., A.V.M.C., R.R. and F.A.M.V. initiated the project. R.R. organized financial support and genome sequencing. P.-A.G., M.T. and F.B. coordinated manuscript writing. F.A.M.V. and F.P. provided scientific input. H.K., J.H. and F.K. carried out genome sequencing. H.K. produced the genome assembly, alignments and browser content. E.D. and B.G. produced RNAseq data. Gene annotation was performed by H.K., M.T., E.D. and K.B. B.L., R.S.T.M., D.M.P. and A.V.M.C. carried out genome pathway enrichment, synteny and gene expansion analysis. E.D. and H.K. performed the phylogenetic analyses. L.B. performed sea bass-specific gene duplication analyses. P.-A.G., F.B. and N.B. produced and analysed RAD sequencing data and performed historical demographic analysis. S.K., K.S. and R.D. maintained software packages, IT infrastructures and organized sequence submission. R.R., F.B. and A.V.M.C. supported the project as senior authors.

Corresponding authors

Correspondence to François Bonhomme, Adelino V. M. Canario or Richard Reinhardt.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-9, Supplementary Tables 1-15 (PDF 1064 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/

Reprints and permissions

About this article

Cite this article

Tine, M., Kuhl, H., Gagnaire, PA. et al. European sea bass genome and its variation provide insights into adaptation to euryhalinity and speciation. Nat Commun 5, 5770 (2014). https://doi.org/10.1038/ncomms6770

Download citation

Received: 15 May 2014
Accepted: 05 November 2014
Published: 23 December 2014
DOI: https://doi.org/10.1038/ncomms6770

This article is cited by

Transgenerational exposure to ocean acidification impacts the hepatic transcriptome of European sea bass (Dicentrarchus labrax)
- Pauline Auffret
- Arianna Servili
- David Mazurais
BMC Genomics (2023)
Circulating MicroRNAs Indicative of Sex and Stress in the European Seabass (Dicentrarchus labrax): Toward the Identification of New Biomarkers
- Camille Houdelet
- Eva Blondeau-Bidet
- Benjamin Geffroy
Marine Biotechnology (2023)
The extensive transgenerational transcriptomic effects of ocean acidification on the olfactory epithelium of a marine fish are associated with a better viral resistance
- Mishal Cohen-Rengifo
- Morgane Danion
- David Mazurais
BMC Genomics (2022)
Population genetics reveals divergent lineages and ongoing hybridization in a declining migratory fish species complex
- Quentin Rougemont
- Charles Perrier
- Sophie Launey
Heredity (2022)
A chromosome-level genome assembly of the jade perch (Scortum barcoo)
- Yishan Lu
- Ruihan Li
- Chao Bian
Scientific Data (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Genome sequencing and assembly

Genome annotation

Gene duplication and the evolution of euryhalinity

Genome-wide patterns of polymorphism and recombination

Genomic landscape of differentiation between lineages

Demographic divergence history

Discussion

Methods

Sequencing strategy

Genome assembly

Transcriptome sequencing and assembly

Genome annotation

Whole-genome alignments

Phylogenetic reconstruction

Enrichment pathway analysis

Detection of gene synteny and collinearity

Detection of duplicated genes

RAD sequencing, SNP discovery and genotyping

Population genetic analyses

Additional information

Accession codes

Accessions

EMBL/GenBank/DDBJ

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links