Population genomics of an endemic Mediterranean fish: differentiation by fine scale dispersal and adaptation

Carreras, Carlos; Ordóñez, Víctor; Zane, Lorenzo; Kruschel, Claudia; Nasto, Ina; Macpherson, Enrique; Pascual, Marta

doi:10.1038/srep43417

Download PDF

Article
Open access
Published: 06 March 2017

Population genomics of an endemic Mediterranean fish: differentiation by fine scale dispersal and adaptation

Carlos Carreras¹,
Víctor Ordóñez¹,
Lorenzo Zane^2,3,
Claudia Kruschel⁴,
Ina Nasto⁵,
Enrique Macpherson⁶^na1 &
…
Marta Pascual¹^na1

Scientific Reports volume 7, Article number: 43417 (2017) Cite this article

5466 Accesses
60 Citations
1 Altmetric
Metrics details

Subjects

Population genetics

Abstract

The assessment of the genetic structuring of biodiversity is crucial for management and conservation. For species with large effective population sizes a low number of markers may fail to identify population structure. A solution of this shortcoming can be high-throughput sequencing that allows genotyping thousands of markers on a genome-wide approach while facilitating the detection of genetic structuring shaped by selection. We used Genotyping-by-Sequencing (GBS) on 176 individuals of the endemic East Atlantic peacock wrasse (Symphodus tinca), from 6 locations in the Adriatic and Ionian seas. We obtained a total of 4,155 polymorphic SNPs and we observed two strong barriers to gene flow. The first one differentiated Tremiti Islands, in the northwest, from all the other locations while the second one separated east and south-west localities. Outlier SNPs potentially under positive selection and neutral SNPs both showed similar patterns of structuring, although finer scale differentiation was unveiled with outlier loci. Our results reflect the complexity of population genetic structure and demonstrate that both habitat fragmentation and positive selection are on play. This complexity should be considered in biodiversity assessments of different taxa, including non-model yet ecologically relevant organisms.

Whole genome sequencing reveals high differentiation, low levels of genetic diversity and short runs of homozygosity among Swedish wels catfish

Article Open access 07 May 2021

Axel Jensen, Mette Lillie, … Jacob Höglund

Complex population structure of the Atlantic puffin revealed by whole genome analyses

Article Open access 29 July 2021

Oliver Kersten, Bastiaan Star, … Sanne Boessenkool

Genomic signatures of drift and selection driven by predation and human pressure in an insular lizard

Article Open access 17 March 2021

Marta Bassitta, Richard P. Brown, … Cori Ramon

Introduction

The assessment of marine biodiversity, including genetic structuring, is one of the major goals of population management and conservation biology¹. This assessment should ideally be achieved by the combination of two alternative approaches based on the analysis of neutral and adaptive loci². The detection of barriers to dispersion is crucial in order to identify isolated units and to assess the degree of connectivity among populations. This detection is especially challenging for marine organisms, for which barriers to dispersal are less evident than those in the terrestrial environment and connectivity usually is due to larval stages³. Neutral genetic markers, such as microsatellites, have been extensively used for this purpose in the past decades⁴, but have lacked power to detect differentiation on several occasions due to recent divergence of populations, large population sizes, the limited number of markers used or homoplasy (e.g. refs 5,6). Another key process in evolutionary genetics is adaptation by natural selection that also drives population differentiation². Local environmental conditions would also favour genetic differentiation among populations, especially considering that generally large-sized populations are more likely affected by selection than by genetic drift⁷. Furthermore, analysing the role of adaptation and the genes involved in the species’ response is necessary to ascertain the vulnerability of key species and populations under environmental change scenarios⁸. Studies on adaptation on natural populations have generally focused on specific and well known regions of the genome, like the MHC associated to the immune system^9,10, heat-shock genes known to play a role in the stress response system^11,12 or the genotypic component of phenotypic variation disentangled through common-garden experiments^13,14. However, the combination of both neutral and selective markers to assess the distribution of genetic variability on non-model organisms is yet far to be common, especially in the marine realm, despite both types of markers provide complementary relevant information^2,15.

Perhaps one of the flagships of the genomics era, favoured by high-throughput sequencing technologies, is the possibility to easily obtain Single Nucleotide Polymorphisms (SNP) markers in ecological-model species without reference genomes^16,17,18. Although not exempt from problems (such as ascertainment bias), SNPs feature important advantages with respect to more ‘traditional’ markers such as microsatellites, including reproducibility between laboratories, high density of markers, and potential for annotation. One of the most important additional applications of SNPs developed by massive sequencing is the potential to study non-neutral signatures¹⁹, revealing adaptation processes when scanned along relevant environmental gradients²⁰ or through the detection of outliers²¹. However, even in organisms with a reference genome, a method of genome reduction is necessary for species with medium to large genomes to ensure sequence depth for SNP identification. This identification is currently achieved using Restriction site Associated DNA sequencing (RAD-seq) protocols, among which the Genotyping-by-Sequencing (GBS) approach²² provides a cost effective methodology for high density SNP discovery and genotyping. Genome reduction techniques facilitate genotyping thousands of genetic markers simultaneously, even in non-model species, resulting in a recent expansion of population genomic studies^{23,24,25,26,27}.

The CoCoNet European project (FP7 Actions) aims to establish Marine Protected Areas (MPAs) based on genetic data of key marine species and considered the Adriatic Sea as a Pilot Area²⁸. The East Atlantic peacock wrasse (Symphodus tinca, Linnaeus, 1758) has several biological features that make this non-model organism a good candidate to study genetic structuring caused by both isolation and local adaptation. This demersal fish is considered a key species due to its abundance and generalist diet (sea-urchins, ophiuroids, bivalves, shrimps and crabs), being an important prey of large predators²⁹, as well as constituting a common species in artisanal and spear fishing activities³⁰. Furthermore it has a very short Pelagic Larval Duration (PLD), lasting only 9–13 days³¹, its larvae are never found more than a few hundred metres from the shore³². Adults exhibit territorial behaviour³³ like most nest-building fishes³⁴, and thus are considered to disperse only during the larval stages. The species lives mainly in shallow rocky shores with a high abundance of arborescent algae to build the nests³³, which is common in the sampled localities of the Adriatic Sea Pilot Study³⁵. Considering all these biological characteristics it is expected that the East Atlantic peacock wrasse would have a very limited dispersion range generating strong genetic population differentiation. The genetic structuring and the degree of connectivity between populations of this species had been studied along the western Mediterranean using eight microsatellites and, despite low dispersion predictions, only major discontinuities generated genetic differentiation³. This unexpected result could be attributed to the mating and settlement behaviour of the species³³ or to the reduced number of analysed markers. High-throughput sequencing of genomic subsets targeted through restriction enzymes open the possibility of working on a genomic scale with non-model yet ecologically relevant species, like S.tinca²².

The aim of this study was to assess connectivity among present and future Marine Protected Areas (MPAs) within the southern Adriatic and northern Ionian Seas to identify the effect of different types of markers in determining genetic differentiation. More specifically, using Genotyping by sequencing we 1) genetically characterized 176 individuals of the East Atlantic peacock wrasse (Symphodus tinca) from 6 locations all of them within existing or planned MPAs and 2) identified putatively neutral loci and positively selected loci to determine changes in population genetic structure based on these two sets of markers. Finally, we discuss the implications of our results and the potential of this approach for the study of non-model organisms thus showing how these new genomic approaches can be applied to marine molecular evolutionary studies and the design of networks of MPAs.

Results

The 176 samples of Symphodus tinca analysed by GBS from the Adriatic and Ionian seas (Fig. 1, Table 1) were sampled in Karaburun Peninsula, Albania (KAP, n = 28), Island of Vir, Croatia (VIR, n = 35) and the Italian sites of Tremiti Islands (TRE, n = 22), Torre Guaceto (TOG, n = 32), Otranto (OTR, n = 29) and Porto Cesareo (POC, n = 30).

**Figure 1: Sampling locations of *Symphodus tinca* within the Adriatic and Ionian Seas.**

Table 1 Sampling information.

Full size table

General SNP calling and filtering

We obtained a total of 440.7 million high-quality reads, with a mean of 2.5 million reads per individual that resulted in a total of 231,884 paired tags for all samples. A mean of 383,502 reads per individual were correctly mapped against the previously defined paired tags used as reference (Table 1). A total of 51,221 putative SNPs were identified among our samples and 4,155 polymorphic SNPs retained over all samples after applying all filters (see methods section, Supplementary Data S1). The mean read depth per individual was 39.8 reads per locus (SD = 12.7 across individuals, SD = 23.4 across loci). The number of polymorphic SNPs was positively correlated to sample size (r = 0.89, P = 0.016). The number of alleles and the observed and expected heterozygosities were positively and negatively correlated to sample size respectively. These correlations became non-significant when we corrected these parameters by sample size and the total number of SNPs respectively (Table 1).

Population genomics

Most pairwise comparisons among sampling locations were significantly different using both F_ST-WC and F_ST-RH estimators (Table 2), with the exceptions of Porto Cesareo (POC) versus Torre Guaceto (TOG) and Vir (VIR) versus Karaburun (KAP). The two estimators were significantly correlated as assessed with a Mantel test (r = 0.99, p = 0.020). However, two additional pairwise comparisons were significant with F_ST-RH but not with F_ST-WC (Otranto-OTR versus POC and OTR versus TOG), probably due to F_ST-RH performing better at low or moderate levels of differentiation³⁶. With both estimators we could define three different units (Table 2). Tremiti (TRE) was the most differentiated location and the locations from the eastern shore of the Adriatic (VIR and KAP) were in all cases different from the south-western locations (TOG, POC and OTR).

Table 2 Pairwise genetic distances among locations of S. tinca within the Adriatic using all 4,155 polymorphic SNPs.

Full size table

The most likely number of populations identified by STRUCTURE was four (K = 4) as identified by ΔK (Supplementary Fig. S1). The probability of assignment of each individual to each of these groups (Fig. 2) revealed an overall differentiation of Tremiti from the eastern and the south-western locations but also differentiated two individuals from Vir. The MDS analysis including all individuals (Fig. 3A) clearly separated with the first axis all Tremiti individuals. Furthermore, five additional individuals were also separated from the remaining bulk of samples, one individual from Karaburun, the two from the Island of Vir already detected by STRUCTURE and two from Porto Cesareo. These five individuals had similar mean number of reads and SNP missingness than the other samples, so apparently its divergence was not a technical artefact. In order to clarify the structuring of the remaining samples we repeated the MDS analysis without the individuals from Tremiti and the five divergent samples. With this approach we found a much clearer separation of individuals sampled in eastern and south-western populations along the first axis although intermixing was observed for some populations (Fig. 3B). The assignment analyses showed that only around half of the individuals were self-assigned to the sampling locations with the exception of Tremiti, where almost all individuals were correctly self-assigned (Supplementary Table S2). However, we repeated the analysis under consideration of the three genetically different groups identified with F_ST values (Table 2), now constituting the putative populations (Tremiti, eastern locations and south-western locations), with the consequence that almost all individuals of all locations were correctly assigned to the corresponding population (Supplementary Table S2). All five divergent samples were assigned to the corresponding sampling locations and groups.

**Figure 2: Posterior probabilities of individual assignment to the most probable number of clusters (K = 4).**

**Figure 3: Multi Dimensional Scaling (MDS) plots of the 176 individuals of *Symphodus tinca* using all SNPs.**

Detection of outlier SNPs

From the 4,155 polymorphic SNPs found in our samples, 78 significant outlier SNPs were detected by ARLEQUIN after FDR correction (Supplementary Fig. S2), all of them potentially under positive selection (F_ST > 0.05). Without applying FDR correction 3,934 SNPs were assumed to be neutral as they were not significantly under selection, although neutrality cannot be directly proven. Finally, the remaining 143 SNPs were not classified in any of the former categories as yielded significant p-values but only before FDR correction. No outlier SNP was identified as to be under balancing selection but preliminary results, without filtering by a mean depth per genotype higher than 100X, identified 59 SNPs under balancing selection. Thus read depth has to be considered before assigning SNPs to any given category since the existence of paralog genes in the genome might erroneously identify SNPs as putatively being under balancing selection. BAYESCAN identified 19 statistically significant outlier SNPs, all of them potentially under positive selection, and all of them already detected by ARLEQUIN.

Pairwise F_ST-WC comparisons showed very different values when only outlier or neutral SNPs were used (Supplementary Table S1). As expected, the 19 outlier SNPs, detected by ARLEQUIN and BAYESCAN to be potentially under positive selection, showed higher F_ST-WC values than when using all the SNPs. Furthermore, all pairwise comparisons with only outlier loci were significant but one (Porto Cesareo versus Torre Guaceto). Neutral SNPs also showed significant structuring, but with lower F_ST-WC values than the whole set of SNPs, and one more non-significant value involving the locations across the Otranto channel: Karaburun Peninsula and Otranto (Supplementary Table S1, Fig. 1). Despite these differences, both sets of pairwise F_ST values were significantly correlated as assessed with a Mantel test (r = 0.927, P = 0.01). Regardless of the set of SNPs used similar population distribution was obtained with a PCoA analysis, which roughly reflected the geographical position of the sampled locations (Fig. 4). The first axis, explaining around 70% of the differentiation in all set of SNPs, clearly differentiated Tremiti from the other locations while the second axis, explaining roughly 20%, separated the south-western from the eastern locations (Fig. 4). Sampling locations were similarly grouped by the heatmaps based on their allele frequency distribution, although groupings were more clearly differentiated when using any of the two sets of outlier markers (Fig. 5). A MDS performed for all individuals using only neutral SNPs revealed that four of the five divergent individuals detected using the whole set of SNPs (those two from Vir and those two from Porto Cesareo) were also divergent for neutral SNPs (Supplementary Figure S3).

**Figure 4: Principal Coordinate Analysis (PCoA) of Symphodus tinca locations of the Adriatic Sea using F_ST pairwise genetic distances.**

Figure 5: Heatmap of the major allele frequency for each SNP in the six populations of *S. tinca.*

The 78 sequences with SNPs identified as outlier with either ARLEQUIN or BAYESCAN were blasted against the genome of the Nile tilapia (Oreochromis niloticus), and 12 of them yielded significant matches (Supplementary Table S3). Four were located within a known gene (minimum distance between the gene and the SNP equal to zero): three of them completely within an intron and one sequence (SNP S1_2262059, Supplementary Table S3) overlapped exonic and intronic regions of the neuronal pentraxin 2b gene (nptx2b). The outlier SNP of this sequence was a T/A located seven nucleotides upstream of the exon before a polypyrimidine tract. An A in this position, as described in the literature, seems to be involved in the branch point of the lariat formation during intron removal³⁷. Thus the T change could compromise the correct splicing of intron two of nptx2b. The function of this gene was related to the regulation of circadian rhythm³⁸ suggesting that adaptation of this SNP could be related to environmental factors. The frequency of the allele that allowed the lariat formation (A) apparently increased with latitude and was higher in the western part of the Adriatic, although the correlation had a low number of observations (Supplementary Fig. S4).

Discussion

The recent incoming of new genomic tools based on next-generation sequencing is revolutionising molecular ecology and evolutionary studies, especially applied to non-model organisms. They not only outperform traditional markers, but also open new research opportunities including the role of adaptation in population differentiation^24,27. In this study we used GBS to study the genetic structure of an ecologically relevant Mediterranean endemic fish, for which traditional markers had revealed a genetic structuring not consistent with the known biology of the species (e.g. low dispersal capabilities), thus showing how these new genomic approaches could change our vision of marine molecular ecology and evolution.

Using 8 microsatellites, Galarza et al.³ found significant genetic structuring in Symphodus tinca populations only along the Almeria-Oran Front, an oceanographic discontinuity in the west Mediterranean that has been reported to be a barrier to gene flow for a high number of species³⁹. However, no trace of genetic differentiation was found for S. tinca along other known fronts, like the Balearic Front, which isolates other fish species with much longer PLD and with an offshore larval distribution³. Even though we have not sampled the same populations, and thus a direct comparison is not possible, we have found significant genetic structure among populations separated by smaller distances and depths than in the previous study, suggesting that GBS provides more resolution when assessing genetic differentiation given that thousands of loci are being assessed. Likewise, recent studies on marine organisms have found deeper levels of genetic differentiation using genome-wide approaches rather than using traditional genetic markers e.g. ref. 25 and it has been proposed that thousands of markers at genome level are often required to distinguish among alternative scenarios, for instance when reconstructing invasion histories⁴⁰. In our study we have analysed up to 14 Mb of the genome considering the ~250 K paired tags detected among all samples. This length would roughly correspond to a 2% of the S. tinca genome⁴¹. Given that 94.5% of our SNPs would be considered neutral, our study also indicates that analysing thousands of curated SNPs is necessary to identify loci potentially under local adaptation in addition to identify connectivity patterns assessed from the neutral markers. Furthermore our results call for caution when SNPs are identified to be under balancing selection since a higher number of reads than expected would suggest the presence of paralogs.

Previous studies within the Adriatic Sea did not find relevant barriers to dispersal in many marine organisms, including fishes^42,43,44. However, some studies demonstrated a weak north-south differentiation^45,46,47. These findings agree with the hydrodynamic provinces suggested in the Mediterranean from Lagrangian simulations and network reconstruction identifying different regions in the Adriatic Sea according to different PLDs⁴⁸. Similarly, Lagrangian simulations assuming larval movement of Carcinus aestuarii support the oceanographic subdivision of the Adriatic Sea into three sub-basins matching currents from north to south⁴⁹. However, only weak genetic differentiation was observed in C. aestuarii mostly differentiating northern and southern locations along the western coast. According to these Lagrangian simulations, our sample from Tremiti could be considered to be representative of a central group of S. tinca for its geographical location, and thus it would explain its strong genetic differentiation from the south-western localities independent of the set of SNPs. A clear genetic break was also observed for Aphanius fasciatus using mtDNA in the eastern Adriatic coast, and at approximately the same latitude of Tremiti, that was attributed to a divergence process during the Pleistocene⁵⁰. However, our populations along the eastern Adriatic coast were not strongly differentiated thus suggesting that the subdivisions might vary comparing the east to the west and the marker used. Most interestingly two individuals found in Vir and two in Porto Cesareo were genetically differentiated and unlikely to belong to the populations sampled in the present study. One possible explanation could be that these individuals represent recent migrants from north-eastern populations or from the central Ionian Sea, respectively. These areas, not included in the present study, have been suggested to be differentiated according to Lagrangian simulations⁴⁸ and thus future sampling of additional locations in these northern and southern areas would help to clarify how many genetically differentiated groups are within the Adriatic and their connections. A recent study using biophysical modelling on Symphodus ocellatus, a species with similar larval characteristics as S. tinca, suggested a high larval retention and confined dispersal across a narrow geographic range, with occasional and weak connections across the two shores of the Adriatic³⁵. This suggested that high larval retention is in accordance with our results since even using only neutral SNPs we detected north-south dispersal limitations and an east-west barrier for S.tinca within the Adriatic Sea. A recent mtDNA and microsatellite study on the black scorpionfish (Scorpaena porcus) in the Adriatic Sea sampling similar localities found some east-west differentiation although was not fully supported by all pairwise comparisons²⁸. The differences with our study could be due to the different resolution of the molecular markers used or to real differences in connectivity mediated by a longer larval duration among others. Therefore, genome-wide SNP genotyping could provide much greater power than traditional markers to detect genetic differentiation and thus, to define barriers to the dispersion of studied organisms^51,52.

One of the most exciting advantages of the genomic approach, compared to traditional markers, possibly relies on the chance to identify traces of selection that shape population genetic differentiation². This advantage is especially promising considering that the role of selection has often been ignored in biodiversity assessments. The assessment of the structuring driven by selection is not unprecedented, as several studies have found significant genetic structure when analysing markers under selection, contrasting to a lack of structuring when using neutral markers^23,24,53. This apparent contradiction has been linked to the generally large effective population sizes of the study organisms and an associated minimal effect of genetic drift. Under large-population scenarios, allele frequencies of different populations are expected to change mainly by differential selection pressures, and even under high levels of migrants, the resulting gene flow would not be sufficient to dilute the genetic differentiation of these markers generated by selection²⁴. Our study detects differentiation using different sets of SNPs, outlier and neutral markers, although the signal greatly reduces for the latter. Thus, the resulting genetic structure reflects the effect of two different components, dispersion and selection, and both should be considered when defining units for management and conservation².

A complex picture of the genetic structure of Symphodus tinca within the Adriatic Sea emerges as a result of the conjunction of these two components. The dispersal potential of this species, likely determined by larval characteristics^31,34, limits its connectivity over long distances and across significant depths. The adaptation to local conditions may strengthen even more this differentiation and thus this study supports previous work hypothesizing that natural selection shapes marine populations at much smaller scales than expected (e.g. refs 23, 54 and 55). Some clues of the elements of selection defining this local adaptation could be found by comparing the sequences of the outlier SNPs to whole genomes. Unfortunately most of the outlier loci (85.2%) did not yield significant matches with the closest available sequenced genome represented by the Nile tilapia. Furthermore, the detection of outlier SNPs does not necessarily mean that these SNPs are located in the gene influenced by the selection. For instance most of our outlier SNPs were located on intergenic regions, probably reflecting linkage disequilibrium to a neighbouring candidate gene or regulatory region. Furthermore, our species lacks a reference genome, and thus the linkage between SNPs and neighbouring genes assumes synteny across species, which is something that remains to be tested. For all these reasons, our results should be interpreted with caution and only as general evidences of the spectrum of elements which could be affecting local adaptation. However, some evidence was obtained for SNP S1_2262059 located within a circadian gene, as it matched a regulatory position involved in intron splicing and its frequency varied with latitude. The information obtained through the comparison of the sequences containing the outlier SNPs with a related genome could be helpful when designing future studies, for instance Genome-Wide Association studies GWAs⁵⁶, or when targeting specific genes.

The somewhat different results found when using different sets of markers open the debate of what constitutes the correct set of markers to use when inferring both genetic structuring and connectivity. While F_ST values obtained by different sets of markers correlated, showing that the relative genetic distances among localities were maintained, the significance of the comparisons obtained from these sets were slightly different. The neutral set of markers showed moderate levels of connectivity between the Adriatic populations across the Otranto channel while being highly differentiated with the outlier loci. This may either suggest that population structure is driven by selection, or that outlier SNPs are simply loci that show higher differentiation because they were picked on the upper end of an F_ST distribution caused by genetic drift. If we assume that outlier SNPs are under selection, we then should include them to infer genetic structuring to delineate conservation and management units². Furthermore, outlier SNPs should ideally not be included to infer connectivity levels among populations, as even highly connected populations may show signals of genetic differentiation due to selection. Thus the inclusion of outlier SNPs may result in an underestimation of the levels of migration among populations⁵³. In our case-study, either considering neutral or outlier SNPs potentially under positive selection the differentiation at both sides of the Adriatic Sea and in a north-south axis seem clear. The major differences between the different types of SNPs exist across the Otranto channel, at the entrance of the Adriatic. Thus, although the individuals may physically cross through this narrow strait from East to West and thereby mix the populations at both sides of the Adriatic, as suggested by neutral SNPs, the outlier loci results indicate that local conditions may be different enough to prevent the genetic homogenisation of these populations through selection. A similar situation has been suggested across the Atlantic-Mediterranean transition, were temporal fluctuations suggests a complex balance of dispersal and selection⁵⁷. Consequently, the differences found in outlier loci should also be taken into account when defining management units for this species because the degree of genetic structuring is clearly deeper than suggested by only neutral markers. Particularly, managers should consider the populations at both sides of the Otranto channel as different units, regardless of the small geographic distance among them, because each area holds a particular set of locally adapted genes. These results are especially relevant when planning networks of Marine Protected Areas (MPAs) and provide insights about how these MPAs are connected considering that Karaburun Peninsula, Tremiti, Torre Guaceto and Porto Cesareo are extant MPAs and Otranto is planned to be a MPA. All south-western MPAs would form an MPA network that would be connected to eastern Adriatic MPAs, as shown by neutral SNPs, but genetically different due to positive selection, as shown by outlier SNPs (Fig. 4). Furthermore, these connections seem not to be symmetrical, as Torre Guaceto, Vir and Porto Cesareo export migrants, while Karaburun Peninsula and Otranto located at both sides of the Otranto channel mostly receive migrants. Considering this asymmetry, protection of source populations should be a priority for management and conservation plans considering their role as network builders. Finally Tremiti MPA would be independent to all of our other localities and could be considered to belong to a different cell of ecosystem functioning⁵⁸.

Overall, our study clearly indicates that GBS is a good approach for population genomic studies of non-model organisms and emphasizes the novelties it brings, particularly to the study of marine organisms. Population genomics studies, as inferred through neutral and outlier SNPs, increase the ability to identify genomic areas under selection which then enhances our knowledge on how dispersal and local adaption shape biodiversity structuring.

Methods

Sampling

Samples of Symphodus tinca were obtained from 6 different locations within the Adriatic and Ionian seas (Fig. 1, Table 1). Tissue samples or fin clips were taken from adults captured by fishermen using spear fishing. Samples were taken from June 2013 to February 2014 and stored in 96% Ethanol. The collection of fish samples was conducted in strict accordance with Spanish and European regulations. The study was found exempt from ethics approval by the ethics commission of the University of Barcelona since, according to article 3.1 of the European Union directive (2010/63/UE) from the 22/9/2010, no approval is needed for fish sacrification with the purpose of tissue or organ analyses. Furthermore, the study species Symphodus tinca is not listed in CITES.

Laboratory procedures

DNA was extracted from samples using the QIAamp^® DNA Mini Kit (QIAGEN) extraction kit following manufacturer’s instructions. DNA integrity was checked by gel electrophoresis, quantified by NanoDrop^® and 1.5–3 μg of DNA per sample was sent to the Cornell University Biotechnology Resource Centre (BRC) to perform GBS²². At the Cornell BRC Genomic Diversity Facilities, individual libraries were produced after digestion with EcoT22I and ligation of a barcode adaptor and a common adaptor with appropriated sticky ends. A total of 95 samples and a blank sample per plate were pooled and cleaned using the Qiagen PCR cleanup kit^® following manufacturer’s instructions. The two resulting 96plex libraries were then amplified by PCR using generic primers matching the adaptors and the following PCR conditions: 5 minutes at 72 °C, 30 seconds at 98 °C, 18 cycles of 10 seconds at 98 °C, 30 seconds at 65 °C and 30 seconds at 72 °C and a final extension of 5 minutes at 72 °C. The PCR was cleaned with the QIAquick PCR Purification kit^®, diluted and single-end sequenced in an Illumina HiSeq 2500 platform at BRC, by using one lane per plate and the HiSeq v4 reagents kit.

SNP calling

Raw sequences from Illumina were used for genotyping using the UNEAK pipeline⁵⁹ as implemented in Tassel vs 3.0⁶⁰. All data from different plates were analysed simultaneously using a common keyfile after removing the blank samples. All high-quality reads with a corresponding barcode were trimmed to 64 bp, excluding primers and barcodes, and all identical reads were merged as unique tags. Resulting tags were pairwise-aligned and all the possible pairs combined as networks. Only reciprocal tag pairs with a 1 bp mismatch were retained as potential SNPs after filtering with a certain tolerance error (set to 0.03). Reads from each individual were then mapped against the retained paired tags to extract the individual genotypes. VCF files were generated by applying a filter to a maximum number of 2 alleles per SNP. Additional filters were then applied using VCFtools vs 1.12⁶¹. We first filtered the individual genotypes by retaining only those with a minimum depth of 5X and a genotype quality (GQ) higher than 98. We also removed SNPs with a minimum allele frequency (MAF) lower than 0.01 and a missingness value higher than 30% (retaining SNPs present at >70% of the individuals). Finally, we removed SNPs with a mean depth per genotype higher than 100X to avoid possible paralogs since preliminary results without removing them yielded a high number of SNPs with large number of reads identified to be under balancing selection.

Population genomics

VCF files were converted to PLINK vs 1.9⁶² using VCFtools. Additionally were also converted to ARLEQUIN vs 3.5⁶³, GENETIX vs 4.05.2⁶⁴, STRUCTURE vs 2.3.4⁶⁵, BAYESCAN vs 2.1⁶⁶ and GeneClass2 vs 2.0⁶⁷ using the file converter PDGSpider vs 2.0.8.3⁶⁸. ARLEQUIN was used to check for departure from Hardy-Weinberg Equilibrium and all loci deviating in at least the 60% of the localities were removed from further analyses²⁵. ARLEQUIN was also used to calculate general diversity indices for each location and for computing F_ST-WC pairwise population values⁶⁹. Allelic richness was calculated using the software ADZE vs 1.0⁷⁰. Genetic differentiation was also assessed using the corrected³⁶ values of F_ST−RH⁷¹ with GENETIX. These two different F_ST measures were used as F_ST-WC is recommended for high values of differentiation and F_ST−RH for low or moderate values of differentiation³⁶. A FDR correction for multiple comparisons was applied to calculate the appropriate threshold of differentiation⁷². Population structuring was also evaluated using the programme STRUCTURE, which implements a Bayesian clustering method to identify the most likely number of genetically differentiated populations (K). We used the strategy and parameters described in the literature⁷³ and thus we carried out 10 runs per each value of K ranging from 1 to the number of localities plus two. We used the model of correlated allele frequencies and a burnin of 50,000 followed by 500,000 Markov Chains Monte Carlo. We estimated the ad hoc statistic ΔK in order to infer the most likely number of populations using STRUCTURE HARVESTER⁷⁴, The 10 runs of STRUCTURE for the most probable K were averaged using CLUMPP vs 1.1.2⁷⁵. A Multi-Dimensional Scaling (MDS) analysis was performed for all individuals using PLINK and the results were plotted using an Excel^® spreadsheet. We also tested if all individuals were correctly reassigned to their sampling locations by using GeneClass2⁶⁷ that implements the Bayesian approach described in the literature⁷⁶ and excludes the individual from their population during computation (leave-one-out procedure). Only the individuals with an assignation score higher that 95% were considered to be correctly assigned.

Detection of outlier SNPs

We identified outlier SNPs using two different programs, ARLEQUIN and BAYESCAN. ARLEQUIN uses coalescent simulations to create a null distribution of F-statistics and then generates P-values for each locus based on its distributions and observed heterozygosities across all loci²¹. We considered each location as a unit to implement a hierarchical island model in order to reduce false positives introduced due to population structure. We performed a total of 20,000 simulations, 10 simulated groups and 100 demes per group. This method detects outlier SNPs with high F_ST values, considered to be potentially under positive selection, and outlier SNPs with F_ST values close to zero, considered to be candidates for balancing selection. To reduce the error due to multiple comparisons we applied a FDR correction⁷² to identify statistically significant outlier SNPs. However, corrections for multiple pairwise comparisons dramatically increase the probability of type II error (β: e.g. assume neutrality of a SNP when it is really not neutral), an effect that becomes worse as many P-values are discarded⁷⁷. For this reason, we followed a conservative approach and we did not apply any correction to identify putatively neutral markers. Additionally, we identified outlier SNPs using BAYESCAN⁶⁶. This software uses a Bayesian approach to estimate population specific F_ST coefficients in contrast to a locus-specific F_ST coefficient shared by all the populations. When the locus-specific component is needed to explain the observed pattern of diversity, the software assumes departure of neutrality either due to positive selection or to balancing selection. We run 100,000 simulations and specified a prior odd of 10,000 in order to minimize false positives⁷⁸. We considered outlier SNPs those with a q-value below 0.05, which is the FDR analogue of the p-value.

We also calculated population differentiation using ARLEQUIN as described above but considering two subsets of SNPs a) outlier SNPs potentially under positive selection and b) neutral SNPs. Principal Coordinate Analyses (PCoA) were performed with GenAlEx vs 6.5⁷⁹ using the genetic distances obtained from ARLEQUIN for all the loci and these two subsets of SNPs. Additionally, we computed for each SNP and population the frequency of the major allele, considering all samples, and represented them using a heatmap and a hierarchical dendrogram as implemented in the R function ‘heatmap.2’ of the package ‘gplots’⁸⁰. This analysis was also done considering all SNPs and the two subsets above mentioned.

Finally, the 64 bp sequences containing all outlier SNPs potentially under selection were blasted against the genome of the Nile tilapia (Oreochromis niloticus), the only species with a reference genome that belongs to the same order (Perciformes) as our study species. We used the BLASTN search tool of the Ensembl website (www.ensembl.org). We set the sensitivity of the search tool to ‘Distant homologies’ in order to maximise the length of the matches considering that a certain level of divergence is expected given the phylogenetic distance between both species. We allowed a maximum E-value of 10⁻³ and considered only matches that included at least half of the 64 bp sequence of each SNP. Whenever a sequence yielded a match within a gene, the annotated function of this gene was searched at the UniProt database (www.uniprot.org). When a sequence yielded a match in an intergenic region, the closest gene was identified and also its function searched at the UniProt database.

Additional Information

How to cite this article: Carreras, C. et al. Population genomics of an endemic Mediterranean fish: differentiation by fine scale dispersal and adaptation. Sci. Rep. 7, 43417; doi: 10.1038/srep43417 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Moritz, C. Defining Evolutionarily Significant Units for Conservation. Trends Ecol. Evol. 9, 373–375 (1994).
Article CAS PubMed Google Scholar
Funk, W. C., McKay, J. K., Hohenlohe, P. A. & Allendorf, F. W. Harnessing genomics for delineating conservation units. Trends Ecol. Evol. 27, 489–496 (2012).
Article PubMed PubMed Central Google Scholar
Galarza, J. A. et al. The influence of oceanographic fronts and early-life-history traits on connectivity among littoral fish species. Proc. Natl. Acad. Sci. USA 106, 1473–1478 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Hellberg, M. E. In Annual Review of Ecology Evolution and Systematics Vol. 40 Annual Review of Ecology Evolution and Systematics 291–310 (Annual Reviews, 2009).
Waples, R. S. & Gaggiotti, O. What is a population? An empirical evaluation of some genetic methods for identifying the number of gene pools and their degree of connectivity. Mol. Ecol. 15, 1419–1439 (2006).
Article CAS PubMed Google Scholar
O’Reilly, P. T., Canino, M. F., Bailey, K. M. & Bentzen, P. Inverse relationship between FST and microsatellite polymorphism in the marine fish, walleye pollock (Theragra chalcogramma): implications for resolving weak population structure. Mol. Ecol. 13, 1799–1814 (2004).
Article PubMed CAS Google Scholar
Hauser, L. & Carvalho, G. R. Paradigm shifts in marine fisheries genetics: ugly hypotheses slain by beautiful facts. Fish. Fish. 9, 333–362 (2008).
Article Google Scholar
Palumbi, S. R., Barshis, D. J., Traylor-Knowles, N. & Bay, R. A. Mechanisms of reef coral resistance to future climate change. Science (New York, N.Y.) 344, 895–898 (2014).
Article ADS CAS Google Scholar
Bonneaud, C., Perez-Tris, J., Federici, P., Chastel, O. & Sorci, G. Major histocompatibility alleles associated with local resistance to malaria in a passerine. Evolution 60, 383–389 (2006).
Article CAS PubMed Google Scholar
Stiebens, V. A., Merino, S. E., Chain, F. J. J. & Eizaguirre, C. Evolution of MHC class I genes in the endangered loggerhead sea turtle (Caretta caretta) revealed by 454 amplicon sequencing. BMC Evol. Biol. 13 (2013).
Calabria, G. et al. Hsp70 protein levels and thermotolerance in Drosophila subobscura: a reassessment of the thermal co-adaptation hypothesis. J. Evol. Biol. 25, 691–700 (2012).
Article CAS PubMed Google Scholar
Hemmer-Hansen, J., Nielsen, E. E., Frydenberg, J. & Loeschcke, V. Adaptive divergence in a high gene flow environment: Hsc70 variation in the European flounder (Platichthys flesus L.). Heredity 99, 592–600 (2007).
Article CAS PubMed Google Scholar
Larsen, P. F. et al. Adaptive differences in gene expression in European flounder (Platichthys flesus). Mol. Ecol. 16, 4674–4683 (2007).
Article CAS PubMed Google Scholar
Harrald, M., Wright, P. J. & Neat, F. C. Substock variation in reproductive traits in North Sea cod (Gadus morhua). Can. J. Fish. Aquat. Sci. 67, 866–876 (2010).
Article Google Scholar
Stiebens, V. A. et al. Living on the edge: how philopatry maintains adaptive potential. P Roy Soc B-Biol Sci 280, 20130305 (2013).
Google Scholar
Everett, M. V. & Seeb, J. E. Detection and mapping of QTL for temperature tolerance and body size in Chinook salmon (Oncorhynchus tshawytscha) using genotyping by sequencing. Evol. Appl. 7, 480–492 (2014).
Article CAS PubMed PubMed Central Google Scholar
Schunter, C., Garza, J. C., Macpherson, E. & Pascual, M. SNP development from RNA-seq data in a nonmodel fish: how many individuals are needed for accurate allele frequency prediction? Molecular Ecology Resources 14, 157–165 (2014).
Article CAS PubMed Google Scholar
Helyar, S. J. et al. Application of SNPs for population genetics of nonmodel organisms: new opportunities and challenges. Molecular Ecology Resources 11, 123–136 (2011).
Article PubMed Google Scholar
Stapley, J. et al. Adaptation genomics: the next generation. Trends Ecol. Evol. 25, 705–712 (2010).
Article PubMed Google Scholar
Kapun, M., Fabian, D. K., Goudet, J. & Flatt, T. Genomic Evidence for Adaptive Inversion Clines in Drosophila melanogaster . Mol. Biol. Evol. 33, 1317–1336 (2016).
Article CAS PubMed Google Scholar
Excoffier, L., Hofer, T. & Foll, M. Detecting loci under selection in a hierarchically structured population. Heredity 103, 285–298 (2009).
Article CAS PubMed Google Scholar
Elshire, R. J. et al. A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species. Plos One 6 (2011).
Bradbury, I. R. et al. Parallel adaptive evolution of Atlantic cod on both sides of the Atlantic Ocean in response to temperature. P Roy Soc B-Biol Sci 277, 3725–3734 (2010).
Google Scholar
Lamichhaney, S. et al. Population-scale sequencing reveals genetic differentiation due to local adaptation in Atlantic herring. Proc. Natl. Acad. Sci. USA 109, 19345–19350 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Benestan, L. et al. RAD genotyping reveals fine-scale genetic structuring and provides powerful population assignment in a widely distributed marine species, the American lobster (Homarus americanus). Mol. Ecol. 24, 3299–3315 (2015).
Article PubMed Google Scholar
Ogden, R. et al. Sturgeon conservation genomics: SNP discovery and validation using RAD sequencing. Mol. Ecol. 22, 3112–3123 (2013).
Article CAS PubMed Google Scholar
Reitzel, A. M., Herrera, S., Layden, M. J., Martindale, M. Q. & Shank, T. M. Going where traditional markers have not gone before: utility of and promise for RAD sequencing in marine invertebrate phylogeography and population genomics. Mol. Ecol. 22, 2953–2970 (2013).
Article CAS PubMed PubMed Central Google Scholar
Boissin, E. et al. Contemporary genetic structure and post-glacial demographic history of the black scorpionfish, Scorpaena porcus, in the Mediterranean and the Black Seas. Molecular Ecology, doi: 10.1111/mec.13616 (2016).
MoralesNin, B. & Moranta, J. Life history and fishery of the common dentex (Dentex dentex) in Mallorca (Balearic Islands, western Mediterranean). Fisheries Research 30, 67–76 (1997).
Article Google Scholar
Coll, J., Linde, M., Garcia-Rubies, A., Riera, F. & Grau, A. M. Spear fishing in the Balearic Islands (west central Mediterranean): species affected and catch evolution during the period 1975-2001. Fisheries Research 70, 97–111 (2004).
Article Google Scholar
Raventos, N. & Macpherson, E. Planktonic larval duration and settlement marks on the otoliths of Mediterranean littoral fishes. Mar. Biol. 138, 1115–1120 (2001).
Article Google Scholar
Sabates, A., Zabala, M. & Garcia-Rubies, A. Larval fish communities in the Medes Islands Marine Reserve (North-west Mediterranean). J. Plankton Res. 25, 1035–1046 (2003).
Article Google Scholar
Luttbeg, B. & Warner, R. R. Reproductive decision-making by female peacock wrasses: flexible versus fixed behavioral rules in variable environments. Behavioral Ecology 10, 666–674 (1999).
Article Google Scholar
Macpherson, E. & Raventos, N. Relationship between pelagic larval duration and geographic distribution of Mediterranean littoral fishes. Marine Ecology Progress Series 327, 257–265 (2006).
Article ADS Google Scholar
Melia, P. et al. Looking for hotspots of marine metacommunity connectivity: a methodological framework. Scientific Reports 6, 23705 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Raufaste, N. & Bonhomme, F. Properties of bias and variance of two multiallelic estimators of F-ST. Theoretical Population Biology 57, 285–296 (2000).
Article CAS PubMed MATH Google Scholar
Black, D. L. In Annual Review of Biochemistry. Volume 72 Annual Review of Biochemistry (eds Charles C. Richardson ) 291–336 (2003).
Appelbaum, L. et al. Circadian and Homeostatic Regulation of Structural Synaptic Plasticity in Hypocretin Neurons. Neuron 68, 87–98 (2010).
Article CAS PubMed PubMed Central Google Scholar
Patarnello, T., Volckaert, F. & Castilho, R. Pillars of Hercules: is the Atlantic-Mediterranean transition a phylogeographical break? Mol. Ecol. 16, 4426–4444 (2007).
Article PubMed Google Scholar
Adrion, J. R. et al. Drosophila suzukii: The Genetic Footprint of a Recent, Worldwide Invasion. Mol. Biol. Evol. 31, 3148–3163 (2014).
Article CAS PubMed PubMed Central Google Scholar
Vinogradov, A. E. Genome size and GC-percent in vertebrates as determined by flow cytometry: The triangular relationship. Cytometry 31, 100–109 (1998).
Article CAS PubMed Google Scholar
Astolfi, L. et al. Mitochondrial variability of sand smelt Atherina boyeri populations from north Mediterranean coastal lagoons. Mar Ecol Prog Ser 297, 233–243 (2005).
Article ADS CAS Google Scholar
Maltagliati, F., Di Giuseppe, G., Barbieri, M., Castelli, A. & Dini, F. Phylogeography and genetic structure of the edible sea urchin Paracentrotus lividus (Echinodermata: Echinoidea) inferred from the mitochondrial cytochrome b gene. Biol. J. Linnean Soc. 100, 910–923 (2010).
Article Google Scholar
Garoia, F. et al. Microsatellite DNA variation reveals high gene flow and panmictic populations in the Adriatic shared stocks of the European squid and cuttlefish (Cephalopoda). Heredity 93, 166–174 (2004).
Article CAS PubMed Google Scholar
Bembo, D. G. et al. Allozymic and morphometric evidence for two stocks of the European anchovy Engraulis encrasicolus in Adriatic waters. Mar. Biol. 126, 529–538 (1996).
Article CAS Google Scholar
Papetti, C. et al. Single population and common natal origin for Adriatic Scomber scombrus stocks: evidence from an integrated approach. Ices Journal of Marine Science 70, 387–398 (2013).
Article Google Scholar
Garoia, F., Guarniero, I., Piccinetti, C. & Tinti, F. First microsatellite loci of red mullet (Mullus barbatus) and their application to genetic structure analysis of adriatic shared stock. Marine Biotechnology 6, 446–452 (2004).
Article CAS PubMed Google Scholar
Rossi, V., Ser-Giacomi, E., Lopez, C. & Hernandez-Garcia, E. Hydrodynamic provinces and oceanic connectivity from a transport network help designing marine reserves. Geophysical Research Letters 41, 2883–2891 (2014).
Article ADS Google Scholar
Schiavina, M., Marino, I. A. M., Zane, L. & Melia, P. Matching oceanography and genetics at the basin scale. Seascape connectivity of the Mediterranean shore crab in the Adriatic Sea. Mol. Ecol. 23, 5496–5507 (2014).
Article CAS PubMed Google Scholar
Buj, I. et al. Population genetic structure and demographic history of Aphanius fasciatus (Cyprinodontidae: Cyprinodontiformes) from hypersaline habitats in the eastern Adriatic. Scientia Marina 79, 399–408 (2015).
Article Google Scholar
Aykanat, T. et al. Low but significant genetic differentiation underlies biologically meaningful phenotypic divergence in a large Atlantic salmon population. Mol. Ecol. 24, 5158–5174 (2015).
Article PubMed Google Scholar
Bradbury, I. R. et al. Transatlantic secondary contact in Atlantic Salmon, comparing microsatellites, a single nucleotide polymorphism array and restriction-site associated DNA sequencing for the resolution of complex spatial structure. Mol. Ecol. 24, 5130–5144 (2015).
Article CAS PubMed Google Scholar
Milano, I. et al. Outlier SNP markers reveal fine-scale genetic structuring across European hake populations (Merluccius merluccius). Mol. Ecol. 23, 118–135 (2014).
Article PubMed Google Scholar
Gaggiotti, O. E. et al. Disentangling the effects of evolutionary, demographic, and environmental factors influencing genetic structure of natural populations: Atlantic Herring as a case study. Evolution 63, 2939–2951 (2009).
Article PubMed Google Scholar
Ruzzante, D. E. et al. Biocomplexity in a highly migratory pelagic marine fish, Atlantic herring. P Roy Soc B-Biol Sci 273, 1459–1464 (2006).
Google Scholar
Korte, A. & Farlow, A. The advantages and limitations of trait analysis with GWAS: a review. Plant Methods 9 (2013).
Pascual, M. et al. Temporal and spatial genetic differentiation in the crab Liocarcinus depurator across the Atlantic-Mediterranean transition. Scientific Reports 6, 29892 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Boero, F. The future of the Mediterranean Sea Ecosystem: towards a different tomorrow. Rend. Lincei.-Sci. Fis. Nat. 26, 3–12 (2015).
Article Google Scholar
Lu, F. et al. Switchgrass Genomic Diversity, Ploidy, and Evolution: Novel Insights from a Network-Based SNP Discovery Protocol. PLoS Genet. 9 (2013).
Glaubitz, J. C. et al. TASSEL-GBS: A High Capacity Genotyping by Sequencing Analysis Pipeline. Plos One 9 (2014).
Danecek, P. et al. The variant call format and VCFtools. Bioinf. 27, 2156–2158 (2011).
Article CAS Google Scholar
Purcell, S. et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. American Journal of Human Genetics 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Excoffier, L. & Lischer, H. E. L. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Molecular Ecology Resources 10, 564–567 (2010).
Article PubMed Google Scholar
Belkhir, K., Borsa, P., Chikhi, L., Raufaste, N. & Bonhomme, F. GENETIX 4.05, logiciel sous Windows TM pour la génétique des populations. Vol. CNRS UMR 5171 (Université de Montpellier II, 1996–2004).
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
Article CAS PubMed PubMed Central Google Scholar
Foll, M. & Gaggiotti, O. A genome-scan method to identify selected loci appropriate for both dominant and codominant markers: a Bayesian perspective. Genetics 180, 977–993 (2008).
Article PubMed PubMed Central Google Scholar
Piry, S. et al. GENECLASS2: A software for genetic assignment and first-generation migrant detection. J. Hered. 95, 536–539 (2004).
Article CAS PubMed Google Scholar
Lischer, H. E. L. & Excoffier, L. PGDSpider: an automated data conversion tool for connecting population genetics and genomics programs. Bioinf. 28, 298–299 (2012).
Article CAS Google Scholar
Weir, B. S. & Cockerham, C. C. Estimating F-statistics for the analysis of population structure. Evolution 38, 1358–1370 (1984).
CAS PubMed Google Scholar
Szpiech, Z. A., Jakobsson, M. & Rosenberg, N. A. ADZE: a rarefaction approach for counting alleles private to combinations of populations. Bioinf. 24, 2498–2504 (2008).
Article CAS Google Scholar
Robertson, A. & Hill, W. G. Deviations from Hardy-Weinberg proportions-sampling variances and use in estimation of inbreeding coeficients. Genetics 107, 703–718 (1984).
Article CAS PubMed PubMed Central Google Scholar
Narum, S. R. Beyond Bonferroni: Less conservative analyses for conservation genetics. Conserv. Genet. 7, 783–787 (2006).
Article CAS Google Scholar
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol. Ecol. 14, 2611–2620 (2005).
Article CAS PubMed Google Scholar
Earl, D. A. & vonHoldt, B. M. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conservation Genetics Resources 4, 359–361 (2012).
Article Google Scholar
Jakobsson, M. & Rosenberg, N. A. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinf. 23, 1801–1806 (2007).
Article CAS Google Scholar
Rannala, B. & Mountain, J. L. Detecting immigration by using multilocus genotypes. Proc. Natl. Acad. Sci. USA 94, 9197–9201 (1997).
Article ADS CAS PubMed PubMed Central Google Scholar
Moran, M. D. Arguments for rejecting the sequential Bonferroni in ecological studies. Oikos 100, 403–405 (2003).
Article Google Scholar
Lotterhos, K. E. & Whitlock, M. C. Evaluation of demographic history and neutral parameterization on the performance of F-ST outlier tests. Mol. Ecol. 23, 2178–2192 (2014).
Article PubMed PubMed Central Google Scholar
Peakall, R. & Smouse, P. E. GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research-an update. Bioinf. 28, 2537–2539 (2012).
Article CAS Google Scholar
Warnes, G. et al. gplots: various R programming tools for plotting data. http://CRAN.R-project.org/package=gplots. (2016).
Wessel, P., Smith, W., Scharroo, R., Luis, J. & Wobbe, F. Generic Mapping Tools: Improved Version Released. EOS, Trans. AGU 64, 409–420 (2013).
Article ADS Google Scholar

Download references

Acknowledgements

This work was supported by project CTM2013-48163 from Ministerio de Economía y Competitividad and by the European FP7 CoCoNet project (Ocean 2011-4, grant agreement #287844). CC, EM and MP are part of the research groups 2014SGR-1364, 2014SGR-120 and 2014SGR-336 of the Generalitat de Catalunya. CC was supported by a grant of the Beatriu de Pinós program of the Generalitat de Catalunya. LZ was supported by the University of Padua grant CPDA148387/14. The authors would like to thank the professionals from Antheus srl (University of Salento) for sample collection in Italy, especially to Stanislao Bevilacqua and Giuseppe Guarnieri. We would also thank Simonetta Fraschetti and Tony Terlizzi (University of Salento) who helped in logistics and sample collection in Italy.

Author information

Enrique Macpherson and Marta Pascual: These authors jointly supervised this work.

Authors and Affiliations

Departament de Genètica, Microbiologia i Estadística and IRBio, Universitat de Barcelona, Av.Diagonal 643, Barcelona, 08028, Spain
Carlos Carreras, Víctor Ordóñez & Marta Pascual
Department of Biology, University of Padova, via G, Colombo 3, Padova, 35131, Italy
Lorenzo Zane
Consorzio Nazionale Interuniversitario per le Scienze del Mare, Piazzale Flaminio 9, Roma, 00196, Italy
Lorenzo Zane
University of Zadar, Ul. Mihovila Pavlinovica, Zadar, 23000, Croatia
Claudia Kruschel
Department of Biology, Faculty of Technical Sciences, Vlora University, Vlora, 9401, Albania
Ina Nasto
Centre d’Estudis Avançats de Blanes (CEAB-CSIC), Car. Acc. Cala St. Francesc 14, 17300 Blanes Girona, Spain ,
Enrique Macpherson

Authors

Carlos Carreras
View author publications
You can also search for this author in PubMed Google Scholar
Víctor Ordóñez
View author publications
You can also search for this author in PubMed Google Scholar
Lorenzo Zane
View author publications
You can also search for this author in PubMed Google Scholar
Claudia Kruschel
View author publications
You can also search for this author in PubMed Google Scholar
Ina Nasto
View author publications
You can also search for this author in PubMed Google Scholar
Enrique Macpherson
View author publications
You can also search for this author in PubMed Google Scholar
Marta Pascual
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.C., E.M. and M.P. conceived and designed the study, L.Z., C.K. and I.N. obtained the samples, C.C. and V.O. did the genetic analyses, C.C. analysed the data, C.C., V.O., E.M. and M.P. prepared the manuscript and all authors contributed to its final version.

Corresponding author

Correspondence to Carlos Carreras.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Tables and Figures (PDF 839 kb)

Supplementary Dataset 1 (ZIP 532 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Carreras, C., Ordóñez, V., Zane, L. et al. Population genomics of an endemic Mediterranean fish: differentiation by fine scale dispersal and adaptation. Sci Rep 7, 43417 (2017). https://doi.org/10.1038/srep43417

Download citation

Received: 06 September 2016
Accepted: 24 January 2017
Published: 06 March 2017
DOI: https://doi.org/10.1038/srep43417

This article is cited by

Exceptional population genomic homogeneity in the black brittle star Ophiocomina nigra (Ophiuroidea, Echinodermata) along the Atlantic-Mediterranean coast
- Carlos Leiva
- Laia Pérez-Sorribes
- Rocío Pérez-Portela
Scientific Reports (2023)
Large effective size as determinant of population persistence in Anostraca (Crustacea: Branchiopoda)
- Lucía Sainz-Escudero
- Marta Vila
- Mario García-París
Conservation Genetics (2023)
Individual-Based Models for Incorporating Landscape Processes in the Conservation and Management of Aquatic Systems
- Travis Seaborn
- Casey C. Day
- Ryan K. Simmons
Current Landscape Ecology Reports (2023)
Genomic basis for early-life mortality in sharpsnout seabream
- Héctor Torrado
- Cinta Pegueroles
- Marta Pascual
Scientific Reports (2022)
Genetic and particle modelling approaches to assessing population connectivity in a deep sea lobster
- Aimee L. van der Reis
- Craig R. Norrie
- Emma L. Carroll
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

General SNP calling and filtering

Population genomics

Detection of outlier SNPs

Discussion

Methods

Sampling

Laboratory procedures

SNP calling

Population genomics

Detection of outlier SNPs

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links