Gene pleiotropy constrains gene expression changes in fish adapted to different thermal conditions

Papakostas, Spiros; Vøllestad, L. Asbjørn; Bruneaux, Matthieu; Aykanat, Tutku; Vanoverbeke, Joost; Ning, Mei; Primmer, Craig R.; Leder, Erica H.

doi:10.1038/ncomms5071

Download PDF

Article
Open access
Published: 03 June 2014

Gene pleiotropy constrains gene expression changes in fish adapted to different thermal conditions

Spiros Papakostas¹,
L. Asbjørn Vøllestad²,
Matthieu Bruneaux¹,
Tutku Aykanat¹,
Joost Vanoverbeke³,
Mei Ning¹^nAff5,
Craig R. Primmer¹^na1 &
…
Erica H. Leder¹^na1

Nature Communications volume 5, Article number: 4071 (2014) Cite this article

6062 Accesses
64 Citations
34 Altmetric
Metrics details

Subjects

Abstract

Understanding the factors that shape the evolution of gene expression is a central goal in biology, but the molecular mechanisms behind this remain controversial. A related major goal is ascertaining how such factors may affect the adaptive potential of a species or population. Here we demonstrate that temperature-driven gene expression changes in fish adapted to differing thermal environments are constrained by the level of gene pleiotropy estimated by either the number of protein interactions or gene biological processes. Genes with low pleiotropy levels were the main drivers of both plastic and evolutionary global expression profile changes, while highly pleiotropic genes had limited expression response to temperature treatment. Our study provides critical insights into the molecular mechanisms by which natural populations can adapt to changing environments. In addition to having important implications for climate change adaptation, these results suggest that gene pleiotropy should be considered more carefully when interpreting expression profiling data.

Gene expression dynamics during rapid organismal diversification in African cichlid fishes

Article 23 November 2020

Detecting signatures of selection on gene expression

Article 12 May 2022

Rapid and biased evolution of canalization during adaptive divergence revealed by dominance in gene expression variability during Arctic charr early development

Article Open access 31 August 2023

Introduction

The significance of gene expression pattern modification as a component of the adaptation process has received considerable attention recently^1,2,3,4. Gene expression has been connected to many facets of evolution, either via adaptive changes or by plastic responses⁵, including population divergence in natural systems^6,7,8 and human populations¹, sexual dimorphism^9,10, sex chromosome evolution^10,11, speciation^12,13, as well as community and eco-evolutionary dynamics¹⁴. Despite this, several key questions regarding gene expression adaptation remain open³. First, whether or not gene expression changes tend to be neutral, or affected by natural selection, remains a point of contention^3,7,13,15. Recent efforts have focused on refining models of gene expression evolution to better assess gene expression adaptation^1,3,16, including application of a Q_ST−F_ST framework^7,17, but several key questions regarding gene expression adaptation remain unanswered³. For example, gene expression evolution is typically driven by regulatory mutations as revealed by studies of genome-wide adaptive gene expression variation^{1,2,3,4,18,19}; however, a mechanistic framework for understanding how gene expression evolution leads to modified biological function and phenotypic adaptation is still underdeveloped in evolutionary and molecular biology. One approach to address this problem is to view gene expression evolution in the context of the interactome, the complex network of all molecular interactions²⁰. Biological function is rarely the property of single gene products, but rather is carried out by proteins that cooperate with each other often at stoichiometric ratios. As a consequence, interacting proteins have expression profiles that tend to co-evolve²¹. What factors, if any, can influence gene expression evolution in the protein interaction network is, however, unknown.

The importance of pleiotropy, that is, when one locus affects multiple phenotypic characters, has been an active topic of discussion in evolutionary biology for many years and remains controversial^22,23. At the molecular level, pleiotropy and predictions of its consequences are generally built within the framework of Fisher’s geometric model, also summarized as the ‘cost of complexity’, which predicts that complexity associated with pleiotropy will constrain adaptive evolution^24,25,26. Gene pleiotropy has been found to positively correlate with the number of biological processes or protein–protein interactions (PPI) in which a gene is involved, but not with the number of molecular functions²⁷. Pleiotropy at the molecular level is thus the result of single molecular functions involved in multiple biological processes through interactions with other gene products. Historically, this has been called type II pleiotropy with the alternative hypothesis being that molecular pleiotropy is conferred by multiple molecular functions of a gene product (type I pleiotropy)^26,27. Under this view, several studies have highlighted the importance of gene pleiotropy as a constraining factor in the rate of molecular evolution and in gene expression variation. For example, proteins with more interactions were found to have slower amino acid substitution rates in Saccharomyces^28,29,30, Caenorhabditis³⁰ and Drosophila³⁰. Further, they have been found to show lower interspecific gene expression divergence in Saccharomyces and Drosophila, as well as lower population-level gene expression variation³¹. Gene pleiotropy, as estimated by number of PPI, may thus have a role in gene expression evolution but more research is required to clarify this relationship. Key questions in particular that have yet to be addressed empirically are how gene pleiotropy affects gene expression evolution in varying environmental conditions and whether pleiotropy constrains adaptation at the gene expression level.

We study a metapopulation of European grayling (Thymallus thymallus), a spring-spawning, freshwater salmonid fish with high homing propensity that has undergone contemporary evolution of early life-history traits in response to temperature^32,33. Fish colonized Lesjaskogsvatnet (62°14′N 8°25′E), a lake in southern Norway, following an upstream watercourse manipulation in the late 1880s (≈25 generations) and subsequently established spawning sub-populations in tributaries with contrasting thermal characteristics, namely warmer versus colder relative to each other^33,34 (Fig. 1 and Supplementary Table 1). Earlier research has provided convincing evidence that rapid adaptation of early life-history traits to differing thermal conditions has occurred in just 25 generations^32,33, despite low effective population sizes, low levels of genetic divergence and variability at assumedly neutral molecular markers^33,35,36 (Supplementary Table 2). As such, regulatory evolution leading to differences in gene expression is a likely to be a molecular mechanism driving the observed adaptations⁴.

**Figure 1: Map of Lesjaskogsvatnet in Norway and the sampling locations of the sub-populations.**

Here we show that gene pleiotropy has a constraining effect on protein expression change in European grayling sub-populations adapted to different thermal environments. We first confirm that juvenile grayling development rates conform to those expected under the local adaptation hypothesis using a common garden experiment with temperatures resembling the alternative natal environments. Next, using samples collected from the same experiment, we employ high-resolution mass spectrometry and measure gene expression directly at the protein level from embryos of similar developmental stage from both temperature treatments. We describe the general effect of gene pleiotropy on protein expression profiles following adaptation rather than providing a list of specific candidate genes responsible for the adaptations. We find that the global proteomic thermal profiles include both plastic and evolutionary components, and that protein expression responses negatively correlate to the level of gene pleiotropy.

Results

Development supports the local thermal adaptation hypothesis

Grayling development rates in the common garden experiment were faster in the temperature most resembling the natal environment. Time to hatch was consistently longer in the cold temperature treatment. However, within a given temperature treatment, ‘local’ grayling, that is, grayling whose thermal origin was most similar to the temperature treatment, hatched significantly earlier than ‘non-local’ grayling (Fig. 2). In the 6 °C treatment, 50% of cold thermal origin embryos were estimated to hatch on average 8.2 degree-days before 50% of warm thermal origin embryos had hatched (Binomial logistic regression: P=0.025, cold: n=380, degree-day_50%=191.3, s.e.=2.53; warm: n=549, degree-day_50%=199.5, s.e.=2.54). In the 10 °C treatment, 50% of warm origin embryos hatched on average 13.8 degree-days earlier than cold origin grayling (Binomial logistic regression: P=0.0012, cold: n=234, degree-day_50%=180.5, s.e.=2.44; warm: n=215, degree-day_50%=166.7, s.e.=2.76) (Fig. 2 and Supplementary Table 3). As rapid development has been shown to provide fitness benefits in salmonid fishes³⁷, these results represent an archetypal case supporting the local adaptation hypothesis³⁸.

**Figure 2: Development rates of grayling juveniles in the common garden experiment.**

Expression profiles have plastic and evolutionary components

We tested the effect of temperature treatment, that is, 6 °C versus 10 °C, on the global protein expression profiles, and of thermal origin, that is, cold versus warm sub-populations. For this, we summarized the expression values of the 408 quantified proteins remaining after filtering (790 proteins were identified in total) along the first component (PC1) of a principal component analysis (PCA) and analysed the PC1 values with a general linear mixed model (GLMM). We found a highly significant effect of temperature treatment on protein expression profiles and a significant effect of thermal origin but no interaction between them (Fig. 3 and Supplementary Fig. 1). In other words, we observed a strong plastic component, that is, expression change between different temperature treatments in samples of the same thermal origin, as well as an evolutionary component, that is, expression difference between samples of differing thermal origin in the same temperature treatment, with each component having an independent effect on the grayling protein expression profiles.

**Figure 3: Global proteomic profiles of grayling juveniles raised in the common garden experiment.**

The expression profiles were generally very consistent between the replicate sub-populations of different thermal origin in both temperature treatments. For instance, hierarchical clustering of normalized protein expression levels grouped cold and warm sub-populations separately when reared in the natal-resembling temperature (Supplementary Fig. 2). The congruity of expression profile between the replicate populations, combined with the very recent divergence time (25 generations) of the sub-populations indicate that adaptation to the natal environment (either cold or warm streams) may have played a significant role in the evolution of their expression profiles. To investigate this further, we compared the level of protein expression divergence between cold and warm thermal origin sub-populations to the level of genetic divergence at assumedly neutral microsatellite markers using a Q_ST−F_ST framework¹⁷ with a view to determining whether the null hypothesis of neutral evolution could be rejected. The mean Q_ST between sub-populations was 0.086 and 0.076 for the cold and warm temperature treatments, respectively, which is four to five times higher than the mean F_ST that was 0.017. Further, 139 (cold temperature treatment: 34.1%) and 124 (warm temperature treatment: 30.4%) of the 408 proteins analysed had Q_ST values larger than the upper bound of the F_ST 95% confidence interval, which is highly significantly more than expected by chance (χ²-test: cold temperature treatment: P<1.00E−16, χ²=145.0, df=1, n=408; warm temperature treatment: P<1.00E−16, χ²=122.2, df=1, n=408). These findings thus suggest that the observed differences in expression levels between cold and warm origin sub-populations, that is, the evolutionary component of protein expression profiles, also include a strong adaptive component.

Gene pleiotropy constrains protein expression responses

We then tested the effect of gene pleiotropy on the plastic and evolutionary components of the protein expression profiles using a GLMM that also incorporated gene pleiotropy as an independent variable. For gene pleiotropy, we used the two alternative estimators, Gene Ontology (GO) and PPI, assigned to the human orthologues of the 408 proteins quantified in our experiment. The three GO categories, biological process (GO-BP), molecular function (GO-MF) and cellular component (GO-CC), were tested independently. We found a highly significant effect of the interaction between gene pleiotropy and thermal origin, and the interaction between gene pleiotropy and temperature treatment when gene pleiotropy was estimated as either PPI or GO-BP but not with GO-MF or GO-CC (Fig. 4 and Supplementary Figs 3 and 4).

**Figure 4: Constraining effect of gene pleiotropy on plastic and evolutionary protein expression responses.**

Genes with low pleiotropy drive expression profile changes

Based on the above result, we sought to confirm the expected restrictive effect of gene pleiotropy on the plastic and evolutionary responses to temperature treatment observed in Fig. 3. For this, we separated the proteins into two groups, high and low pleiotropy, using the median values of PPI and GO-BP, and repeated the GLMM analysis of the PC1 values for each case. We found that genes with low pleiotropy were the main drivers of protein expression profile change due to thermal origin and temperature treatment (Fig. 5 and Supplementary Fig. 5).

**Figure 5: Global proteomic profiles of genes with differing pleiotropy levels in grayling from the common garden experiment.**

No influence of potential biases in GO and PPI estimation

We investigated whether our findings could be driven by biases (a) in the number of GO annotations from proliferating semantically similar terms that do not represent distinct GO-BP, GO-CC and GO-MF³⁹ or (b) in the PPI data set from the presence of few well-studied proteins for which many interactions have been reported^40,41. For this, we repeated the analyses after summarizing the GO annotations assigned to each protein by grouping terms of similar meaning using different similarity thresholds^42,43 and performed bootstrap analysis on all data sets. We found that our observations remained unchanged and highly significant after GO summarization and had very strong bootstrap support, thus suggesting that it is unlikely our results are biased by these factors (Supplementary Data 1 and 2). We further employed a spectral counting approach to obtain rough estimates of protein expression levels (Supplementary Methods) and found that the effect of gene pleiotropy remains very significant after expression level is taken into account (Supplementary Fig. 6). This result further suggests that our findings are not due to any confounding factor linked to the expression level of the studied proteins⁴⁴.

Similar results using predicted PPI proxies in zebrafish

To further validate our findings on PPI in the case of teleost fish, we used the results produced by an in silico method that predicts conserved PPI or interologs, that is, orthologous pairs of interacting proteins in different species^45,46. Using zebrafish (Danio rerio) PPI proxies predicted with high confidence (>0.9) in the Funcoup database⁴⁷, we found that the constraining effect of gene pleiotropy on both the plastic and evolutionary responses in grayling remained strong and significant (Fig. 6, Supplementary Fig. 7 and Supplementary Data 2).

**Figure 6: The constraining effect of gene pleiotropy using predicted protein interactions.**

Upstream regulators link expression to phenotypic evolution

To elucidate the biological causes and further explore the biological meaning of our observations, we searched for upstream regulators such as transcription factors that were predicted to have driven the observed gene expression changes⁴⁸. We identified seven significantly activated or inhibited transcription factors in total, including hsf1 and hsf2, and myc and mycn, out of 73 occurrences of transcription factors known to regulate the genes observed in our data (Supplementary Table 4).

Discussion

We demonstrate empirically for the first time that gene pleiotropy constrains both plastic and evolutionary, presumably adaptive, components of gene expression change. We also reveal that genes with low levels of pleiotropy are of particular significance during the initial phases of evolution. Strong plastic and significant evolutionary components in the global protein expression profiles (Fig. 3) give support at the molecular level for local adaptation of the grayling sub-populations, and furthermore, provide an ideal opportunity for elucidating the relative importance of plastic and adaptive responses during rapid fisherian evolution. The resulting significant effect of the interaction between either GO-BP or PPI and thermal origin and temperature treatment (Fig. 4 and Supplementary Data 1 and 2) indicates an effect of gene pleiotropy on both plastic (interaction with temperature treatment) and evolutionary (interaction with thermal origin) protein expression responses. Differences in protein expression between experimental groups, that is, the extent of expression change, were also decreasing with increasing pleiotropic level for both plastic and evolutionary responses (Fig. 4). These findings are consistent with Fisher’s prediction that pleiotropy restricts adaptation^24,25, because trade-offs between changes in expression that favour one process but harm others are more likely to be in highly pleiotropic genes. These findings are also in concordance with the observation that genes with many genetic interactions confer robustness to environmental and stochastic change⁴⁹. By comparison, genes with low pleiotropy had a stronger effect on both plastic and evolutionary responses (Fig. 5), which indicates that genes with a low level of pleiotropy may play a particularly important role during the early phases of rapid evolution. At the interspecific level, Lemos et al.³¹ observed a negative association between number of PPI and levels of gene expression variation in two Drosophila species. As such, the role of gene pleiotropy as described here may span a large range of evolutionary time scales.

Our results on gene pleiotropy are in concordance with the current view that molecular-level pleiotropy is generally the result of a given molecular function being involved in multiple biological processes (type II pleiotropy)^26,27. GO-MF had no significant effect on either plastic or evolutionary protein expression response (Supplementary Figs 3 and 4 and Supplementary Data 1 and 2), which is contrary to what would be expected with type I pleiotropy. In the type II view of molecular-level pleiotropy, more biological processes have been further suggested to distribute to more cellular components²⁷. In line with this finding, we observed a weakly significant association between GO-CC and both plastic and evolutionary gene expression responses (Supplementary Figs 3 and 4 and Supplementary Data 1 and 2).

By ensuring that the GO and PPI data sets yielded unbiased and meaningful results in European grayling, we also provide a general strategy for the use of this kind of information in non-model species. GO-BP annotations are well suited for cross-species use⁵⁰, and correcting GO annotations for semantic similarity can help reduce biases in gene pleiotropy estimation coming from genes annotated with multiple terms for the same biological process³⁹. This kind of annotation bias has earlier been a criticism of studies linking PPI and rate of evolution^40,41. We tested our data for many semantic similarity thresholds and annotation sets (human, zebrafish, whole UniProt) to take this factor fully into account (Supplementary Data 1). By bootstrapping, we further tested the sensitivity of both the GO and PPI data sets to the influence of a small number of proteins with many annotations. High bootstrap values in almost every case support our conclusions (Supplementary Data 1 and 2). Finally, use of predicted protein interactions can help overcome the lack of PPI information in species outside a few genomic models^47,51. Another approach would be the use of PPI information from well-defined interactomes as gene pleiotropy estimates are based on PPI number, which seems to be an evolutionarily conserved metric in interologous networks⁵². We employed both strategies by counting PPIs either as high-quality experimentally observed annotations assigned to human orthologues or as predicted for zebrafish (Supplementary Fig. 7 and Supplementary Data 2). To avoid incorrect orthologue identification, we used a rather conservative E-value threshold for blastp (≤3.00E−18). All results corroborated that PPI, a proxy for gene pleiotropy, constrain both plastic and evolutionary gene expression responses (Figs 4 and 6, Supplementary Figs 3 and 4 and Supplementary Data 2).

Associating the observed gene expression profiles with upstream regulators links our observations at the molecular level with a phenotypic trait, larval growth, which has been shown earlier to undergo temperature-driven adaptive evolution in this study system^32,33. Myc-target genes are evolutionarily conserved from teleost fishes to mammals and have functional roles for the control of growth during embryonic development via cell proliferation and differentiation⁵³. hsf1 is known as the master regulator of the heat-shock response in vertebrates and hsf2 modulates its activity^54,55. Heat-shock regulators are an integral part of the heat-shock response, which has been used to evaluate the acclimation ability and thermal tolerance of species in light of climate change⁵⁵. Many factors can influence protein turnover to regulate protein expression levels inside the cells^56,57 but the transcription factor analysis described here provide insights into the larger biological role of the studied proteins.

Given our findings, we suggest that the level of pleiotropy of a given gene should have a much more central role when interpreting the biological meaning of gene expression data. Not all proteins seem to have the same capacity to change their expression level as our study shows that proteins with fewer interactions (also more peripheral in the interactome) or involved in fewer biological processes have greater plastic and evolutionary responses to temperature. Accordingly, those low-pleiotropic proteins were driving the observed differences in the global expression profiles. Limitations surrounding gene expression interpretations based on fold-change cut-offs and the corresponding tests have been described^58,59, but a method that takes into account gene pleiotropy could prove valuable for interpretation of gene expression profiling studies in the future. For example, pleiotropic level could be used to weigh the significance of expression-level change, with higher weight being given to changes in genes with higher pleiotropy.

Further, we anticipate our findings to serve as a starting point to answer many exciting new questions posed by evolutionary and molecular biologists; for example, are the constraints imposed by gene pleiotropy different at the proteome and transcriptome levels⁶⁰? This question is important given the discordance between changes in protein and mRNA expression levels^61,62. At the proteome level, we have shown that gene pleiotropy constrains both plastic and evolutionary gene expression responses. At the transcriptome level, previous results have revealed that protein interactions constrain gene expression variation and gene expression divergence between species³¹. The degree of overlap between these findings remains unclear. Pleiotropic constraints may also play a role in various aspects of gene expression control^61,63; for instance, they may influence translation and transcription rates that in turn have been found to be good predictors of protein expression levels⁶¹. Furthermore, follow-up studies to clarify associations between expression changes in specific proteins and their effects on fitness-related traits would also be worthwhile. Other intriguing questions include: how are pleiotropic constraints affected by the modularity of biological networks²⁰ or by tissue specificity⁴⁴? What is the relative importance of gene pleiotropy for plastic and evolutionary or adaptive responses over longer evolutionary periods? Answering such questions will help further elucidate the role of gene expression regulation in evolution and allow for a better understanding of the molecular basis of adaptation.

Methods

Sample collection and common garden experiment

Full details are available in Thomassen et al.³³ and Haugen and Vøllestad⁶⁴. Animal sampling and experimentation were performed in compliance with the recommendations of the Norwegian Animal Research Authority (permission ID 2008/7368.5). Briefly, samples were obtained from grayling spawning sites from four sub-populations in Lesjaskogsvatnet (Fig. 1). The mean summer, June–July, temperatures in the four streams investigated here differ strongly, with the two small and warm streams, Steinbekken and Sandbekken, being approximately 1–1.5 °C warmer than the large and cold streams, Hyrjon and Valåe (Sandbekken 8.44±0.52 °C (n=4 years); Steinbekken 8.81±0.60 °C (n=4); Hyrjon 7.40±0.94 °C (n=8); Valåe 7.28±0.69 °C (n=7)). This results in a large temperature-sum difference among streams during egg and larvae development. Developing embryos in the cold streams experience more time at or below 6 °C compared with the warm streams where embryos are subjected to temperatures of 9 °C or higher for longer periods (Supplementary Table 1). In June 2007, adult grayling were captured at the four spawning sites using fyke nets and stripped of gametes, which were subsequently transported on ice to the Veterinary Institute of Norway, Oslo (5 h drive). Gametes were stripped on 12 June 2007 for the warm sites and on 23 June 2007 for the cold sites. This time difference exemplifies the difference in the thermal environments of the streams (time taken to reach the minimum water temperature for spawning). Samples were the product of artificially fertilized gametes of multiple individuals per sub-population (Steinbekken 20♀, 16♂; Sandbekken 17♀, 11♂; Hyrjon 4♀, 3♂; Valåe 20♀, 24♂). Fertilized eggs were placed in porous containers suspended in the large treatment tanks as described previously^33,64. Each of the tanks contained two replicate containers from each sub-population. The water temperatures of the cold and warm temperature treatments across the experiment were 5.83±0.43 °C (n=764) and 10.02±0.28 °C (n=440), respectively, to represent lower and upper temperatures experienced by developing grayling larvae in nature³³ (Supplementary Table 1). Each day post fertilization, approximately five individuals (eggs or larvae) were randomly sampled from each sub-population in each temperature treatment. Samples were visually inspected and whether an individual had hatched or not was recorded. Samples used for the proteomics experiment were immediately frozen on dry ice and transferred to −80 °C within ~30 min for storage. Grayling embryos selected for protein extraction were of similar developmental stage as estimated based on the number of degree-days (temperature in °C × number of days elapsed since fertilization) in relation to the average number of degree-days to 50% hatching in the sub-population—temperature treatment (Supplementary Table 3). The effect of thermal origin on the number of degree-days to 50% hatching was tested by performing a binomial logistic regression using a generalized linear model. The generalized linear model was performed with a logit link function and with the count of the hatched and not-hatched individuals per sampling day as the dependent variable, and the number of degree-days and thermal origin as independent variables. Degree-day values for a 50% probability of hatching for each experimental group and s.e. were estimated using the dose.p function from the MASS library in R⁶⁵.

Measuring protein expression levels

Details about the protein extraction, peptide-level iTRAQ labelling, fractionation by isotope coded affinity tag procedure, liquid chromatography–tandem mass spectrometry and protein quantification parameters can be found in the Supplementary Methods. The mass spectrometry files (*.RAW), the processed peaklists (*.mgf), the search databases (*.fasta) and the results of the ProteinPilot searches (*.xml) for both the validation and the actual experiment have been deposited in the ProteomeXchange consortium (http://proteomecentral.proteomexchange.org) via the PRIDE partner repository⁶⁶ with the data set identifier PXD000368.

Three embryos per sub-population per common garden temperature treatment, 24 samples (that is, 3 embryos × 4 sub-populations × 2 temperature treatments), were labelled by iTRAQ, fractionated by strong cation exchange chromatography and combined accordingly to minimize batch effects (Supplementary Table 5). Protein identification and quantification were done using the ProteinPilot v.4 programme (Applied Biosystems). As a search database, we used all Atlantic salmon (Salmo salar) submitted in the UniProt database (www.uniprot.org) as of March 2012 (13,035 amino acid sequences). Maximum false discovery rate correction for protein identification was performed using a decoy database as implemented in ProteinPilot and was set to 5%, whereas minimum confidence for peptide identification was 95%. A collection of 248 sequences of common-contaminant proteins, provided by Applied Biosystems, was also included in the search database. Contaminant and decoy hits were filtered and samples were divided into four groups of six samples each, namely the grayling embryos from each of the cold/warm thermal origin with each of the 6 °C/10 °C temperature treatments. Proteins were also filtered for missing values so that each group had at least three valid ratios. Ratios were then log₂-transformed and then loess-normalized across biological replicates using the median values as a reference set. For these transformations, we used DanteR, an R package for the analysis of proteomic data (updated edition of DAnTE⁶⁷). The accuracy of the quantification method was evaluated using a six-protein mix provided with the iTRAQ kit (Supplementary Methods, Supplementary Fig. 8 and Supplementary Data 3).

Statistical analyses

R scripts for all statistical analyses are available in Supplementary Data 4. All GLMM analyses verified the assumptions required for linear modelling between dependent variables and continuous predictors, namely normal distribution of the residuals, and the absence of strong heteroscedasticity (Supplementary Figs 9 and 10).

To test the effect of thermal origin (cold versus warm sub-populations), temperature treatment (6 °C versus 10 °C) and their interaction on the protein expression profiles, we used the coordinates of normalized expression ratios (Supplementary Data 5) along the first component (PC1) of a PCA. We used PCA because it inherently focuses on summarizing the differences in expression between thermal origins and temperature treatments, without the need to standardize for the direction of change in expression for each protein in every sample. Next, a GLMM analysis was performed with the PC1 coordinates as a dependent variable. Thermal origin, temperature treatment and their interaction were used as independent, categorical, fixed factors and sub-population as a random factor (Supplementary Methods). Hierarchical clustering was performed directly on the log₂-transformed and loess-normalized data using Euclidean distances. Gene expression data were visualized as heatmaps.

Q_ST−F_ST comparisons were made to assess the role of divergent selection in protein expression divergence betweeen sub-populations. F_ST was calculated using Weir and Cockerham’s θ (theta) and Q_ST was calculated using the formula σ²_GB/ (σ²_GB+2σ²_GW), where σ²_GB is the among sub-population and σ²_GW is the average within sub-population component of protein expression variance in each temperature treatment⁶⁸. Variance components for every protein were estimated using a restricted maximum likelihood approach, with sub-population as a random factor. By repeating this procedure for all of the proteins in both temperature treatments (N=408+408, because of the two temperature treatments), we obtained the Q_ST distributions⁶⁸. F_ST was estimated using data for 19 microsatellites genotyped at 38–44 individuals per sub-population (data from Junge et al.³⁵ where the sub-populations are labelled STE07: Steinbekken, SAN07: Sandbekken, SHYR08: Hyrjon and VAL07: Valåe). Weir and Cockerham’s F_ST can produce slightly negative estimates, thus any negative values of F_ST were adjusted to zero; hence, F_ST had the same range as Q_ST. The F_ST sampling distribution was estimated from 1,000 resamples with the hierfstat R package⁶⁹, using similar sample size to that used in Q_ST estimations (n=3 per population). The upper 95% confidence interval of the F_ST sampling distribution was set as the threshold for the neutral divergence expectation to quantify the outlier Q_ST estimates. χ²-test was used to test whether the number of outliers significantly exceeded what may be expected by chance alone. The common garden rearing of fish ensures environmental effects do not inflate the among-sub-population variance, which is a major source of bias in Q_ST estimation¹⁷. Other factors, for example, maternal, epistatic and dominance effects, may bias Q_ST estimation, but usually downwards, by upwardly biasing additive genetic variance⁶⁸, thereby making the test of divergent selection applied here a conservative one. Similarly, mutations might bias Q_ST estimates downward through increased dominance variance¹⁷; however, due to the low divergence times, mutations are not expected to play a large role in the evolution of this system.

To test the effect of gene pleiotropy on protein expression responses, we first studied the mean standardized expression of grouped proteins. A GLMM was applied using all 408 quantified proteins with loess-normalized and log₂-transformed values (Supplementary Data 5). Standardized protein expression level was the dependent variable, and the independent variables were as follows: thermal origin (categorical), temperature treatment (categorical), gene pleiotropy (continuous) and their interactions (fixed effects). Sub-population and individual were used as random factors. Sub-population was nested within thermal origin and individuals were nested within sub-population. Raw gene pleiotropy value distributions were initially strongly skewed and therefore log₂-transformed. To fulfil the assumptions of GLMM of the dependent variable (reduce kurtosis) and reduce the complexity of the GLMM, proteins were grouped based on level of pleiotropy. This was done by ordering the proteins according to their pleiotropy level (the ordering varied depending on what measure of gene pleiotropy, PPI or GO-BP, was used each time), with random order within each level of pleiotropy and subsequently dividing the ordered proteins into groups of ten. GLMM analysis was repeated for group sizes ranging from 2 to 30 and group size=10 represented a good compromise between minimizing the kurtosis while retaining the groups as small as possible to have more data points in the analyses (Supplementary Fig. 11). Bin size had little effect on the observed P-values and the results were also highly significant if no binning was used (Supplementary Data 1 and 2). For each group of proteins, we calculated the average value for log₂(pleiotropy+1) and the average standardized protein expression level for each individual and used these data in the GLMM. In this way we avoid side effects of unequal group sizes, which would arise if we had simply grouped by the pleiotropy level. For comparison, we also did the analysis without grouping the proteins. As the analysis of the effect of gene pleiotropy focused on the extent of expression change, and not the direction (increase or decrease in expression), we standardized protein expression values by removing differences in the direction of the expression change between proteins as follows: we used the experimental group with the largest differential response in the PCA as a reference (Valåe at 6 °C, Fig. 3) and whenever the median expression level of a protein in this reference population (median over three replicate individuals) was below the overall median expression level (median over 24 individuals), we flipped the individual expression levels around the overall median expression level for that protein. By using the median expression level within one reference sub-population and flipping the sign of expression level for all individuals in the entire data set when needed, we conserve the independence of the values between the 24 individuals needed for statistical inference, but eliminate between-protein differences in the direction of the expression change.

In a second approach, we tested the effect of gene pleiotropy on protein expression responses using the mean standardized expression of the 30% most variable proteins per group of ten proteins (Supplementary Methods). With this approach we excluded the contribution of proteins in our data set that are less likely to be linked to temperature responses, that is, those that had low levels of expression variation, and therefore we increased the power to detect differences between thermal origins and temperature treatments. In a third approach, we tested the effect of gene pleiotropy on protein expression responses by performing multidimensional scaling analysis (Supplementary Methods). We used this approach as an alternative to reduce the influence of non-responsive proteins within each level of gene pleiotropy and increase the power of our analysis. This is because the scores from the multidimensional scaling analysis are mainly determined by the subset of proteins that show strong differential protein expression response. Therefore, this approach focuses more on those proteins.

Estimating gene pleiotropy

Gene pleiotropy was estimated either as the amount of known PPI or GO terms²⁷. GO terms were retrieved separately for GO-BP, GO-CC and GO-MF. For each of these measures we performed an independent analysis. PPI counts and GO annotations were retrieved from human (Homo sapiens) orthologues of the detected salmonid proteins. Orthologues were identified by sequence similarity with blastp using the human reference proteome in UniProt (www.uniprot.org, version 10 June 2013) as the search database and a minimum E-value of 3.00E−18 (Supplementary Data 6 and 7). The more comprehensive annotation for H. sapiens genes often makes them preferred even over zebrafish for the transfer of annotations to fish species; however, we also conducted an analysis based on zebrafish data, which is currently among the best annotated fish species. GO annotations for the human genes were downloaded from the GO website (www.geneontology.org, version 6 June 2013). PPI were retrieved from the Ingenuity Pathway Analysis (IPA 2013) platform and included only experimentally observed direct interactions in the BIND, BIOGRID, Cognia, DIP, INTACT, MINT and MIPS databases, as well as Ingenuity expert findings as of 17 June 2013.

Annotation bias, particularly in human GO annotations, can occur because the discovery of new processes or functions is less frequent than the proliferation of semantically similar GO terms³⁹. For this reason, we also summarized semantically similar GO terms and re-analysed the data to determine whether redundancy has a marked effect on our findings. Redundancy was treated with REVIGO using SimRel as a semantic similarity measure⁴². We examined three different levels of allowed similarity, namely large (allowed similarity=0.9), medium (0.7), which is the default option in REVIGO, and small (0.5). We also ran all our analyses with three different GO databases (whole UniProt, H. sapiens and D. rerio) to estimate the information content of each term during the calculation of the semantic distances. This was done separately for each protein and for GO-BP, GO-MF and GO-CC (Supplementary Data 6).

Bootstrap test

To test the robustness of our GLMM analyses, we repeated all of the analyses on each of 2,000 bootstrap re-samples (except for standardized protein expression levels without grouping that was very computer-intensive for which we used 100 bootstrap re-samples). Each re-sample was generated by randomly sampling the 408 proteins from the original data set with replacement. The bootstrap values were then calculated as the proportions of re-sampled data sets for which the analyses yielded a significant P-value<0.05 (Supplementary Data 1 and 2).

PPI proxies for zebrafish

To corroborate the results obtained with experimentally observed PPI in human orthologues, we looked for predictors of PPI in zebrafish. We used Funcoup 2.0, which applies an optimized Bayesian framework to reconstruct global networks of protein functional coupling (FC) by integrating proteomic and genomic data from multiple species^47,51. Data files for human and zebrafish were retrieved from the Funcoup server and each final Bayesian score (FBS) was converted to a probability of FC (pfc) using pfc=1/(1+exp(−FBS—ln(P(FC)))) where P(FC)=0.001 (ref. 47). In all analyses using the Funcoup database, we considered only interactions with pfc>0.9 for the category of interest to ensure only high-quality interactions were included (Supplementary Methods and Supplementary Data 7).

The effect of genes with contrasting pleiotropy levels

To assess how proteins with lower or higher pleiotropy drive the differences in expression profiles between experimental groups, we conducted a post-hoc analysis as follows: we used the median value of PPI and GO-BP to separate the proteins into two groups with low or high PPI and GO-BP counts (PPI: median=51, low PPI group: 120 genes, high PPI group: 119 genes; GO-BP: median=10.5, low PPI group: 122 genes, high PPI group: 122 genes). PCA and hierarchical clustering were re-performed on those sub-sets of proteins with low or high pleiotropy, and the PC1 coordinates were then used in a GLMM with thermal origin, temperature treatment and their interaction as independent fixed factors, and sub-populations as random factors nested within thermal origin (Fig. 5, Supplementary Fig. 5 and Supplementary Table 6). For this analysis we used the 244 proteins with no missing values.

Upstream regulator analysis

To identify upstream regulators that explain the observed gene expression changes, we used the IPA platform. IPA examines every known upstream regulator for each gene in our data set and compares the observed direction of change in protein expression with that expected based on literature findings stored in the IPA knowledgebase. Significant activation or inhibition of an upstream regulator is predicted based on evidence from multiple target genes in the data set for which the direction of the observed expression change is consistent with that expected from the literature. To calculate the direction of expression change for each protein, we used the median values across all six biological replicates per thermal origin-temperature treatment combination. Z-score value specifies whether an upstream regulator has significantly more activated predictions than inhibited predictions (Z>2) or vice versa (Z<2). The bias metric estimates whether there are more up- than downregulated genes or vice versa for an upstream regulator in the data set (there should be enough evidence from both directions for reliable predictions). Overlap P-value is calculated using Fisher’s exact test and determines whether there is a statistically significant higher presence of genes controlled by an upstream regulator in the data set compared with a reference list of genes. In our case, we used the gene list of the Agilent zebrafish V2 microarray, which represents the genome of the teleost species D. rerio⁷⁰. To add specificity to our analyses, we considered only direct and experimentally observed molecular interactions. We only examined the most reliable predictions, as recommended by IPA Systems (P<0.001, |Z|>2, and bias|<0.25). Details about the employed algorithms are described elsewhere⁴⁸.

Additional information

How to cite this article: Papakostas, S. et al. Gene pleiotropy constrains gene expression changes in fish adapted to different thermal conditions. Nat. Commun. 5:4071 doi: 10.1038/ncomms5071 (2014).

References

Fraser, H. B. Gene expression drives local adaptation in humans. Genome Res. 23, 1089–1096 (2013).
Article CAS Google Scholar
Romero, I. G., Ruvinsky, I. & Gilad, Y. Comparative studies of gene expression and the evolution of gene regulation. Nat. Rev. Genet. 13, 505–516 (2012).
Article CAS Google Scholar
Fraser, H. B. Genome-wide approaches to the study of adaptive gene expression evolution: systematic studies of evolutionary adaptations involving gene expression will allow many fundamental questions in evolutionary biology to be addressed. Bioessays 33, 469–477 (2011).
Article CAS Google Scholar
Lopez-Maury, L., Marguerat, S. & Bahler, J. Tuning gene expression to changing environments: from rapid responses to evolutionary adaptation. Nat. Rev. Genet. 9, 583–593 (2008).
Article CAS Google Scholar
Stern, S., Dror, T., Stolovicki, E., Brenner, N. & Braun, E. Genome-wide transcriptional plasticity underlies cellular adaptation to novel challenge. Mol. Syst. Biol. 3, 106 (2007).
Article Google Scholar
Oleksiak, M. F., Churchill, G. A. & Crawford, D. L. Variation in gene expression within and among natural populations. Nat. Genet. 32, 261–266 (2002).
Article CAS Google Scholar
Whitehead, A. & Crawford, D. L. Neutral and adaptive variation in gene expression. Proc. Natl Acad. Sci. USA 103, 5425–5430 (2006).
Article CAS ADS Google Scholar
Papakostas, S., Vasemagi, A., Vaha, J.-P., Himberg, M. & Peil, L. Primmer CR. A proteomics approach reveals divergent molecular responses to salinity in populations of European whitefish (Coregonus lavaretus). Mol. Ecol. 21, 3516–3530 (2012).
Article CAS Google Scholar
Rinn, J. L. & Snyder, M. Sexual dimorphism in mammalian gene expression. Trends Genet. 21, 298–305 (2005).
Article CAS Google Scholar
Mank, J. E. Sex chromosomes and the evolution of sexual dimorphism: lessons from the genome. Am. Nat. 173, 141–150 (2009).
Article Google Scholar
Leder, E. H. et al. Female-biased expression on the X chromosome as a key step in sex chromosome evolution in threespine sticklebacks. Mol. Biol. Evol. 27, 1495–1503 (2010).
Article CAS Google Scholar
Brawand, D. et al. The evolution of gene expression levels in mammalian organs. Nature 478, 343–348 (2011).
Article CAS ADS Google Scholar
Khaitovich, P., Enard, W., Lachmann, M. & Paabo, S. Evolution of primate gene expression. Nat. Rev. Genet. 7, 693–702 (2006).
Article CAS Google Scholar
Becks, L., Ellner, S. P., Jones, L. E. & Hairston, N. G. The functional genomics of an eco-evolutionary feedback loop: linking gene expression, trait evolution, and community dynamics. Ecol. Lett. 15, 492–501 (2012).
Article Google Scholar
Gilad, Y., Oshlack, A. & Rifkin, S. A. Natural selection on gene expression. Trends Genet. 22, 456–461 (2006).
Article CAS Google Scholar
Rohlfs, R. V., Harrigan, P. & Nielsen, R. Modeling gene expression evolution with an extended Ornstein-Uhlenbeck process accounting for within-species variation. Mol. Biol. Evol. 31, 201–211 (2014).
Article CAS Google Scholar
Leinonen, T., McCairns, R. J. S., O'Hara, R. B. & Merila, J. QST-FST comparisons: evolutionary and ecological insights from genomic heterogeneity. Nat. Rev. Genet. 14, 179–190 (2013).
Article CAS Google Scholar
Prud'homme, B., Gompel, N. & Carroll, S. B. Emerging principles of regulatory evolution. Proc. Natl Acad. Sci. USA 104, 8605–8612 (2007).
Article CAS ADS Google Scholar
Wray, G. A. The evolutionary significance of cis-regulatory mutations. Nat. Rev. Genet. 8, 206–216 (2007).
Article CAS Google Scholar
Barabasi, A. L. & Oltvai, Z. N. Network biology: understanding the cell's functional organization. Nat. Rev. Genet. 5, 101–113 (2004).
Article CAS Google Scholar
Fraser, H. B., Hirsh, A. E., Wall, D. P. & Eisen, M. B. Coevolution of gene expression among interacting proteins. Proc. Natl Acad. Sci. USA 101, 9033–9038 (2004).
Article CAS ADS Google Scholar
Paaby, A. B. & Rockman, M. V. The many faces of pleiotropy. Trends Genet. 29, 66–73 (2013).
Article CAS Google Scholar
Stearns, F. W. Anecdotal, historical and critical commentaries on genetics. One hundred years of pleiotropy: a retrospective. Genetics 186, 767–773 (2010).
Article CAS Google Scholar
Fisher, R. A. The Genetical Theory of Natural Selection Oxford Clarendon Press (1930).
Orr, H. A. Adaptation and the cost of complexity. Evolution 54, 13–20 (2000).
Article CAS Google Scholar
Wagner, G. P. & Zhang, J. Z. The pleiotropic structure of the genotype-phenotype map: the evolvability of complex organisms. Nat. Rev. Genet. 12, 204–213 (2011).
Article CAS Google Scholar
He, X. L. & Zhang, J. Z. Toward a molecular understanding of pleiotropy. Genetics 173, 1885–1891 (2006).
Article CAS Google Scholar
Fraser, H. B., Hirsh, A. E., Steinmetz, L. M., Scharfe, C. & Feldman, M. W. Evolutionary rate in the protein interaction network. Science 296, 750–752 (2002).
Article CAS ADS Google Scholar
Fraser, H. B. Modularity and evolutionary constraint on proteins. Nat. Genet. 37, 351–352 (2005).
Article CAS Google Scholar
Hahn, M. W. & Kern, A. D. Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks. Mol. Biol. Evol. 22, 803–806 (2005).
Article CAS Google Scholar
Lemos, B., Meiklejohn, C. D. & Hartl, D. L. Regulatory evolution across the protein interaction network. Nat. Genet. 36, 1059–1060 (2004).
Article CAS Google Scholar
Koskinen, M. T., Haugen, T. O. & Primmer, C. R. Contemporary fisherian life-history evolution in small salmonid populations. Nature 419, 826–830 (2002).
Article CAS ADS Google Scholar
Thomassen, G., Barson, N. J., Haugen, T. O. & Vøllestad, L. A. Contemporary divergence in early life history in grayling (Thymallus thymallus). BMC Evol. Biol. 11, 360 (2011).
Article Google Scholar
Kavanagh, K. D., Haugen, T. O., Gregersen, F., Jernvall, J. & Vøllestad, L. A. Contemporary temperature-driven divergence in a Nordic freshwater fish under conditions commonly thought to hinder adaptation. BMC Evol. Biol. 10, 350 (2010).
Article Google Scholar
Junge, C. et al. Strong gene flow and lack of stable population structure in the face of rapid adaptation to local temperature in a spring-spawning salmonid, the European grayling (Thymallus thymallus). Heredity 106, 460–471 (2011).
Article CAS Google Scholar
Barson, N. J., Haugen, T. O., Vøllestad, L. A. & Primmer, C. R. Contemporary isolation-by-distance, but not isolation-by-time, among demes of European grayling (Thymallus thymallus, Linnaeus) with recent common ancenstors. Evolution 63, 549–556 (2009).
Article Google Scholar
Jensen, L. F. et al. Local adaptation in brown trout early life-history traits: implications for climate change adaptability. Proc. R. Soc. B Biol. Sci. 275, 2859–2868 (2008).
Article Google Scholar
Kawecki, T. J. & Ebert, D. Conceptual issues in local adaptation. Ecol. Lett. 7, 1225–1241 (2004).
Article Google Scholar
Gillis, J. & Pavlidis, P. Assessing identity, redundancy and confounds in Gene Ontology annotations over time. Bioinformatics 29, 476–482 (2013).
Article CAS Google Scholar
Bloom, J. D. & Adami, C. Apparent dependence of protein evolutionary rate on number of interactions is linked to biases in protein-protein interactions data sets. BMC Evol. Biol. 3, 21 (2003).
Article Google Scholar
Pal, C., Papp, B. & Lercher, M. J. An integrated view of protein evolution. Nat. Rev. Genet. 7, 337–348 (2006).
Article CAS Google Scholar
Supek, F., Bosnjak, M., Skunca, N. & Smuc, T. REVIGO summarizes and visualizes long lists of Gene Ontology terms. PLoS ONE 6, e21800 (2011).
Article CAS ADS Google Scholar
Pesquita, C., Faria, D., Falcao, A. O., Lord, P. & Couto, F. M. Semantic similarity in biomedical ontologies. PLoS Comput. Biol. 5, e1000443 (2009).
Article ADS MathSciNet Google Scholar
Liao, B. Y. & Zhang, J. Z. Low rates of expression profile divergence in highly expressed genes and tissue-specific genes during mammalian evolution. Mol. Biol. Evol. 23, 1119–1128 (2006).
Article CAS Google Scholar
Walhout, A. J. M. et al. Protein interaction mapping in C. elegans using proteins involved in vulval development. Science 287, 116–122 (2000).
Article CAS ADS Google Scholar
Yu, H. Y. et al. Annotation transfer between genomes: protein-protein interologs and protein-DNA regulogs. Genome Res. 14, 1107–1118 (2004).
Article CAS Google Scholar
Alexeyenko, A. et al. Comparative interactomics with Funcoup 2.0. Nucleic Acids Res. 40, D821–D828 (2012).
Article CAS Google Scholar
Kramer, A., Green, J., Pollard, J. Jr & Tugendreich, S. Causal analysis approaches in Ingenuity Pathway Analysis (IPA). Bioinformatics 30, 523–530 (2013).
Article Google Scholar
Lehner, B. Genes confer similar robustness to environmental, stochastic, and genetic perturbations in yeast. PLoS ONE 5, e9035 (2010).
Article ADS Google Scholar
Primmer, C. R., Papakostas, S., Leder, E. H., Davis, M. J. & Ragan, M. A. Annotated genes and nonannotated genomes: cross-species use of Gene Ontology in ecology and evolution research. Mol. Ecol. 22, 3216–3241 (2013).
Article CAS Google Scholar
Alexeyenko, A. & Sonnhammer, E. L. L. Global networks of functional coupling in eukaryotes from comprehensive data integration. Genome Res. 19, 1107–1116 (2009).
Article CAS Google Scholar
Brown, K. V. & Jurisica, I. Unequal evolutionary conservation of human protein interactions in interologous networks. Genome Biol. 8, R95 (2007).
Article Google Scholar
Dang, C. V. C-myc target genes involved in cell growth, apoptosis, and metabolism. Mol. Cell Biol. 19, 1–11 (1999).
Article CAS Google Scholar
Akerfelt, M., Morimoto, R. I. & Sistonen, L. Heat shock factors: integrators of cell stress, development and lifespan. Nat. Rev. Mol. Cell Biol. 11, 545–555 (2010).
Article CAS Google Scholar
Tomanek, L. Variation in the heat shock response and its implication for predicting the effect of global climate change on species' biogeographical distribution ranges and metabolic costs. J. Exp. Biol. 213, 971–979 (2010).
Article CAS Google Scholar
Reed, S. I. Ratchets and clocks: the cell cycle, ubiquitylation and protein turnover. Nat. Rev. Mol. Cell. Biol. 4, 855–864 (2003).
Article CAS Google Scholar
Mizushima, N. & Klionsky, D. J. Protein turnover via autophagy: implications for metabolism. Annu. Rev. Nutr. 27, 19–40 (2007).
Article CAS Google Scholar
Dalman, M. R., Deeter, A., Nimishakavi, G. & Duan, Z. H. Fold change and P-value cutoffs significantly alter microarray interpretations. BMC Bioinformatics 13, S11 (2012).
Article Google Scholar
Jeffery, I. B., Higgins, D. G. & Culhane, A. C. Comparison and evaluation of methods for generating differentially expressed gene lists from microarray data. BMC Bioinformatics 7, 359 (2006).
Article Google Scholar
Koonin, E. V. & Wolf, Y. I. Constraints and plasticity in genome and molecular-phenome evolution. Nat. Rev. Genet. 11, 487–498 (2010).
Article CAS Google Scholar
Schwanhausser, B. et al. Global quantification of mammalian gene expression control. Nature 473, 337–342 (2011).
Article ADS Google Scholar
Vogel, C. & Marcotte, E. M. Insights into the regulation of protein abundance from proteomic and transcriptomic analyses. Nat. Rev. Genet. 13, 227–232 (2012).
Article CAS Google Scholar
Lu, R. et al. Systems-level dynamic analyses of fate change in murine embryonic stem cells. Nature 462, 358–362 (2009).
Article CAS ADS Google Scholar
Haugen, T. O. & Vøllestad, L. A. Population differences in early life-history traits in grayling. J. Evol. Biol. 13, 897–905 (2000).
Article Google Scholar
Venables, W. N., Ripley, B. & Venables, W. N. Modern Applied Statistics with S Springer (2002).
Vizcaino, J. A. et al. The Proteomics Identifications (PRIDE) database and associated tools: status in 2013. Nucleic Acids Res. 41, D1063–D1069 (2013).
Article CAS Google Scholar
Polpitiya, A. D. et al. DAnTE: a statistical tool for quantitative analysis of -omics data. Bioinformatics. 24, 1556–1558 (2008).
Article CAS Google Scholar
Whitlock, M. C. Neutral additive genetic variance in a metapopulation. Genet. Res. 74, 215–221 (1999).
Article CAS Google Scholar
Goudet, J. HIERFSTAT, a package for R to compute and test hierarchical F-statistics. Mol. Ecol. Notes 5, 184–186 (2005).
Article Google Scholar
Domazet-Loso, T. & Tautz, D. A phylogenetically based transcriptome age index mirrors ontogenetic divergence patterns. Nature 468, 815–818 (2010).
Article CAS ADS Google Scholar

Download references

Acknowledgements

This work has been financially supported by the Academy of Finland (grant numbers 258048, 272836 and 136464), the Biocenter Finland and the Research Council of Norway. We thank T. Haugen, G. Thomassen and N. Barson for fieldwork assistance and sampling the grayling embryos. The Turku Proteomics facility, L. Peil and R. Moulder are thanked for help with the proteomics experiment. We also thank the Finnish Centre for Scientific Computing for providing computational resources.

Author information

Mei Ning
Present address: Present address: Institute for Nutritional Sciences, Shanghai Institute of Biological Sciences (SIBS), 294 Taiyuan Road, Shanghai, China,
Craig R. Primmer and Erica H. Leder: These authors contributed equally to this work

Authors and Affiliations

Division of Genetics and Physiology, Department of Biology, University of Turku, Pharmacity, Itäinen Pitkäkatu 4, 20520 Turku, Finland,
Spiros Papakostas, Matthieu Bruneaux, Tutku Aykanat, Mei Ning, Craig R. Primmer & Erica H. Leder
Department of Biosciences, Centre for Ecological and Evolutionary Synthesis (CEES), University of Oslo, P.O. Box 1066 Blindern, Oslo, NO-0316, Norway
L. Asbjørn Vøllestad
Department of Biology, Laboratory of Aquatic Ecology, Evolution and Conservation, KU Leuven, Ch. Deberiotstraat 32,, Leuven, 3000, Belgium
Joost Vanoverbeke

Authors

Spiros Papakostas
View author publications
You can also search for this author in PubMed Google Scholar
L. Asbjørn Vøllestad
View author publications
You can also search for this author in PubMed Google Scholar
Matthieu Bruneaux
View author publications
You can also search for this author in PubMed Google Scholar
Tutku Aykanat
View author publications
You can also search for this author in PubMed Google Scholar
Joost Vanoverbeke
View author publications
You can also search for this author in PubMed Google Scholar
Mei Ning
View author publications
You can also search for this author in PubMed Google Scholar
Craig R. Primmer
View author publications
You can also search for this author in PubMed Google Scholar
Erica H. Leder
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.R.P., E.H.L., L.A.V. and S.P. conceived the study. L.A.V., E.H.L. and C.R.P. designed and conducted the common garden experiment. S.P., N.M., E.H.L. and C.R.P. designed and conducted the proteomics experiment. S.P., M.B., T.A. and J.V. analysed the data. S.P., C.R.P., M.B. and E.H.L. wrote the manuscript. All authors read and commented on the manuscript.

Corresponding author

Correspondence to Craig R. Primmer.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-11, Supplementary Tables 1-6, Supplementary Methods and Supplementary References (PDF 2761 kb)

Supplementary Data 1

P-values and bootstrap support for the effect of GO counts on protein expression change (XLSX 48 kb)

Supplementary Data 2

P-values and bootstrap support for the effect of PPI number on protein expression change (XLSX 43 kb)

Supplementary Data 3

Results for protein identification and relative protein quantification of the 6-protein mix (XLSX 110 kb)

Supplementary Data 4

R scripts used to perform the analyses described in the article (ZIP 528 kb)

Supplementary Data 5

Identification and quantification details of the 408 studied grayling proteins (XLSX 410 kb)

Supplementary Data 6

GO counts for the 408 grayling proteins used in the study (XLSX 139 kb)

Supplementary Data 7

PPI number for the 408 grayling proteins used in the study (XLSX 68 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/

Reprints and permissions

About this article

Cite this article

Papakostas, S., Vøllestad, L., Bruneaux, M. et al. Gene pleiotropy constrains gene expression changes in fish adapted to different thermal conditions. Nat Commun 5, 4071 (2014). https://doi.org/10.1038/ncomms5071

Download citation

Received: 21 February 2014
Accepted: 08 May 2014
Published: 03 June 2014
DOI: https://doi.org/10.1038/ncomms5071

This article is cited by

Weak gene–gene interaction facilitates the evolution of gene expression plasticity
- Hao-Chih Kuo
- Cheng-Te Yao
- Chih-Ming Hung
BMC Biology (2023)
Selection drives convergent gene expression changes during transitions to co-sexuality in haploid sexual systems
- Guillaume G. Cossard
- Olivier Godfroy
- Susana M. Coelho
Nature Ecology & Evolution (2022)
Global systematic diversity, range distributions, conservation and taxonomic assessments of graylings (Teleostei: Salmonidae; Thymallus spp.)
- Steven J. Weiss
- Duarte V. Gonçalves
- Elsa Froufe
Organisms Diversity & Evolution (2021)
Gene expression changes associated with the evolutionary loss of a metabolic trait: lack of lipogenesis in parasitoids
- Mark Lammers
- Ken Kraaijeveld
- Jacintha Ellers
BMC Genomics (2019)
The genetic architecture of adaptation: convergence and pleiotropy in Heliconius wing pattern evolution
- Jake Morris
- Nicolas Navarro
- Kanchon K. Dasmahapatra
Heredity (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.