The proteome of human brain synapses is highly complex and is mutated in over 130 diseases. This complexity arose from two whole-genome duplications early in the vertebrate lineage. Zebrafish are used in modelling human diseases; however, its synapse proteome is uncharacterized, and whether the teleost-specific genome duplication (TSGD) influenced complexity is unknown. We report the characterization of the proteomes and ultrastructure of central synapses in zebrafish and analyse the importance of the TSGD. While the TSGD increases overall synapse proteome complexity, the postsynaptic density (PSD) proteome of zebrafish has lower complexity than mammals. A highly conserved set of ∼1,000 proteins is shared across vertebrates. PSD ultrastructural features are also conserved. Lineage-specific proteome differences indicate that vertebrate species evolved distinct synapse types and functions. The data sets are a resource for a wide range of studies and have important implications for the use of zebrafish in modelling human synaptic diseases.
Synapses are the hallmark of the central nervous system. Although synapses were originally considered to be simple connectors between neurons, they are now recognized to be highly sophisticated computational units built from proteomes containing in excess of 1,000 proteins that regulate the behavioural repertoire1,2,3,4. Proteomic studies revealed that genetic disruption of postsynaptic density (PSD) proteins results in over 130 human mental and neurological disorders5,6. These disorders are now known as synaptopathies7,8 and include complex genetic disorders such as intellectual disability, autism spectrum disorders and schizophrenia5,9,10.
Comparative proteomic and genomic approaches have been used to study the evolutionary origins of vertebrate postsynaptic complexity. The major classes of vertebrate synapse proteins evolved in unicellular eukaryotes, and these proteins were recruited into synapses in early invertebrates11. Subsequent whole-genome duplications (WGD) have played a major role in generating and shaping the complexity of vertebrate synapse proteomes. Two WGDs in early vertebrates resulted in a major expansion of the synapse proteome, which distinguishes them from invertebrate synapses. This event, which occurred ∼550 million years ago12, generated ohnologues (paralogues arising from WGDs) that subsequently diversified, potentially contributing to the enhanced cognitive abilities and behavioural repertoire of vertebrates2. Importantly, there was another WGD ∼300 million years ago13 in the major clade of the fish lineage known as the teleost-specific genome duplication (TSGD). Although the TSGD increased the number of protein-coding genes in zebrafish14 and other fish compared to mammals, it is unknown whether this influenced synapse proteome complexity. While the mammalian synaptic proteome has been characterized, the absence of similar data from teleosts limits our knowledge on the evolution of synapses in vertebrates and the roles of WGDs.
The freshwater fish Danio rerio (zebrafish) is a teleost that is now widely used in neuroscience for modelling human genetic brain disorders including those that disrupt synapse proteins15,16,17. Here we report the first characterization of the ultrastructure, proteome composition and evolution of zebrafish central nervous synapses. The proteome of synaptosomes and the PSD of zebrafish and mice were analysed in parallel and their complexity compared. Surprisingly, despite zebrafish having an extra WGD the PSD proteome was less complex when compared with mammals. We identify a core ‘vertebrate PSD’ (vPSD) that corresponds to the ancestral postsynaptic machinery common to all vertebrates and conserved ultrastructural features. We have also identified proteins that are only present in the mouse proteome, representing molecular innovations either acquired by mammals after divergence of the fish lineage or specifically lost from the fish lineage. We have made these data freely available in a database and web resource that includes links to a wide variety of related biomedical data sets (http://www.genes2cognition.org/publications/zebrafish-prot/).
Ultrastructure of zebrafish synapses
Before analysing the synapse proteome we performed an ultrastructural analysis of central synapses in zebrafish to address two questions: do zebrafish synapses contain PSDs (a prerequisite for their biochemical isolation), and, if so, do they show any morphological features that are conserved with mammals? Moreover, to our knowledge, the ultrastructure of zebrafish brain synapses has not been previously described, although studies in other bony fish species were reported several decades ago18,19,20,21,22. We therefore examined the four main regions of the zebrafish brain with transmission electron microscopy (Fig. 1, Supplementary Note 1, Supplementary Figs 1–5 and Supplementary Table 1). In olfactory bulb, telencephalon, midbrain and hindbrain (Fig. 1b) asymmetric synapses presented structures equivalent to mammalian PSDs (Fig. 1c). While PSDs were identified across the entire transverse sections of olfactory bulb and telencephalon, in midbrain and hindbrain these were restricted to the optic tectum and cerebellar corpus, respectively, which are layered, cortex-like structures located in the most dorsal part of the brain (Supplementary Figs 3 and 4). Asymmetric synapses from the olfactory bulb were morphologically similar to those found in mammals and other bony fish species20,21 (Fig. 1c,d and Supplementary Fig. 1). The zebrafish olfactory bulb presented the characteristic dendrodendritic synapses23 of this brain region, which contains synaptic vesicles on both sides of the synaptic cleft (Fig. 1d and Supplementary Fig. 1). Synapses in the telencephalon also showed the prototypical characteristics of mammalian synapses with PSDs present in spine-like structures (Fig. 1e and Supplementary Fig. 2). At the level of the optic tectum, PSDs were mainly present in the medial layers of this structure (Supplementary Fig. 3b, orange delimited area). Presynaptic boutons in the optic tectum appeared to make synaptic contacts directly on dendritic shafts rather than on spines as suggested by the small diameter of postsynaptic elements and the presence of microtubule-like structures beneath the PSD (Fig. 1f and Supplementary Fig. 3).
Synapses in the cerebellar corpus showed distinct features and could be classified into different types. Although we observed synapses with flat PSDs and aligned pre- and postsynaptic membranes (Fig. 1g and Supplementary Fig. 5a), these were a minority (13%, Supplementary Fig. 5c). Most synapses presented highly curved PSDs and a presynaptic element surrounding the postsynaptic spine (Fig. 1h–i and Supplementary Figs 4 and 5b). In mammals, cerebellar glutamatergic synapses present a presynaptic element that partially surrounds the postsynaptic structure24; however, this feature is greatly enhanced in the zebrafish cerebellum. When measuring the arch length of curved PSDs and their postsynaptic elements these can be divided into short and long PSDs (Supplementary Fig. 5d–i). Thus, at the zebrafish cerebellar corpus we could identify three PSD shapes: ‘flat’, ‘round-short’ and ‘round-long’ (Supplementary Fig. 5 and Supplementary Table 1). Finally, we compared PSD length and the area between all four brain regions and found that flat PSDs and cerebellum round-short PSDs have similar sizes and areas (Supplementary Fig. 5j–l), while telencephalon and cerebellum round-long PSDs were larger, with the cerebellum PSDs being the largest. These studies show a diversity of synapse ultrastructure in zebrafish and characteristic features shared with mammalian synapses.
Proteomic profiling of zebrafish and mouse synapses
We purified synaptosomal (SYN) and PSD fractions in triplicate from mouse and zebrafish brains using identical protocols, generating equivalent yields and expected PSD enrichments (Supplementary Fig. 6). These data indicate that the performance of the biochemical methods was equivalent between species, although stochastic effects on small numbers of proteins cannot be fully ruled out. Quantitative mass spectrometry-based proteomic analysis (see Methods, Supplementary Note 2 and Supplementary Fig. 7) identified a total of 3,579 and 3,840 proteins in triplicate for mouse and zebrafish, respectively (Fig. 2a and Supplementary Data 1 and 2). In order to define which proteins are enriched/depleted in PSDs compared to their parent SYN fractions, we used a label-free quantification and statistical analysis of the mass spectrometry data. This identified proteins significantly enriched in SYN (depleted from PSD), which were removed from the final list of PSD components as potential purification contaminants (Fig. 2b–d and Supplementary Data 1 and 2). Thus, we document 3,223 and 2,128 proteins in mouse SYN and PSD fractions, respectively, and 3,640 and 1,758 proteins in the corresponding zebrafish structures (Fig. 2e,f). The greater number of zebrafish SYN proteins compared to mouse likely reflects the greater number of protein-coding genes in the zebrafish genome, which is supported by the finding that the proportion of SYN genes relative to genome size is 14% in both species. Surprisingly, the zebrafish PSD proteome was 17% smaller than mouse (P<1E−06, binomial test). This difference is still significant if the PSD is defined as the sum of zebrafish proteins exclusively found in the PSD or significantly enriched in it (P=0.0003, binomial test). The zebrafish PSD was only 48% of the SYN proteome compared to the 66% in mouse. Hence, despite the TSGD and the concomitant expansion of the zebrafish synapse proteome, the zebrafish PSD is of smaller size than that found in mammals.
Teleost genome duplication expanded synapse protein families
Many well-known families of synaptic proteins were found with an expanded number of ohnologues in the zebrafish. For example, zebrafish show twice as many ionotropic glutamate receptor subunits in the NMDA (N-methyl-D-aspartate) and AMPA (α-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid) families and more scaffold proteins in the PSD95/Dlg family (six in zebrafish and four in mice, Supplementary Data 2). We therefore asked whether family expansion was a common feature among SYN and PSD proteins in zebrafish. Using the Ensembl Families classification we found that both zebrafish SYN (Fig. 3a) and PSD (Fig. 3b) proteomes contain protein families with a significantly higher number of components.
We next asked whether synapse genes were more likely to be retained after the TSGD than genes expressed elsewhere in the brain or in other tissues. We calculated the fraction of proteins belonging to the orthology types (zebrafish:mouse: 1:1, 1:many, many:1, many:many and unique to each species; Fig. 3c). The many:1 category is increased in synaptosomes and PSD protein families compared to the genome-wide ratio, indicating that synapse genes have been retained at higher frequencies after the TSGD than seen in the genome as a whole. To quantify these differences we determined the number of orthologues in each species and calculated the ratio of zebrafish:mouse orthologues (Fig. 3d). While most genes have a 1:1 ratio between species, a clear peak appears at 2:1 representing genes with double the number of orthologues in zebrafish compared to a small peak at 1:2 representing genes duplicated in mouse. This ratio is also seen in other teleosts but not in the Spotted Gar (Lepisosteus oculatus), a fish whose lineage diverged before the teleost-specific WGD25 (Supplementary Fig. 8). Examples of the increased ratio of orthologues in key synaptic proteins among fish species but not in the Gar, and other vertebrate and invertebrate species, are shown in Fig. 3e. These data support the assumption that the trend for gene family expansion in zebrafish is a legacy of the TSGD rather than by specific loss of genes in mammalian genomes. The distribution of orthologue ratios for SYN and PSD proteomes is statistically different to the whole-genome proteomes, and the PSD is even statistically different to the brain proteome. No statistical difference was seen between SYN and PSD proteomes (Fig. 3d). These results show that following the TSGD zebrafish synapse proteome-encoding genes, especially PSD ones, were more frequently retained as duplicates.
Functional complexity of the synapse proteome
To understand the functional implications of the different complexity of zebrafish and mouse synapse proteomes, we examined protein diversity. We first considered high-level categories corresponding to protein cellular location and molecular function from the ingenuity pathway analysis (IPA) knowledgebase functional classification system: no significant differences were observed between the percentages of protein location or function, indicating a general conservation of the molecular characteristics of SYN and PSD proteomes between species (Fig. 4a,b and Supplementary Data 3). To corroborate this finding we compared enriched functional categories from two other classification systems: the Gene Ontology (GO-Slim) and the Panther Protein Class ontology. Most of the significantly enriched functional categories were found in both species (Fig. 4c and Supplementary Data 3), supporting the conclusion that the overall functionality of mammalian and teleost synapse proteomes is conserved.
Nevertheless, as we observed differences in the number of PSD proteins between species, we asked whether there might be differences in the number of protein families, using the Ensembl Protein Family annotation. Zebrafish showed fewer families in both SYN and PSD proteomes even when accounting for proteome size (Table 1 and Supplementary Data 4). To correct for possible genome annotation differences between species, we obtained Ensembl Protein Family IDs from mouse orthologues of zebrafish proteins and repeated the analysis, obtaining the same results (Table 1). The number of zebrafish PSD families was significantly lower than expected for zebrafish PSD proteins or mouse orthologues of zebrafish PSD proteins (Table 1). Thus, the lower PSD complexity in zebrafish results from fewer protein families.
Since protein domains contribute to functional complexity, we examined domain composition (number of unique protein domain types/protein, Supplementary Data 5) in synaptic proteins, mammalian and zebrafish synaptic brain proteomes26,27,28,29 and all mouse and zebrafish coding proteins (Supplementary Data 6). We did not find a statistically significant difference of domain complexity between species for any of the proteomes. However, within species we found that the SYN and PSD proteomes do have higher complexity than brain or genome (number of unique protein domain types/protein mouse: genome=1.46, brain=1.50, SYN=1.66, PSD=1.64; zebrafish: genome=1.46, brain=1.57, SYN=1.67, PSD=1.75), suggesting that SYN and PSD represent specialized proteomes with higher functional complexity compared to the brain or whole proteome. Cumulative distributions (Fig. 4d,e) show significant increase in unique domains per protein in SYN and PSD compared to brain and genome data sets.
Species specialization in the PSD
We next focussed our attention on identifying those biological functions that were specific to either zebrafish or mouse PSDs. The zebrafish-specific PSD (Zf-sPSD, 523 proteins) and mouse-specific PSD (Mm-sPSD, 745 proteins) proteomes were examined for enrichment of GO terms from biological process and Cellular Component categories. To avoid potentially misleading differences between species arising from the biochemical fractionation, for a protein to be considered species-specific it had to be absent from both the PSD and SYN proteomes in the reciprocal species. To account for the possibly less complete annotation of the zebrafish genome, the enrichment analysis with zebrafish proteins was done twice, first using Zf-sPSD proteins against the zebrafish genome and later using mouse orthologues of Zf-sPSD against the mouse genome. The final list of zebrafish-enriched terms corresponded to the sum of terms enriched in both analyses.
A large difference in the number of significantly enriched terms was found between the two species-specific proteomes: the Mm-sPSD presented 97 biological process and 66 cellular component-enriched terms, while only 17 and 8 were found in Zf-sPSD. Most (80%) Mm-sPSD proteins presented an orthologue in the zebrafish genome (Supplementary Note 3 and Supplementary Fig. 9), indicating that gene loss in zebrafish or gene gain in mouse is not the only factor driving PSD functional differences observed between species. Terms enriched in zebrafish were not obviously relevant to synaptic biology, whereas Mm-sPSD proteins were enriched in terms such as `postsynaptic density', `synapse' or `regulation of synapse structure or activity' (Supplementary Data 7). This is consistent with specialized synaptic proteins found in mammalian synapses being absent from the Zf-sPSD. In addition to these proteins, most of the other mouse-enriched terms fall into a few functions embracing endocytosis, vesicle-mediated intracellular trafficking, protein localization to the plasma membrane and actin filament-based processes (Supplementary Data 7).
Among proteins involved in vesicle traffic and endocytosis, particularly noticeable was the differential presence at the mouse PSD of many proteins forming SNARE complexes. These included syntaxins, synaptobrevins (vamps) and ‘soluble N-ethylmaleimide-sensitive factor (NSF) attachment proteins’ (SNAP) as well as syntaxin-binding proteins (Sec1/Munc18) and synaptotagmins (Fig. 5a). Interestingly, syntaxins and syntaxin-interacting proteins enriched in the mouse PSD are involved in endocytic pathways, while those participating in presynaptic exocytosis were depleted from it (Fig. 5a), suggesting that the former are not biochemical contaminants. These included proteins with very well-established presynaptic functions, such as Snap25 or Vamp2. While these might be biochemical contaminants of the PSD preparation, several recent publications30,31 have given evidence for their participation in postsynaptic processes. Thus, their localization in the PSD cannot be excluded. The mouse PSD was also specifically enriched in other complexes involved in endocytosis, including constituents of the ‘endosomal-sorting complexes required for transport’ (ESCRT) and ‘homotypic fusion and vacuole protein sorting’ (HOPS), and other key proteins for vesicle-mediated protein transport (Fig. 5b,c). To further explore this observation, we looked for these proteins in the PSD from human5,32,33, rat34,35,36,37 and an independent and recently generated mouse PSD proteome38. In all species we found more representatives of all these protein complexes than in zebrafish, with the exception of ESCRT components in rat (Supplementary Data 8).
To further investigate the depletion of some mammalian proteins from the zebrafish PSD, we asked whether the orthologous genes encoding these proteins were present in the zebrafish genome and, if so, whether they were expressed at low levels. Of the 745 mouse PSD proteins absent from zebrafish synaptic proteomes (SYN+PSD), 80% have orthologues in the zebrafish genome (Supplementary Fig. 9). To examine the possibility that those might be expressed at low levels in the zebrafish brain, we examined the 84 proteins shown in Fig. 5 using RNA sequencing data and found that the expression of most of these genes (63/84=75%) is low (less than 10 transcripts per million (TPM)) or very low (less than 1 TPM; Fig. 5 and Supplementary Fig. 10). The percentage of lowly expressed genes in this protein-depleted group is greater than what we see for all SYN- and PSD-encoding genes, where the percentage of detectable genes with TPM<10 is 51% and 53%, respectively. This suggests that proteins depleted from the SYN and PSD of zebrafish show a corresponding low expression of encoding mRNA in the brain. Together, these findings indicate that both low levels of expression and absence of orthologues are mechanisms contributing to the depletion of synapse proteins from the zebrafish synapse.
To further test the hypothesis that mouse-specific PSD proteins added new functionalities to the PSD, we repeated the GO enrichment analysis with human5,32,33 and rat34,35,36,37 PSD proteins absent from the zebrafish synapse (Supplementary Data 7). For this extended analysis we also combined our mouse data with PSD proteins identified in other mouse studies38,39,40. We then looked for those terms significantly enriched in all species examined (Supplementary Data 7). We again found many enriched GO terms related to vesicle-mediated protein traffic, endocytosis and localization to the plasma membrane (Table 2). The rest of the enriched terms could be grouped into those related to actin filament organization, cation transport through the membrane and cell junctions involving the actin cytoskeleton (adherens junctions; Table 2). Finally, we looked for enriched KEGG pathways among PSD-specific proteins, as KEGG uses an annotation system different from that of GO. Again, we found that mammalian-specific PSD proteins are involved in endocytosis, regulation of actin cytoskeleton and adherens junctions among other pathways (Supplementary Data 7). Altogether, these analyses suggest that the zebrafish PSD has a reduced functional repertoire related to vesicle-mediated trafficking than that of mammals.
A conserved vertebrate synapse proteome
The comparison of vertebrate synapse proteomes from species separated from a common ancestor for over 400 million years provides an opportunity to identify the conserved elements within this highly complex structure, which will likely underpin the function of most vertebrates. We therefore sought to define the common set of vertebrate PSD proteins (vPSD) by identifying zebrafish PSD proteins with an orthologue in the mouse PSD. Accordingly, the vPSD consists of 1,101 proteins (Supplementary Data 9), including proteins from 12 major functional groups such as cytoskeletal proteins, ribosomal proteins, kinases, phosphatases, adenylate cyclase or small GTPases among others (Fig. 6a).
We next performed a set of analyses that show PSD protein sequences are remarkably conserved across vertebrates. First, SYN and PSD protein conservation was significantly higher than the average protein encoded in the genome in both species (median % of identity between zebrafish and mouse: for all zebrafish proteins, 49; SYN, 70; PSD, 72; Supplementary Fig. 11a; median % of protein identity between mouse and zebrafish for all mouse proteins, 49; SYN, 70; PSD, 70; Supplementary Fig. 11b). Second, we compared SYN and PSD protein conservation over ∼90 million years since humans and mice shared a common ancestor, and found higher identity in PSD compared to SYN proteins (Supplementary Fig. 11c) or in PSD-enriched proteins as compared to PSD-depleted ones (Supplementary Fig. 11d). Third, SYN and PSD were significantly more conserved than other proteins expressed in the brain (Supplementary Fig. 11e–h), which already exhibit high conservation41. Fourth, the vPSD showed a greater level of conservation than the entire PSD (Fig. 6b,c). This observation held true when the vPSD was obtained by comparing zebrafish PSD proteins with human5,32,33, mouse38,39,40 and rat34,35,36,37 PSD proteomes (median vPSD protein identity 66% and median mammalian-specific PSD protein identity 61.5%; significantly different, Mann–Whitney U-test, P<0.0001). Fifth, we asked whether the species-specific PSD (Zf-sPSD and Mm-sPSD) proteins also showed this high conservation, and found that they were significantly lower, even when compared with whole-brain proteomes (Fig. 6b,c). To further validate these findings we took PSD proteins specific to each species and eliminated those found in the SYN fraction of the other species. Again, vPSD proteins showed higher percentages of protein identity than Zf-sPSD and Mm-sPSD (Fig. 6d). Thus, the vPSD is a highly conserved set of ∼1,000 proteins common to vertebrate species that shared an ancestor ∼450 million years ago.
Our study of zebrafish synapse proteomes has led to a number of new insights into the evolution of synapses. First, retention of duplicated synapse genes following the TSGD has generated an increase in molecular complexity in zebrafish. Second, despite this increase in proteome size, the PSD complexity was lower in zebrafish than in mammals. Third, the characterization of a conserved vPSD indicates that high molecular complexity is a core feature across bony fish, amphibians, reptiles, birds and mammals. Fourth, lineage-specific changes in proteins around this vPSD result in species-specific differences in synapse composition.
Our data show that vertebrate synapse proteomes have been shaped by multiple WGDs11,42 including the TSGD around 300 million years ago13 and two WGDs 150 million years earlier. Following a WGD, most duplicated genes accumulate deleterious mutations becoming pseudogenes43 so that only a small number of the originally duplicated genes are retained. For instance, the last update of the zebrafish genome identifies 3,440 gene pairs (ohnologues) remaining from the TSGD14, representing a retention of ∼25% of the novel duplicates. Nevertheless, when considering the number of paralogues found in zebrafish synaptic protein families we found a significant increase compared with mouse, indicating that after the TSGD many synaptic genes have been retained in the zebrafish genome. Indeed, we have shown that zebrafish genes expressed in synapses have been retained more frequently after the TSGD than other coding genes in the genome or other genes expressed in the brain, suggesting their functional importance. Consistent with this, studies reporting the types of genes retained in vertebrates after the two rounds of WGD show that among those more commonly retained are genes involved in synaptic function44. Our data support the idea that genes performing synaptic functions are retained at higher frequencies following successive rounds of WGD. This is consistent with the view that their sub- and/or neo-functionalization45 expanded synaptic molecular complexity and diversity, contributing to improved fitness.
Consistent with the conservation of the vPSD we found that many ultrastructural features of the postsynaptic density observed in mammals were found in zebrafish. Asymmetric synapses (containing PSDs) in olfactory bulb and telencephalon were particularly similar to those observed in mammals. However, some of the synapses identified in the optic tectum and cerebellum presented particular morphologies. These have also been reported in other bony fish species18,19,22, further supporting synaptic diversity across vertebrates. Particularly remarkable were asymmetric synapses from the optic tectum, which show a very clear PSD but not an equally obvious dendritic spine, as the presynaptic boutons contacted thin structures that might correspond with dendritic shafts. Similar observations have been made in a few other teleost species18,19. Yet, in the superior colliculus of mammals, the homologous brain region to the fish optic tectum, asymmetric synapses are mainly formed on dendritic spines46. Since previous studies in mice show that synapse proteins appearing after the two WGDs contributed to synaptic diversity42, future neuroanatomical studies could determine whether synapse proteins arising from the TSGD are allocated into different individual synapses in the zebrafish brain.
We unexpectedly found several lines of evidence that highlight the specialization of the synapse proteome. While studying the frequency of new domains found per protein, we observed a strong increase in the PSD and synaptosome proteins (in both mouse and zebrafish) compared with whole-brain proteomes or all protein-coding genes in the genome. Second, in previous work we have reported that PSD proteins have been subjected to very high levels of sequence conservation during mammalian evolution5,39, and the present study indicates that this evolutionary constraint has occurred throughout vertebrate evolution, and not only for postsynaptic proteins, but for synaptic molecules overall. This high conservation suggests that the proteins are important for fitness, and consistent with this, disease-causing mutations have been documented in several hundred different genes encoding the human postsynaptic proteome. We suggest that the vPSD data set will be particularly valuable for future human genetic studies and behavioural genetic screens in zebrafish.
The identification of mammalian-specific PSD proteins, which were absent from the zebrafish synapse, opens new lines of investigation into the mammalian brain. No relevant differences were observed when comparing human and mouse postsynaptic proteomes39, suggesting that it may be a key difference between mammals and fish. Our observations that there were fewer orthologues and low levels of mRNA expression suggest that gene loss and transcriptional regulatory changes are contributing mechanisms. In addition, there may be post-translational mechanisms such as protein stability. Although we cannot fully exclude the contribution of technical reasons, the observed enrichment in particular synaptic functions in the set of mammalian-specific PSD proteins suggests that there are biologically relevant differences between the zebrafish and mouse PSD. Particularly noticeable was the high number of SNARE complex components (Syntaxins, Vamps and SNAPs) and associated proteins (Sec1/Munc18s and Synaptotagmins). Importantly, only Syntaxins and Sec1/Munc18s proteins with a clear role in endocytosis47 were found in the mouse PSD, and several of these (Syntaxin 12 (ref. 48), SNAP23 (ref. 49) or SNAP47 (ref. 50)) play a role in AMPA receptor trafficking. In addition, HOPS and ESCRT complexes, which also participate in the endocytic machinery51,52, were also found enriched among Mm-sPSD proteins and are also found in the PSD of other mammalian species. A recent study shows that some ESCRT components are at the mouse PSD, where they contribute to the regulation of synaptic plasticity and confer specific structural characteristics to the postsynaptic membrane53. To extend and validate the analysis of Mm-sPSD we repeated the analysis with human and rat and confirmed that proteins incorporated into the PSD after the fish divergence added functionality related to vesicle-mediated protein traffic, protein location to the plasma membrane and actin filament organization as well as the regulation of cation transport and establishment of adherens junctions.
Our findings have several implications for the use of zebrafish as models of human brain disease. Zebrafish are used to model neurodegeneration15, depression54, autism17 and schizophrenia55 among others56 and for neuropharmacological16 and neurotoxicology57 research. These disorders and interventions directly and indirectly influence synapse protein structure and function. Therefore, it is potentially important to consider the following issues: the additional zebrafish-specific paralogues arising from the TSGD will increase redundancy and potentially mask phenotypes in mutations within that gene family. The additional paralogues may also have undergone species-specific neofunctionalization resulting in species-specific phenotypes. The species-specific differences in overall complexity alters many classes of proteins at multiple levels of signalling pathways, and therefore the postsynaptic signalling networks will have a different structure, potentially resulting in differential robustness and signalling capacity. It is also interesting to consider the finding that the vPSD is highly conserved, and perhaps this subset of the synapse proteome will be preferred when modelling human mutations. While these considerations may be important for studies aimed at modelling or treating human diseases, we also wish to highlight that these differences will be of interest in the study of fundamental synaptic physiology and behaviour of zebrafish. The demonstration of paralogue-specific behavioural functions in mice and conserved phenotypes in humans and mice2,3 illustrate that the synapse proteome complexity of zebrafish will be a major factor in their behavioural repertoire.
Synapse proteome data from mice and humans have been used in a wide range of applications. For example, the mammalian data have been used in many human genetic studies including those showing that schizophrenia is primarily a synaptic disorder where multiple susceptibility genes converge on the PSD10,58,59. Mouse proteome data have been used to show that the mRNAs that interact with the Fragile X Mental Retardation Protein predominantly target the PSD60. PSD proteome data were used to study the behavioural and physiological phenotypes controlled by synapses using the Mouse and Human phenotype ontologies5. Thus, we expect that the zebrafish synapse proteome data will be a valuable resource that can be exploited with many orthogonal data sets and technical approaches. All data and tables from this study are freely available through the Genes to Cognition database (http://www.genes2cognition.org/publications/zebrafish-prot/).
Mouse (Mus musculus) and zebrafish (Danio rerio) were treated in accordance with the British Home Office regulations (Animal Scientific Procedures Act, 1986; Project Licence PPL80/2,337 to Professor Seth Grant). Animal protocols were approved by the local ethical committee on animal experimentation at the Wellcome Trust Sanger Institute. Animals were housed in The Wellcome Trust Sanger Institute animal facility.
Zebrafish and mouse brain samples
We used whole-brain samples dissected from male and female adult D. rerio and 6–8-week-old mice from the 129 Strain. After dissection brain tissue was immediately frozen in liquid nitrogen and stored at −80 °C until being used for extraction of synaptosomes and PSDs. Zebrafish were from the following strains: H Longfin, Tubingen Longfin, Tubingen, AB, WIK, LON and SAT. Before dissection, fish were killed by an overdose of the fish anaesthetic Tricane at 0.1% (w/v) and mice were killed by cervical dislocation. Three independent biological replicas were prepared for both zebrafish and mouse brain samples, and each contained 0.7–1 g of tissue. All mouse and zebrafish sample-processing steps were performed in parallel and peptide fractions from all samples analysed back-to-back by mass spectrometry within just over a week. Performance and sensitivity were monitored throughout. Sample size was established based on the standard in the field. No method was used to randomize animals between experimental groups; neither investigators were blinded to the species origin of each sample.
Electron microscopy was performed with the brain from two adult specimens. In each case the following brain regions were studied: olfactory bulb, telencephalon, optic tectum and cerebellum. Zebrafish brains were dissected under cold primary fixative containing 2% paraformaldehyde and 2.5% glutaraldehyde in 0.1 M sodium cacodylate buffer at pH 7.42. Each brain was halved in mid-sagittal section and then separated in transverse sections into telencephalon (including olfactory bulb), optic tectum, cerebellum and medulla. These four compartments were fixed for the remainder of 2 h, rinsed and post-fixed in 1% osmium tetroxide for an hour, mordanted with 1% tannic acid and dehydrated in an ethanol series, en bloc staining with 2% uranyl acetate at the 30% stage. Following immersion in propylene oxide, the brain segments were embedded in TAAB 812 resin. Semi-thin sections (0.5 μm) were cut on a Leica UCT ultramicrotome and stained with toluidine blue on a microscope slide. Images were recorded on a Zeiss Axiovert CCD (charge-coupled device) camera and areas selected for 50 nm ultrathin sectioning. Thin sections were collected on copper/palladium grids and contrasted with uranyl acetate and lead citrate before viewing on an FEI 120kV Spirit Biotwin TEM and recording CCD images on an F4.15 Tietz camera. PSD lengths and areas were measured on electron microscopy images with the FiJi image-processing package61. Groups were compared through the median lengths and areas, and significant differences were analysed through the Kruskal–Wallis non-parametric test. All analyses were performed with the SPSS statistics software (IBM).
Isolation and characterization of synaptosomes and PSDs
Mouse (M. musculus) and zebrafish (D. rerio) samples were fractionated in parallel, using previously reported methods39. Briefly, ∼1 g of whole-brain tissue was homogenized 9:1 (v:w) using a glass–teflon tissue grinder in a buffer containing Tris 50 mM, pH 7.4, 0.3 M sucrose, 5 mM EDTA and the protease inhibitors 1 mM phenylmethylsulphonyl fluoride (PMSF), 2 μM Aprotinin and 2 μM Leupeptin. The homogenate was centrifuged at 800g to pellet nuclei and cell debris; the resulting supernatant was then centrifuged at 16,000g and the pellet was resuspended 5:1(v:w) in Tris 50 mM, pH 8.1, 5 mM EDTA, 1 mM PMSF, 2 μM Aprotinin and 2 μM Leupeptin, and chilled in ice for 45 min. Sucrose was added to a final 34% (w/w) concentration. A sucrose gradient was prepared with equal volumes of the following layers (bottom to top): sample, Tris 50 mM, pH 7.4, 0.85 M sucrose and Tris 50 mM, pH 7.4, 0.3 M sucrose. This gradient was then ultracentrifuged for 2 h at 60,000g and the interphase between 34 and 28.5% sucrose was collected, diluted to 10% sucrose with Tris 50 mM, pH 7.4 and centrifuged again at 48,000g during 30 min. Pellet was resuspended in 1 ml of Tris 50 mM, pH 7.4 to generate the SYN fraction. A range of 5–10% of this solution was set apart and later used for proteomics profiling. To obtain the final PSD fraction the remaining synapstosomal fraction was mixed with an equal volume of 3% Triton X-100 and chilled in ice for 30 min. Sample was finally layered on top of 10 ml of Tris 50 mM, pH 7.4, 0.85 M sucrose and centrifuged at 104,000g for 1 h to produce a pellet containing the PSD fractions that are solubilized in Tris 50 mM, pH 7.41% SDS. Enrichment of postsynaptic proteins in postsynaptic density fractions was assessed by immunoblotting using the postsynaptic marker protein PSD95 (antibody used: Affinity, ref. MA1-045).
Mass spectrometry-based proteomics
In-gel digestion was performed as reported previously5. Extracted peptides (six fractions per sample) were analysed using nanoLC-MS/MS on a LTQ-Orbitrap Velos (Thermo Fisher) hybrid mass spectrometer equipped with a nanospray source, coupled with an Ultimate 3000 Nano/Capillary LC System (Dionex). The system was controlled with Xcalibur 2.1 (Thermo Fisher) and DCMSLink 2.08 (Dionex). Peptides were desalted on-line using a micro-Precolumn cartridge (C18 Pepmap 100, LC Packings) and then separated using a 120 min reverse phase gradient (4–32% acetonitrile/0.1% formic acid) on an EASY-Spray column, 50 cm × 75 μm ID, PepMap C18, 2 μm particles, 100 Å pore size (Thermo). The LTQ-Orbitrap Velos was operated with a cycle of one MS (in the Orbitrap) acquired at a resolution of 60,000 at m/z 400, with the top 10 most abundant multiply charged (2+ and higher) ions in a given chromatographic window subjected to MS/MS fragmentation in the linear ion trap. An FTMS target values of 1e6 and an ion trap MSn target value of 5e3 was used and with the lock mass (445.120025) enabled. Maximum FTMS scan accumulation time of 150 ms and maximum ion trap MSn scan accumulation time of 100 ms was used. Dynamic exclusion was enabled with a repeat duration of 45 s with an exclusion list of 500 and exclusion duration of 30 s.
MS data were analysed using MaxQuant62 version 18.104.22.168. Data were searched against mouse (GRCm38.p3 (GCA_000001635.5)) or zebrafish GRCz10 (GCA_000002035.3) UniProt sequence databases (downloaded June 2015) using the following search parameters: trypsin with a maximum of two missed cleavages, 7 p.p.m. for MS mass tolerance, 0.5 Da for MS/MS mass tolerance, with acetyl (protein N-term) and oxidation (M) set as variable modifications and carbamidomethyl (C) as a fixed modification. A protein false discovery rate (FDR) of 0.01 and a peptide FDR of 0.01 were used for identification level cutoffs. Variance in protein abundance data was similar within species replicas and between species. In addition, for a protein to be included in the final set of SYN or PSD proteins it had to be identified with at least one unique peptide in each of the three SYN or PSD replicas. Label-free quantification (LFQ) was performed using MaxQuant LFQ intensities63, and statistical analysis was performed using Perseus64 as follows. The data set was filtered to remove proteins with less than two valid LFQ values in at least one group (PSD or SYN). LFQ intensities were log2-transformed and missing values were imputed using a downshifted normal distribution (width 0.3, downshift 1.8). Next t-testing was performed with correction for multiple hypothesis testing using a permutation-based FDR of 0.05.
Gene orthology relationships between zebrafish and mouse and percentage of protein sequence identity were taken from Ensembl database65 version 81, containing the last update of the zebrafish genome14. Statistical comparison of protein identities between different proteomic sets was performed using the Mann–Whitney U-test.
Functional classification of synaptic proteins
For the functional classification with high-level categories, Ensembl mouse identifiers for mouse proteins or orthologous mouse identifiers for zebrafish proteins were integrated with functional annotation from the Ingenuity knowledgebase66, using IPA (QIAGEN Redwood City www.qiagen.com/ingenuity). Information of predicted cellular localization (cytoplasm, extracellular space, nucleus, plasma membrane and other) and IPA protein types (cytokine, enzyme, G-protein coupled receptor (GPCR), ion channel, kinase, peptidase, phosphatase, transcription regulator, translation regulator, transporter and other) were obtained. Counts, comparisons and plots of proteins within each species and category were conducted using R. To reproducibly call orthologous sequences between species for a large data set, the Ensembl biomart database was queried using the bioconductor package biomaRt67. All orthologues were obtained and counted in each species to determine orthology type, either 1:1, 1:many, many:1, many:many or unique to a species (no orthologue known). For the analysis of all protein families, we used Ensembl identifiers of mouse, zebrafish and mouse orthologues of zebrafish proteins to retrieve Ensembl Protein Families from Ensembl database65 version 81, containing the last update of the zebrafish genome14. Families with an unknown function were not considered.
Protein domain composition for all genes in the mouse and zebrafish genome data sets were obtained via biomart; subsets corresponding to brain, SYN and PSD data sets were obtained from this single data set. The total counts for domains for each protein were determined, and the unique protein types were determined by removing duplicate domains within any single protein. The complexity of the proteome was calculated by comparing the cumulative frequency of unique domains per protein within a proteome. Distributions were compared using a two-tailed Kolmogorov–Smirnov test applied to cumulative frequency distributions. All statistical calculations were conducted in R.
GO enrichment analysis
Zebrafish and mouse synaptic proteins were annotated for ‘Cellular Component’ and ‘Biological Process’ gene ontology68 terms using the Panther database and analysis tools69. Binomial statistics were used to compare GO term over-representation using the whole genome as the background set, and the Bonferroni test was used to correct for multiple testing. To account for the lower level of GO annotations found in zebrafish, Zf-sPSD proteins were searched for enrichment against the zebrafish and mouse genomes. Terms found enriched in both species were not further considered. Equivalent enrichment analysis was also performed with categories from the KEGG pathway database70.
Analysis of protein sequence identity
Percentage of protein sequence identity between mouse and zebrafish or mouse and human proteins was taken from Ensembl database65 version 81. Differences between protein sequence identity were calculated with Mann–Whitney U-test.
Four whole brains were removed and placed in RNAlater before RNA isolated using the Qiagen RNeasy Plus Mini Kit, and 150 bp paired end Illumina sequencing was conducted at Barts and the London Genome Centre. Adapters were removed from the raw reads using Cutadapt. TopHat2 was used as a wrapper for the alignment programme Bowtie2 to map sequence reads to the reference genome (Danio_rerio.GRCz10.86 obtained via Ensembl), reads were converted into counts using HTSeq and converted to TPM. For comparison of each gene, the average expression (mean TPM from four whole-brain biological replicates) was determined for each gene. Where multiple transcripts for a given gene are known, these were combined to result in a single mean TPM per gene.
We have constructed a freely available database and web resource that includes links to a wide variety of biomedical data sets: (http://www.genes2cognition.org/publications/zebrafish/).
Mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE71 partner repository with the data set identifier PXD005630.
How to cite this article: Bayés, À. et al. Evolution of complexity in the zebrafish synapse proteome. Nat. Commun. 8, 14613 doi: 10.1038/ncomms14613 (2017).
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Bayés, A. & Grant, S. G. N. Neuroproteomics: understanding the molecular organization and complexity of the brain. Nat. Rev. Neurosci. 10, 635–646 (2009).
Nithianantharajah, J. et al. Synaptic scaffold evolution generated components of vertebrate cognitive complexity. Nat. Neurosci. 16, 16–24 (2013).
Ryan, T. J. et al. Evolution of GluN2A/B cytoplasmic domains diversified vertebrate synaptic plasticity and behavior. Nat. Neurosci. 16, 25–32 (2013).
Dieterich, D. C. & Kreutz, M. R. Proteomics of the synapse—a quantitative approach to neuronal plasticity. Mol. Cell Proteomics 15, 368–381 (2015).
Bayés, A. et al. Characterization of the proteome, diseases and evolution of the human postsynaptic density. Nat. Neurosci. 14, 19–21 (2011).
Bayés, A. et al. Human post-mortem synapse proteome integrity screening for proteomic studies of postsynaptic complexes. Mol. Brain 7, 88 (2014).
Brose, N., O'Connor, V. & Skehel, P. Synaptopathy: dysfunction of synaptic function? Biochem. Soc. Trans. 38, 443–444 (2010).
Grant, S. G. Synaptopathies: diseases of the synaptome. Curr. Opin. Neurobiol. 22, 522–529 (2012).
Grant, S. G., Marshall, M. C., Page, K. L., Cumiskey, M. A. & Armstrong, J. D. Synapse proteomics of multiprotein complexes: en route from genes to nervous system diseases. Hum. Mol. Genet 14, R225–R234 (2005).
Kirov, G. et al. De novo CNV analysis implicates specific abnormalities of postsynaptic signalling complexes in the pathogenesis of schizophrenia. Mol. Psychiatry 17, 142–153 (2011).
Emes, R. D. & Grant, S. G. N. Evolution of synapse complexity and diversity. Annu. Rev. Neurosci. 35, 111–131 (2012).
Kasahara, M. The 2R hypothesis: an update. Curr. Opin. Immunol. 19, 547–552 (2007).
Hedges, S. B. The origin and evolution of model organisms. Nat. Rev. 3, 838–849 (2002).
Howe, K. et al. The zebrafish reference genome sequence and its relationship to the human genome. Nature 496, 498–503 (2013).
Xi, Y., Noble, S. & Ekker, M. Modeling neurodegeneration in zebrafish. Curr. Neurol. Neurosci. Rep. 11, 274–282 (2011).
Kalueff, A. V., Stewart, A. M. & Gerlai, R. Zebrafish as an emerging model for studying complex brain disorders. Trends Pharmacol. Sci. 35, 63–75 (2014).
Stewart, A. M. et al. Molecular psychiatry of zebrafish. Mol. Psychiatry 20, 2–17 (2015).
Laufer, M. & Vanegas, H. The optic tectum of a perciform teleost. II. Fine structure. J. Comp. Neurol. 154, 61–95 (1974).
Ito, H., Butler, A. B. & Ebbesson, S. O. An ultrastructural study of the normal synaptic organization of the optic tectum and the degenerating tectal afferents from retina, telencephalon, and contralateral tectum in a teleost, Holocentrus rufus. J. Comp. Neurol. 191, 639–659 (1980).
Oka, Y. Golgi, electron-microscopic and combined Golgi-electron-microscopic studies of the mitral cells in the goldfish olfactory bulb. Neurosci.ence 8, 723–742 (1983).
Ichikawa, M. Fine structure of the olfactory bulb in the goldfish, Carassius auratus. Brain Res. 115, 43–46 (1976).
Meek, J. & Nieuwenhuys, R. Palisade pattern of mormyrid Purkinje cells: a correlated light and electron microscopic study. J. Comp. Neurol. 306, 156–192 (1991).
Whitman, M. C. & Greer, C. A. Synaptic integration of adult-generated olfactory bulb granule cells: basal axodendritic centrifugal input precedes apical dendrodendritic local circuits. J. Neurosci. 27, 9951–9961 (2007).
Harris, K. M. & Stevens, J. K. Dendritic spines of rat cerebellar Purkinje cells: serial electron microscopy with reference to their biophysical characteristics. J. Neurosci. 8, 4455–4469 (1988).
Braasch, I. et al. The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons. Nat. Genet. 48, 427–437 (2016).
Wang, H. et al. Characterization of the mouse brain proteome using global proteomic analysis complemented with cysteinyl-peptide enrichment. J. Proteome Res. 5, 361–369 (2006).
Kim, M.-S. et al. A draft map of the human proteome. Nature 509, 575–581 (2014).
Kelkar, D. S. et al. Annotation of the zebrafish genome through an integrated transcriptomic and proteomic analysis. Mol. Cell Proteomics 13, 3184–3198 (2014).
Nolte, H. et al. Global protein expression profiling of zebrafish organs based on in vivo incorporation of stable isotopes. J. Proteome Res. 13, 2162–2174 (2014).
Antonucci, F. et al. SNAP-25, a known presynaptic protein with emerging postsynaptic functions. Front Synaptic Neurosci. 8, 7 (2016).
Hussain, S. & Davanger, S. Postsynaptic VAMP/synaptobrevin facilitates differential vesicle trafficking of GluA1 and GluA2 AMPA receptor subunits. PLoS ONE 10, e0140868 (2015).
Zhou, J. et al. Proteomic analysis of postsynaptic density in Alzheimer's disease. Clin. Chim. Acta 420, 62–68 (2013).
Focking, M. et al. Common proteomic changes in the hippocampus in schizophrenia and bipolar disorder and particular evidence for involvement of cornu ammonis regions 2 and 3. Arch Gen. Psychiatry 68, 477–488 (2011).
Cheng, D. et al. Relative and absolute quantification of postsynaptic density proteome isolated from rat forebrain and cerebellum. Mol. Cell Proteomics 5, 1158–1170 (2006).
Li, K. et al. Organelle proteomics of rat synaptic proteins: correlation-profiling by isotope-coded affinity tagging in conjunction with liquid chromatography-tandem mass spectrometry to reveal post-synaptic density specific proteins. J. Proteome Res. 4, 725–733 (2005).
Peng, J. et al. Semiquantitative proteomic analysis of rat forebrain postsynaptic density fractions by mass spectrometry. J. Biol. Chem. 279, 21003–21011 (2004).
Han, X. et al. iTRAQ-based quantitative analysis of hippocampal postsynaptic density-associated proteins in a rat chronic mild stress model of depression. Neurosci.ence 298, 220–292 (2015).
Distler, U. et al. In-depth protein profiling of the postsynaptic density from mouse hippocampus using data-independent acquisition proteomics. Proteomics 14, 2607–2613 (2014).
Bayés, A. et al. Comparative study of human and mouse postsynaptic proteomes finds high compositional conservation and abundance differences for key synaptic proteins. PLoS ONE 7, e46683 (2012).
Trinidad, J. C., Thalhammer, A., Burlingame, A. L. & Schoepfer, R. Activity-dependent protein dynamics define interconnected cores of co-regulated postsynaptic proteins. Mol. Cell Proteomics. 12, 29–41 (2012).
Wang, H. Y. et al. Rate of evolution in brain-expressed genes in humans and other primates. PLoS Biol. 5, e13 (2007).
Emes, R. D. et al. Evolutionary expansion and anatomical specialization of synapse proteome complexity. Nat. Neurosci. 11, 799–806 (2008).
Watterson, G. A. On the time for gene silencing at duplicate Loci. Genetics 105, 745–766 (1983).
Huminiecki, L. & Heldin, C. H. 2R and remodeling of vertebrate signal transduction engine. BMC Biol. 8, 146 (2010).
Force, A. et al. Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151, 1531–1545 (1999).
Lund, R. D. Synaptic patterns of the superficial layers of the superior colliculus of the rat. J. Comp. Neurol 135, 179–208 (1969).
Rizo, J. & Südhof, T. C. The membrane fusion enigma: SNAREs, Sec1/Munc18 proteins, and their accomplices—guilty as charged? Annu. Rev. Cell Dev. Biol. 28, 279–308 (2012).
Wang, Y. & Tang, B. L. SNAREs in neurons – beyond synaptic vesicle exocytosis (Review). Mol. Memb. Biol. 23, 377–384 (2009).
Suh, Y. H. et al. A neuronal role for SNAP-23 in postsynaptic glutamate receptor trafficking. Nature Neurosci. 13, 338–343 (2010).
Jurado, S. et al. LTP requires a unique postsynaptic SNARE fusion machinery. Neuron 77, 542–558 (2013).
Balderhaar, H. J. K. & Ungermann, C. CORVET and HOPS tethering complexes - coordinators of endosome and lysosome fusion. J. Cell Sci. 126, 1307–1316 (2013).
Hurley, J. H. The ESCRT complexes. Crit. Rev. Biochem. Mol. Biol. 45, 463–487 (2010).
Chassefeyre, R. et al. Regulation of postsynaptic function by the dementia-related ESCRT-III subunit CHMP2B. J. Neurosci. 35, 3155–3173 (2015).
Fonseka, T. M., Wen, X.-Y., Foster, J. A. & Kennedy, S. H. Zebrafish models of major depressive disorders. J. Neurosci. Res. 94, 3–14 (2016).
Morris, J. A. Zebrafish: a model system to examine the neurodevelopmental basis of schizophrenia. Prog. Brain Res. 179, 97–106 (2009).
Cunliffe, V. T. Building a zebrafish toolkit for investigating the pathobiology of epilepsy and identifying new treatments for epileptic seizures. J. Neurosci. Methods 260, 91–95 (2016).
Nishimura, Y. et al. Zebrafish as a systems toxicology model for developmental neurotoxicity testing. Congenit Anom 55, 1–16 (2015).
Fernandez, E. et al. Targeted tandem affinity purification of PSD-95 recovers core postsynaptic complexes and schizophrenia susceptibility proteins. Mol. Syst. Biol. 5, 269 (2009).
Fromer, M. et al. De novo mutations in schizophrenia implicate synaptic networks. Nature 506, 179–184 (2014).
Darnell, J. C. et al. FMRP stalls ribosomal translocation on mRNAs linked to synaptic function and autism. Cell 146, 247–261 (2011).
Schindelin, J. et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods 9, 676–682 (2012).
Cox, J. & Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367–1372 (2008).
Cox, J. et al. Accurate proteome-wide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ. Mol. Cell Proteomics 13, 2513–2526 (2014).
Tyanova, S. et al. The Perseus computational platform for comprehensive analysis of (prote)omics data. Nat. Methods 13, 731–740 (2016).
Cunningham, F. et al. Ensembl 2015. Nucleic Acids Res. 43, D662–D669 (2015).
Calvano, S. E. et al. A network-based analysis of systemic inflammation in humans. Nature 437, 1032–1037 (2005).
Durinck, S., Spellman, P. T., Birney, E. & Huber, W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc. 4, 1184–1191 (2009).
Gene Ontology Consortium. Gene Ontology Consortium: going forward. Nucleic Acids Res. 43, D1049–D1056 (2015).
Mi, H., Poudel, S., Muruganujan, A., Casagrande, J. T. & Thomas, P. D. PANTHER version 10: expanded protein families and functions, and analysis tools. Nucleic Acids Res. 44, D336–D342 (2016).
Kanehisa, M., Sato, Y., Kawashima, M., Furumichi, M. & Tanabe, M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 44, D457–D462 (2016).
Vizcaíno, J. A. et al. (2016) update of the PRIDE database and its related tools. Nucleic Acids Res. 44, D447–D456 (2016).
We thank D.L. Stemple, K. Howe and K. Elsegood for technical support; L. Chakrabarti and K. Sharma YVR for isolation of zebrafish RNA; M.D. Croning and J. Menedez Montes for web development; R. Lujan (Universidad Castilla-La Mancha, Albacete, Spain) for critical review of electron microscopy data; I. Gich (IIB Sant Pau) for support in biostatistics; and D. Maizels for artwork. This study was supported by AB, Spanish grants ref. BFU2012-34398 and BFU2015-69717-P, Career Integration Grant, ref. 304111, Marie Curie Intra-European Fellowship, ref. 221540, Ramón y Cajal Fellowship, ref. RYC-2011-08391p; M.O.C. was supported by the Royal Society (R/144823-11-1); J.S.C. and S.G.N.G. was supported by Wellcome Trust; R.D.E. was supported by the University of Nottingham Advanced Data Analysis Centre. A.I. was supported by an international PhD studentship from Consejo Nacional de Ciencia y Tecnologia (CONACYT) Mexico.
The authors declare no competing financial interests.
Supplementary Figures, Supplementary Table, Supplementary Notes and Supplementary References. (PDF 13832 kb)
Supplementary Data 1
Synaptosomal and postsynaptic density proteins identified in mouse Sheet#1. List of all proteins identified in mouse synatosomes (SYN) and postsynaptic densities (PSD). For each protein data from each identified peptide is given. Sheet#2. Mass spectrometry quantitative data is provided for mouse proteins. Two label-free quantitative variables are provided: iBAQs (intensity-based absolute quantification) and LFQ (label-free quantification) intensity. Sheet#3. Final filtered list of mouse proteins in synaptosomes (SYN) and PSDs. For each protein the number and properties of its identified peptides is given. SYN components are defined as the sum of proteins in the following groups: SYN ONLY (marked with 'YES' in Column E) + EQUAL ABUNDACE (marked with 'YES' in Column G) + PSD Depleted (marked with 'YES' in Column H) + PSD Enriched (marked with 'YES' in Column I). PSD components are defined as the sum of proteins in the following groups: PSD ONLY (marked with 'YES' in Column F) + PSD Enriched (marked with 'YES' in Column I) + EQUAL ABUNDACE (marked with 'YES' in Column G). (XLSX 8660 kb)
Supplementary Data 2
Synaptosomal and postsynaptic density proteins identified in zebrafish Sheet#1. List of all proteins identified in zebrafish synatosomes (SYN) and postsynaptic densities (PSD). For each protein the number and properties of its identified peptides is given. Sheet#2. Mass spectrometry quantitative data is provided for mouse proteins. Two label-free quantitative variables are provided: iBAQs (intensity-based absolute quantification) and LFQ (label-free quantification) intensity. Sheet#3. Final filtered list of zebrafish proteins in synaptosomes (SYN) and PSDs. For each protein the number and properties of its identified peptides is given. SYN components are defined as the sum of proteins in the following groups: SYN ONLY (marked with 'YES' in Column E) + EQUAL ABUNDACE (marked with 'YES' in Column G) + PSD Depleted (marked with 'YES' in Column H) + PSD Enriched (marked with 'YES' in Column I). PSD components are defined as the sum of proteins in the following groups: PSD ONLY (marked with 'YES' in Column F) + PSD Enriched (marked with 'YES' in Column I) + EQUAL ABUNDACE (marked with 'YES' in Column G). (XLSX 9139 kb)
Supplementary Data 3
Functional analysis of Mouse and zebrafish proteins Sheet#1. Cell Location and Protein Type of mouse SYN and PSD proteins as described by the Ingenuity knowledgebase. Proteins found in synaptosomes (SYN) and/or PSDs are indicated. Sheet#2. Cell Location and Protein Type of zebrafish SYN and PSD proteins as described by the Ingenuity knowledgebase. Proteins found in synaptosomes (SYN) and/or PSDs are indicated. Sheet#3. Enriched GO-Slim terms for Molecular Function and Biological Process and Panther Protein Class categories from the mouse synaptic proteomes. 4 Sheet#4. Enriched GO-Slim terms for Molecular Function and Biological Process and Panther Protein Class categories from the zebrafish synaptic proteomes. (XLSX 660 kb)
Supplementary Data 4
Mouse and zebrafish Ensembl protein families Sheet#1. Ensemble gene ID, Ensemble Protein Family ID and Ensemble Family Description are given for all SYN and PSD mouse proteins. The count of proteins for each family is also given. Sheet#2. Ensemble ID, Ensemble Protein Family ID and Ensemble Family Description are given for all SYN and PSD zebrafish proteins as well as mouse orthologues to zebrafish proteins. The count of proteins for each family is also given. Sheet#2. Ensemble ID, Ensemble Protein Family ID and Ensemble Family Description are given for all SYN and PSD zebrafish proteins as well as mouse orthologues to zebrafish proteins. The count of proteins for each family is also given. (XLSX 604 kb)
Supplementary Data 5
Mouse and zebrafish domain analysis. List of protein domains found among mouse and zebrafish SYN and PSD components. Also provided are protein domains found in proteins from mouse and zebrafish brain proteomes (M.Brain and Z.Brain) and all protein coding genes (M.Genome or Z.Genome). (XLSX 391 kb)
Supplementary Data 6
Supporting proteomic sets. Ensembl gene identifiers are given for proteins in the different supporting proteomics sets used in this study. (XLSX 884 kb)
Supplementary Data 7
Species specific PSD proteins GO analysis Sheet#1. Biological Function and Cellular component GO terms enriched among mouse specific PSD proteins. Sheet#2. Biological Function and Cellular component GO terms enriched among zebrafish specific PSD proteins. Sheet#3. Biological Function and Cellular component GO terms enriched among human, mouse and rat PSD proteins absent from the zebrafish synapse. Sheet#4. KEGG pathways enriched among human, mouse and rat PSD proteins absent from the zebrafish synapse. (XLSX 45 kb)
Supplementary Data 8
Proteins from SNARE, ESCRT and HOPS complexes in mammalian PSD. The presence of components of the following protein complexes: SNARE, ESCRT and HOPS, in the postsynaptic density (PSD) of zebrafish, human, mouse and rat, is indicated. For each species a 'YES' 6 denotes that the protein was found and a blank cell denotes that the protein was not identified. The articles describing the human and rat PSD proteomes are indicated. An independently generated PSD proteome from mouse is also included. (XLSX 14 kb)
Supplementary Data 9
Mouse and zebrafish SYN/PSD gene orthologs Sheet#1. List of SYN and PSD mouse proteins with and ortholog in the zebrafish genome. Orthology type is provided. Sheet#2. List of SYN and PSD zebrafish proteins with and ortholog in the mouse genome. Orthology type is provided. Sheet#3. Mouse SYN and PSD proteins with one or more orthologs in the corresponding zebrafish dataset. Sheet#4. Zebrafish SYN/PSD proteins with one or more orthologs identified in the corresponding mouse SYN/PSD datasets. (XLSX 388 kb)
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Bayés, À., Collins, M., Reig-Viader, R. et al. Evolution of complexity in the zebrafish synapse proteome. Nat Commun 8, 14613 (2017). https://doi.org/10.1038/ncomms14613
This article is cited by
Automated synapse-level reconstruction of neural circuits in the larval zebrafish brain
Nature Methods (2022)
Proteomic insights into synaptic signaling in the brain: the past, present and future
Molecular Brain (2021)
Effects of feeding status on nucb1 and nucb2A mRNA expression in the hypothalamus of Schizothorax davidi
Fish Physiology and Biochemistry (2020)
The structural variation landscape in 492 Atlantic salmon genomes
Nature Communications (2020)
Intestinal dysmotility in a zebrafish (Danio rerio) shank3a;shank3b mutant model of autism
Molecular Autism (2019)
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.