Protein–protein interaction network module changes associated with the vertebrate fin-to-limb transition

Fernando, Pasan C.; Mabee, Paula M.; Zeng, Erliang

doi:10.1038/s41598-023-50050-2

Download PDF

Article
Open access
Published: 18 December 2023

Protein–protein interaction network module changes associated with the vertebrate fin-to-limb transition

Scientific Reports volume 13, Article number: 22594 (2023) Cite this article

873 Accesses
1 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Evolutionary phenotypic transitions, such as the fin-to-limb transition in vertebrates, result from modifications in related proteins and their interactions, often in response to changing environment. Identifying these alterations in protein networks is crucial for a more comprehensive understanding of these transitions. However, previous research has not attempted to compare protein–protein interaction (PPI) networks associated with evolutionary transitions, and most experimental studies concentrate on a limited set of proteins. Therefore, the goal of this work was to develop a network-based platform for investigating the fin-to-limb transition using PPI networks. Quality-enhanced protein networks, constructed by integrating PPI networks with anatomy ontology data, were leveraged to compare protein modules for paired fins (pectoral fin and pelvic fin) of fishes (zebrafish) to those of the paired limbs (forelimb and hindlimb) of mammals (mouse). This also included prediction of novel protein candidates and their validation by enrichment and homology analyses. Hub proteins such as shh and bmp4, which are crucial for module stability, were identified, and their changing roles throughout the transition were examined. Proteins with preserved roles during the fin-to-limb transition were more likely to be hub proteins. This study also addressed hypotheses regarding the role of non-preserved proteins associated with the transition.

Co-expression network analysis identifies gonad- and embryo-associated protein modules in the sentinel species Gammarus fossarum

Article Open access 27 May 2019

Phylobone: a comprehensive database of bone extracellular matrix proteins in human and model organisms

Article Open access 15 August 2023

Proteomic analysis of eleven tissues in the Chinese giant salamander (Andrias davidianus)

Article Open access 11 November 2019

Introduction

Phenotypes, including fin development and limb development, emerge from the intricate interplay of numerous genes and proteins within complex biological pathways^1,2,3. Evolutionary shifts in phenotypes, spurred by environmental or other changes, entail modifications in protein interactions and their pathway associations. Most often, it is the intricate network of protein interactions, rather than the contribution of a single protein, that determines the resulting phenotype^1,4. Thus, the assembly of proteins and their interactions, i.e., modular protein structure¹, is vital for understanding the evolutionary mechanisms that drive phenotypic changes in the field of evolutionary biology. The analysis of protein modules has become common in bioinformatics, and the concept of modular evolution has emerged to explain the changes occurring in groups, rather than individual proteins, during the evolution of organisms^5,6,7. However, most studies have concentrated on smaller protein complexes, typically containing less than 20 proteins, which determine molecular functions^8,9,10. In contrast, phenotypes such as fins and limbs, are shaped by a multitude of proteins with diverse molecular functions belonging to various biological pathways. The majority of previous protein network studies centered on phenotypes have targeted human diseases^1,11. To our knowledge, there have been no studies of modules aimed at understanding evolutionary transitions in phenotypes. Given the profound anatomical changes associated with vertebrate evolution, such as the transformation from fins to limbs, understanding the bases for both the conservation and the changes are critical. This study uses the fin to limb transition in vertebrate evolution as a case study for the value of employing PPI networks in better understanding the molecular basis of evolutionary changes in phenotypes.

The fin-to-limb transition is an iconic anatomical change associated with the evolution of terrestrial vertebrates from aquatic fish-like ancestors^12,13. According to fossil record, the transformation of fishes into land vertebrates began in the Devonian, 365–408 million years ago¹³. This well-studied transformation is associated with numerous phenotypic changes beyond the shift from fin to limb¹⁴, such as modifications in the cranial and axial skeleton¹⁵. Homologies between the anatomical structures of land and aquatic vertebrates are evident from numerous shared characteristics. For instance, the pectoral fin endoskeleton of panderichthyid fish fossils shows substantial similarities to the limb skeletons of terrestrial tetrapods, such as the presence of a proximal humerus and two distal bones¹². This and other evidence support the concept that forelimbs and hindlimbs of tetrapods are homologous to the pectoral and pelvic fins of fishes, respectively.

Identifying the genetic changes associated with the fin-to-limb transition is a frequent subject of comparative study in evolutionary and developmental biology, with a long and continuing legacy of experimental endeavors^{16,17,18,19,20,21}. While many wet lab experiments have demonstrated the evolutionary importance of key genes such as shh^12,16, computational studies focusing on the fin-to-limb transition have been relatively few¹⁷. The emergence of large protein–protein interaction (PPI) networks offers a productive way forward to understand the specific changes in biological networks associated with phenotypes altered during such evolutionary transitions. As major anatomical changes involve many proteins and pathways, the application of PPI networks enables a systems biology perspective.

This research sought to isolate functional modules^6,22, i.e., a set of nodes that are highly connected internally and sparsely connected with external nodes¹ that correspond to fins and limbs. A variety of functional module detection algorithms exist⁵. For modules associated with complex phenotypes, which typically involve a large number of proteins, it can be advantageous to perform module detection using prior knowledge as a computational constraint^1,4,23. These methods begin with a set of proteins known to be associated with a given phenotype and expand the module based on the network structure. One of the simplest ways for isolating a functional module by expansion involves incorporating all immediate neighbors of the proteins associated with the known phenotype into the module¹. However, this method has been found to yield a high number of false positives¹. As a result, more accurate network-based candidate protein prediction algorithms, such as the Hishigaki method²⁴, are often used to predict new candidate proteins for module inclusion^1,4,24. The Hishigaki method mitigates the bias towards extensively studied functions and was used herein.

This study further sought to identify hub proteins in the functional modules, and to compare them between fins and limbs. Hub proteins typically have more interactions than others in the module^3,25,26 and function to maintain module stability; their removal is likely to disrupt module organization, and subsequently the biological function(s) or phenotype(s) that is regulated. Characterizing how hub proteins change over the course of evolution while maintaining developmental stability is a key question in evolutionary developmental biology. To date, a primary focus has been on identifying the proteins that are conserved throughout evolution and their organization within respective modules^8,10. It has been hypothesized that evolution navigates gradual modular changes, preserving fundamental structure, because dramatic alterations in protein interactions jeopardize organismal function⁷. Supporting this, conserved proteins are observed to play an important role in maintaining the stability of protein modules during evolution^7,8,10. The recruitment and the removal of other proteins and the rewiring of biological pathways are often held together by the conserved proteins. Module analysis allows identification of these crucial conserved proteins, which often also serve as hub proteins^7,8. While these may play a role in maintaining protein module structure, species-specific module proteins recruited or removed during evolution can also have essential roles contributing to evolutionary transitions²⁷. This study aimed to distinguish conserved and species-specific proteins in functional modules as a potential mechanism to better understand their roles in evolution.

Methods

The goal of this work was to compare PPI network modules responsible for paired fins with those of paired limbs to study the modular changes associated with the transition. Zebrafish and mouse were selected as model organisms to extract modules from their PPI network for paired fins and paired limbs, respectively. Ensuring the quality of PPI network data remains a significant challenge in research endeavors involving PPI networks^4,28. The PPI networks constructed using experimental methods, such as the high-throughput yeast two-hybrid assay, often contain a multitude of spurious interactions^4,28,29. This necessitated refining the raw PPI networks to bolster the precision of our current network analyses. In addressing this need, we used enhanced integrated networks developed in our previous research²⁸. These were formulated by integrating raw PPI data from the STRING database³⁰ with established experimental insights on protein-anatomy relationships, sourced from the Monarch Initiative repository³¹, which archives proteins annotated using Uberon anatomy ontology terms³². This integration improves the quality of raw PPI networks by filtering out spurious interactions, leveraging the highly accurate protein-anatomy relationships documented through trustworthy experimental knowledge²⁸. Our earlier research validated the robustness of these integrated networks, highlighting their superiority over raw PPI networks in predicting new anatomical protein candidates, such as those for fins and limbs²⁸. Consequently, we selected the most effective integrated networks for zebrafish and mouse, identified in our previous research²⁸, to carry out the module analyses for this study.

In this study, we constructed a network workflow for comparing paired fin modules to paired limb modules. The initial phase of this workflow entails the fusion of PPI networks from the STRING database with anatomical annotations sourced from the Monarch Initiative repository to generate integrated networks. This integration serves to enhance network quality, and the most effective networks were then chosen based on our prior findings²⁸. Then, the PPI modules pertaining to paired fins and limbs were extracted from integrated zebrafish and mouse networks, respectively, using the Hishigaki network prediction algorithm²⁴. This extraction process predicts new protein candidates for each fin and limb module based on originally annotated proteins. Subsequently, the accuracy of these newly projected protein candidates was verified through enrichment and homology analyses. The concluding phase involved comparing the zebrafish paired fin modules with the mouse paired limb modules, facilitating the examination of modular changes. This comparison provided insights into conserved versus module-specific proteins and their shifting significance during the evolutionary transition. These workflow steps are visually represented in Fig. 1 and the Python scripts used are available at: https://doi.org/10.5281/zenodo.4445583. It is noteworthy to mention that while the current workflow targets the paired fin-limb comparison, the provided scripts are adaptable for any anatomical entity defined in the Uberon ontology and for any network sourced from the STRING database.

Generation and selection of the integrated networks for module detection

During the integrated network generation step (Fig. 1), which was conducted during our previous work²⁸, anatomy-based protein networks were constructed by calculating the semantic similarity between anatomy ontology terms annotated to different proteins using four semantic similarity methods (Lin³³, Resnik³⁴, Schlicker³⁵, and Wang³⁶). Subsequently, these semantic networks were integrated with the PPI networks for zebrafish and mouse. During this process, the combined STRING PPI network score for each interaction was integrated with the equivalent semantic similarity scores of protein pairs from the anatomy-based semantic networks. This network integration substantially increased the candidate protein prediction accuracy for anatomical entities, thus ensuring these integrated networks are superior for module detection than raw PPI networks retrieved from the STRING^{22,23,28,37,38}. Furthermore, of the four integrated networks constructed by the four semantic similarity methods, the best-performing semantic networks for zebrafish and mouse were evaluated in our previous work. This assessment involved using the Hishigaki network-based candidate protein prediction method^24,28 coupled with leave-one-out cross-validation process²⁸. The two best-performing networks for zebrafish (Lin) and mouse (Schlicker) were used in this study to ensure the accuracy of module detections. Hereafter, these will be referred as “zebrafish integrated network” and “mouse integrated network”. Both Lin (Eq. 1) and Schlicker (Eq. 2) methods consider the Information Content (IC) of each term of the anatomy ontology, factoring in the number of protein annotations for each term.

$$sim_{L } \left( {t_{1} ,t_{2} } \right) = \mathop {\max }\limits_{{t \in S\left( {t_{1} ,t_{2} } \right)}} \left\{ {\frac{2IC\left( t \right)}{{IC\left( {t_{1} } \right) + IC\left( {t_{2} } \right)}}} \right\}$$

(1)

$$sim_{S } \left( {t_{1} ,t_{2} } \right) = \mathop {\max }\limits_{{t \in S\left( {t_{1} ,t_{2} } \right)}} \left\{ { \frac{2IC\left( t \right)}{{IC\left( {t_{1} } \right) + IC\left( {t_{2} } \right)}} \left( {1 + IC\left( t \right)} \right)} \right\}$$

(2)

In the above equations, t₁ and t₂ represent the ontology terms for which the similarity is being calculated, whereas S denotes the set of common ancestors for these two terms. The IC for a specific term t is represented by IC(t), and is calculated based on the number of proteins annotated to term t as illustrated below (Eqs. 3 and 4).

$$IC\left( t \right) = - log\left( {p\left( t \right)} \right)$$

(3)

$$P\left( t \right) = \frac{{Number\, of\, proteins \,associated \,with \,the \,term\, {\text{t}} + constant}}{Total\, number \,of \,proteins \,associated \,with \,the \,entire \,ontology}$$

(4)

Detection of network modules

For module detection, proteins with direct annotations to the pectoral fin (UBERON:0000151), forelimb (UBERON:0002102), pelvic fin (UBERON:0000152), and hindlimb (UBERON:0002103) were used as prior information, and their anatomical profiles were extracted from the Monarch Initiative repository (https://monarchinitiative.org/; 06/20/2018)³¹. In addition, proteins that were annotated to the parts (e.g., pectoral fin radial skeleton is a part of the pectoral fin) and developmental precursors (e.g., pectoral fin bud and forelimb bud) of the above entities were extracted using the Uberon anatomy ontology³² relationships. The proteins directly annotated to the anatomical entity of interest or annotated to a part or a developmental precursor of the entity are collectively referred to as “proteins with original annotations”.

The process of module identification begins with proteins bearing original annotations. The Hishigaki network-based candidate protein prediction method^24,28 is then used to identify potential additional proteins relevant to the anatomical entities of interest. The Hishigaki method employs a chi-square-based scoring algorithm to estimate the probability of a given protein having a particular function. This approach mitigates the bias towards extensively studied functions, where higher scores are often assigned due to a greater number of original annotations. Consequently, the Hishigaki method serves as an ideal candidate for extracting protein modules for anatomical entities, such as fin and limb, which may have a low number of original annotations. Initially, the performance of network-based candidate protein prediction for each targeted anatomical entity is evaluated through leave-one-out cross-validation²⁸, which facilitates the generation of ROC and precision-recall curves. Following this, a prediction precision threshold was used to discern new candidate proteins. A trial-and-error method was used to select the best precision threshold for each protein module, with the resultant module sizes taken into consideration. Modules for different precision thresholds were generated for pectoral fin-forelimb and pelvic fin-hindlimb entity pairs. The thresholds leading to the most comparable module sizes for aforementioned paired entities were selected. For example, a lower precision threshold was required for the pelvic fin due to the comparatively low number of original annotations.

Once the candidate proteins were predicted, respective modules for the pectoral fin and the pelvic fin were extracted from the zebrafish integrated network, and for the forelimb and hindlimb from the mouse network and visualized using Cytoscape software³⁹.

Validation of the predicted proteins

The validation process comprised three steps. Initially, the predicted proteins for the pectoral fin and pelvic fin modules in zebrafish were compared with the orthologous proteins in the forelimb and hindlimb modules in mouse and vice versa. This was to determine whether these proteins were annotated to a homologous anatomical entity. For example, if proteins predicted for the pectoral fin module appear in the forelimb module, this implies a certain degree of validation for those proteins based on homology.

The second step involved performing enrichment analyses to verify if the predicted proteins in each module share similar Biological Process terms from Gene Ontology (GO-BP) and Uberon annotations as the proteins with original annotations. The online functional enrichment analysis tool DAVID (https://david.ncifcrf.gov/) was used for gene/protein set enrichment analysis using GO-BP terms. DAVID uses Fisher’s exact test⁴⁰ for enrichment analyses. Despite the common use of GO for enrichment analysis, anatomy ontologies are rarely used. To perform enrichment analysis using the Uberon anatomy ontology and Fisher’s exact test, a dedicated Python program (Uberon enrichment analysis program) was developed and used. Ontology terms with p-values less than 0.05 were considered as enriched terms.

In the third step, the weighted degree distributions of the predicted proteins were compared to the weighted degree distributions of the proteins with original annotations in each module. If the predicted proteins exhibit a higher weighted degree distribution, it suggests they hold similar or a greater importance as the proteins with original annotations. The degree of a protein is usually used to rank proteins and identify hub proteins. However, in weighted network analysis like the integrated networks used here, weighted degree is preferred over simple degree as it takes into account different interaction weights (Eq. 5) instead of merely counting the number of interactions for a particular node²⁶.

$$Weighted\, degree = \mathop \sum \limits_{v \in n\left( u \right)} sim(v,u)$$

(5)

In Eq. 5, n(u) is the neighborhood of the protein of interest (u) and v iterates through all the neighbors of protein u. The interaction weight is denoted by sim(v,u), which represents the protein similarity score for the interaction between proteins v and u. The weighted degree of protein u is derived from the sum of all interaction weights between protein u and all its neighbors.

Comparison of the network modules

To identify the modular changes, the pectoral fin and pelvic fin modules of the zebrafish were compared with the forelimb and hindlimb modules of the mouse, respectively.

Because a whole genome duplication event occurred at the origin of actinopterygian fishes⁴¹, most mouse proteins have duplicated counterparts in zebrafish. To facilitate the module comparison, gene/protein ortholog mappings between mouse and zebrafish were retrieved from the Zebrafish Information Network⁴² (ZFIN; 06/26/2018; https://zfin.org/downloads). If a single mouse protein corresponded to multiple zebrafish orthologs within a zebrafish module, all zebrafish orthologs were retained. Through the module comparison, three categories of proteins were identified: conserved proteins (those common to two modules), zebrafish module-specific proteins, and mouse module-specific proteins. The weighted degree of each protein was calculated (Eq. 5) to identify the important hub proteins of each module, with the proteins then being ranked accordingly. The weighted degree of each zebrafish module protein was compared with the corresponding mouse ortholog to identify changes in importance during the transition. However, due to the differing sizes of the zebrafish and mouse modules, it was necessary to normalize each protein’s weighted degree by the total number of proteins in each module. Consequently, the normalized weighted degree distributions for conserved proteins, zebrafish module-specific proteins, and mouse module-specific proteins were compared for pectoral fin versus forelimb and pelvic fin versus hindlimb, thereby examining the relative importance of proteins within each group. Furthermore, Wilcoxon rank-sum tests were conducted on normalized weighted degree distributions between conserved and module-specific proteins for each fin and limb. This provided a statistical validation when comparing the importance of proteins in each group.

The fate of the zebrafish module-specific proteins in mouse was investigated by extracting mouse orthologs for the pectoral and pelvic fin module-specific proteins and performing enrichment analyses using Uberon and GO-BP terms. Similarly, the roles of the mouse module-specific proteins in zebrafish were investigated using zebrafish orthologs for the forelimb and hindlimb module-specific proteins. The DAVID online functional enrichment analysis tool was used to conduct gene/protein set enrichment analysis using GO-BP terms. Ontology terms with p-values less than 0.05 were considered as enriched terms.

Results

Detection of network modules

The zebrafish integrated network used to detect paired fin modules contained 17,394 proteins and 730,855 interactions, while the mouse integrated network used to identify paired limb modules encompassed 18,002 proteins and 613,671 interactions²⁸. A breakdown of the number of proteins originally annotated to each anatomical entity is provided in Supplementary Table S1. The total number of proteins for the pectoral fin (198) and the forelimb (267) showed closer similarity compared to the total number of proteins for the pelvic fin (15) and the hindlimb (777).

The ROC and precision-recall curves created during the evaluation of network-based candidate protein predictions for each anatomical entity were provided in Supplementary Figs. S1 and S2, respectively. The curves indicate high accuracy for network-based candidate protein predictions for all anatomical entities (the AUC values of ROC curves were higher than 0.85), excluding the pelvic fin. This validates the high accuracy of the network-based candidate protein predictions, primarily owing to the integration of high-quality enhanced protein networks from our previous research²⁸. The pelvic fin’s relatively lower performance might be attributed to a reduced number of original protein annotations. It is well established that prediction accuracy tends to improve when the size of the dataset or the number of protein annotations increases⁴³. Anatomical entities with fewer protein annotations might yield lower AUC values.

The statistics for the extracted protein modules are given in Supplementary Table S1. Proteins with original annotations that were lost during the module extraction are listed in Supplementary Table S2. A high precision threshold of 0.7 was used for candidate protein predictions for pectoral fin, forelimb, and hindlimb modules. For the pelvic fin module, the precision threshold was reduced to 0.05 in order to achieve a comparable number of proteins to those in the hindlimb module.

Visual representations of the final modules for the pectoral fin, pelvic fin, forelimb, and hindlimb (paired fin and limb modules) are depicted in Supplementary Figs. S3, S4, S5, and S6, respectively. Supplementary Files S1, S2, S3, and S4 contain the accompanying Cytoscape network files for these modules. Proteins from the paired fin and limb modules, ranked based on the weighted degree, are listed in Supplementary Files S5, S6, S7, and S8, respectively.

Validation of the predicted proteins

The lists of predicted proteins for paired fin and limb modules are provided in Supplementary Tables S3, S4, S5, and S6, respectively. From the 45 proteins predicted for the pectoral fin, 14 had mouse orthologs that were associated with the forelimb (9 direct annotations, 2 annotations specific to the parts or the developmental precursors, and 3 predicted). Out of the 605 proteins predicted for the pelvic fin, 78 had mouse orthologs related to the hindlimb (46 direct annotations, 20 annotations specific to the parts or the developmental precursors, and 12 predicted). From the 18 proteins predicted for the forelimb, 6 had zebrafish orthologs associated with the pectoral fin (2 direct annotations, 1 annotation solely to the parts or the developmental precursors, and 3 predicted). Finally, of 32 proteins predicted for the hindlimb, 12 had zebrafish orthologs connected to the pelvic fin (all 12 being predicted). These findings suggest that the orthologs of the predicted proteins are annotated to corresponding homologous anatomical entities, which adds a layer of validation to these predictions.

The shared enriched GO-BP terms among predicted proteins and proteins originally annotated for the paired fin and limb modules are enumerated in Supplementary Tables S7, S8, S9, and S10. Similarly, Supplementary Tables S11, S12, S13, and S14 outline the enriched Uberon terms common to the predicted proteins and proteins originally annotated for the paired fin and limb modules. Several fin/limb related GO-BP terms were commonly enriched across all modules, e.g., “pectoral fin development”, as were Uberon terms, such as “median fin fold”.

Boxplot comparisons (Fig. 2) show that across all paired fin and limb modules, the weighted degree distributions of predicted proteins exceeded those of proteins with original annotations. This pattern indicates that, as a collective, predicted proteins play central roles in module function.

Comparison of the network modules

Pectoral fin vs. forelimb modules

A comparison of the proteins between these homologous structures yielded 183 proteins to be specific to the pectoral fin module, 207 proteins that were unique to the forelimb module, and 37 proteins shared (conserved) between both (Supplementary Table S15; Fig. 3).

In the pectoral fin module, the hub protein with the highest weighted degree was shha (sonic hedgehog a; Supplementary File S5), a protein known for its crucial role in pectoral fin development¹⁶. Its ortholog, Shh, ranked fourth in the forelimb module (Supplementary Table S15) and is widely studied for its role in the development and morphogenesis of limbs across species⁴⁴, including mouse and humans. Notably, any disruptions in the sonic hedgehog signaling pathway in tetrapods correspond to losses, gains, or malformations of limbs⁴⁴. The significance of shh gene in morphological patterning of both paired fins and limbs has made it a focal point in studies concerning the transition from fin to limb¹².

The highest ranked protein in the forelimb module was bmp4 (bone morphogenetic protein 4; Supplementary File S7). The bmp4 plays a crucial role in the formation and morphogenesis of tetrapod limbs⁴⁵. Mutations within bmp4 can disrupt the bmp4 signaling pathway, resulting in limb abnormalities in limb and digit formation⁴⁵. The bmp4 also held a high position in the pectoral fin module, ranking second, and was predicted during the module detection phase (Supplementary Table S15).

Of the conserved proteins (Fig. 3), some important hub proteins in pectoral fin module, such as shha, bmp4, bmp2b, and bmp7a, have retained their importance in forelimb development. This is underscored by their high rankings based on the weighted degree in the forelimb module (Supplementary Table S15). Other pectoral fin proteins, such as sox9, have a higher rank within the forelimb, reflecting a more substantial role in limb development. Within the pectoral fin module, the proteins sox9a and sox9b were ranked 83rd and 104th, respectively, whereas in mouse, the corresponding ortholog sox9 was elevated to the 15th position (Supplementary Table S15). Sox9 is renowned for its involvement in limb digit patterning, a process attributed to its role in the bmp-sox9-wnt Turing network¹⁷. Given that digits emerged after the transition from fins to limbs^12,15, the involvement of sox9 in a digit patterning pathway could have amplified the interactions it had with other proteins in the forelimb module, thereby increasing its importance.

A boxplot illustrating the normalized weighted degree distributions for pectoral fin module-specific proteins, pectoral fin conserved proteins (those shared with the forelimb), forelimb conserved proteins (those shared with the pectoral fin), and forelimb module-specific proteins is presented in Fig. 4. The conserved proteins in both modules exhibit higher normalized weighted degree distributions in comparison to their respective module-specific proteins (p = 5.97e−13 for pectoral fin and p = 1.69e−6 for forelimb) when comparing conserved versus module-specific proteins based on Wilcoxon rank-sum tests. This indicates that as a group, the conserved proteins engage in more interactions within the module and play a central role in maintaining modular stability. From an evolutionary point of view, it seems that during the transition from pectoral fin to the forelimb, proteins with higher degrees in the pectoral fin module, such as shha and bmp4, were retained in the forelimb. Moreover, new forelimb module-specific proteins were integrated into the forelimb, likely surrounding these conserved proteins.

Pelvic fin versus hindlimb modules

Comparisons revealed 536 specific proteins in the pelvic fin module, and 601 specific proteins in the hindlimb module. 81 proteins were conserved between the two modules (Supplementary Table S16 and Fig. S7).

In the pelvic fin module, the protein with the highest rank was hsp90ab (predicted; Supplementary File S6). Although hsp90ab is a heat shock protein and its impact on pelvic fin development is not well-established, its suppression has been linked to developmental defects in zebrafish, particularly in eye development⁴⁶. Furthermore, the disruption of hsp90ab expression has been associated with caudal fin fold defects in zebrafish. This convergence of our computational findings and observed fin effects suggests that hsp90ab is a promising new candidate for pelvic fin development that may have a key role in the module stability.

The hub protein with the highest rank in the hindlimb module, trp53, is known to be associated with embryonic hindlimb development in mouse⁴⁷. Although trp53 also appears in the pelvic fin module (as a predicted protein), it held a lower rank (24th) based on the weighted degree (Supplementary Table S16).

Comparison of conserved proteins between pelvic fin and hindlimb modules (Supplementary Table S16 and Fig. S7) revealed several key proteins central to modular stability. For example, ctnnb1, a predicted module protein and ranked 4^th in the pelvic fin module, also held a high rank (3rd) in the hindlimb module. Ctnnb1 is essential in the β-catenin pathway, which is necessary for mouse hindlimb initiation⁴⁸. While there is as of yet no specific connection to zebrafish paired fins, ctnnb1 is recognized as a critical element in overall fish development⁴⁹.

According to comparison of normalized weighted degree distributions for pelvic fin module-specific proteins, pelvic fin conserved proteins, hindlimb conserved proteins, and hindlimb module-specific proteins (Supplementary Fig. S8), the conserved proteins in the hindlimb module demonstrated a higher normalized weighted degree distribution relative to the respective hindlimb module-specific proteins (p = 2.20e−16 based on the Wilcoxon rank-sum test). Meanwhile, the normalized weighted degree distribution for the pelvic fin module showed a moderate increase (p = 0.07, Wilcoxon rank-sum test). These findings underline the greater significance of conserved proteins in maintaining the stability of both the hindlimb and pelvic fin modules.

The fate of zebrafish paired fin module-specific proteins in mouse

A large number of proteins found in zebrafish fin modules (83% for pectoral fin: 183 proteins; 80% for pelvic fin: 536 proteins) were not featured in the mouse limb modules (Supplementary Files S5 and S6), implying these proteins were not retained in limb development. To understand the roles of these proteins in mouse, the enriched GO-BP and Uberon terms for their mouse orthologs were investigated (Supplementary Tables S17, S18, S19 and S20). Several unique anatomical entities and corresponding biological processes exclusive to tetrapods¹⁵ were enriched (Supplementary Tables S21 and S22), such as the involvement of highly ranked (7th) pectoral fin module-specific protein lef1 with palate development, trachea gland development, and neck-related phenotypes^50,51.

The role of mouse paired limb module-specific proteins in zebrafish

A majority of the limb module-specific proteins in mouse (85% for forelimb: 207 proteins; 90% for hindlimb: 601 proteins) did not appear in pectoral fin or pelvic fin modules in zebrafish (Supplementary Files S7 and S8). This suggests different roles for these proteins in zebrafish, prompting investigation into their enriched GO-BP and Uberon terms (Supplementary Tables S23, S24, S25 and S26). Some mouse limb module-specific proteins were enriched in the head region of the zebrafish, particularly in the jaw and post-hyoid pharyngeal arch skeleton (Supplementary Tables S27 and S28).

Discussion

The application of PPI network methods enabled a deeper biomolecular exploration of the phenotypic transition from fin to limb and revealed new insights into this transition. A primary goal for this work was to identify hub proteins in the functional modules, and to compare them between fins and limbs. This study aimed to distinguish conserved and species-specific proteins in functional modules to better understand their roles in evolution. The results indicate that conserved proteins are more likely to be hub proteins than module-specific proteins, as evidenced by the weighted degree comparisons between these two groups (Fig. 4). This reinforces the initial hypothesis that during the fin-to-limb transition, most hub proteins from fin modules were preserved in limbs, with limb-specific proteins recruited to support this conserved appendage core network.

Furthermore, the results of this study imply that many proteins specific to the zebrafish fin modules were not retained in limb development. Instead, some appear to have been evolutionarily repurposed in the development of anatomical structures that emerged during the aquatic-to-terrestrial transition in vertebrates, such as the lungs and neck. Several unique anatomical entities and corresponding biological processes exclusive to tetrapods¹⁵ were enriched (Supplementary Tables S21 and S22) for fin module-specific proteins. For example, the highly ranked pectoral fin module-specific protein lef1 is involved with palate development, trachea gland development, and neck-related phenotypes^50,51. The evolution of the neck in tetrapods was instrumental in supporting the head, a critical adaptation for survival on land⁵². Similarly, the pelvic fin module-specific protein, mapk1, is associated with neck-related phenotypes, such as thymus development and trachea formation⁵³. Additionally, it participates in lung development and other lung phenotypes⁵³. Like the neck, the evolution of lungs in tetrapods was a significant factor enabling them to breathe and flourish in terrestrial environments⁵⁴. Moreover, Lama5, a module-specific protein found in both pectoral fin and pelvic fin modules, is involved in mouse lung development⁵⁵, hair follicle development, and hair-related phenotypes⁵⁶, of which the latter anatomical features are unique to mammals⁵⁷. These examples suggest that many proteins that initially played roles in fin development were co-opted for the development of novel anatomical structures, a move that helped tetrapods adapt to and thrive in terrestrial environments.

In addition, this research suggests that the original function of some specific proteins in mouse limb modules may have been in the development of non-paired fin structures, such as gill rakers and the caudal fin, which were gradually lost during the evolution of tetrapods. Investigation into their different potential roles in zebrafish showed that some mouse limb module-specific proteins were enriched in the head region of the zebrafish, particularly in the jaw and post-hyoid pharyngeal arch skeleton (Supplementary Tables S27 and S28). The latter encompasses the gill chamber and contains anatomical parts such as gill rakers⁵⁸, which were lost during the evolution of tetrapods. For instance, fst, a crucial forelimb module-specific protein (Supplementary File S7), has a zebrafish ortholog (fsta) associated with splanchnocranium⁵⁹ and post-hyoid pharyngeal arch skeleton⁶⁰. Similarly, twist1, a module-specific protein for both forelimb and hindlimb, has two zebrafish orthologs (twist1a and twist1b) implicated in pharyngeal system development⁶¹. These enrichment analyses suggest that proteins initially associated with fish-specific structures, such as gill arches and the caudal fin, might have been co-opted for limb development as these structures were lost during the transition to tetrapods. These findings support Gegenbaur's theory, which proposes that pectoral and pelvic appendages in tetrapods originated from head branchial arches^62,63,64. Despite initially being refuted and overshadowed in favor of the competing fin-fold theory, Gegenbaur's theory has seen a resurgence in support from recent evolutionary development (evo-devo) studies^58,62. Together, these new findings and the generalized workflow developed here, sets the stage for further experimental exploration by evolutionary developmental biologists.

The network-based workflow used for this study presented several challenges and limitations which were mitigated using computational solutions. For instance, a single protein often plays a role in multiple phenotypes^3,4,28; hence, it is crucial to verify whether the modular structure and underlying protein interactions are attributable exclusively to their involvement with the corresponding fins and limbs, or if they are associated with other phenotypes. Moreover, the presence of incorrect interactions in PPI networks can compromise the prediction accuracy of the modules^4,28. To mitigate the confounding effects caused by other phenotypes and erroneous interactions, a network-based prediction model that has demonstrated a high degree of accuracy in our previous work²⁸ was used. This model has been rigorously tested under various experimental conditions—including different semantic similarity calculation methodologies, various ontology annotation types, and multiple evaluation techniques—for both mouse and zebrafish, and has consistently delivered accurate candidate protein prediction results²⁸. Another challenge faced during the study was that the module comparison depended on respective module sizes. For instance, the detection of the pelvic fin module presented challenges due to the low number of original proteins annotations, potentially because the pelvic fin bud arises late in development⁶⁵ after gene disruptions may have killed the larval zebrafish. To address this issue, the prediction threshold for the pelvic fin module has to be lowered to extract a sizable module comparable to the hindlimb.

Generally, PPI networks retrieved from databases such as STRING are directly used for module detection^3,4, but this work represents the inaugural application of integrated networks in addressing a biological problem. While the transition from fin to limb was the focal point of this study, this comparative integrative network-based approach could be applied to a range of other fields, such as human diseases, plant stress phenotypes, and more, paving the way for future directions in this line of research. In the future, this network comparison workflow could be applied, for example, to other significant evolutionary changes associated with aquatic-to-terrestrial vertebrate transition, such as changes in axial and cranial skeletons. Furthermore, a web-based application will be developed to easily compare the PPI network modules of any vertebrate anatomical change, which will enhance the usability of this workflow.

Conclusions

This work represents the inaugural application of integrated networks in addressing a biological problem that may be generalized to many types of problems in comparative biology. This work enabled the identification of hub proteins essential to the anatomical transition from paired fins to limbs and an assessment of the hypothesis that hub proteins are most likely to be conserved during the transition. This approach also offered insights into the fate of fin module-specific proteins in terrestrial vertebrates, alongside the roles of limb module-specific proteins in fishes. Finally, this study presents a generalized network-based computational workflow designed to perform protein network module comparisons that can be more broadly used in investigating other evolutionary phenotypic transitions.

Data availability

The network files and anatomy profiles used for candidate protein predictions are available at: https://doi.org/10.6084/m9.figshare.13589579.v1. The additional figures and tables are included in electronic supplementary files.

References

Cowen, L., Ideker, T., Raphael, B. J. & Sharan, R. Network propagation: A universal amplifier of genetic associations. Nat. Rev. Genet. 18, 551–562. https://doi.org/10.1038/nrg.2017.38 (2017).
Article PubMed CAS Google Scholar
Yamada, T. & Bork, P. Evolution of biomolecular networks—lessons from metabolic and protein interactions. Nat. Rev. Mol. Cell Biol. 10, 791–803. https://doi.org/10.1038/nrm2787 (2009).
Article PubMed CAS Google Scholar
Wimalagunasekara, S. S., Weeraman, J. W. J. K., Tirimanne, S. & Fernando, P. C. Protein-protein interaction (PPI) network analysis reveals important hub proteins and sub-network modules for root development in rice (Oryza sativa). J. Genet. Eng. Biotechnol. 21, 69. https://doi.org/10.1186/s43141-023-00515-8 (2023).
Article PubMed PubMed Central Google Scholar
Sharan, R., Ulitsky, I. & Shamir, R. Network-based prediction of protein function. Mol. Syst. Biol. 3, 88. https://doi.org/10.1038/msb4100129 (2007).
Article PubMed PubMed Central Google Scholar
Tripathi, S., Moutari, S., Dehmer, M. & Emmert-Streib, F. Comparison of module detection algorithms in protein networks and investigation of the biological meaning of predicted modules. BMC Bioinform. 17, 129. https://doi.org/10.1186/s12859-016-0979-8 (2016).
Article CAS Google Scholar
Tang, X. et al. A comparison of the functional modules identified from time course and static PPI network data. BMC Bioinform. 12, 339. https://doi.org/10.1186/1471-2105-12-339 (2011).
Article CAS Google Scholar
Vespignani, A. Evolution thinks modular. Nat. Genet. 35, 118–119. https://doi.org/10.1038/ng1003-118 (2003).
Article PubMed CAS Google Scholar
Wuchty, S., Oltvai, Z. N. & Barabasi, A. L. Evolutionary conservation of motif constituents in the yeast protein interaction network. Nat. Genet. 35, 176–179. https://doi.org/10.1038/ng1242 (2003).
Article PubMed CAS Google Scholar
Alhindi, T. et al. Protein interaction evolution from promiscuity to specificity with reduced flexibility in an increasingly complex network. Sci. Rep. 7, 44948. https://doi.org/10.1038/srep44948 (2017).
Article PubMed PubMed Central ADS CAS Google Scholar
Shui, Y. & Cho, Y.-R. Alignment of PPI networks using semantic similarity for conserved protein complex prediction. IEEE Trans. Nanobiosci. 15, 380–389. https://doi.org/10.1109/TNB.2016.2555802 (2016).
Article Google Scholar
Garcia Del Valle, E. P. et al. Disease networks and their contribution to disease understanding: A review of their evolution, techniques and data sources. J. Biomed. Inform. 94, 103206. https://doi.org/10.1016/j.jbi.2019.103206 (2019).
Article PubMed Google Scholar
Amaral, D. B. & Schneider, I. Fins into limbs: Recent insights from sarcopterygian fish. Genesis 56, e23052. https://doi.org/10.1002/dvg.23052 (2018).
Article Google Scholar
Clack, J. A. Gaining Ground: The Origin and Evolution of Tetrapods (Indiana University Press, 2012).
Google Scholar
Molnar, J. L., Hutchinson, J. R., Diogo, R., Clack, J. A. & Pierce, S. E. Evolution of forelimb musculoskeletal function across the fish-to-tetrapod transition. Sci. Adv. 7, eabd7457. https://doi.org/10.1126/sciadv.abd7457 (2021).
Article PubMed ADS Google Scholar
Shubin, N. Your Inner Fish: A Journey into the 3.5-Billion-Year History of the Human Body (Pantheon Books, 2008).
Letelier, J. et al. A conserved Shh cis-regulatory module highlights a common developmental origin of unpaired and paired fins. Nat. Genet. 50, 504–509. https://doi.org/10.1038/s41588-018-0080-5 (2018).
Article PubMed PubMed Central CAS Google Scholar
Onimaru, K., Marcon, L., Musy, M., Tanaka, M. & Sharpe, J. The fin-to-limb transition as the re-organization of a Turing pattern. Nat. Commun. 7, 11582. https://doi.org/10.1038/ncomms11582 (2016).
Article PubMed PubMed Central ADS CAS Google Scholar
Hawkins, M. B., Henke, K. & Harris, M. P. Latent developmental potential to form limb-like skeletal structures in zebrafish. Cell 184, 899–911. https://doi.org/10.1016/j.cell.2021.01.003 (2021).
Article PubMed CAS Google Scholar
Royle, S. R., Tabin, C. J. & Young, J. J. Limb positioning and initiation: An evolutionary context of pattern and formation. Dev. Dyn. https://doi.org/10.1002/dvdy.308 (2021).
Article PubMed PubMed Central Google Scholar
Woltering, J. M. et al. Sarcopterygian fin ontogeny elucidates the origin of hands with digits. Sci. Adv. 6, eabc3510. https://doi.org/10.1126/sciadv.abc3510 (2020).
Article PubMed PubMed Central ADS CAS Google Scholar
Fernando, P. C., Jackson, L. M., Zeng, E., Mabee, P. M. & Balhoff, J. P. A generic bioinformatics pipeline to integrate large-scale trait data with large phylogenies. In 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2235–2237 (IEEE, 2017).
Zeng, E., Ding, C., Mathee, K., Schneper, L. & Narasimhan, G. Gene function prediction and functional network: The role of gene ontology. In Data Mining: Foundations and Intelligent Paradigms Vol. 25 Intelligent Systems Reference Library (eds Holmes, D. E. & Jain, L. C.) Ch. 7, 123–162 (Springer, Berlin, Heidelberg, 2012).
Zeng, E., Yang, C., Li, T. & Narasimhan, G. On the effectiveness of constraints sets in clustering genes. In 2007 IEEE 7th International Symposium on BioInformatics and BioEngineering 79–86 (IEEE, 2007).
Hishigaki, H., Nakai, K., Ono, T., Tanigami, A. & Takagi, T. Assessment of prediction accuracy of protein function from protein–protein interaction data. Yeast 18, 523–531. https://doi.org/10.1002/yea.706 (2001).
Article PubMed CAS Google Scholar
Taylor, I. W. et al. Dynamic modularity in protein interaction networks predicts breast cancer outcome. Nat. Biotechnol. 27, 199–204. https://doi.org/10.1038/nbt.1522 (2009).
Article PubMed CAS Google Scholar
Tang, X., Wang, J., Zhong, J. & Pan, Y. Predicting essential proteins based on weighted degree centrality. IEEE/ACM Trans. Comput. Biol. Bioinform. 11, 407–418. https://doi.org/10.1109/TCBB.2013.2295318 (2014).
Article PubMed Google Scholar
Liang, Z., Xu, M., Teng, M. & Niu, L. Comparison of protein interaction networks reveals species conservation and divergence. BMC Bioinform. 7, 457. https://doi.org/10.1186/1471-2105-7-457 (2006).
Article CAS Google Scholar
Fernando, P. C., Mabee, P. M. & Zeng, E. Integration of anatomy ontology data with protein-protein interaction networks improves the candidate gene prediction accuracy for anatomical entities. BMC Bioinform. 21, 442. https://doi.org/10.1186/s12859-020-03773-2 (2020).
Article CAS Google Scholar
von Mering, C. et al. Comparative assessment of large-scale data sets of protein-protein interactions. Nature 417, 399–403 (2002).
Article ADS Google Scholar
Szklarczyk, D. et al. The STRING database in 2017: Quality-controlled protein-protein association networks, made broadly accessible. Nucleic Acids Res. 45, D362–D368. https://doi.org/10.1093/nar/gkw937 (2017).
Article PubMed CAS Google Scholar
Mungall, C. J. et al. The Monarch Initiative: An integrative data and analytic platform connecting phenotypes to genotypes across species. Nucleic Acids Res. 45, D712–D722. https://doi.org/10.1093/nar/gkw1128 (2017).
Article PubMed CAS Google Scholar
Mungall, C. J., Torniai, C., Gkoutos, G. V., Lewis, S. E. & Haendel, M. A. Uberon, an integrative multi-species anatomy ontology. Genome Biol. 13, R5. https://doi.org/10.1186/gb-2012-13-1-r5 (2012).
Article PubMed PubMed Central Google Scholar
Lin, D. An information-theoretic definition of similarity. In Proceedings of the 15th International Conference on Machine Learning (Morgan Kaufmann, 1998).
Resnik, P. Semantic similarity in a taxonomy: An Information-Based measure and its application to problems of ambiguity in natural language. J. Artif. Intell. Res. 11, 95–130 (1999).
Article Google Scholar
Schlicker, A., Domingues, F. S., Rahnenfuhrer, J. & Lengauer, T. A new measure for functional similarity of gene products based on Gene Ontology. BMC Bioinform. 7, 1–16. https://doi.org/10.1186/1471-2105-7-302 (2006).
Article Google Scholar
Wang, J. Z., Du, Z., Payattakool, R., Yu, P. S. & Chen, C. F. A new method to measure the semantic similarity of GO terms. Bioinformatics 23, 1274–1281. https://doi.org/10.1093/bioinformatics/btm087 (2007).
Article PubMed CAS Google Scholar
Zeng, E., Ding, C., Narasimhan, G. & Holbrook, S. R. Estimating support for protein-protein interaction data with applications to function prediction. In Computational Systems Bioinformatics: (Volume 7) 73–84 (World Scientific, 2008).
Wang, D., Ogihara, M., Zeng, E. & Li, T. Combining gene expression profiles and protein-protein interactions for identifying functional modules. In 2012 11th International Conference on Machine Learning and Applications 114–119 (IEEE, 2012).
Smoot, M. E., Ono, K., Ruscheinski, J., Wang, P. L. & Ideker, T. Cytoscape 2.8: New features for data integration and network visualization. Bioinformatics 27, 431–432. https://doi.org/10.1093/bioinformatics/btq675 (2011).
Article PubMed CAS Google Scholar
Routledge, R. Fisher's exact test. In Encyclopedia of Biostatistics Major Reference Works (eds Armitage, P. & Colton, T.) (2005).
Meyer, A. & Schartl, M. Gene and genome duplications in vertebrates: the one-to-four (-to-eight in fish) rule and the evolution of novel gene functions. Curr. Opin. Cell Biol. 11, 699–704. https://doi.org/10.1016/s0955-0674(99)00039-3 (1999).
Article PubMed CAS Google Scholar
Bradford, Y. et al. ZFIN: Enhancements and updates to the Zebrafish Model Organism Database. Nucleic Acids Res. 39, D822-829. https://doi.org/10.1093/nar/gkq1077 (2011).
Article PubMed CAS Google Scholar
Varoquaux, G. Cross-validation failure: Small sample sizes lead to large error bars. NeuroImage 180, 68–77. https://doi.org/10.1016/j.neuroimage.2017.06.061 (2018).
Article PubMed Google Scholar
Lopez-Rios, J. The many lives of SHH in limb development and evolution. Semin. Cell Dev. Biol. 49, 116–124. https://doi.org/10.1016/j.semcdb.2015.12.018 (2016).
Article PubMed CAS Google Scholar
Bakrania, P. et al. Mutations in BMP4 cause eye, brain, and digit developmental anomalies: overlap between the BMP4 and hedgehog signaling pathways. Am. J. Hum. Genet. 82, 304–319. https://doi.org/10.1016/j.ajhg.2007.09.023 (2008).
Article PubMed PubMed Central CAS Google Scholar
Yeyati, P. L., Bancewicz, R. M., Maule, J. & van Heyningen, V. Hsp90 selectively modulates phenotype in vertebrate development. PLoS Genet. 3, e43. https://doi.org/10.1371/journal.pgen.0030043 (2007).
Article PubMed PubMed Central CAS Google Scholar
Im, C. H. et al. Hindlimb muscles of 17.5–18.5 dpc mice double null for MyoD and Trp53 appear indistinguishable from muscles of mice null for either gene. FASEB J. 30, 1035–1032. https://doi.org/10.1096/fasebj.30.1_supplement.1035.2 (2016).
Article Google Scholar
Kawakami, Y. et al. Islet1-mediated activation of the β-catenin pathway is necessary for hindlimb initiation in mice. Development 138, 4465–4473. https://doi.org/10.1242/dev.065359 (2011).
Article PubMed PubMed Central CAS Google Scholar
Zhang, M., Zhang, J., Lin, S. C. & Meng, A. beta-Catenin 1 and beta-catenin 2 play similar and distinct roles in left-right asymmetric development of zebrafish embryos. Development 139, 2009–2019. https://doi.org/10.1242/dev.074435 (2012).
Article PubMed CAS Google Scholar
Duan, D. et al. Submucosal gland development in the airway is controlled by lymphoid enhancer binding factor 1 (LEF1). Development 126, 4441–4453 (1999).
Article PubMed CAS Google Scholar
Nawshad, A. & Hay, E. D. TGFbeta3 signaling activates transcription of the LEF1 gene to induce epithelial mesenchymal transformation during mouse palate development. J. Cell Biol. 163, 1291–1301. https://doi.org/10.1083/jcb.200306024 (2003).
Article PubMed PubMed Central CAS Google Scholar
Ericsson, R., Knight, R. & Johanson, Z. Evolution and development of the vertebrate neck. J. Anat. 222, 67–78. https://doi.org/10.1111/j.1469-7580.2012.01530.x (2013).
Article PubMed Google Scholar
Boucherat, O., Nadeau, V., Berube-Simard, F. A., Charron, J. & Jeannotte, L. Crucial requirement of ERK/MAPK signaling in respiratory tract development. Development 141, 3197–3211. https://doi.org/10.1242/dev.110254 (2014).
Article PubMed CAS Google Scholar
Tatsumi, N. et al. Molecular developmental mechanism in polypterid fish provides insight into the origin of vertebrate lungs. Sci. Rep. 6, 30580. https://doi.org/10.1038/srep30580 (2016).
Article PubMed PubMed Central ADS CAS Google Scholar
Nguyen, N. M., Miner, J. H., Pierce, R. A. & Senior, R. M. Laminin α5 is required for lobar septation and visceral pleural basement membrane formation in the developing mouse lung. Dev. Biol. 246, 231–244. https://doi.org/10.1006/dbio.2002.0658 (2002).
Article PubMed CAS Google Scholar
Gao, J. et al. Laminin-511 is an epithelial message promoting dermal papilla development and function during early hair morphogenesis. Genes Dev. 22, 2111–2124. https://doi.org/10.1101/gad.1689908 (2008).
Article PubMed PubMed Central CAS Google Scholar
Wu, D. D., Irwin, D. M. & Zhang, Y. P. Molecular evolution of the keratin associated protein gene family in mammals, role in the evolution of mammalian hair. BMC Evol. Biol. 8, 241. https://doi.org/10.1186/1471-2148-8-241 (2008).
Article PubMed PubMed Central CAS Google Scholar
Gillis, J. A., Modrell, M. S. & Baker, C. V. Developmental evidence for serial homology of the vertebrate jaw and gill arch skeleton. Nat. Commun. 4, 1436. https://doi.org/10.1038/ncomms2429 (2013).
Article PubMed ADS CAS Google Scholar
Dalcq, J. et al. RUNX3, EGR1 and SOX9B form a regulatory cascade required to modulate BMP-signaling during cranial cartilage development in zebrafish. PLoS One 7, e50140. https://doi.org/10.1371/journal.pone.0050140 (2012).
Article PubMed PubMed Central ADS CAS Google Scholar
Dal-Pra, S., Furthauer, M., Van-Celst, J., Thisse, B. & Thisse, C. Noggin1 and Follistatin-like2 function redundantly to Chordin to antagonize BMP activity. Dev. Biol. 298, 514–526. https://doi.org/10.1016/j.ydbio.2006.07.002 (2006).
Article PubMed CAS Google Scholar
Das, A. & Crump, J. G. Bmps and id2a act upstream of Twist1 to restrict ectomesenchyme potential of the cranial neural crest. PLoS Genet. 8, e1002710. https://doi.org/10.1371/journal.pgen.1002710 (2012).
Article PubMed PubMed Central CAS Google Scholar
Diogo, R. Cranial or postcranial—Dual origin of the pectoral appendage of vertebrates combining the fin-fold and gill-arch theories?. Dev. Dyn. 249, 1182–1200. https://doi.org/10.1002/dvdy.192 (2020).
Article PubMed Google Scholar
Gillis, J. A., Dahn, R. D. & Shubin, N. H. Shared developmental mechanisms pattern the vertebrate gill arch and paired fin skeletons. Proc. Natl. Acad. Sci. 106, 5720. https://doi.org/10.1073/pnas.0810959106 (2009).
Article PubMed PubMed Central ADS Google Scholar
Gegenbaur, C. Grundzüge der vergleichenden Anatomie. (W. Engelmann, 1870).
Grandel, H. & Schulte-Merker, S. The development of the paired fins in the zebrafish (Danio rerio). Mech. Dev. 79, 99–120. https://doi.org/10.1016/s0925-4773(98)00176-2 (1998).
Article PubMed CAS Google Scholar

Download references

Acknowledgements

The authors thank J. P. Balhoff for assisting with retrieval of gene-anatomical entity relationships from the Monarch Initiative repository. The authors thank T. J. Vision, D. Goodman, B. Wone, W. M. Dahdul, and L. M. Jackson for assistance and helpful comments.

Funding

This work was supported by NSF 1661529, NSF 1062542, NSF 0956049, and NSF EPSCoR IIA–1355423 grants. This work was also supported by the University of Iowa faculty startup fund and UI COD Seed Grant to EZ. The views expressed in this paper do not necessarily reflect those of the NSF.

Author information

Authors and Affiliations

Department of Plant Sciences, University of Colombo, Colombo, Sri Lanka
Pasan C. Fernando
Department of Biology, University of South Dakota, Vermillion, SD, USA
Paula M. Mabee
Departments of Preventive & Community Dentistry, College of Dentistry, University of Iowa, Iowa City, IA, USA
Erliang Zeng
Division of Biostatistics and Computational Biology, College of Dentistry, University of Iowa, Iowa City, IA, USA
Erliang Zeng
Departments of Biostatistics, College of Public Health, University of Iowa, Iowa City, IA, USA
Erliang Zeng
Departments of Biomedical Engineering, College of Engineering, University of Iowa, Iowa City, IA, USA
Erliang Zeng
National Ecological Observatory Network, Battelle, 1625 38th St. #100, Boulder, CO, 80301, USA
Paula M. Mabee

Authors

Pasan C. Fernando
View author publications
You can also search for this author in PubMed Google Scholar
Paula M. Mabee
View author publications
You can also search for this author in PubMed Google Scholar
Erliang Zeng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors planned and designed the experiments. P.C.F. developed the Python scripts for the analysis and conducted the experiments under the supervision of P.M.M. and E.Z. All authors engaged in analyzing the data, interpreting the results, and reviewing the manuscript. Each author has given their approval for the final version of the manuscript.

Corresponding authors

Correspondence to Pasan C. Fernando or Erliang Zeng.

Ethics declarations

Competing interests

EZ is an editor of the Biological Modularity collection. The other authors declare that they have no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figures.

Supplementary Tables.

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Supplementary Information 5.

Supplementary Information 6.

Supplementary Information 7.

Supplementary Information 8.

Supplementary Information 9.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fernando, P.C., Mabee, P.M. & Zeng, E. Protein–protein interaction network module changes associated with the vertebrate fin-to-limb transition. Sci Rep 13, 22594 (2023). https://doi.org/10.1038/s41598-023-50050-2

Download citation

Received: 01 June 2023
Accepted: 14 December 2023
Published: 18 December 2023
DOI: https://doi.org/10.1038/s41598-023-50050-2

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Methods

Generation and selection of the integrated networks for module detection

Detection of network modules

Validation of the predicted proteins

Comparison of the network modules

Results

Detection of network modules

Validation of the predicted proteins

Comparison of the network modules

Pectoral fin vs. forelimb modules

Pelvic fin versus hindlimb modules

The fate of zebrafish paired fin module-specific proteins in mouse

The role of mouse paired limb module-specific proteins in zebrafish

Discussion

Conclusions

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links