Abstract
Several highly effective Covid-19 vaccines are in emergency use, although more-infectious coronavirus strains, could delay the end of the pandemic even further. Because of this, it is highly desirable to develop fast antiviral drug treatments to accelerate the lasting immunity against the virus. From a theoretical perspective, computational approaches are useful tools for antiviral drug development based on the data analysis of gene expression, chemical structure, molecular pathway, and protein interaction mapping. This work studies the structural stability of virus–host interactome networks based on the graphical representation of virus–host protein interactions as vertices or nodes connected by commonly shared proteins. These graphical network visualization methods are analogous to those use in the design of artificial neural networks in neuromorphic computing. In standard protein-node-based network representation, virus–host interaction merges with virus–protein and host–protein networks, introducing redundant links associated with the internal virus and host networks. On the contrary, our approach provides a direct geometrical representation of viral infection structure and allows the effective and fast detection of the structural robustness of the virus–host network through proteins removal. This method was validated by applying it to H1N1 and HIV viruses, in which we were able to pinpoint the changes in the Interactome Network produced by known vaccines. The application of this method to the SARS-CoV-2 virus–host protein interactome implies that nonstructural proteins nsp4, nsp12, nsp16, the nuclear pore membrane glycoprotein NUP210, and ubiquitin specific peptidase USP54 play a crucial role in the viral infection, and their removal may provide an efficient therapy. This method may be extended to any new mutations or other viruses for which the Interactome Network is experimentally determined. Since time is of the essence, because of the impact of more-infectious strains on controlling the spread of the virus, this method may be a useful tool for novel antiviral therapies.
Similar content being viewed by others
Introduction
The current health emergency caused by the SARS-CoV-2 infection has prompted worldwide efforts to develop an antiviral treatment against Covid-19. The development of antiviral drugs requires an urgent, in-depth understanding of host–coronavirus protein–protein interactions. Many improvements in the SARS-CoV-2 interactome have been studied recently, but a disease treatment still remains elusive. Moreover, despite the genome sequences of the SARS-CoV-2 are quite similar to SARS-CoV-1 and MERS-CoV, and there are currently several vaccines in emergency use authorizations, there is no effective antiviral drug treatment yet. Current vaccines can be less effective against new variants of SARS-CoV-2 that could spread more quickly, develop more severe disease, or be capable of evading diagnostics. On the other hand, antiviral drugs can be easily administrated, possibly transported without a cold chain and at low cost. In fact, the US government invests $18.5 billion into vaccines but $8.2 billion in antiviral drugs development because it has not yet identified a highly effective drug to treat or prevent the Covid-19 infection (https://www.nytimes.com/2021/01/30/health/covid-drugs-antivirals.html).
Based on available virus–host protein Interactome Network we developed a computational approach for the fast detection of crucial protein–protein interactions. Our approach is largely inspired on the graphical representation of multi-dimensional neural networks being used in various implementations of neuromorphic computing. These maybe potential targets for disruption of the Interactome Network. When applied to the H1N1 and HIV viruses this analysis method is able to pinpoint a major part of the known proteins that are used in therapeutic treatments of this viruses1,2.
We have applied this method to the MERS-CoV, SARS-CoV-1 and SARS-CoV-2 to predict potential key proteins whose removal may produce important disruptions of the host–virus protein network. We demonstrate that targeting nonstructural proteins nsp4, nsp12, and nsp16 removal produces a significant structural change of the host–SARS-CoV-2 interactions. This implies that the inhibition of a small set of virus–host protein interactions profoundly impacts viral infection caused by the SARS-CoV-2 coronavirus. Furthermore, these studies also reveal that the SARS-CoV-2 infection extends across the entire human cell, unlike in the case of SARS-CoV-1 and MERS-CoV. Our approach is a useful, fast, predictive tool which helps accelerate understanding of new variant strain of SARS-CoV-2 providing new perspectives for antiviral treatment. Since COVID infections are very time-sensitive, it may be worthwhile to include this method in the therapeutic toolkit immediately, without a lengthy way for complete proof of its efficacy.
The modern computational/data science approach harnesses the symbiotic relationship between computational and biological systems, which has given rise to bioinspired or neuromorphic computing3,4, to developing innovative algorithms to solve complex problem. Bioinspired computation is a very promising approach including heuristic algorithms of evolution, optimization, and artificial intelligence, which allow understanding the complexity of biological systems. These methods are a very powerful predictive tool to tackle diverse problems from the analysis of uncorrelated data.
SARS-COV-2 description
The SARS-CoV-2 coronavirus is the most recent and transmissible member of the family Coronaviridae causing the Covid-19 disease. It has a lower death rate than previous highly pathogenic viruses MERS-CoV, and SARS-CoV-15,6,7,8. SARS-CoV-2 virus consist of four structural (S, N, M, E) and 16 nonstructural (nsp1–nsp16) proteins9,10,11,12. The spike glycoprotein (S), which lies on the particle's surface, allows to bind the cellular receptor13,14,15,16. While the membrane (M)13,14,17 is responsible for viral assembly, the envelope (E) protein is involved in release process of the genetic material13,14,18. Nonstructural proteins prevent the host immune response against the viral replication mediated by the nucleocapsid (N) protein13,14,19. Nonstructural proteins functions include inhibitory activities of the host immune response (nsp1)20,21; perturbation of host intracellular signaling (nsp2)22,23; polyprotein processing (nsp3)24,25,26,27, and (nsp5)28,29; viral replication and transcription (nsp4, nsp12)30,31,32,33; autophagosome (nsp6)34,35; RNA primase function (nsp7, nsp8, nsp9, nsp10, nsp11)11,33,36,37,38; helicase core domain that binds ATP (nsp13)39,40,41; mRNA capping (nsp14, nsp16)11; evasion of host cell dsRNA sensors (nsp15)9,42.
Protein network
The host–virus analysis based on protein–protein interactions (PPI) network is an efficient computational approach to elucidate genetic features, molecular interactions, viral infection mechanisms, and host response43,44,45. However, to achieve this goal, a suitable amount of data is required, for instance, molecular data, genomic, and protein sequence of the virus46. Unfortunately, there is a paucity of SARS-CoV-2 infection mechanism data47,48. In this context, our model can accelerate the discovery process by elucidating the importance of new protein clusters in the infection process and identify emergent features from uncorrelated data, improving antiviral treatment development. Furthermore, our approach is a useful tool to rapidly identify potential target proteins of new variant strains of SARS-CoV-2.
Previous works focused mainly on viral protein properties within the host–virus PPI network by studying the PPI network's structural properties, local connectivity distribution, and cluster formation43,44,45,46,48,49. We propose an alternative approach based on the complexity and tolerance of the virus/protein–host/protein interaction network. This approach builds on the assumption that viral infection can be graphically represented by a network, where each node represents a virus–host protein interaction, and the edges correspond to proteins involved in each of those interactions. In virus–host systems, host–host protein interactions are commonly affected by viral proteins since, frequently, proteins involved in a host–host interaction also take part in a virus–host interaction50,51,52. Thus, the protein–protein connection redundancy leads to a robust virus–host network, which is unaffected by the host/protein network variations53,54.
On the other hand, during the viral infection, pathogen proteins mutate, changing the structural and topological properties of the network50,55,56. To reduce the statistical bias, we simulate the viral evolution starting from a fully connected virus–host network, where proteins interact with each other only if they are in physical contact Therefore, the schematic representation of the virus–host interaction is not affected by the virus/protein network and host/protein network structures.”
During the viral infection, pathogen and cellular proteins compete for binding partners changing their protein–protein network structure50,51. For instance, mutations at the protein interfaces change the “protein electrostatics and structural properties”50. Viral infection evolution can be represented as a protein–protein interactions network where patterns of interactions encode complex biological processes56,57, and statistical methods can be used for drug target identification58.
The efficiency of virus transmission, replication and proliferation can be identified from the effect on the connectivity of this network caused by a virus–host protein removal. The idea is that identification of the main virus–host interactions is therefore the key for the development of antiviral drugs.
In network science many systems exhibit tolerance against errors. The ability to maintain interactions or communication, notwithstanding the structural changes caused by removing nodes, arises from redundant interconnections. Many networks in nature (or real networks) manifest high tolerance against local failures, but there are still target nodes whose removal causes a significant global impact on the network's structural properties59,60. Inspired by this feature we explore how removal of a particular virus–host protein interaction modifies the viral infection.
The method
We built a virus/protein–host/protein interaction network based on a public PPI database as follows, each PPI consists of one viral protein and one host protein; if two PPI share a common virus/protein or a host/protein, then they are connected. All the information of viral infection is encoded into the virus/protein–host/protein interaction network architecture. The removal of a specific protein produces a structural change in the network. We set the average connectivity (see Supplementary Information Section Fig. 9) as a proxy of the network tolerance against the deletion of proteins. We identify the target viral proteins as the proteins whose removal causes a significant variation of the network's average connectivity.
Graph theory
Virus/protein–host/protein interaction network are composed by a set of interactions \(I_{i} ,\;I_{j} , \ldots ,I_{k} ,\;I_{l}\), which in turn indicates the interaction between a virus protein and a host protein, as shown in Fig. 1A. Figure 1B Two interactions are connected if they share a common virus protein or a host protein, if they share both proteins then they are the same virus–host interaction. For instance, the interaction \(I_{i}\), and \(I_{j}\) share the E protein so they are connected (Fig. 1C). We construct a graph network with the connected interaction, and we obtain a hierarchical cluster decomposition of the interactions where each cluster is located a specific cellular compartment except the SARS-CoV-2 largest cluster which is formed by interaction with different cellular localization (Fig. 1D). When a single virus protein or host protein is removed all the links that contain it disappear, while if a single interaction is removed all its links are drop out.
Validation
A multiply connected random network is formed by clusters which are connected by multiple alternative connections. An important way to characterize such a network is given by the average connectivity of clusters as a function of the number of nodes (or interactions in our case). For a randomly fully connected network this graph has a slope of 1 and any deviation from this reflects peculiarities and biases of the interactions and/or connectivity which is an intrinsic feature of real networks61,62. Hepatitis and Ebola virus network architectures are composed of a few fully connected nodes and thus our methods are not applicable.
Application of this concepts to the well-known H1N1, and HIV virus (using the public PPI database http://viruses.string-db.org/ with a scoring > 0.7) reveals interesting systematics. The H1N1 (Fig. 2A) and HIV (Fig. 2B) virus/protein–host/protein interaction network (see Supplementary Information Section Fig. 2) displays few small clusters (empty circle marks) and one large cluster (black dot). Figure 2 shows the average connectivity of each cluster as a function of the number of PPI. The dashed line is a line with slope of 1 expected from a fully randomly interconnected network61,62. The small clusters are on this line, however the largest cluster (black dot, for each virus) are completely off the line. This indicates that the large cluster is weakly connected to the rest of the network and therefore elimination of a few connections may produce a major disruption of network connectivity.
In Fig. 2 we show the effect of the removal of specific proteins on the position of the largest cluster on this graph. This is a measure of the tolerance of the network against PPI removal by means of the average connectivity variation of the largest cluster. Figure 2 shows that removal of some proteins from the network moves the position of the largest cluster closer to the expected random network curve. Interestingly all the predicted target proteins (i.e., removed proteins producing a significant change in the largest cluster connectivity) for the H1N1 virus are already used in vaccines: HA (blue) attenuated vaccine strains, NA (cyan) conserved epitopes, M (red) ectodomain based vaccines, PB2 (orange) and NS1 (magenta) live attenuated vaccine strains1,63,64. Figure 2B shows the same for the HIV virus. Figure 2B displays the following target proteins: F (cyan); vif (blue); IL10 (magenta); CREB1 (orange); JUN (green); vpr (red). Inactivated antigen approach to influenza vaccination promotes an immune response against the viral surface glycoproteins HA and neuraminidase NA1,63,64, while live attenuated influenza vaccine targets M, PB2, and NS163,65,66,67,68. Our method predicted those proteins as key targets which disrupt the influenza network structure.
Our method identifies three key proteins that play a significant role in the efficacy of the vaccination against HIV-1, Vpr, vif, and CREB1. These play a crucial role in the development of therapeutic interventions as described in references2,69,70. Our method does not identify other protein vaccine candidates, such as multiple gp120 envelope proteins71, because the statistical weight associated with protein removal relies on the number of connections and the effective long-range correlation between virus–host interactions.
Our findings indicate that a study of the viral/protein–human/protein interaction in the H1N1 and HIV is able to identify important proteins used in vaccines. Considering that time is of essence, this calls for application of this method to other important viral systems although further studies will be needed as experimental data becomes available.
Geometrical descriptors of structural changes of the network like the largest cluster average connectivity and the number of PPI capture the impact of highly connected nodes. This statistical description is sensitive to the number of disconnected links, which accounts for effective long-range interactions. Therefore, we are not able to account for targets which depend on long range biological interactions. However, by evaluating different scoring methods and using many protein interactions, we decrease the statistical bias and discrepancies between predicted proteins based on biological mechanisms and statistical methods.
Host–coronavirus protein interaction network
Recently, a map of virus–host protein for MERS-CoV, SARS-CoV-1, and SARS-CoV-2 has been published44,45, including a high-score update and additional statistical updates of mass spectrometry and PPI scoring information44,45, reporting more than 300 high-confidence interactions for these coronaviruses. A protein-interaction-based network analysis based on this data set has not yet been carried out, to the best of our knowledge. Moreover, the networks tolerance against antiviral attack has not been reported. Within this context, we study the tolerance of MERS-CoV, SARS-CoV-1, and SARS-CoV-2 networks to removal of a single protein [see Supplementary Information (SI), Figs. 3–5]. We construct a protein-interaction-based network for each of these coronaviruses by identifying virus–host interactions as nodes interacting with each other by sharing a virus or a host protein (see Fig. 3). According to the information provided by Krogan’s lab group44,45, we use a high-confidence interactome to create host–coronaviruses networks for each of these viruses (it is important to emphasize that our analysis focuses on the architecture of the Interaction Network instead of the Proteins Network). Figure 3 shows that all networks display a hierarchical cluster structure; MERS-CoV and SARS-CoV-1 share a similar structure characterized by a largest cluster formed by two sub-clusters see Fig. 3. In contrast, the SARS-CoV-2 largest cluster is formed by four sub-clusters, which suggests that the evolution from MERS-CoV to SARS-CoV-2 is closely linked to the largest cluster topology.
By using the cellular localization analysis provided by Krogan’s lab group44,45, we apply a color-code [diffuse cytoplasm (red); endoplasmic reticulum (ER) (green), plasma membrane (PM); (blue); endosomes (cyan); Golgi (yellow); mitochondria (magenta); no information (black)] for the viral/protein localization, as shown in Fig. 3. In the MERS-CoV and SARS-CoV-1 network, the largest cluster appears mainly located in the diffuse cytoplasm. Meanwhile in SARS-CoV-2, the largest cluster spreads across the diffuse cytoplasm, endoplasmic reticulum, and plasma membrane. Therefore, our study reveals that the hallmark of the SARS-CoV-2 infection is a highly connected virus/protein–host/protein interaction network across the entire host cell, which may be the key to its efficient infection mechanism. These results indicate that the protein-interaction-based network exhibits a hierarchical cluster structure which is highly correlated with cellular localization. Viral/proteins interact with host/proteins while in direct physical contact. However, we find that virus–host interactions which apparently are biologically disconnected maybe linked indirectly throughout the virus/protein–host/protein interaction network. This maybe a very important feature of the SARS-CoV-2 hijacking of host cell regulation.
The virus/protein–host/protein interaction graph network is built on virus–host interactions mediating cellular functions and viral infection, which are connected if they share a common single virus/protein or host/protein. Figure 3 shows the MERS-CoV, SARS-CoV-1, and SARS-CoV-2 virus/protein–host/protein interaction network, where virus/protein–host/protein interactions are represented as square marks and the links between them indicate that two interactions share a common single virus/protein or host/protein. Affinity purification mass spectrometry and structural properties of protein–protein complexes allow the identification of these protein interactions. However, only a few of these interactions play a relevant biological role72,73,74. We compare several statistical methods to score the PPI and to create the virus/protein–host/protein interaction network (see Supplementary Information Figs. 7, 8).
To compare the infection mechanism of MERS-CoV, SARS-CoV-1, and SARS-CoV-2 we construct their virus/protein–host/protein interaction network. To quantify the effect of virus/proteins on the virus/protein–host/protein interaction, we investigate the variation of average connectivity by removing some of these elements. We use the change of the structural properties of the network as a proxy to simulate potential damage produced by antiviral activity.
Coronavirus–host protein network analysis
Figure 4A shows that the highly pathogenic coronaviruses MERS-CoV, SARS-CoV-1, and SARS-CoV-2 display an almost linear relationship between the number of PPI and the average connectivity per cluster. Since the average connectivity of these small clusters increases with the number of PPI there is a homogenous connectivity distribution rather highly connected virus–host interactions or hubs. However, the largest clusters display key virus–host interactions, regardless of the small number of connections they have, that when they are removed several virus–host subcluster are disconnected. For this reason, the largest cluster plays a more central role in the structure of the virus–host interaction network. The largest clusters have around 40–60 PPI for MERS-CoV and SARS-CoV-1, however, SARS-CoV-2 has twice as many, see Fig. 4A.
Due to the urgency of the ongoing worldwide health emergency caused by the Covid-19, we focus on the tolerance against protein removal of SARS-CoV-2; similar analysis for the MERS-CoV, and SARS-CoV-1 are addressed in the Supplementary Information (SI) Section, Figs. 3–5. Figure 3B shows that removal of most virus proteins does not affect the linear dependence of small clusters. However, removal of nsp4, nsp12, nsp13, and nsp16 proteins, reduce substantially the number of PPIs bringing the largest cluster closer to the linear dependence. The disruptive effect of antiviral drugs affects mostly the host/protein–virus/protein interaction rather than a single viral protein. Therefore, a more useful approach is to investigate the virus–host network tolerance against single virus–host interaction removal.
SARS-CoV-2 target protein interaction
The SARS-CoV-2 network retains its structural properties against protein removal except when nsp4, nsp12, nsp13, nsp16, are removed (see Fig. 4B and Supplementary Information (SI) section, Figs. 3–5). The comparison between MERS-CoV, SARS-CoV-1, and SARS-CoV-2 implies that common proteins are located in similar places and thus expected to perform the same functions. The exception is nsp13, which change its location from virus to virus thus expected to exhibit different functions and therefore is not included in our analysis44,45. Moreover, we found that the set of virus–host interactions causing a significant change in the virus–host network appears in a small cluster linking the nsp4, nsp12, and nsp16 proteins, which belong to different complexes (Fig. 5). This provides a target which produces significant changes in the virus–host network and thus is the optimal candidate for an antiviral drug attack.
Comparative scoring methods analysis
We use four protein–protein interaction scoring methods, high confidence, MIST, Saint, and K, as published on44,45. All these scoring methods reveal the importance of the SARS-CoV-2 largest cluster on the virus–human interaction. MIST and Saint scoring methods are based on experimental biological data ranking the confidence of the virus/protein–host–protein interactions between 0 and 1, where 1 indicates the maximum confidence value (we selected PPI with a confidence ≥ of 0.6). Combining the MIST and Saint scoring method with protein complexes information, Krogan’s lab group introduce a high confidence score44,45. Finally, the K score is defined as the average between the MIST and Saint score44,45. In this case, we also use a confidence score ≥ 0.6. We identify critical targets by removing a PPI that produce a significant variation on the SARS-CoV-2 largest cluster connectivity. As shown in Table 1, all the scoring methods identify the nsp4/NUP210 and nsp16/NUP210 interactions as critical targets. The nsp12/USP54 interaction is detected only by the High confidence scoring method since the nsp12 protein is not included in the MIST, Saint, and K dataset. This comparative analysis reveals that our method allows us to identify crucial PPI in good agreement with different scoring methods; thus, our approach is a robust predictive method (see Table 1).
Discussion
We developed a new, fast detection method of the key controlling virus/protein–host/protein interactions which identifies the SARS-CoV-2 infection mechanism. An analysis of the protein-interaction-based network provides a universal hierarchical cluster decomposition from three coronaviruses-host networks. This cluster decomposition identifies all virus–host interactions sharing a virus and host proteins and localizes them in different cellular compartments. We find that the SARS-CoV-2 largest cluster structure extends across the entire host cell, revealing an enhanced hijacking of the host cell regulation mechanism. The evolution of the largest clusters coincides with new coronavirus strains' suggesting that further mutational changes will occur within this structure. We verified the validity of our method by applying it to H1N1 and HIV viruses, in both cases we detect important target proteins which are used in antiviral drug development.
We simulate an inhibitory antiviral action on SARS-CoV-2 by removing a protein and virus/protein–host/protein interaction. This process shows that removal of the nonstructural protein nsp4, nsp12, nsp13, nsp16, produces a significant change in the structural properties of the virus–host network resembling the behavior of previous coronaviruses, MERS-CoV, and SARS-CoV-1. Furthermore, we find that this effect is supported exclusively by a small set of virus–host interactions linking nsp4, nsp12, and nsp16 proteins. Moreover, our protein-interaction-based network method does not depend on the scoring method for protein–protein interactions. Interestingly our cluster decomposition coincides with the previously reported clustering obtained from genetic analysis, of these three coronaviruses.
The SARS-CoV-2 infection spreads a viral interaction network across different intracellular compartments, which strongly suggests that this network's largest cluster encodes relevant mechanisms of viral infection. Our analysis highlights the significance of long-range proteins interactions resulting from emerging and collective behavior of the virus–host interactome, which may be inspired by the seeking of a new biological mechanism for viral infection. In a global health emergency, it is crucial to develop a fast viral mechanism characterization method. Our analysis based on the geometric properties and tolerance of a virus/protein–host/protein interaction network is a valuable tool for understanding the viral mechanism of new variant strains of SARS-CoV-2 and detecting target protein–protein interaction rapidly. Our insights may provide essential information for further antiviral drug development, uncover the role of nonstructural viral/proteins, and identify the importance of a small set of virus–host interactions. Furthermore, structural information of the virus/protein–host/protein interaction networks may help understand the spike protein function in the mutations of the SARS-CoV-2.
The conventional methods based on network analysis, map the virus–host interaction into a graph between the virus and human proteins where connections involve biological interactions. On the other hand, we use an interaction-based network to explore its structure and the impact caused by removing certain proteins. It is our hypothesis that this simulates the antiviral action and brings a new perspective regarding viral infection. We validate this hypothesis by applying it to the known effects of protein removal on the H1N1 and HIV viruses. This methodology applied to the SARS-CoV-2 infection, predict that removal NUP210 and USP54 host proteins, produce a major change in the connectivity of the SARS-CoV-2 interaction network. These are known, to play an important role in cancer therapy and autoimmune diseases75,76. Although, nonstructural proteins (nsp4, nsp12, nsp16) have not attracted attention as target proteins, our validated method predicts them to be important candidates for antiviral drug and vaccine development. According to available connectivity data the SARS Cov2 spike proteins are weekly connected to the whole network. Because of this removing them as proposed by our method, does not affect in a major way the network structure and connectivity. On the other hand, in the case of H1N1 where more connectivity data is available, we properly predict which proteins are targets77. The spike proteins, the glycoproteins HA, and neuraminidase proteins NA are located on the viral surface, and they are also the most variable viral proteins. Protein variability results in highly connected nodes because SARS-CoV-2 variants and mutations potentially change viral properties. The identification of spike protein is restricted by available information in this work. Notwithstanding the preceding, our method has been able to identify crucial target proteins for H1N1 and HIV viruses and revealed the role of non-structural proteins, as has been reported by other authors62,63,64,65,66,67,68.
Predicted targets proteins are validated by Live attenuated and Inactivated antigen vaccines methods. Similarly, standard antiviral therapy mainly uses inhibitors to prevent the spike and nucleocapsid proteins binding78,79. However, it has been reported that the non-structural proteins play a significant role in the virulence of the SARS-CoV-2 virus80 as our method predicts.
Molecular docking study of ivermectin, and remdesivir indicates spike, M, N, nsp14, and nsp16 as viable targets for drug treatment development of SARS-CoV-281,82, while Raltegravir and Maraviroc are potential candidates to inhibit nsp-16 protein83. Multi-targeting features of Diosmin display highest binding affinity and inhibitory action of several non-structural proteins (nsp3, nsp9, nsp12, nsp15)84. This confirmation of the role of non-structural proteins in treatment validates our target proteins predictions.
Finally, our approach of graphically visualizing the connectivity of interactome networks overlaps with the design and optimization of neural networks, and as such our method could pave a roadmap to combine neuromorphic computing and artificial intelligence to optimize the design of drug treatments of viral diseases.
Materials and methods
Dataset
We obtain the protein sequence similarity, high-confidence protein interaction, and localization of the virus protein from a public dataset and permission as reported by Krogan’s lab group44,45 (https://doi.org/10.1126/science.abe9403).
Code
All code used in the calculations of the network analysis is available from FT upon request. Supplementary Information contains all the raw data of the simulations.
Graph theory methods
A virus/protein–host/protein interaction network \(G(I,\;k)\) is a set of \(I_{1} ,\;I_{2} , \ldots ,\;I_{D}\) virus–host interactions with \(k_{1} ,\;k_{2} , \ldots ,\;k_{D}\) connections between them, as described in Fig. 1. Graph representation of this network consists of D-nodes where each node corresponds to a one virus–host interaction. If two interactions share a common virus/protein or a host/protein, then they are connected by a link. The number of connections of the i-th virus–host interaction \(I_{j}\) is \(k_{j}\). The graphs were constructed from binary matrices where entries correspond to all virus/protein–host/protein interactions. Undirected links indicate connected virus–host protein interactions. The adjacency matrix \(A_{ij}\), with \(i,\;j = 1,\;2, \ldots ,\;D\), encodes all these interactions, with \(A_{ij} = 1\) indicating that the i-th and j-th interaction are connected, and \(A_{ij} = 0\) otherwise, Fig. 1. Notice that the connection of the i-th interaction \(k_{i} = \sum\nolimits_{j = 1}^{D} {A_{ij} }\).
According to cluster decomposition, the adjacency matrix of full connected interactions can be expressed as sum of N cluster adjacency matrices
where \(a(L)_{ij}\) stands the adjacency matrix of the L-th cluster. To quantify the network tolerance against local failures due to the removal of a virus protein or single virus–host interaction, we introduce the average connectivity as a proxy. This way, the cluster average connectivity is
Here \(n_{L}\) is the number of PPIs of the L-th cluster. Notice that the removal of a virus protein implies that all its interactions are deleted. In this case we have a network composed by D-1 interactions with D-1 connections.
Network’s tolerance against the antiviral activity
We simulate the inhibitory action of antiviral drugs utilizing node removal. The features of the cluster decomposition, in particular, the properties of its larger cluster, change as virus proteins or virus–host interactions are deleted. We use the cluster average connectivity variation as a proxy to quantify the impact of virus proteins in the virus–host interaction networks, as detailed in the Supplementary Information (SI), Figs. 3–8.
References
González, M. E. The HIV-1 Vpr protein: A multifaceted target for therapeutic intervention. Int. J. Mol. Sci. 18(1), 126 (2017).
Ellebedy, A. H. & Webby, R. J. Influenza vaccines. Vaccine 27(Suppl 4), D65–D68 (2009).
del Valle, J., Ramírez, J. G., Rozenberg, M. J. & Schuller, I. K. Challenges in materials and devices for resistive-switching-based neuromorphic computing. J. Appl. Phys. 124, 211101 (2018).
Neumann, F. & Carsten, W. Bioinspired Computation in Combinatorial Optimization 1619–7127 (Springer, 2010).
Gordon, D. E. et al. A SARS-CoV-2 protein interaction map reveals targets for drug repurposing. Nature 583, 459–468 (2020).
Zhu, N. et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med. 382, 727–733 (2020).
Su, S. et al. Epidemiology, genetic recombination, and pathogenesis of coronaviruses. Trends Microbiol. 24, 490–502 (2016).
Liu, J. et al. A comparative overview of COVID-19, MERS and SARS: Review article. Int. J. Surg. 81, 1–8 (2020).
Krafcikova, P. et al. Structural analysis of the SARS-CoV-2 methyltransferase complex involved in RNA cap creation bound to sinefungin. Nat. Commun. 11, 3717 (2020).
Wu, A. et al. Genome composition and divergence of the novel coronavirus (2019-nCoV) originating in China. Cell Host Microbe 27, 325–328 (2020).
Snijder, E. J., Decroly, E. & Ziebuhr, J. The nonstructural proteins directing coronavirus RNA synthesis and processing. Adv. Virus Res. 96, 59–126 (2016).
Naqvi, A. et al. Insights into SARS-CoV-2 genome, structure, evolution, pathogenesis and therapies: Structural genomics approach. Biochim. Biophys. Acta Mol. Basis Dis. 1866(10), 165878 (2020).
Fani, M. et al. Comparison of the COVID-2019 (SARS-CoV-2) pathogenesis with SARS-CoV and MERS-CoV infections. Future Virol. https://doi.org/10.2217/fvl-2020-0050 (2020).
Chen, Y., Liu, Q. & Guo, D. Emerging coronaviruses: Genome structure, replication and pathogenesis. J. Med. Virol. 92(4), 418–423 (2020).
Hoffmann, M. et al. SARS-CoV-2 cell entry depends on ACE2 and TMPRSS2 and is blocked by a clinically proven protease inhibitor. Cell 181(2), 271-280.e8 (2020).
Huang, Y. et al. Structural and functional properties of SARS-CoV-2 spike protein: Potential antivirus drug development for COVID-19. Acta Pharmacol. Sin. 41, 1141–1149 (2020).
Voss, D. et al. Studies on membrane topology, N-glycosylation and functionality of SARS-CoV membrane protein. Virol. J. 6, 79. https://doi.org/10.1186/1743-422X-6-79 (2009).
Schoeman, D. & Fielding, B. C. Coronavirus envelope protein: Current knowledge. Virol. J. 16(1), 69 (2019).
Mu, J. et al. SARS-CoV-2-encoded nucleocapsid protein acts as a viral suppressor of RNA interference in cells. Sci. China Life Sci. 63(9), 1–4 (2020).
Jauregui, A. R., Savalia, D., Lowry, V. K., Farrell, C. M. & Wathelet, M. G. Identification of residues of SARS-CoV nsp1 that differentially affect inhibition of gene expression and antiviral signaling. PLoS One 8(4), e62416 (2013).
Almeida, M. S., Johnson, M. A., Herrmann, T., Geralt, M. & Wuthrich, K. Novel beta-barrel fold in the nuclear magnetic resonance structure of the replicase nonstructural protein 1 from the severe acute respiratory syndrome coronavirus. J. Virol. 81, 3151–3161 (2007).
Cornillez-Ty, C. T., Liao, L., Yates, J. R. 3rd., Kuhn, P. & Buchmeier, M. J. Severe acute respiratory syndrome coronavirus nonstructural protein 2 interacts with a host protein complex involved in mitochondrial biogenesis and intracellular signaling. J. Virol. 83(19), 10314–10318 (2009).
Harcourt, B. H. et al. Identification of severe acute respiratory syndrome coronavirus replicase products and characterization of papain-like protease activity. J. Virol. 78, 13600–13612 (2004).
Báez-Santos, Y. M., St John, S. E. & Mesecar, A. D. The SARS-coronavirus papain-like protease: Structure, function and inhibition by designed antiviral compounds. Antiviral Res. 115, 21–38 (2015).
Báez-Santos, Y. M., Mielech, A. M., Deng, X., Baker, S. & Mesecar, A. D. Catalytic function and substrate specificity of the papain-like protease domain of nsp3 from the Middle East respiratory syndrome coronavirus. J. Virol. 88(21), 12511–12527 (2014).
Neuvonen, M. & Ahola, T. Differential activities of cellular and viral macro domain proteins in binding of ADP-ribose metabolites. J. Mol. Biol. 385, 212–225 (2009).
Snijder, E. J. et al. Unique and conserved features of genome and proteome of SARS-coronavirus, an early split-off from the coronavirus group 2 lineage. J. Mol. Biol. 331, 991–1004 (2003).
Wu, C. et al. Analysis of therapeutic targets for SARS-CoV-2 and discovery of potential drugs by computational methods. Acta Pharm. Sin. B 10(5), 766–788 (2020).
Anand, K. et al. Structure of coronavirus main proteinase reveals combination of a chymotrypsin fold with an extra alpha-helical domain. EMBO J. 21, 3213–3224 (2002).
Manolaridis, I. et al. Structure of the C-terminal domain of nsp4 from feline coronavirus. Acta Crystallogr. D Biol. Crystallogr. 65, 839–846 (2009).
Sakai, Y. et al. Two-amino acids change in the nsp4 of SARS coronavirus abolishes viral replication. Virology 510, 165–174 (2017).
Joseph, J. S. et al. Crystal structure of nonstructural protein 10 from the severe acute respiratory syndrome coronavirus reveals a novel fold with two zinc-binding motifs. J. Virol. 80, 7894–7901 (2006).
Zanotto, P. M., Gibbs, M. J., Gould, E. A. & Holmes, E. C. A reevaluation of the higher taxonomy of viruses based on RNA polymerases. J. Virol. 70, 6083–6096 (1996).
Angelini, M. M., Akhlaghpour, M., Neuman, B. W. & Buchmeier, M. J. Severe acute respiratory syndrome coronavirus nonstructural proteins 3, 4, and 6 induce double-membrane vesicles. MBio 4(4), e00524-13 (2013).
Cottam, E. M., Whelband, M. C. & Wileman, T. Coronavirus NSP6 restricts autophagosome expansion. Autophagy 10(8), 1426–1441 (2014).
Kirchdoerfer, R. N. & Ward, A. B. Structure of the SARS-CoV nsp12 polymerase bound to nsp7 and nsp8 co-factors. Nat. Commun. 10(1), 2342 (2019).
Peti, W. et al. Structural genomics of the severe acute respiratory syndrome coronavirus: Nuclear magnetic resonance structure of the protein nsP7. J. Virol. 79, 12905–12913 (2005).
Zhai, Y. et al. Insights into SARS-CoV transcription and replication from the structure of the nsp7–nsp8 hexadecamer. Nat. Struct. Mol. Biol. 12, 980–986 (2005).
Jang, K. J. et al. A high ATP concentration enhances the cooperative translocation of the SARS coronavirus helicase nsP13 in the unwinding of duplex RNA. Sci. Rep. 10(1), 4481 (2020).
Nishikiori, M. et al. Crystal structure of the superfamily 1 helicase from tomato mosaic virus. J. Virol. 86, 7565–7576 (2012).
van Dinten, L. C., van Tol, H., Gorbalenya, A. E. & Snijder, E. J. The predicted metal-binding region of the arterivirus helicase protein is involved in subgenomic mRNA synthesis, genome replication, and virion biogenesis. J. Virol. 74, 5213–5223 (2000).
Deng, X. et al. Coronavirus nonstructural protein 15 mediates evasion of dsRNA sensors and limits apoptosis in macrophages. Proc. Natl. Acad. Sci. U.S.A. 114(21), E4251–E4260 (2017).
Yang, S., Fu, C., Lian, X., Dong, X. & Zhang, Z. Understanding human-virus protein–protein interactions using a human protein complex-based analysis framework. mSystems 4(2), e00303-18 (2019).
Gordon, D. et al. Comparative host–coronavirus protein interaction networks reveal pan-viral disease mechanisms. Science 370, eabe9403 (2020).
Gordon, D. E. et al. A SARS-CoV-2 protein interaction map reveals targets for drug repurposing. Nature 583, 459–468 (2020).
Guzzi, P. H., Mercatelli, D., Ceraolo, C. & Giorgi, F. M. Master regulator analysis of the SARS-CoV-2/human interactome. J. Clin. Med. 9(4), 982 (2020).
Ceraolo, C. & Giorgi, F. M. Genomic variance of the 2019-nCoV coronavirus. J. Med. Virol. 92, 522–528 (2020).
Woo, P. C. Y., Huang, Y., Lau, S. K. P. & Yuen, K.-Y. Coronavirus genomics and bioinformatics analysis. Viruses 2, 1804–1820 (2010).
Srinivasan, S. et al. Structural genomics of SARS-CoV-2 indicates evolutionary conserved functional regions of viral proteins. Viruses 12(4), 360 (2020).
Brito, A. F. & Pinney, J. W. Protein–protein interactions in virus–host systems. Front. Microbiol. 8, 1557 (2017).
Franzosa, E. A., Garamszegi, S. & Xia, Y. Toward a three-dimensional view of protein networks between species. Front. Microbiol. 3, 428 (2012).
Franzosa, E. A. & Xia, Y. Structural principles within the human–virus protein–protein interaction network. Proc. Natl. Acad. Sci. U.S.A. 108, 10538–10543 (2011).
Sánchez-García, R. J. Exploiting symmetry in network analysis. Commun. Phys. 3, 87 (2020).
Tononi, G., Sporns, O. & Edelman, G. M. Measures of degeneracy and redundancy in biological networks. Proc. Natl. Acad. Sci. Neurobiol. 96(6), 3257–3262 (1999).
Daugherty, M. D. & Malik, H. S. Rules of engagement: Molecular insights from host–virus arms races. Annu. Rev. Genet. 46, 677–700 (2012).
Gursoy, A., Keskin, O. & Nussinov, R. Topological properties of protein interaction networks from a structural perspective. Biochem. Soc. Trans. 36, 1398–1403 (2008).
Han, J. D. et al. Evidence for dynamically organized modularity in the yeast protein protein interaction network. Nature 430, 88–93 (2004).
Tripathi, S. et al. Meta- and orthogonal integration of influenza “OMICs” data defines a role for UBR4 in virus budding. Cell Host Microbe 18, 723–735 (2015).
Albert, R., Jeong, H. & Barabási, A.-L. Diameter of the World-Wide Web. Nature 401, 130–131 (1999).
Lawrence, S. & Giles, C. L. Accessibility of information on the web. Nature 400, 107 (1999).
Orsini, C. et al. Quantifying randomness in real networks. Nat. Commun. 6, 8627 (2015).
Dehmamy, N., Milanlouei, S. & Barabási, A. L. A structural transition in physical networks. Nature 563, 676–680 (2018).
Zhu, W., Wang, C. & Wang, B. Z. From variation of influenza viral proteins to vaccine development. Int. J. Mol. Sci. 18(7), 1554 (2017).
Wong, S. S. & Webby, R. J. Traditional and new influenza vaccines. Clin. Microbiol. Rev. 3, 476–492 (2013).
Murphy, B. R., Park, E. J., Gottlieb, P. & Subbarao, K. An influenza A live attenuated reassortant virus possessing three temperature-sensitive mutations in the PB2 polymerase gene rapidly loses temperature sensitivity following replication in hamsters. Vaccine 15(12–13), 13728 (1997).
Parkin, N. T., Chiu, P. & Coelingh, K. Genetically engineered live attenuated influenza A virus vaccine candidates. J Virol 71(4), 2772–2778 (1997).
Talon, J. et al. Influenza A and B viruses expressing altered NS1 proteins: A vaccine approach. Proc. Natl. Acad. Sci. 97(8), 4309–4314 (2000).
Watanabe, T., Watanabe, S., Ito, H., Kida, H. & Kawaoka, Y. Influenza A virus can undergo multiple cycles of replication without M2 ion channel activity. J. Virol. 75(12), 5656–5662 (2001).
Ogawa, K. et al. Mutational analysis of the human immunodeficiency virus Vpr open reading frame. J. Virol. 63, 4110–4114 (1989).
Nakaya, H. I. Hidden in plain sight: Uncovering the role of CREB1 in HIV-1 vaccine-induced immunity. Nat. Immunol. 22, 1199–1200 (2021).
Wolfe, L. S. et al. Development of a platform-based approach for the clinical production of HIV gp120 envelope glycoprotein vaccine candidates. Vaccine 39(29), 3852–3861 (2021).
Gromiha, M. M., Yugandhar, K. & Jemimah, S. Protein–protein interactions: Scoring schemes and binding affinity. Curr. Opin. Struct. Biol. 44, 31–38 (2017).
Teo, G. et al. SAINTq: Scoring protein–protein interactions in affinity purification-mass spectrometry experiments with fragment or peptide intensity data. Proteomics 16, 2238–2245 (2016).
Verschueren, E. et al. Scoring large-scale affinity purification mass spectrometry datasets with MiST. Curr. Protoc. Bioinform. 49, 8.19.1-8.19.16 (2015).
Miguela, V. & Lujambio, A. Uncovering the role of USP54 in cancer. Oncotarget 8(7), 10765–10766 (2017).
Nofrini, V. D., Giacomo, D. & Mecucci, C. Nucleoporin genes in human diseases. Eur. J. Hum. Genet. 24, 1388–1395 (2016).
Devenishm, L. P., Mhlanga, M. M. & Negishi, Y. Immune regulation in time, and space: The role of local- and long-range interactions in regulating immune responses. Front. Immunol. 12, 1715 (2021).
McKee, D. L., Sternberg, A., Stange, U., Laufer, S. & Naujokat, C. Candidate drugs against SARS-CoV-2 and COVID-19. Pharmacol. Res. 157, 104859 (2020).
Uqiao, Hu. et al. The study of antiviral drugs targeting SARS-CoV-2 nucleocapsid and spike proteins through large-scale compound repurposing. Heliyon 7(3), e06387 (2021).
Biswas, N. et al. Structural and drug screening analysis of the non-structural proteins of severe acute respiratory syndrome coronavirus 2 virus extracted from Indian coronavirus disease 2019 patients. Front. Genet. 12, 626642 (2021).
Rutwick Surya, U. & Praveen, N. A molecular docking study of SARS-CoV-2 main protease against phytochemicals of Boerhavia diffusa Linn. for novel COVID-19 drug discovery. Virus Dis. 32(1), 46–54 (2021).
Eweas, A. F., Alhossary, A. A. & Abdel-Moneim, A. S. Molecular docking reveals ivermectin and remdesivir as potential repurposed drugs against SARS-CoV-2. Front. Microbiol. 11, 592908 (2021).
Tazikeh-Lemeski, E. et al. Targeting SARS-COV-2 non-structural protein 16: A virtual drug repurposing study. J. Biomol. Struct. Dyn. 39(13), 4633–4646 (2021).
Kumar, S., Sharma, P. P., Upadhyay, C., Kempaiah, P. & Rathi, B. Multi-targeting approach for nsp3, nsp9, nsp12 and nsp15 proteins of SARS-CoV-2 by Diosmin as illustrated by molecular docking and molecular dynamics simulation methodologies. Methods 195, 44–56 (2021).
Acknowledgements
We thank Dr. Mehdi Bouhaddou and Prof. Nevan Krogan from UCSF for access to their MERS, SARS-CoV-1, and SARS-CoV-2 protein maps, and Dr. Alex Fraño from UCSD for useful discussions on Neuromorphic computing and the connectivity of the neurons. The collaborative UCSD-UCH research is a result of numerous discussions regarding interconnectivity in complex systems such as in Neuromorphic Computing. F. Torres and M. Kiwi acknowledge financial support from Grant Nos. FA9550-161-0122, FA9550-18-1-0438, Fondo Nacional de Investigaciones Científicas y Tecnológicas (FONDECYT, Chile) under grants 1211902, and CEDENNA, through Financiamiento Basal para Centros Científicos y Tecnológicos de Excelencia AFB180001. IKS holds the Center for Memory and Recording Research Chair in the UCSD Center for Memory and Recording Research Center (CMRR).
Author information
Authors and Affiliations
Contributions
Inspiration for these studies came from multiple discussions regarding network science in a variety of systems, most notably in the context of the connectivity of neurons in animal brains and its implementation in artificial neuromorphic computing. All authors participated in discussion of the results and in writing multiple iterations of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Torres, F., Kiwi, M. & Schuller, I.K. The impact of the suppression of highly connected protein interactions on the corona virus infection. Sci Rep 12, 9188 (2022). https://doi.org/10.1038/s41598-022-13373-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-13373-0
This article is cited by
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.