Abstract
COVID-19, caused by SARS-CoV-2, is a primarily pulmonary disease that can affect several organs, directly or indirectly. To date, there are many questions about the different pathological mechanisms. Here, we generate an approach to identify the cellular-level tropism of SARS-CoV-2 using human proteomics, virus-host interactions, and enrichment analysis. Through a network-based approach, the molecular context was visualized and analyzed. This procedure was also performed for SARS-CoV-1. We obtained proteomes and interactomes from 145 different cells corresponding to 57 different tissues. We discarded the cells without the proteins known for interacting with the virus, such as ACE2 or TMPRSS2. Of the remaining cells, a gradient of susceptibility to infection was observed. In addition, we identified proteins associated with the coagulation cascade that can be directly or indirectly affected by viral proteins. As a whole we identified 55 cells that could be potentially controlled by the virus, with different susceptibilities, mainly being pneumocytes, heart, kidney, liver, or small intestine cells. These results help to explain the molecular context and provide elements for possible treatments in the current situation. This strategy may be useful for other viruses, especially those with limited reported PPI, such as a new virus.
Similar content being viewed by others
Introduction
Understanding the molecular interactions of SARS-CoV-2 with its host (cell) is necessary to identify the various impacts of the infection at the organismal level. Sequestration of a host protein by a SARS-CoV-2 protein, either to inhibit or to promote its function, can modify a given molecular process. Numerous protein–protein interactions (PPIs) have been reported by Gordon DE et al.1, Li et al.2, and Stukalov et al.3. They represent an overview of the PPI networks and molecular processes under the context of the cell line from which those authors obtained results (HEK293T, embryonic kidney and A549 human lung (carcinoma)), as well as some extrapolations. Using data from these investigators, simulated data, associations with interactomes of other viruses or experimental data, several groups have generated general molecular potential interactions between viral proteins and human proteins. Examples are Sarbecovirus-human protein–protein interaction network4; network-based model for defining the molecular aspects of pathogenic phenotypes in HCoV infections5; overview of coronavirus-host interactions6; cellular infection profile7; uncover structural and functional modules8 or an organotropism beyond the respiratory tract9.
Just because a virus can enter a cell does not mean that there is an enabling environment for its replication; a coordinated cellular sequestration is necessary. We analyzed how 29 SARS-CoV-2 proteins interact with human proteins under their cellular context. We identified possible alterations of protein–protein interactions (PPI) by the infection within each cell, by searching for cells that have input mechanisms, already reported10,11,12,13,14,15,16,17,18 and for proteomes and interactomes within each cell associated with viral proteins. This resulted in identifying a group of cells with a certain degree of susceptibility to be controlled by SARS-CoV-2 (see Fig. 1). At the same time, we compared our findings related to SARS-CoV-2 with those we investigated for SARS-CoV-1.
Results
Input tropism
The first step to identify cell tropism was to search for cells with potential entry-related proteins. We consulted two databases to cross-check the proteome and transcriptome data. First, from the RNA Consensus Tissue Gene Data we looked for those with reported expression of genes involved in cell entry: protein S-binding proteins (ACE2, NRP1, or BSG) and proteases (TMPRSS2, CTSL, CTSB, or FURIN). We selected 57 (out 62) tissues because they met the criterion of expressing at least 3 of the 7 genes we were looking for. Twenty threes (out 69) tissues (such as the olfactory region) had no information on cell types (Fig. 1b). Second, from the Normal Tissue Data, we found 45 tissues with at least 3 of the seven proteins of interest. Six cell types including those at bronchus, nasopharynx and oral mucosa did not have a transcriptome report at the tissue level (see Fig. 1b). Both data sets shared 39 tissues, including 69 different cell types (see Figs. 1b and 2a). For SARS-CoV-1 we considered the presence of four proteins (ACE2, BSG, TMPRSS2 and CTSL) to identify relevant tissues.
Of note, in 16 out of 69 (23%) cell types from 10 of the 39 tissues identified in the analysis for SARS-CoV-2, we didn’t find reports of ACE2 or TMPRSS2 protein, nor of their mRNA: adrenal gland (glandular cells), bone marrow (hematopoietic cells), cerebellum (granular layer cells, molecular layer cells and Purkinje cells), cerebral cortex (endothelial cells, glial cells, neuronal cells and neuropil), endometrium (glandular cells 1 and 2), germinal and non-germinal center cells of lymph nodes, parathyroid gland (glandular cells), skeletal muscle (myocytes), smooth muscle (smooth muscle cells) and spleen (red pulp cells), see Fig. 2b. Notably, bone marrow do not express ACE2 or TMPRSS2, and several tissues have values below the threshold (NX < 1) of ACE2, such as lung (0.8). Nonetheless, no cell type was removed from the analysis, since they expressed at least three of the necessary proteins to enter the cell. In contrast, for SARS-CoV-1, we removed 14 cell types with neither ACE2 nor TMPRSS2 (nor in mRNA); BSG and CTSL only had expression data from mRNA (see Fig. 2c). These results show that ACE2 and TMPRSS2 are the least common in the 69 cell types of the seven input proteins.
PPI tropism
To investigate differences in the PPI networks that may occur in infected cells, we compared the size of the interactomes inferred in each cell type. The cells showing the highest number of interactions with SARS-CoV-2 proteins were located in the kidney (tubule cells and glomerulus cells), small intestine (glandular cells), and lung (pneumocytes), with a maximum of 970 proteins out of the 1345 reported. The least connected cells were located in the bone marrow (hematopoietic cells), colon (peripheral nerve/ganglion), placenta (decidual cells), and ovary (follicular cells), see Supplementary Table S1. For SARS-CoV-1, the cells with the fewer interactions were the same as in SARS-CoV-2, and the cells with the most interactions were lung (pneumocytes) and glandular cells (thyroid gland, duodenum, salivary gland, small intestine, gall bladder, colon, and pancreas), with 114 proteins (out of 151) interacting with the 26 viral proteins (see Supplementary Table S2).
When clustering the cell types by their expression value, several proteins were highly expressed in a group of cells, see Supplementary Fig. S1. Comparing the unsupervised hierarchical clustering (Supplementary Fig. S2), we found that mainly these cell types were glandular. A similar pattern was observed for SARS-CoV-1 (Supplementary Fig. S1c).
This finding of a higher expression value in glandular cells than in pneumocytes, the main target cells, brought out the hypothesis that the susceptibility of a cell could depend not only on the expression profile but also on their connectivity to the viral proteins. Thus, we characterized the internal dynamics of the cells. We identified their connectivity according to their PPI, by assessing the Average Degree of each network (which ranges from 4.561 to 7.391). We found that the most connected cellular networks were indeed pneumocytes, and also glandular cells (endometrium 1), squamous epithelial cells (esophagus), non-germinal center cells (lymph node and tonsil tissues), and cells in the red pulp (spleen), see Supplementary Table S1. Networks visualized at the V-H PPI level highlighted emblematic cells, for example, the lung (pneumocytes and macrophages) as the most affected tissue (Fig. 3), and small intestine (glandular cells) as the tissue with the highest expression of ACE2 (Fig. S3a). Considering the expression in each cell and the total connectivity of each protein, our findings indicate that the human proteins with the most interconnections to viral proteins (V-H) were ATP1A1, ATP2A2, HACD3, XPO1 and ATP5F1B. The most linked proteins SARS-CoV-2 to human were M, nsp7b (orf7b) and orf3a, while the most connected within human cells (H–H) were LARP7 and GOLGA2. See in Supplementary Table S3 the connectivity of all proteins. Figure 3e,f and Fig S3c, show the networks for SARS-CoV-1, where UBC was the protein with the most connections, with a similar expression pattern as for SARS-CoV-2. See in Supplementary Table S4 the connectivity of each protein in general (not limited to a cell type) in SARS-CoV-1.
We generated a gene set enrichment analysis (GSEA) for each proteome of each cell type to identify specific processes potentially altered by PPI. Unsupervised clustering (binary distance and simple linkage method) allowed the identification of a “gradient” of cells, where molecular processes were altered from lower to higher degree. Figure 4 shows the KEGG (Kyoto Encyclopedia of Genes and Genomes) signaling pathways in SARS-CoV-2 and SARS-CoV-1. In both heatmaps, we identified an effect of proteome size (Supplementary Table S1 y S2), thus 14 cell types with a small proteome were removed, reducing to 55 the initial 69 cell types. For SARS-CoV-1, lung macrophages and tonsil’s squamous epithelial cells were also identified as not significantly altering the molecular environment and were therefore also removed for subsequent analyses. Hence, for SARS-CoV-2, 55 cell types remained and for SARS-CoV-1, 45. Supplementary Figure S4 shows the behavior of the remaining SARS-CoV-2 cells in terms of molecular functions and biological processes. With these results, we identified that salivary gland (glandular cells), esophagus (squamous epithelial cells), testis (Leydig cells), lung (pneumocytes) and small intestine (glandular cells) behave as a group (identifiable in molecular functions, dark blue branch, where they share a group of unaltered functions). Supplementary Figure S5 shows the effect of this reduction on SARS-CoV-1.
Supplementary Figure S6 shows the 50 most discriminating results, and the profiles of the gradients shown in the previous results can be identified. For SARS-CoV-2, one signaling pathway that appears relevant is "ribosome biogenesis in eukaryotes" (Supplementary Fig. S6b). Given its relevance, we generated subnetworks of ribosomes and their biogenesis. The ribosome network showed high protein expression value in the small intestine (glandular cells); moderately-high in the lung (macrophages); moderate in the kidney (tubule cells), and moderately-low in the lung (pneumocytes) (see Fig. 5a,b, and Supplementary Fig. S7a and b). We identified that pneumocytes and glandular cells of the small intestine behave similarly in some Expression Values, as is the case of XPO1, which extensively connects with SARS-CoV2 proteins (see Fig. 5c,d, and Supplementary Fig. S7c and d). We found that XPO1, CSNK2B, XRN2, DKC1, NOP56, NVL, RBM28, NAT10, TBL3, and MPHOSPH10 interact directly with virus proteins; and other 20 proteins do so indirectly (Fig. 6). These results suggest that SARS-CoV-2 hijacks and appears to promote ribosome synthesis, which would potentially enhance their replication capability.
In addition, we found that coagulation cascade proteins such as FGA, FGB, PROS1, PLAU, PLAT, PROCR, and CPB2 may also interact directly with virus proteins (Figs. 4a, 6b).
Finally, from the centroids of 55 SARS-CoV-2 cell types (45 for SARS-CoV-1), we identified 8 clusters separated into 4 quadrants (Fig. 7). According to the distribution of both the Expression value and the PPI, we identified the importance of associating the potential interactions with the level of protein expression. Most of the cells of the most cited tissues in the literature, such as pneumocytes, kidney (tubule cells and glomerulus cells), testis (seminiferous duct cells), heart muscle (myocytes) and liver, are found in quadrants 1 and 4 (high PPI) (see Supplementary Tables S1 and S2).
Discussion
In this work, we analyzed human PPI and proteome, GSEA information, and reported virus-host protein interactions to identify cells and tissues with the putative necessary environment to sustain infection by SARS-CoV-2. This analysis identified a series of tissues and cell types that could potentially be a target for viral infection. Several studies have been carried out to determine the breadth of SARS-CoV-2 cell tropism outside the respiratory tract (e.g., lungs and trachea). Still, many have evaluated the permissiveness of various cell lines, which is hard to interpret. Autopsy analyses have shown the presence of SARS-CoV-2 antigens, by immunodetection, in the trachea, small intestine, kidneys, pancreas, and vascular tissues of the brain and heart20,21. Our work identified possible targets for infection, in addition to cells of the respiratory tract, the small intestine and pancreas. Other studies have pointed to the potential infection of these tissues. Interestingly, Stillfried et al. using pathological specimens, have identified SARS-CoV-2 by RT-PCR in different tissues, including small intestine22.
It was initially difficult for us to identify a set of cells characteristically susceptible to SARS-CoV-2 infection. The amount of ACE2 in the tissues has not shown a correlation with infection or with TMPRSS2 in any of the reported organs15,16,18. Therefore, we decided to explore all 7 proteins already known to participate in viral entry, such as NRP110, that could explain the tropism to different organs. This strategy did not show a better correlation with the organs known to be affected by SARS-CoV-2. Thus, the next step was to explore the intracellular molecular context, to determine the interactome and signaling pathways.
A high expression profile was supposed to be linked to a greater susceptibility to infection, but as the overexpression is mainly linked to glandular cells, which was already reported23,24, other factors should be considered. It is plausible that a higher concentration of cellular proteins increases the possibility of an interaction with viral proteins, since competition for protein interaction could play an important role. This possibility should be explored but to date there are no reports on the susceptibility to infection of glandular cells, although hormonal disarranges have been found. Our results indicate a gradient of molecular interactions, and therefore a possible explanation of susceptibility to infection. An hypothesis of a gradient of susceptibility to infection is supported by the availability of entry proteins, the concordance of connectivity, and previous autopsy and clinical reports.
Several of the cells herein identified having a high potential connectivity with viral proteins were expected given the clinical and pathological observations. For example, part of the oral system25, digestive tract, or pneumocytes. Others like cerebral cortex, skin, appendix (glandular cells), endometrium, and tonsils, were not expected and should be considered for further investigation. Furthermore, the amplitude in tropism could contribute to the diversity of symptoms, severity26 or long27,28 COVID. It is also possible that in some cases, by having a mild capacity to infect certain cells (due to a low PPI connectivity), no symptoms occur; however, due to overactivation of the immune system and cell lysis, several autoantigens may be exposed and facilitate the progress from a mild or moderate disease to a long COVID-1929.
A relevant finding that merits further investigation in both the acute disease and the long COVID syndrome is the possible deregulation of ribosome biogenesis, due to its critical role in cell function and dysfunction in diseases such as cancer30. In addition to intracellular molecular alterations, such as ribosome biogenesis, already reported in SARS-CoV-1 by PPI31, we show a set of coagulation cascade proteins interacting directly and indirectly with SARS-CoV-2 proteins, which could promote their dysregulation. These findings, together with pro-thrombotic autoantibodies26 and other potential causes32, may account for explaining a coagulation-promoting environment affecting different organs33.
A relevant result for SARS-CoV-1 was that lung macrophages could be identified as cells in which the molecular environment may not be significantly altered, and therefore, cell sequestration would be inefficient. This agrees with previous research34,35, where it was identified that the viral particle could enter that cell but not replicate, which may also be occurring with several SARS-CoV-2 infected cells, such as tonsil’s squamous epithelial cells.
For SARS-CoV-1 we identified that pneumocytes were almost isolated from the rest of the cells (Fig. 7b). Only a few more cells have the same connectivity: endometrial glandular cells, esophagus squamous epithelial cells, and tonsil’s non-germinal center cells. In contrast, for SARS-CoV-2 there are a significant number of cells closely connected to pneumocytes (Fig. 7a), which may reflect the multi-tropism that it has shown. Another point of contrast is the connectivity shown by both virus types, higher in SARS-CoV-1; although this may be due to properties of the networks, it may also reflect a greater capacity of SARS-CoV-1 to take control of the target cells, which would possibly account for its greater lethality.
This approach sheds light on several SARS-CoV-2 target cells and could be very useful in studying the new variants of interest. For example, some mutations in the spike protein (S) confer the virus a better binding to ACE236, without modifying its tropism. However, in the case of Omicron, its tropism for pneumocytes seems to have been altered. Further research is warranted to assess how the PPI have been modified.
In our work we did not consider using directed networks, given their still limited coverage. Although it would give us valuable information about “indispensable” nodes37 directly associated with the virus, the loss of information would limit the results, especially in limited virus-host PPIs. With our strategy we obtained the highly connected nodes (hubs) (supplementary tables S3 and S4).
In conclusion, we identified the connectivity gradient of a set of 55 cell types whose molecular environment may be altered by SARS-CoV-2. This approach allowed us to detect their potential susceptibilty to be controlled by the virus. As expected, pneumocytes were amongst them and, noteworthy, also glandular cells. We also identified several proteins of the coagulation cascade that may be directly and indirectly affected by viral proteins, which could help explain the coagulation disorders associated with this infection. Other possible targets of infection we identified were cells from the small intestine, colon, stomach, and pancreas; heart, kidney, salivary glands, appendix, and tonsils; testicles, prostate, and endometrium among others, not necessarily expected. We have also identified a profile of molecular alterations in the form of a gradient of infection. This model is still perfectible, but it represents an approach to determine the cell tropism and cell-affecting mechanisms of SARS-CoV-2 and other viruses, such as SARS-CoV-1. We expect this model to contribute to the fight against current and future pandemics by identifying both virus and host proteins essential for infection, regardless limited PPI virus-host information.
Methods
PPIs
The PPIs between SARS-CoV-2 and human proteins were collected from IMEX38, and those of SARS-CoV-1, from Biogrid39. Because the present strategy addresses both limited and numerous cases of virus-host PPI, the confidence value was not considered. On the other hand, the human interactome was generated from the compilation of information from several primary databases39,40,41,42,43,44,45. Only PPIs that were reported in at least two studies were considered.
Proteomes and transcriptomes
From the Normal tissue and RNA Consensus Tissue Gene databases, deposited in the Protein Atlas (http://www.proteinatlas.org)46, proteomes corresponding to each cell of each tissue and transcriptomes of each tissue were generated. Only those tissue genes/proteins with transcriptomic and proteomic evidence were considered (see Fig. 1). Proteins with an uncertain reliability score and genes with normalized expression (NX) values less than one were eliminated. For normal tissue data (cells), values of 1, 2, 3 and 4 were assigned (provided there was a value ≥ 1 in RNA consensus tissue gene data), for Not detected, Low, Medium, and High, respectively for expression value (level). For RNA consensus tissue gene data, the assigned NX values were used. The expression value and NX values of each cell were plotted by unsupervised hierarchical clustering in R47 using the gplots library48.
Entrance tropism
To identify which cells and tissues have the components to allow viral entry, we assessed the presence of NRP110, ACE211,12,13,14,15,16,17,18, BSG11, TMPRSS211,12,13,14, CTSL11,13, CTSB11,13 and FURIN11 proteins, or evidence of their gene expression. Those with a combination of protein S-binding proteins (ACE2, NRP1 or BSG) and proteases necessary for protein S cleavage and membrane fusion (TMPRSS2, CTSL, CTSB or FURIN) were selected. We considered only those tissues with information at both levels (proteome and transcriptome).
Tropism at PPI level
At this point, only the proteome (cell) data were processed. The interactomes (Virus-Human (V-H) and Human–Human (H–H)) and proteomes of each cell were used. The interactomes of each cell of each tissue were generated considering the interactions of SARS-CoV-2 with human proteins (V-H), a second level of interaction (H–H), and the interactions between them. The networks were visualized with Cytoscape49, associating the Expression value and the level of interaction associated with the connectivity of each protein. Enrichment analysis was generated with Enrichr50 to identify the alterations of each cell, using the enrichR library of R to automate the procedures. Heatmaps were performed with gplots.
Finally, centroids of each cell were identified by k-means, using the PPIs and the Proteome Expression value of each cell. Next, cell clusters susceptible to SARS-CoV-2 infection were identified according to its connectivity and autopsy reports.
All the methods were performed in accordance with relevant guidelines and regulations.
Limitations
The choice not to consider a confidence value of virus-host PPIs might generate some uncertainty, but some PPIs with low confidence values may actually represent host-protein interactions. We are aware that a stricter choice of PPIs could modify our conclusions; however, the objective of our strategy was to present a tool to obtain quick answers in an initial context of little information.
Several cells that have been reported to have clinical significance were not considered, due to a lack of information about them at the transcriptome or proteome level. The lack of information on certain cell types and of availability of proteome or transcriptome information meant that not all expected results were obtained, as well as some other hypothetical ones. Cells with a small proteome, such as ovarian (follicular cells) or placental (decidual cells) were also discarded. Whether this is due to a truly small proteome or a technical inability to identify various proteins in these cells remains to be determined.
Data availability
The datasets analyzed during the current study are available in the IMEx repository, https://www.ebi.ac.uk/intact/imex/main.xhtml;jsessionid=49BA95180925C722BEE62ADE77D78873?conversationContext=1; DIP repository, https://dip.doe-mbi.ucla.edu/dip/File.cgi?FN=2016/imex/dip_imex20161230CT2016.xml; IntAct (and Mint) repository, https://www.ebi.ac.uk/intact/search?query=9606&interactorSpeciesFilter=Homo%20sapiens; BioGRID repository, https://downloads.thebiogrid.org/Download/BioGRID/Release-Archive/BIOGRID-4.4.209/BIOGRID-ALL-4.4.209.mitab.zip; HPRD repository, http://hprd.org/download; MatrixDB repository, matrixdb.univ-lyon1.fr/download/matrixdb_FULL.tab.gz; Reactome repository, https://reactome.org/download/current/interactors/reactome.homo_sapiens.interactions.psi-mitab.txt; and HPA repository, https://www.proteinatlas.org/download/normal_tissue.tsv.zip.
References
Gordon, D. E. et al. A SARS-CoV-2-human protein-protein interaction map reveals drug targets and potential drug-repurposing. BioRxiv https://doi.org/10.1101/2020.03.22.002386 (2020).
Li, J. et al. Virus-host interactome and proteomic survey reveal potential virulence factors influencing SARS-CoV-2 pathogenesis. Med (N Y) 2, 99-112.e7 (2021).
Stukalov, A. et al. Multilevel proteomics reveals host perturbations by SARS-CoV-2 and SARS-CoV. Nature 594, 246–252 (2021).
Khorsand, B., Savadi, A. & Naghibzadeh, M. SARS-CoV-2-human protein-protein interaction network. Inform. Med. Unlocked 20, 100413 (2020).
Messina, F. et al. COVID-19: Viral-host interactome analyzed by network based-approach model to study pathogenesis of SARS-CoV-2 infection. J. Transl. Med. 18, 233 (2020).
Perrin-Cocon, L. et al. The current landscape of coronavirus-host protein-protein interactions. J. Transl. Med. 18, 319 (2020).
Bojkova, D. et al. Proteomics of SARS-CoV-2-infected host cells reveals therapy targets. Nature 583, 469–472 (2020).
Kumar, N., Mishra, B., Mehmood, A., Athar, M. & Mukhtar, M. S. Integrative network biology framework elucidates molecular mechanisms of SARS-CoV-2 pathogenesis. SSRN https://doi.org/10.2139/ssrn.3581857 (2020).
Puelles, V. G. et al. Multiorgan and renal tropism of SARS-CoV-2. N. Engl. J. Med. 383, 590–592 (2020).
Cantuti-Castelvetri, L. et al. Neuropilin-1 facilitates SARS-CoV-2 cell entry and infectivity. Science 370, 856–860 (2020).
Singh, M., Bansal, V. & Feschotte, C. A single-cell RNA expression map of human coronavirus entry factors. Cell Rep. 32, 108175 (2020).
Bojkova, D. et al. SARS-CoV-2 and SARS-CoV differ in their cell tropism and drug sensitivity profiles. BioRxiv https://doi.org/10.1101/2020.04.03.024257 (2020).
Sungnak, W. et al. SARS-CoV-2 entry factors are highly expressed in nasal epithelial cells together with innate immune genes. Nat. Med. 26, 681–687 (2020).
Yang, L. et al. A human pluripotent stem cell-based platform to study SARS-CoV-2 tropism and model virus infection in human cells and organoids. Cell Stem Cell 27, 125-136.e7 (2020).
Li, M.-Y., Li, L., Zhang, Y. & Wang, X.-S. Expression of the SARS-CoV-2 cell receptor gene ACE2 in a wide variety of human tissues. Infect. Dis. Poverty 9, 45 (2020).
Xu, H. et al. High expression of ACE2 receptor of 2019-nCoV on the epithelial cells of oral mucosa. Int. J. Oral Sci. 12, 8 (2020).
Zou, X. et al. Single-cell RNA-seq data analysis on the receptor ACE2 expression reveals the potential risk of different human organs vulnerable to 2019-nCoV infection. Front. Med. 14, 185–192 (2020).
Hikmet, F. et al. The protein expression profile of ACE2 in human tissues. Mol. Syst. Biol. 16, e9610 (2020).
Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
Bian, X.-W., COVID-19 Pathology Team. Autopsy of COVID-19 patients in China. Natl. Sci. Rev. 7, 1414–1418 (2020).
Liu, J. et al. SARS-CoV-2 cell tropism and multiorgan infection. Cell Discov. 7, 17 (2021).
von Stillfried, S. et al. SARS-CoV-2 RNA screening in routine pathology specimens. Microb. Biotechnol. https://doi.org/10.1111/1751-7915.13828 (2021).
Pontén, F. et al. A global view of protein expression in human cells, tissues, and organs. Mol. Syst. Biol. 5, 337 (2009).
Gry, M., Oksvold, P., Pontén, F. & Uhlén, M. Tissue-specific protein expression in human cells. Tissues Organs. https://doi.org/10.4172/JPB.1000153 (2010).
Huang, N. et al. Integrated single-cell atlases reveal an oral SARS-CoV-2 infection and transmission axis. MedRxiv https://doi.org/10.1101/2020.10.26.20219089 (2020).
Zuo, Y. et al. Prothrombotic autoantibodies in serum from patients hospitalized with COVID-19. Sci. Transl. Med. 12, 3876 (2020).
Wang, J. Y., Zhang, W., Roehrl, M. W., Roehrl, V. B. & Roehrl, M. H. An autoantigen profile of human A549 lung cells reveals viral and host etiologic molecular attributes of autoimmunity in COVID-19. J. Autoimmun. 120, 102644 (2021).
Wallukat, G. et al. Functional autoantibodies against G-protein coupled receptors in patients with persistent long-COVID-19 symptoms. J. Transl. Autoimmun. 4, 100100 (2021).
Boscolo-Rizzo, P. et al. Sequelae in adults at 12 months after mild-to-moderate coronavirus disease 2019 (COVID-19). Int. Forum Allergy Rhinol. https://doi.org/10.1002/alr.22832 (2021).
Pelletier, J., Thomas, G. & Volarević, S. Ribosome biogenesis in cancer: New players and therapeutic avenues. Nat. Rev. Cancer 18, 51–63 (2018).
Chen, J.-Y. et al. Interaction between SARS-CoV helicase and a multifunctional cellular protein (Ddx5) revealed by yeast and mammalian cell two-hybrid systems. Arch. Virol 154, 507–512 (2009).
Wang, J., Pendurthi, U. R., Yi, G. & Rao, L. V. M. SARS-CoV-2 infection induces the activation of tissue factor-mediated coagulation by activation of acid sphingomyelinase. Blood https://doi.org/10.1182/blood.2021010685 (2021).
Vinayagam, S. & Sattu, K. SARS-CoV-2 and coagulation disorders in different organs. Life Sci. 260, 118431 (2020).
Peiris, J. S. M. & Cheung, C. Y. The macrophage in the pathogenesis of severe acute respiratory syndrome coronavirus infection. Hong Kong Med. J. 15(Suppl 6), 21–25 (2009).
Yilla, M. et al. SARS-coronavirus replication in human peripheral monocytes/macrophages. Virus Res. 107, 93–101 (2005).
Goher, S. S., Ali, F. & Amin, M. The delta variant mutations in the receptor binding domain of SARS-CoV-2 show enhanced electrostatic interactions with the ACE2. Med. Drug Discov. https://doi.org/10.1016/j.medidd.2021.100114 (2021).
A, V. et al. Controllability analysis of the directed human protein interaction network identifies disease genes and drug targets. Proc. Natl. Acad. Sci. U.S.A. 113, 2 (2016).
Perfetto, L. et al. The IMEx coronavirus interactome: An evolving map of Coronaviridae-host molecular interactions. Database 3, 96 (2020).
Oughtred, R. et al. The BioGRID interaction database: 2019 update. Nucleic Acids Res. 47, D529–D541 (2019).
Sprinzak, E., Cokus, S. J., Yeates, T. O., Eisenberg, D. & Pellegrini, M. Detecting coordinated regulation of multi-protein complexes using logic analysis of gene expression. BMC Syst. Biol. 3, 115 (2009).
Orchard, S. et al. The MIntAct project–IntAct as a common curation platform for 11 molecular interaction databases. Nucleic Acids Res. 42, D358-363 (2014).
Keshava Prasad, T. S. et al. Human protein reference database–2009 update. Nucleic Acids Res. 37, D767-772 (2009).
Clerc, O. et al. MatrixDB: Integration of new data with a focus on glycosaminoglycan interactions. Nucleic Acids Res. 47, D376–D381 (2019).
Licata, L. et al. MINT, the molecular interaction database: 2012 update. Nucleic Acids Res. 40, D857-861 (2012).
Jassal, B. et al. The reactome pathway knowledgebase. Nucleic Acids Res. 48, D498–D503 (2020).
Thul, P. J. et al. A subcellular map of the human proteome. Science 356, 3321 (2017).
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing Vienna, Austria https://www.r-project.org/.
Warnes GR et al. gplots: Various R Programming Tools for Plotting Data version 3.1.1 from CRAN. https://rdrr.io/cran/gplots/ (2012).
Shannon, P. et al. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Kuleshov, M. V. et al. Enrichr: A comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 44, W90-97 (2016).
Author information
Authors and Affiliations
Contributions
Conceptualization: D.O.-B.; methodology: D.O.-B.; software: D.O.-B.; validation: D.O.-B. and S.Z.; formal analysis: D.O.-B., S.Z., M.A.M.-C., and R.B.; investigation: D.O.-B., S.Z., M.A.M.-C., and R.B.; data curation: D.O.-B.; writing—original draft preparation: D.O.-B., S.Z., M.A.M.-C., and R.B.; writing—review and editing: D.O.-B., S.Z., M.A.M.-C., and R.B.; visualization: D.O.-B.; supervision: S.Z. and R.B.; project administration: R.B. All authors reviewed the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ortega-Bernal, D., Zarate, S., Martinez-Cárdenas, M. et al. An approach to cellular tropism of SARS-CoV-2 through protein–protein interaction and enrichment analysis. Sci Rep 12, 9399 (2022). https://doi.org/10.1038/s41598-022-13625-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-13625-z
This article is cited by
-
Liver injury in COVID-19: an insight into pathobiology and roles of risk factors
Virology Journal (2024)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.