Host nasopharyngeal transcriptome dataset of a SARS-CoV-2 positive Italian cohort

Salvati, Annamaria; Ferravante, Carlo; Lamberti, Jessica; Rocco, Teresa; Alexandrova, Elena; D’Agostino, Ylenia; Sorokin, Maksim; Efimov, Victor; Buzdin, Anton; Strianese, Oriana; Nassa, Giovanni; Tarallo, Roberta; Weisz, Alessandro; Rizzo, Francesca; Giurato, Giorgio

doi:10.1038/s41597-023-02289-7

Download PDF

Data Descriptor
Open access
Published: 14 June 2023

Host nasopharyngeal transcriptome dataset of a SARS-CoV-2 positive Italian cohort

Scientific Data volume 10, Article number: 379 (2023) Cite this article

1008 Accesses
23 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 07 September 2023

This article has been updated

Abstract

The ongoing COVID-19 pandemic caused by SARS-CoV-2 has affected millions of people worldwide and has significant implications for public health. Host transcriptomics profiling provides comprehensive understanding of how the virus interacts with host cells and how the host responds to the virus. COVID-19 disease alters the host transcriptome, affecting cellular pathways and key molecular functions. To contribute to the global effort to understand the virus’s effect on host cell transcriptome, we have generated a dataset from nasopharyngeal swabs of 35 individuals infected with SARS-CoV-2 from the Campania region in Italy during the three outbreaks, with different clinical conditions. This dataset will help to elucidate the complex interactions among genes and can be useful in the development of effective therapeutic pathways.

Comparative transcriptome analyses reveal genes associated with SARS-CoV-2 infection of human lung epithelial cells

Article Open access 10 August 2021

Dual RNA-Seq analysis of SARS-CoV-2 correlates specific human transcriptional response pathways directly to viral expression

Article Open access 25 January 2022

Reduced subgenomic RNA expression is a molecular indicator of asymptomatic SARS-CoV-2 infection

Article Open access 22 September 2021

Background & Summary

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), responsible for coronavirus disease 19 (COVID-19), has emerged in December 2019 when the first case was reported in Wuhan, China. Soon after, it has rapidly spread to other countries worldwide, becoming pandemic with more than 4 million fatalities and 230 million cases registered¹.

There are different factors that make it difficult to contain the spread of COVID-19. These include the high mutation rate of the virus, the challenge of diagnosing asymptomatic or mildly symptomatic individuals and the capability of the virus to be transmitted during the pre-symptomatic phase².

After transmission processes through respiratory droplets, aerosol or surface contamination, follows the incubation period that could led to a plethora of symptoms such as fever, cough, shortness of breath, loss of taste and smell, diarrhea and nausea³. Nevertheless, a notable proportion of individuals with pre-existing conditions, such as asthma, diabetes, cardiovascular disease and other chronic illnesses experienced severe complications such as pneumonia affections or acute respiratory syndrome⁴. Some respiratory failures in severe SARS-CoV-2 infection have been found to be associated with the activation of immune response and pro-inflammatory mechanisms by chemokines and cytokine release, which may be caused by a “cytokine storm syndrome”⁵. In addition to pre-existing clinical conditions, other factors, such as age, sex, and ethnicity can also impact the clinical presentation of infected patients⁶.

As the vast range of disease susceptibility and outcomes observed in individuals infected with SARS-CoV-2 may be attributed to gene expression modulation resulting from virus-host cell interactions, several studies have been performed to investigate the biological effects of virus infection on the host transcriptome profile⁷. SARS-CoV-2 enters into the host cell by direct attachment to multiple receptors on the cell membrane or through membrane fusion within the endosome after endocytosis leading to further factors in human gene expression modulation⁸. SARS-CoV-2 primarily enters host cells through the angiotensin converting enzyme-2 (ACE2) located on the surface of different cell types. This interaction activates the renin-angiotensin pathway, which may increase the risk of severe COVID-19 symptoms in affected individuals⁹. Hence, upon detection of infection, human cells activate mechanisms to counteract viral replication which involves significant reprogramming of their own transcriptome¹⁰. Despite the worldwide spread, the host immune response against SARS-CoV-2 infection remains poorly characterized. Identifying transcriptome differences can be valuable for the determination of the cellular pathways that are modulated by the virus in infected cells.

Here, our objective is to provide a comprehensive transcriptomic dataset of a cohort of SARS-CoV-2 positive Italian individuals. This dataset will allow the scientific community to study the impact of virus infection on the transcriptome of mucosa cells. To this aim, RNA extracted from 35 nasopharyngeal swabs of COVID-19 patients enrolled in the Campania region was subjected to total RNA sequencing and subsequent bioinformatics analysis (Fig. 1).

Patients were selected according to age, sex, sampling time and clinical manifestation of the disease (Fig. 2a and Supplementary File 1). Our sampling also covers the timing of the three different waves of SARS-CoV-2 infections in Italy, ranging from the pandemic declaration in March 2020 to spring 2021¹¹. In detail, 15 cases belong to the 1st period (March-May 2020), 13 to the 2nd period (September - November 2020) and 7 to the 3rd period (January - February 2021) (Fig. 2a).

Interestingly, by total RNA approach and deep sequencing conditions, detailed in the Methods section, our bioinformatics analyses have detected also reads aligned on the SARS-CoV-2 genome. In this way, we are also able to observe the distribution of virus variants peculiar to different pandemic waves, which may contribute to host response variability analysis (Fig. 2a and Supplementary File 1).

The transcriptome dataset here proposed, can provide valuable insights into the biological impact of SARS-CoV-2 infection on the modulation of host gene expression. By analyzing this dataset and integrating it with others, researchers could identify key protein-coding and non-coding genes involved in pathways affected by the virus’s entrance. This could help in the development of new therapies and diagnostic tools.

Moreover, this dataset includes several clinical factors which can be used to study the relationship between these factors and the host’s gene expression changes induced by SARS-Cov-2 infection.

Additionally, the clade assignment provides an opportunity to investigate the potential differences in transcriptome profiles between different viral strains. This can help in understanding the pathogenesis of the disease and the potential differences in virulence and transmissibility among different SARS-CoV-2 variants. Overall, the transcriptome dataset from the Italian cohort of these patients is a valuable resource for researchers to be integrated with other datasets and identify potential therapeutic targets and diagnostic biomarkers.

Methods

Cohort and clinical samples

The cohort analyzed in our study includes 35 patients from Campania region hospitals in Italy, selected after confirmed SARS-CoV-2 infection by PCR testing in the nasopharyngeal swab. The study was approved by the Campania Sud Ethics Committee (approval code 206/2021) and was conducted according to the guidelines of the Declaration of Helsinki. The individuals consented to their data being published under an open license.

RNA samples collection and quality controls

RNA purity was determined by using NanoDrop spectrophotometer ND-2000 (Thermo Fischer Scientific) through the evaluation of the absorbance ratio A260/A280. Total RNAs concentration was measured using RNA HS kit on a Qubit fluorimeter (ThermoFisher Scientific) while their integrity was evaluated by TapeStation 2200 (Agilent). The total RNA for each sample was reverse transcribed to cDNA using Random Hexamer (Tetro cDNA Synthesis Kit, Bioline, Memphis, Tennessee). RT-qPCRs were carried out by SensiFAST SYBR Lo-ROX kit (Bioline), according to the manufacturer’s instructions, targeting the viral N gene by following primers:

Forward primers- GGGGAACTTCTCCTGCTAGAAT

Reverse primers-CAGACATTTTGCTCTCAAGCTG

RNA Seq library preparation and sequencing

100 ng of each RNA sample was used for sequencing library preparation using the TruSeq Stranded Total RNA Sample Prep kit protocol (Illumina Inc., San Diego, CA, USA). Then, 33 libraries were equimolarly pooled, diluted to a final concentration of 1.2 nMol and sequenced on NovaSeq 6000 (Illumina Inc) in a paired-end mode (2 × 100 base pairs). The 2 remaining libraries were diluted to a final concentration of 1.7 pMol and sequenced on NextSeq 500 (Illumina Inc) in a paired-end mode (2 × 75 base pairs).

Bioinformatics and functional annotation analyses

After sequencing, the demultiplexing step was performed using Illumina bcl2fastq software, in order to generate FASTQ files for the subsequent analysis. Raw FASTQ files were quality checked using FastQC (v0.11.8)¹² and trimmed with Cutadapt software (v4.2)¹³ using the following parameters:–minimum-length/-m and–quality-cutoff/-q options set as 20 and 25 respectively. The read alignment on the human reference genome (GRCh38/hg38) was performed using STAR software (v 2.7.4a)¹⁴ with default parameters and gene quantification was obtained with FeatureCounts v2.0.0¹⁵. The counts were then imported in R (version 3.6.3) and differentially expressed genes were identified using R package DESeq 2 v1.26.0¹⁶. Differential expression was reported as |fold change| ≥ 1.5 along with associated adjusted p-value (false discovery rate (FDR)) ≤ 0.05, computed according to Benjamini–Hochberg¹⁷, as described in Salvati et al.¹⁸. Functional analysis has been performed using IPA (Ingenuity Pathway Analysis). Only functions and pathways showing a p-value ≤ 0.05 have been considered. In addition, for the identification of SARS-CoV-2 virus strains, the reads that did not map on the human reference genome, were aligned on the SARS-CoV-2 genome (primary assembly MN908947.3) using Burrows–Wheeler Aligner (BWA) software (v0.7.17)¹⁹. Nucleotide variants were identified using Freebayes v.1.0.2²⁰ and clade assignment was performed using Nextclade (https://clades.nextstrain.org/).

Data Records

Complete RNA-seq data were deposited in the ArrayExpress database (https://www.ebi.ac.uk/biostudies/arrayexpress) under the accession number E-MTAB-13028²¹. Other data, such as list of differentially expressed genes, raw gene counts, R code used for differential expression analysis and list of canonical pathways are accessible through Figshare platform²².

Technical Validation

To verify the quality and robustness of the data presented here, cycle threshold (Ct) values were also reported (Fig. 3a) based on primers amplification of conserved viral genome regions by Reverse transcription-quantitative polymerase chain reaction (RT-qPCR) to rapidly track viral copies numbers in RNA extracted from swabs of all 35 patients²³ (Fig. 3a and Supplementary file 2). In general, these specimens harbor high Ct values corresponding to low virus load and vice versa. Several studies aimed to detect the relationship between Ct and infectious virus activity but could be an imperfect measurement associated with laboratory-dependent procedures²⁴. In this context, allowed a rapid method to confirm COVID-19-infected patients, highlighting the uniform distribution of high and low Ct values in each period (Fig. 3a). To evaluate the quality of the RNA-Seq data produced, the coverage along gene body was assessed and reported in Fig. 3b.

The same RNAs were then used for total RNA library preparation and sequencing, as described in the Methods section. The sequencing step produced 3,494,055,062 reads, with an average of 99,830,145 reads per sample (range 26,853,188–197,633,680). After trimming of low-quality and adapter related fragments, a total of 36,847,520 reads were removed from the dataset, corresponding to 1,052,786 reads per sample (range 45,944–2,592,362). The high-quality reads are then mapped to the human reference genome resulting in 73.11% of read mapping per sample.

Since the major clinical feature of COVID-19 is a hyperinflammatory state, characterized by high expression of cytokines and chemokines, as further technical validation of our dataset we investigated if these pathways were affected. This feature was analysed performing a differential gene expression analysis comparing patients of the first and the third period (Fig. 2b), due to the variations in COVID-19 restrictions implemented during the three distinct periods and the circulation of different viruses’ variants. The analysis highlighted 1,345 up- and 2,396 down-regulated transcripts (Figshare File 1 and File 2²²). Functional analysis, performed with Ingenuity pathway analysis (IPA), confirmed the strong impact of COVID-19 infection in the modulation of target transcripts in host cells (Fig. 2c) and revealed their involvement in several biological mechanisms associated with immune cell response and coronavirus pathogenesis pathway^7,25 (Figshare File 4 and File 5²²).

Concerning differentially expressed genes, as expected, the majority of them are protein-coding (2,729 genes), while, interestingly a huge part (1,044) belong to the biotype class of non-coding RNA (ncRNA) (Fig. 2d). This aspect suggests how these data could be integrated with other datasets or other information to provide a detailed picture of SARS-CoV-2 infection, taking in consideration the known key role of ncRNAs into the mechanisms driving severe COVID-19²⁶.

Analysis with covariates, such as age, sex and diseases status resulted in 250 differentially expressed genes. The observed result may be attributed to the limited number of patients when stratified based on these characteristics. In fact, the limitation of this particular dataset could be the small sample size, which may limit the statistical power of the analysis. However, the strength of the dataset lies in the accuracy and variety of patient features reported, which could provide valuable insight into the impact of COVID-19 on gene expression mechanisms.

The integration of this dataset with similar ones, along with a rigorous data validation process, could help to increase the statistical power and reliability of the findings.

Code availability

The R code used to perform differential expression analysis is available in FigShare File 3²².

Change history

07 September 2023
A Correction to this paper has been published: https://doi.org/10.1038/s41597-023-02504-5

References

Dong, E., Du, H. & Gardner, L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect Dis 20(5), 533–534 (2020).
Article CAS PubMed PubMed Central Google Scholar
Luo, T., Cao, Z., Wang, Y., Zeng, D. & Zhang, Q. Role of Asymptomatic COVID-19 Cases in Viral Transmission: Findings From a Hierarchical Community Contact Network Model. IEEE Trans Autom Sci Eng 19(2), 576–585 (2021).
Article PubMed Google Scholar
Gómez, S. A. et al. Binding of SARS-CoV-2 to Cell Receptors: A Tale of Molecular Evolution. Chembiochem 22(4), 724–732 (2021).
Article PubMed Google Scholar
Ejaz, H. et al. COVID-19 and comorbidities: Deleterious impact on infected patients. J Infect Public Health 13(12), 1833–1839 (2020).
Article PubMed PubMed Central Google Scholar
Montazersaheb, S. et al. COVID-19 infection: an overview on cytokine storm and related interventions. Virol J 19(1), 92 (2022).
Article CAS PubMed PubMed Central Google Scholar
St Sauver, J. L. et al. Factors Associated With Severe COVID-19 Infection Among Persons of Different Ages Living in a Defined Midwestern US Population. Mayo Clin Proc 96(10), 2528–2539 (2021).
Article Google Scholar
Chakraborty, C., Sharma, A. R., Bhattacharya, M., Zayed, H., Lee, S. S. Understanding Gene Expression and Transcriptome Profiling of COVID-19: An Initiative Towards the Mapping of Protective Immunity Genes Against SARS-CoV-2 Infection. Front Immunol. 2021 Dec 15;12:724936. https://doi.org/10.3389/fimmu.2021.724936. PMID: 34975833; PMCID: PMC8714830.
Zhang, Q. et al. Molecular mechanism of interaction between SARS-CoV-2 and host cells and interventional therapy. Sig Transduct Target Ther 6, 233, https://doi.org/10.1038/s41392-021-00653-w (2021).
Article CAS Google Scholar
Beyerstedt, S., Casaro, E. B. & Rangel, É. B. COVID-19: angiotensin-converting enzyme 2 (ACE2) expression and tissue susceptibility to SARS-CoV-2 infection. Eur J Clin Microbiol Infect Dis 40(5), 905–919 (2021).
Article CAS PubMed PubMed Central Google Scholar
Jabeen, A., Ahmad, N. & Raza, K. Global Gene Expression and Docking Profiling of COVID-19 Infection. Front Genet 13, 870836 (2022).
Article CAS PubMed PubMed Central Google Scholar
Vinceti, M., Filippini, T., Rothman, K. J., Di Federico, S. & Orsini, N. SARS-CoV-2 infection incidence during the first and second COVID-19 waves in Italy. Environ Res 197, 111097 (2021).
Article CAS PubMed PubMed Central Google Scholar
Andrews, S. FastQC: A Quality Control Tool for High Throughput Sequence Data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (2010).
Kechin, A., Boyarskikh, U., Kel, A. & Filipenko, M. cutPrimers: A New Tool for Accurate Cutting of Primers from Reads of Targeted Next Generation Sequencing. J Comput Biol 11, 1138–1143 (2017).
Article Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29(1), 15–21 (2013).
Article CAS PubMed Google Scholar
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30(7), 923–30 (2014).
Article CAS PubMed Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq 2. Genome Biol 15, 550 (2014).
Article PubMed PubMed Central Google Scholar
Benjamini, Y. et al. Controlling the false discovery rate in behavior genetics research. Behav Brain Res 125(1-2), 279–84 (2021).
Article Google Scholar
Salvati, A. et al. The Histone Methyltransferase DOT1L Is a Functional Component of Estrogen Receptor Alpha Signaling in Ovarian Cancer Cells. Cancers 11(11), 1720 (2019).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25(14), 1754–60 (2009).
Article CAS PubMed PubMed Central Google Scholar
Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. arXiv preprint arXiv:12073907 (2012).
Salvati, A. Host nasopharyngeal transcriptome dataset of a SARS-CoV-2 positive Italian cohort. ArrayExpress https://identifiers.org/arrayexpress:E-MTAB-13028 (2023).
Salvati, A. et al. Host nasopharyngeal transcriptome dataset of a SARS-CoV-2 positive Italian cohort. figshare. https://doi.org/10.6084/m9.figshare.23056541.v3 (2023).
D’Agostino, Y. et al. Rapid and sensitive detection of SARS-CoV-2 variants in nasopharyngeal swabs and wastewaters. Diagn Microbiol Infect Dis 102(4), 115632 (2022).
Article PubMed PubMed Central Google Scholar
Engelmann, I. et al. Preanalytical Issues and Cycle Threshold Values in SARS-CoV-2 Real-Time RT-PCR Testing: Should Test Results Include These? ACS Omega 6(10), 6528–6536.
Gusev, E. et al. SARS-CoV-2-Specific Immune Response and the Pathogenesis of COVID-19. Int J Mol Sci 23(3), 1716 (2022).
Article CAS PubMed PubMed Central Google Scholar
Plowman, T. & Lagos, D. Non-Coding RNAs in COVID-19: Emerging Insights and Current Questions. Noncoding RNA 7(3), 54 (2021).
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Work supported by Regione Campania (grants: ‘Monitoring the spread and genomic variability of the Covid 19 virus in Campania using NGS technology’, POR Campania FESR 2014/2020, CUP: B14I20001980006, and ‘GENOMAeSALUTE’, POR Campania FESR 2014/2020, azione 1.5; CUP: B41C17000080007) to FR and AW and Ministry of Science and Higher Education of the Russian Federation within the framework of state support for the creation and development of World-Class Research Centers ‘Digital biodesign and personalized healthcare’ No 075-15-2022-304 to AB (and MS). EA is a fellow of Fondazione U. Veronesi. JL is PhD student of the Research Doctorates in ‘Veterinary Sciences’ of the University of Napoli ‘Federico II’ and ‘Molecular and Translational Oncology and Innovative Medical-Surgical Technologies’ of the University of Catanzaro ‘Magna Graecia’. AS, YD and CF are residents of the Postgraduate School in Clinical Pathology and Clinical Biochemistry of the University of Salerno. We thanks Prof. Ivan Gentile, Dr. Nicola Schiano Moriello, PO Malattie Infettive - Prof. Michele Portella, Dr. Michele Cennamo, PO Patologia Clinica - A.O.U. “Federico II”, Napoli, Dr. Paolo Sorrentino, Dr. Carmine Sanseverino, Unità Fegato, UO Malattie infettive – Dr. Maria Landi, Dr. Maria Grazia Foti, Servizio di Microbiologia e Virologia - A.O.R.N. “San Giuseppe Moscati”, Avellino, Prof. Paolo Maggi, U.O.C. Malattie Infettive e Tropicali – Dr. Maddalena Schioppa, UOSD Genetica e Biologia Molecolare - A.O.R.N. “S. Anna e S. Sebastiano”, Caserta and Prof. Pasquale Pagliano, UO Clinica Infettivologica, Prof. Gianluigi Franci, Programma nella Diagnostica Avanzata delle Resistenze Microbiche, Dr. Emilia Vaccaro, U.O.S.D. NAT e Biologia Molecolare - A.O.U. “San Giovanni di Dio e Ruggi d’Aragona”, Salerno for providing RNAs and information on samples analysed in this study.

Author information

These authors contributed equally: Annamaria Salvati, Carlo Ferravante.

Authors and Affiliations

Molecular Pathology and Medical Genomics Program, Division of Oncology, AOU ‘S. Giovanni di Dio e Ruggi 14 d’Aragona’, Università di Salerno, Salerno, 84131, Italy
Annamaria Salvati, Carlo Ferravante, Teresa Rocco, Ylenia D’Agostino, Giovanni Nassa, Roberta Tarallo, Alessandro Weisz, Francesca Rizzo & Giorgio Giurato
Laboratory of Molecular Medicine and Genomics, Department of Medicine, Surgery and Dentistry ‘Scuola Medica Salernitana’, University of Salerno, Baronissi (Sa), 84081, Italy
Annamaria Salvati, Carlo Ferravante, Jessica Lamberti, Teresa Rocco, Elena Alexandrova, Ylenia D’Agostino, Giovanni Nassa, Roberta Tarallo, Alessandro Weisz, Francesca Rizzo & Giorgio Giurato
Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia
Maksim Sorokin, Victor Efimov & Anton Buzdin
OmicsWay Corp, Walnut, USA
Maksim Sorokin
Oncobox Ltd., Moscow, Russia
Maksim Sorokin & Victor Efimov
World-Class Research Center ‘Digital biodesign and personalized healthcare’, Sechenov First Moscow State Medical University, Moscow, Russia
Victor Efimov & Anton Buzdin
Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Moscow, 117997, Russia
Anton Buzdin
Genome Research Center for Health, Campus of Medicine, University of Salerno, Baronissi (Sa), 84081, Italy
Oriana Strianese, Giovanni Nassa, Roberta Tarallo, Alessandro Weisz, Francesca Rizzo & Giorgio Giurato

Authors

Annamaria Salvati
View author publications
You can also search for this author in PubMed Google Scholar
Carlo Ferravante
View author publications
You can also search for this author in PubMed Google Scholar
Jessica Lamberti
View author publications
You can also search for this author in PubMed Google Scholar
Teresa Rocco
View author publications
You can also search for this author in PubMed Google Scholar
Elena Alexandrova
View author publications
You can also search for this author in PubMed Google Scholar
Ylenia D’Agostino
View author publications
You can also search for this author in PubMed Google Scholar
Maksim Sorokin
View author publications
You can also search for this author in PubMed Google Scholar
Victor Efimov
View author publications
You can also search for this author in PubMed Google Scholar
Anton Buzdin
View author publications
You can also search for this author in PubMed Google Scholar
Oriana Strianese
View author publications
You can also search for this author in PubMed Google Scholar
Giovanni Nassa
View author publications
You can also search for this author in PubMed Google Scholar
Roberta Tarallo
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Weisz
View author publications
You can also search for this author in PubMed Google Scholar
Francesca Rizzo
View author publications
You can also search for this author in PubMed Google Scholar
Giorgio Giurato
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors participated in conception and design of the study. Study concept and design: A.W., F.R. and G.G. Sample preparation and sequencing: E.A., J.L., T.R. and Y.D.A. Bioinformatics analysis: C.F., V.M.C., M.S., V.E. and A.B. Statistical analysis and interpretation of the data: A.S., A.W., F.R., O.S. and G.G. Writing of the manuscript: A.S., A.W., F.R., G.G., G.N. and R.T.

Corresponding authors

Correspondence to Francesca Rizzo or Giorgio Giurato.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Table S2

Supplementary Table S1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Salvati, A., Ferravante, C., Lamberti, J. et al. Host nasopharyngeal transcriptome dataset of a SARS-CoV-2 positive Italian cohort. Sci Data 10, 379 (2023). https://doi.org/10.1038/s41597-023-02289-7

Download citation

Received: 06 March 2023
Accepted: 05 June 2023
Published: 14 June 2023
DOI: https://doi.org/10.1038/s41597-023-02289-7