The death risk of COVID-19 caused by SARS-CoV-2 seems strikingly decreased due to extensive vaccination and attenuated pathogenicity of SARS-CoV-2 variants [1,2,3], however, we still face big challenges in monitoring the long term trend and warning the death risk of COVID-19 because the epidemiological situation is continuously evolving as the pandemic has entered the third year [4]. We previously presented efficient algorithms [5, 6] to quantify the importance score of both coding RNA molecules (mRNAs) and noncoding RNA molecules (long noncoding RNAs and microRNAs) based on sequences. Given that SARS-CoV-2 is a single-stranded positive-sense RNA virus, here we used the GIC (Gene Importance Calculator) algorithm to compute the importance score (GIC score) of 1,019,300 SARS-CoV-2 complete genome RNA sequences collected between December 2019 and now (May 2022) from the NCBI SARS-CoV-2 database (https://www.ncbi.nlm.nih.gov/sars-cov-2/). The GIC algorithm was trained with a logistic regression model based on nucleotide triplet features and secondary structure info derived from RNA sequences to evaluate the importance of RNAs. Subsequently, to quantify the death risk of COVID-19, we obtained the reported counts of cases and deaths from WHO Coronavirus (COVID-19) Dashboard (https://covid19.who.int/info). And then we estimated the monthly death risk by calculating the death-to-case ratio for every month. For exploring the association between genome RNA sequences of SARS-CoV-2 strains and their death risk, we calculated the correlation of GIC score of virus RNA sequences and death risk of COVID-19 with Spearman’s correlation analysis. Interestingly, we found that the monthly death risk of COVID-19 is highly related with the importance score of the genome RNA sequences of SARS-CoV-2 strains (Fig. 1). Moreover, February 2021 shows to be an inflection point for their relation. Before February 2021, the death risk of COVID-19 has highly negative correlation with the GIC score (Rho = −0.86, p-value = 8.20E−5; the left part of the dotted line in Fig. 1a), whereas their relation clearly turns into positive correlation (Rho = 0.75, p-value = 1.18E−3; the right part of the dotted line in Fig. 1a) and the death risk decreased obviously after February 2021. Strikingly, we revealed that it is just before February 2021 the number of countries beginning with the first vaccine increased sharply (44 countries on December, 2020 and 34 countries on January, 2021) according to the WHO data (Fig. 1b), suggesting that vaccination generated a great effect on both the genome and the severity of the virus. Meanwhile, the number of types of SARS-CoV-2 lineages and sublineages significantly decreased after the prevalence of vaccination, that is, since January 2021 (Fig. 1c), while virus still could evolve into new variants and produce further threat to public health, like Omicron variant (BA.1, BA.2, BA.3, and BA.4). The above findings suggest that the virus would produce serious effects on health again once current vaccines fail to protect against new variants in the future. Finally, we developed an online tool CoVIS (https://www.cuilab.cn/cov2) for timely calculating GIC score and thus monitoring the tendency of death risk of COVID-19.
Data availability
Data are available to the journal and the publisher upon request.
References
Halfmann PJ, Iida S, Iwatsuki-Horimoto K, Maemura T, Kiso M, Scheaffer SM, et al. SARS-CoV-2 Omicron virus causes attenuated disease in mice and hamsters. Nature. 2022;603:687–92.
Shuai H, Chan JF-W, Hu B, Chai Y, Yuen TT-T, Yin F, et al. Attenuated replication and pathogenicity of SARS-CoV-2 B.1.1.529 Omicron. Nature. 2022;603:693–9.
Suzuki R, Yamasoba D, Kimura I, Wang L, Kishimoto M, Ito J, et al. Attenuated fusogenicity and pathogenicity of SARS-CoV-2 Omicron variant. Nature. 2022;603:700–5.
Koelle K, Martin MA, Antia R, Lopman B, Dean NE. The changing epidemiology of SARS-CoV-2. Science. 2022;375:1116–21.
Song F, Cui C, Gao L, Cui Q. miES: predicting the essentiality of miRNAs with machine learning and sequence features. Bioinforma. 2019;35:1053–4.
Zeng P, Chen J, Meng Y, Zhou Y, Yang J, Cui Q. Defining essentiality score of protein-coding genes and long noncoding RNAs. Front. Genet. 2018;9:380.
Acknowledgements
This work has been supported by the grant from the Natural Science Foundation of China (62025102).
Author information
Authors and Affiliations
Contributions
C.C. and Q.C. collected the data and performed the analysis. Q.C. and C.C. prepared and revised the paper. QC conceived the project.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Cui, C., Cui, Q. Importance score of SARS-CoV-2 genome predicts the death risk of COVID-19. Cell Death Discov. 8, 303 (2022). https://doi.org/10.1038/s41420-022-01100-7
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41420-022-01100-7