Relating SARS-CoV-2 variants using cellular automata imaging

Souza, Luryane F.; Rocha Filho, Tarcísio M.; Moret, Marcelo A.

doi:10.1038/s41598-022-14404-6

Download PDF

Article
Open access
Published: 18 June 2022

Relating SARS-CoV-2 variants using cellular automata imaging

Luryane F. Souza^1,2,
Tarcísio M. Rocha Filho³ &
Marcelo A. Moret^2,4

Scientific Reports volume 12, Article number: 10297 (2022) Cite this article

1447 Accesses
5 Citations
2 Altmetric
Metrics details

Subjects

Abstract

We classify the main variants of the SARS-CoV-2 virus representing a given biological sequence coded as a symbolic digital sequence and by its evolution by a cellular automata with a properly chosen rule. The spike protein, common to all variants of the SARS-CoV-2 virus, is then by the picture of the cellular automaton evolution yielding a visible representation of important features of the protein. We use information theory Hamming distance between different stages of the evolution of the cellular automaton for seven variants relative to the original Wuhan/China virus. We show that our approach allows to classify and group variants with common ancestors and same mutations. Although being a simpler method, it can be used as an alternative for building phylogenetic trees.

Neutral evolution test of the spike protein of SARS-CoV-2 and its implications in the binding to ACE2

Article Open access 22 September 2021

Homology between SARS CoV-2 and human proteins

Article Open access 25 August 2021

Exposing structural variations in SARS-CoV-2 evolution

Article Open access 11 November 2021

Introduction

The disruption caused during the last two years by the COVID-19 pandemic is hard to be underestimated, from more than five million deaths and 270 million cases world-wide, according to official sources ¹, to economic disruption in most countries². In December 31, 2019 the first case was reported in the city of Wuhan, China, and in January 9, 2020, the World Health Organization (WHO) informed that Chinese scientists reported that the disease was caused by a new coronavirus. In February 11, 2020, in order to not associate the disease with any locality or groups of people the new coronavirus was named SARS-CoV-2 and the disease it caused COVID-19. In March 11 of that same year the WHO declared the outbreak a pandemic³.

The SARS-CoV-2 virus is part of the same virus family as the SARS-CoV and MERS-CoV viruses, the Sarbecovirus subgroup of the subdivision of the Betacoronavirus genera, which were responsible for epidemics in China (2003) and Saudi Arabia (2012)⁴. The last decade or so witnessed important developments in genome sequencing techniques, with a major increase in information gathering (data) on DNA, RNA, and protein sequences, as exemplified by the amount of data in databases such as GenBank⁵ and UniProt⁶ (for a more thorough account on genomic databases see^7,8). Genomic information on animals, plants, and significant disease-causing viruses and bacteria are now easily available to researchers worldwide. Even before COVID-19 was declared a pandemic researchers in China determined the genomic sequencing of the virus⁹. Genomic sequencing is a crucial for designing vaccines, identify variants, determine the virus family and to drugs development^10,11,12. The SARS-CoV-2 is a single-stranded RNA virus, with a genome size of 30 Kb, and four structural proteins: Nucleocapsid (N), Matrix (M), Envelope (E) and the Spike (S)^4,10. The latter is responsible for recognizing and allowing the virus to enter the cell, possibly the main reason why this protein has been widely studied. Mutations in the SARS-CoV-2 viruses result in new variants with mutations in the spike protein increasing replication within cells, and an increased transmissibility⁸.

A protein can be depicted as a primary structure formed by a sequence of long strings of characters containing all information: structure, function, hydrophobicity and different motifs. Several researchers have studied how to extract different properties, e. g. hydrophobicity^13,14,15,16, fractality^17,18, geometric and thermodynamic aspects^19,20,21. Cellular Automata have been widely used to model complex systems with simple, easy-to-understand rules²², and in recent years many papers were devoted study protein related problems using this approach. Sleit and Mdain²³ proposed a protein folding model based on cellular automata, with straightforward evolutionary rules based on the hydrophobicity of amino acids. Other works dedicated to the same problem include^24,25,26. Cellular Automata Image (CAI) analysis²⁷ is a powerful tool to classify protein structure^28,29,30 and virus taxonomy³¹. These images can contain important information on the modeled system, for example, CAI allows to differentiate similar systems with respect to those significantly different. The identification of functions, structures, location, and common ancestry of a protein sequence can be performed by a comparison with other know proteins in databases, using alignment, similarity, and homology techniques³². In the present paper we propose a protein comparison approach using a cellular automaton image and the information theoretic Hamming metric for the distance between such images, as a measure of similarity and difference, applied to the spike protein. The distance is measured with respect to the S protein in the initial virus strain as first detected in Wuhan, and for the following Variants Of Concern (VOCs) with mutations of the Spike protein: Alpha (first identified in the United Kingdom), Beta (South Africa), Gamma (Brazil), Delta (India), and the more recent Omicron (South Africa), B.1.1.28, and P2 (Brazil). Our goal is to explicitly obtain the evolutionary relationships between these SARS-CoV-2 variants.

The cellular automata image approach for protein classification and the Hamming distance are presented in “Methods” section. Our results are presented and discussed in “Results and discussion” section, and we close the paper with some concluding remarks in “Concluding remarks” section.

Methods

Cellular automata are discrete dynamical systems with simple local evolution rules and, despite this, can show complex behavior²². The rules take into account the state of neighboring cells, analogous to protein structure since physicochemical characteristics of neighboring amino acids influence the folding or function of the protein. The cellular automata considered here has four components: a grid, the set of states, the neighborhood of each state, and the local transition rule. Several possibilities were proposed for encoding the sequence of the 20 types of amino acids in a protein: an 8-digit code for each amino acid³³, or codes reflecting physicochemical characteristics and degeneracy, based on rules of similarity and complementarity: based on molecule recognition and information theory, with a 5-digit code for each amino acid³⁴, or by representing the amino acid sequences using the hydrophobicity index of each amino acid²⁸. The latter in the present work as it allows to better describe the evolutionary relationships between SARS-CoV-2 variants, resulting in smaller distantes for variants with the same mutations and those that emerged in the same period throughout the pandemic. It also groups together variants that share a mutation in the amino acid N501Y. Coronaviruses that cause MERS, SARS and COVID-19 diseases are all closely related, and it is natural to expect that the same coding scheme will be a good representation of the SARS-CoV-2 proteins based in the same molecules. This is reinforced by the discussion in³⁵ (see particularly Figure 3 of this paper) that shows that the Spike proteins of these viruses have very similar characteristics. Different binary codes were used to distinguish SARS-CoV viruses from other coronaviruses, such as the one used by Xiao et al.³⁴, which is a simpler code and does not take into account physicochemical amino acids.

Table 1 shows the coding of Ref.²⁸ that will be used throughout the rest of this work. The Spike protein sequence has 1273 amino acids, and each one is coded as a 5 digit sequence, and thence $N=6365$ cells with 0 and 1 as possible state, and composing the first line of the cellular automata (initial condition). The state of the i-th cell at step t is notated as $x_i^t=0,1$, i, $i=1,\ldots ,N$. The neighborhood of the cell at position i is composed by the three cells at positions $i-1$, i and $i+1$, resulting in $2^3=8$ different states for the neighborhood. We also use periodic boundary conditions. For each possible configuration of the neighborhood, the middle cell can assume two possible states, and thus the number of possible evolution rules is given by $2^8=256$³⁶. As discussed in³⁶ and ³¹, the most appropriate evolution for the celular atomaton rule for SARS-CoV virus classification and for distinguishing them from other viruses, is Wolfram’s 184 and depicted in Fig. 1. This rule yield as a typical feature of SARS-CoV viruses a V pattern pattern in the cellular automaton image (see below).

Table 1 Coding for each of the 20 possible amino acids²⁸.

Full size table

In order to implement a numeric metric to distinguish different images, we consider here the information theoretic Hamming distance $D_H$, which is commonly used for the distance between sequences of same length and is a simple measure the number of different positions/errors with all required mathematical properties³⁷. Here the sequences considered are the states of the automata at the same stept. In this case the distance can be written as:

$$\begin{aligned} D_H(t)= \frac{1}{N} \sum _{i=1}^{N}\Vert x_i^t-\overline{x}_i^t\Vert , \end{aligned}$$

(1)

with N the size of the grid, $x_i^t$ the state of the cell at step t for the S protein in the initial Wuhan strain and $\overline{x}_i^t$ the Spike protein of the given variant.

Ethics declarations

No human samples/human data were used in the present work.

Results and discussion

The cellular automaton for the SARS-CoV-2 spike protein using available genomic data data for Alpha, Beta, Gamma, Delta, B.1.1.28, P2 variant and the original strain are available at⁵ and³⁸ for Omicron, represented with the coding in Table 1, and evolved according to the rule in Fig. 1 over 1000 time steps. Deletions in the protein sequence were represented by the code 00000 and insertions by introducing the deletion code in the other proteins at the corresponding position. Figure 2 shows the resulting image representing the evolution of the automaton for each considered variant, where the V shaped patterns characteristic of SARS-CoV viruses³¹ are clearly visible. Figure 3 shows the time evolution of the Hamming distance $D_H$ for each variant with respect to the original Wuhan strain. For the initial steps the distance has small values, as expect for variants of the same virus, and increases with t up to an asymptotic constant value after approximately $t=400$ steps. The small number of mutations, if compared to the number of amino-acids in the protein and measured by the small Hamming distance at $t=0$, is amplified by the evolution of the cellular automata and results in quite different asymptotic values of $D_H$, after an irregular transient of roughly 200 time steps. This allows us to classify the cellular automata as Wolfram Class IV, with an intermediate behavior between Classes II (periodical) and III (chaotic). Although the Omicron variant presents more mutations (and therefore a higher value of $D_H$) than other known VOCs, with 33 amino acid changes in the spike protein³⁹, its distance plot remains close to the variants sharing the N501Y mutation (see Table 2 for the characteristic mutations of each variant). This large number of modifications seems to be linked to an increased transmissibility and possibly smaller efficiency of curent vaccines⁴⁰.

Table 2 shows the different mutations present in each main variant of the SARS-CoV-2 virus. We then see from Fig. 3 that the present approach groups the variants carrying the N501Y mutation, the sense that final stationary Hamming distance between these variants and the original are more closer and with higher values. The Gamma and P2 variants are also closer as they have the same clade B.1.1.28 (note that the distance for P2 and B.1.1.28 are practically the same in the Figure), while the Delta variant, which carries the P681R mutation unfamiliar to the other variants, is the one with smallest distance. We believe that the present approach is a straightforward way to measure evolutionary distances between SARS-CoV-2 variants, much simpler that other techniques as in^41,42 were a normalized Laplacian pyramid is employed to measure pairwise similarities in cellular automata image wavelet images in order to build phylogenetic trees.

In order to show that the present approach properly relates the variants, we computed the phylogenetic tree from the the neighbor-joining method with alignment⁴³, which calculates evolutionary distances between species. Figure 4 shows the results for the main known variants of SARS-CoV-2. We see that the variants Gamma, P2, and B.1.1.28 are in the same clade in the tree, while in Fig. 3 these same variants have closer stationary distances. Our results for the Hamming distance fo Delta, Gamma, P2, and B.1.1.28 variants shows that they are closer to the protein initially found in the Wuhan strain, as expected as they are in the same clade in Fig. 4. The same occurs for Alpha and Beta variants, which are in the same clade and have close stationary Hamming distances, while in both approaches the Omicron variant is clearly separated from the other variants. On the other hand, variants B.1.1.28 and P2 have the same stationary Hamming distance, as they have very similar mutations (see Table 2) while P2 is more close to Gamma in the phylogenetic tree.

Table 2 Mutations the Spike protein of the SARS-CoV-2 variants from³⁹.

Full size table

The variants with smallest values of $D_H$ are those with the smallest number of mutations in Table 2: Delta, B.1.1.28 and P2, which are also the variants without the N501Y mutation. Despite the differences in the images of each variant, resulting from different mutations, the celular automaton rule also results in the V-shaped pattern for SARS-CoV-2 type coronaviruses. This V pattern is characteristic of SARS-CoV-like coronaviruses as discussed in length in Refs.^31,36. Despite the fact that the SARS-CoV-2 virus is different from SARS-CoV, they share this pattern from their common ancestors. During the COVID-19 pandemic many mutations occurred in the virus sequence, but without a functional change in the Spike protein, although some of these mutations may bring some advantages. However, since different sequencesi perform the same function, mutations in proteins are degenerate, a behavior fundamental for natural selection to occur. Without degeneracy, there is no genetic variability, and this hinders natural selection from acting⁴⁴.

Concluding remarks

The approach presented here allows to cluster variants with common ancestors by using a cellular automaton and the asymptotic Hamming distance for the resulting images for each variant, as shown in Fig. 2, and is a more straightforward and simpler evolutionary classification of those variants, than other approaches such as alignment technique, similarity analysis and image processing. Iti particularly discerns the deviation of Omicron with respect to other variants, preserving the V shaped pattern characteristic of the SARS-CoV viruses, despite having the largest number of mutations among known variants, and grouping variants with the N501Y mutation. Furthermore, after just three iterations of the automaton for the protein in the Wuhan strain, the amino acid at position 501 changed from N to Y. This rapid convergence suggest an alternative explanation for the emergence of Alpha, Beta, and Gamma on three continents simultaneously, an evolutionary convergence. We also note that without degeneration, mutations could lead to unfavorable structures for the virus, making it easier to control its spread⁴⁴. Cellular automata are a simple tool to extract meaningful information from proteins sequences, with a very low computational cost. We hope that the present work will contribute as an useful tool to build protein phylogenetic trees.

Data availability

Datasets used during the current study are the sequences of the Spike proteins of the virus initially found in Wuhan [YP_009724390.1] and its variants Alpha [QWP89177.1], Beta [UAL50115.1], Gamma [QXF22923.1 ], Delta [QXP08802.1], Omicron [UGO97992.1], B.1.1.28 [QQK84800.1] and P2 [QXF22396.1] which are available at https://www.ncbi.nlm.nih.gov/genbank/.

References

John Hopkins University. John Hopkins Coronavirus Resource Center (2021). Available online: https://coronavirus.jhu.edu/map.html. Accessed 4 Sept 2021.
Tooze, A. Shutdown—How Covid Shook the World’s Economy (Penguin Random House, 2021).
Google Scholar
World Health Organization. WHO timeline-COVID-19. https://www.who.int/emergencies/diseases/novel-coronavirus-2019/interactive-timeline?gclid=CjwKCAiA7dKMBhBCEiwAO_crFAhknuq4kc_PZRW1qx3v_bMHTvAmmEewQ2vyKtZ47HyUy7DLGlZxoCkC4QAvD_BwE#event-115 (2020). Accessed 17 Nov 2021.
Machhi, J. et al. The natural history, pathobiology, and clinical manifestations of SARS-CoV-2 infections. J. Neuroimmune Pharmacol. 15, 359–386. https://doi.org/10.1007/s11481-020-09944-5 (2020).
Article PubMed PubMed Central Google Scholar
GenBank. National Center for Biotechnology Information (2021).
UniProt. The Universal Protein Resource (2021).
Chen, C., Huang, H. & Wu, C. H. Protein bioinformatics databases and resources. Methods Mol. Biol. 1558, 3–39. https://doi.org/10.1007/978-1-4939-6783-4_1 (2017).
Article CAS PubMed PubMed Central Google Scholar
NIH—National Library of Medicine. NCBI SARs-CoV-2 Resources (2021).
Wu, F. et al. A new coronavirus associated with human respiratory disease in China. Nature 579, 1–8. https://doi.org/10.1038/s41586-020-2008-3 (2020).
Article CAS Google Scholar
Khan, M. T. et al. Structures of SARS-CoV-2 RNA-binding proteins and therapeutic targets. Intervirology 64, 1–14. https://doi.org/10.1159/000513686 (2021).
Article CAS Google Scholar
Chou, K. C., Wei, D. Q. & Zhong, W. Z. Binding mechanism of coronavirus main proteinase with ligands and its implication to drug design against SARS. Biochem. Biophys. Res. Commun. 308, 148–151. https://doi.org/10.1016/S0006-291X(03)01342-1 (2003).
Article CAS PubMed PubMed Central Google Scholar
Chou, K. C., Wei, D. Q., Du, Q. S., Sirois, S. & Zhong, W. Z. Progress in computational approach to drug development against SARS. Curr. Med. Chem. 13, 3263–3670. https://doi.org/10.2174/092986706778773077 (2006).
Article CAS PubMed Google Scholar
Moret, M. A. & Zebende, G. F. Amino acid hydrophobicity and accessible surface area. Phys. Rev. E Stat. Nonlinear Soft Matter Phys. 75, 011920. https://doi.org/10.1103/PhysRevE.75.011920 (2007).
Article ADS CAS Google Scholar
Phillips, J. C. Scaling and self-organized criticality in proteins I. Proc. Natl. Acad. Sci. 106, 3107–3112. https://doi.org/10.1073/pnas.0811262106 (2009).
Article ADS PubMed PubMed Central Google Scholar
Phillips, J. C. Synchronized attachment and the Darwinian evolution of coronaviruses CoV-1 and CoV-2. Physica A Stat. Mech. Appl. 581, 126202. https://doi.org/10.1016/j.physa.2021.126202 (2021).
Article MathSciNet CAS Google Scholar
Li, S., Cai, C., Gong, J., Liu, X. & Li, H. A fast protein binding site comparison algorithm for proteome-wide protein function prediction and drug repurposing. Proteins Struct. Funct. Bioinform. 89, 1541–1556. https://doi.org/10.1002/prot.26176 (2021).
Article CAS Google Scholar
Moret, M. A., Miranda, J. G. V., Nogueira, E., Santana, M. C. & Zebende, G. F. Self-similarity and protein chains. Phys. Rev. E Stat. Nonlinear Soft Matter Phys. 71, 012901. https://doi.org/10.1103/PhysRevE.71.012901 (2005).
Article CAS Google Scholar
Moret, M. A., Santana, M. C., Nogueira, E. & Zebende, G. F. Protein chain packing and percolation threshold. Physica A Stat. Mech. Appl. 361, 250–254 (2006).
Article ADS CAS Google Scholar
Moret, M. A. Self-organized critical model for protein folding. Physica A Stat. Mech. Appl. 390, 3055–3059. https://doi.org/10.1016/j.physa.2011.04.008 (2011).
Article ADS CAS Google Scholar
Xu, X. L., Shi, J. X., Wang, J. & Li, W. Long-range correlation and critical fluctuations in coevolution networks of protein sequences. Physica A Stat. Mech. Appl. 562, 125339. https://doi.org/10.1016/j.physa.2020.125339 (2021).
Article CAS Google Scholar
Nelson, E. D. & Onuchic, J. N. Proposed mechanism for stability of proteins to evolutionary mutations. Proc. Natl. Acad. Sci. 95, 10682–10686. https://doi.org/10.1073/pnas.95.18.10682 (1998).
Article ADS CAS PubMed PubMed Central Google Scholar
Toffoli, T. & Margolus, N. Cellular Automata Machines: A New Environment for Modeling (MIT Press in Scientific Computation, 1987).
Book Google Scholar
Sleit, A. & Madain, A. Protein folding in the two-dimensional hydrophobic polar model based on cellular automata and local rules. Int. J. Comput. Netw. Inf. Secur. 16, 48 (2016).
Google Scholar
Varela, D. & Santos, J. Protein folding modeling with neural cellular automata using Rosetta. In GECCO ’16 Companion: Proceedings of the 2016 on Genetic and Evolutionary Computation Conference Companion, GECCO ’16 Companion, 1307–1312 (Association for Computing Machinery, 2016).
Varela, D. & Santos, J. Protein folding modeling with neural cellular automata using the Face-Centered Cubic model (2017). Published in IWINAC 19 June 2017.
Varela, D. & Santos, J. Automatically obtaining a cellular automaton scheme for modeling protein folding using the FCC model. Nat. Comput.https://doi.org/10.1007/s11047-018-9705-y (2019).
Article MathSciNet Google Scholar
Wolfram, S. Cellular automata as models of complexity. Nature 311, 419–424 (1984).
Article ADS Google Scholar
Xiao, X. & Chou, K. Digital coding of amino acids based on hydrophobic index. Protein Pept. Lett. 14, 871–5 (2007).
Article CAS Google Scholar
Xiao, X., Wang, P. & Chou, K. C. Predicting protein structural classes with pseudo amino acid composition: An approach using geometric moments of cellular automaton image. J. Theor. Biol. 254, 691–6. https://doi.org/10.1016/j.jtbi.2008.06.016 (2008).
Article ADS MathSciNet CAS PubMed MATH Google Scholar
Kavianpour, H. & Vasighi, M. Structural classification of proteins using texture descriptors extracted from the cellular automata image. Amino Acids 49, 261–271. https://doi.org/10.1007/s00726-016-2354-5 (2017).
Article CAS PubMed Google Scholar
Wang, M. et al. A new nucleotide-composition based fingerprint of SARS-CoV with visualization analysis. Med. Chem.https://doi.org/10.2174/1573406053402505 (2005).
Article PubMed Google Scholar
Gabler, F. et al. Protein sequence analysis using the MPI bioinformatics toolkit. Curr. Protoc. Bioinform. 72, e108. https://doi.org/10.1002/cpbi.108 (2020).
Article CAS Google Scholar
Ghosh, S. & Chaudhuri, P. P. Cellular automata model for proteomics and its application in cancer immunotherapy. In Cellular Automata. ACRI 2018. Lecture Notes in Computer Science, 3–15 (Springer International Publishing, 2018).
Xiao, X., Shao, S., Ding, Y. & Chen, X. Digital coding for amino acid based on cellular automata. In 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No. 04CH37583), vol. 5, 4593–4598, https://doi.org/10.1109/ICSMC.2004.1401256 (2004).
Phillips, J. C., Moret, M. A., Zebende, G. F. & Chow, C. C. Phase transitions may explain why SARS-CoV-2 spreads so fast and why new variants are spreading faster. Physica A 598, 127318. https://doi.org/10.1016/j.physa.2022.127318 (2022).
Article CAS PubMed PubMed Central Google Scholar
Xiao, X. et al. Using cellular automata to generate image representation for biological sequences. Amino Acids 28, 29–35. https://doi.org/10.1007/s00726-004-0154-9 (2005).
Article CAS PubMed PubMed Central Google Scholar
Hamming, R. W. Error detecting and error correcting codes. Bell Syst. Tech. J. 29, 147–160. https://doi.org/10.1002/j.1538-7305.1950.tb00463.x (1950).
Article MathSciNet MATH Google Scholar
Mullen, J. L. et al. Outbreak. Info (2021). Accessed 17 Dec 2021.
European Centre for Disease Prevention and Control. Implications of the emergence and spread of the SARS-CoV-2 b.1.1. 529 variant of concern (Omicron) for the EU/EEA. https://www.ecdc.europa.eu/en/publications-data/threat-assessment-brief-emergence-sars-cov-2-variant-b.1.1.529 (2021). Accessed 17 Dec 2021.
World Health Organization. Enhancing Readiness for Omicron (b.1.1.529): Technical brief and priority actions for member states. https://www.who.int/publications/m/item/enhancing-readiness-for-omicron-(b.1.1.529)-technical-brief-and-priority-actions-for-member-states (2021). Accessed 17 Dec 2021.
Wu, Z. C., Xiao, X. & Chou, K. C. 2D-MH: A web-server for generating graphic representation of protein sequences based on the physicochemical properties of their constituent amino acids. J. Theor. Biol. 267, 29–34. https://doi.org/10.1016/j.jtbi.2010.08.007 (2010).
Article ADS MathSciNet CAS PubMed MATH Google Scholar
Rahman, M. M., Biswas, B. A. & Bhuiyan, M. I. H. Protein similarity analysis by wavelet decomposition of cellular automata images. In 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE), 1–6 (IEEE, 2019).
Saitou, N. & Nei, M. The neighbor-joining method: A new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4, 406–442. https://doi.org/10.1093/oxfordjournals.molbev.a040454 (1987).
Article CAS PubMed Google Scholar
Edelman, G. M. & Gally, J. A. Degeneracy and complexity in biological systems. Proc. Natl. Acad. Sci. 98, 13763–13768. https://doi.org/10.1073/pnas.231499798 (2001).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Author information

Authors and Affiliations

CCET, Universidade Federal do Oeste da Bahia, Barreiras, 47808-021, Brazil
Luryane F. Souza
SENAI-CIMATEC, Salvador, 41650 -010, Brazil
Luryane F. Souza & Marcelo A. Moret
ICCMP & IF, Universidade de Brasília, Brasília, 70910-900, Brazil
Tarcísio M. Rocha Filho
DCET, UNEB, Salvador, Brazil
Marcelo A. Moret

Authors

Luryane F. Souza
View author publications
You can also search for this author in PubMed Google Scholar
Tarcísio M. Rocha Filho
View author publications
You can also search for this author in PubMed Google Scholar
Marcelo A. Moret
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.F.S. conducted the computer modelling, M.A.M. separated the protein sequence data from the variants,T.M.R.F. made the images of the automata. All analyzed the results and reviewed the manuscript.

Corresponding author

Correspondence to Luryane F. Souza.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Souza, L.F., Rocha Filho, T.M. & Moret, M.A. Relating SARS-CoV-2 variants using cellular automata imaging. Sci Rep 12, 10297 (2022). https://doi.org/10.1038/s41598-022-14404-6

Download citation

Received: 02 February 2022
Accepted: 07 June 2022
Published: 18 June 2022
DOI: https://doi.org/10.1038/s41598-022-14404-6

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.