Mutation trend of hemagglutinin of influenza A virus: a review from a computational mutation viewpoint

Abstract

Since 1999 we have developed two computational mutation approaches to analyze the protein primary structure whose methodology and implications were reviewed in 2002. Our first approach is the calculation of predictable and unpredictable portions of amino-acid pairs in a protein, and the second is the calculation of amino-acid distribution rank in a protein. Both approaches provide quantitative measures to present a protein, which we have used to study a number of proteins with numerous mutations such as p53 proteins. More recently, we focussed our efforts on analyzing the proteins mutating frequently over time such as hemagglutinins of influenza A viruses. In this review we summarise our findings and their implications for hemagglutinin mutations in combination with some newly available data. Our approaches throw light on the true nature of genetic heterogeneity of influenza virus hemagglutinins; that is, the protein variability is highly relevant to its amino-acid construction. Using these approaches, we can monitor new mutations from influenza virus hemagglutinins and may predict their mutations in the future.

References

  1. 1

    Baigent SJ, McCauley JW . Influenza type A in humans, mammals and birds: determinants of virus virulence, host-range and interspecies transmission. Bioessays 2003; 25: 657–71.

  2. 2

    Tognotti E . Scientific triumphalism and learning from facts: bacteriology and the “Spanish flu” challenge of 1918. Soc Hist Med 2003; 16: 97–110.

  3. 3

    Reid AH, Taubenberger JK . The origin of the 1918 pandemic influenza virus: a continuing enigma. J Gen Virol 2003; 84: 2285–92.

  4. 4

    Schafer JR, Kawaoka Y, Bean WJ, Suss J, Senne D, Webster RG . Origin of the pandemic 1957 H2 influenza A virus and the persistence of its possible progenitors in the avian reservoir. Virology 1993; 194: 781–8.

  5. 5

    Bean WJ, Schell M, Katz J, Kawaoka Y, Naeve C, Gorman O, et al. Evolution of the H3 influenza virus hemagglutinin from human and nonhuman hosts. J Virol 1992; 66: 1129–38.

  6. 6

    Lin YP, Shaw M, Gregory V, Cameron K, Lim W, Klimov A, et al. Avian-to-human transmission of H9N2 subtype influenza A viruses: relationship between H9N2 and H5N1 human isolates. Proc Natl Acad Sci USA 2000; 97: 9654–8.

  7. 7

    Katz JM . The impact of avian influenza viruses on public health. Avian Dis 2003; 47: 914–20.

  8. 8

    Perdue ML, Swayne DE . Public health risk from avian influenza viruses. Avian Dis 2005; 49: 317–27.

  9. 9

    de la Barrera CA, Reyes-Teran G . Influenza: forecast for a pandemic. Arch Med Res 2005; 36: 628–36.

  10. 10

    Hilleman MR . Realities and enigmas of human viral influenza: pathogenesis, epidemiology and control. Vaccine 2002; 20: 3068–87.

  11. 11

    Zambon MC . Epidemiology and pathogenesis of influenza. J Antimicrob Chemother 1999; 44 Suppl B: 3–9.

  12. 12

    Subbarao K, Katz J . Avian influenza viruses infecting humans. Cell Mol Life Sci 2000; 57: 1770–84.

  13. 13

    Yang P, Bansal A, Liu C, Air GM . Hemagglutinin specificity and neuraminidase coding capacity of neuraminidase-deficient influenza viruses. Virology 1997; 229: 155–65.

  14. 14

    Patriarca PA . New options for prevention and control of influenza. JAMA 1999; 282: 75–7.

  15. 15

    Donis RO, Bean WJ, Kawaoka Y, Webster RG . Distinct lineages of influenza virus H4 hemagglutinin genes in different regions of the world. Virology 1989; 169: 408–17.

  16. 16

    Hoffmann E, Stech J, Leneva I, Krauss S, Scholtissek C, Chin PS, et al. Characterization of the influenza A virus gene pool in avian species in southern China: was H6N1 a derivative or a precursor of H5N1? Virology 2000; 74: 6309–15.

  17. 17

    Bender C, Hall H, Huang J, Klimov A, Cox N, Hay A, et al. Characterization of the surface proteins of influenza A (H5N1) viruses isolated from humans in 1997–1998. Virology 1999; 254: 115–23.

  18. 18

    Gubareva LV, Novikov DV, Hayden FG . Assessment of hemagglutinin sequence heterogeneity during influenza virus transmission in families. J Infect Dis 2002; 186: 1575–81.

  19. 19

    Webster RG, Guan Y, Peiris M, Walker D, Krauss S, Zhou NN, et al. Characterization of H5N1 influenza viruses that continue to circulate in geese in southeastern China. J Virol 2002; 76: 118–26.

  20. 20

    Spackman E, Senne DA, Davison S, Suarez DL . Sequence analysis of recent H7 avian influenza viruses associated with three different outbreaks in commercial poultry in the United States. J Virol 2003; 77: 13399–402.

  21. 21

    Deem MW, Lee HY . Sequence space localization in the immune system response to vaccination and disease. Phys Rev Lett 2003; 91: 068101.

  22. 22

    Ferguson NM, Galvani AP, Bush RM . Ecological and immunological determinants of influenza evolution. Nature 2003; 422: 428–33.

  23. 23

    Lin J, Andreasen V, Casagrandi R, Levin SA . Traveling waves in a model of influenza A drift. J Theor Biol 2003; 222: 437–45.

  24. 24

    Wu G . The first and second order Markov chain analysis on amino acids sequence of human haemoglobin α-chain and its three variants with low O2 affinity. Comp Haematol Int 1999; 9: 148–51.

  25. 25

    Wu G . The first, second and third order Markov chain analysis on amino acids sequence of human tyrosine aminotransferase and its variant causing tyrosinemia type II. Pediatr Relat Top 2000; 39: 37–47.

  26. 26

    Wu G . The first, second, third and fourth order Markov chain analysis on amino acids sequence of human dopamine β-hydroxylase. Mol Psychiatry 2000; 5: 448–51.

  27. 27

    Wu G . Frequency and Markov chain analysis of amino-acid sequence of human glutathione reductase. Biochem Biophys Res Commun 2000; 268: 823–6.

  28. 28

    Wu G . Frequency and Markov chain analysis of amino-acid sequence of human tumor necrosis factor. Cancer Lett 2000; 153: 145–50.

  29. 29

    Wu G . Frequency and Markov chain analysis of the amino-acid sequence of sheep p53 protein. J Biochem Mol Biol Biophys 2000; 4: 179–85.

  30. 30

    Wu G . Frequency and Markov chain analysis of amino-acid sequences of mouse p53. Human Exp Toxicol 2000; 19: 535–9.

  31. 31

    Wu G . Frequency and Markov chain analysis of the amino-acid sequence of human alcohol dehydrogenase α-chain. Alcohol Alcohol 2000; 35: 302–6.

  32. 32

    Wu G, Yan SM . Frequency and Markov chain analysis of amino-acids sequence of human platelet-activating factor acetylhydrolase α-subunit and its variant causing the lissencephaly syndrome. Pediatr Relat Top 2000; 39: 513–26.

  33. 33

    Wu G, Yan SM . Prediction of two- and three-amino-acid sequences of Citrobacter Freundii β-lactamase from its amino acid composition. J Mol Microbiol Biotechnol 2000; 2: 277–81.

  34. 34

    Wu G, Yan SM . Prediction of two- and three-amino acid sequence of human acute myeloid leukemia 1 protein from its amino acid composition. Comp Haematol Int 2000; 10: 85–9.

  35. 35

    Wu G, Yan SM . Frequency and Markov chain analysis of amino-acid sequences of human connective tissue growth factor. J Mol Model 2001; 5: 120–4.

  36. 36

    Wu G, Yan SM . Prediction of presence and absence of two- and three-amino-acid sequence of human monoamine oxidase B from its amino acid composition according to the random mechanism. Biomol Eng 2001; 18: 23–7.

  37. 37

    Wu G, Yan SM . Prediction of presence and absence of two- and three-amino-acid sequence of human tyrosinase from their amino acid composition and related changes in human tyrosinase variant causing oculocutaneous albinism. Pediatr Relat Top 2001; 40: 153–66.

  38. 38

    Wu G, Yan SM . Random analysis of presence and absence of two- and three-amino-acid sequences and distributions of amino acids, two- and three-amino-acid sequences in bovine p53 protein. Mol Biol Today 2002; 3: 31–7.

  39. 39

    Wu G, Yan SM . Prediction of distributions of amino acids and amino acid pairs in human haemoglobin α-chain and its seven variants causing α-thalassemia from their occurrences according to the random mechanism. Comp Haematol Int 2000; 10: 80–4.

  40. 40

    Wu G, Yan SM . Analysis of distributions of amino acids, amino acid pairs and triplets in human insulin precursor and four variants from their occurrences according to the random mechanism. J Biochem Mol Biol Biophys 2001; 5: 293–300.

  41. 41

    Wu G, Yan SM . Analysis of distributions of amino acids and amino acid pairs in human tumor necrosis factor precursor and its eight variants according to random mechanism. J Mol Model 2001; 7: 318–23.

  42. 42

    Wu G, Yan SM . Analysis of distributions of amino acids in the primary structure of apoptosis regulator Bcl-2 family according to the random mechanism. J Biochem Mol Biol Biophys 2002; 6: 407–14.

  43. 43

    Wu G, Yan SM . Analysis of distributions of amino acids in the primary structure of tumor suppressor p53 family according to the random mechanism. J Mol Model 2002; 8: 191–8.

  44. 44

    Wu G, Yan SM . Randomness in the primary structure of protein: methods and implications. Mol Biol Today 2002; 3: 55–69.

  45. 45

    Wu G, Yan S . Prediction of amino acid pairs sensitive to mutations in the spike protein from SARS related coronavirus. Pep-tides 2003; 24: 1837–45.

  46. 46

    Wu G, Yan S . Fate of 130 hemagglutinins from different influenza A viruses. Biochem Biophys Res Commun 2004: 317; 917–24.

  47. 47

    Wu G, Yan S . Potential targets for anti-SARS drugs in the structural proteins from SARS related coronavirus. Peptides 2004; 25: 901–8.

  48. 48

    Wu G, Yan S . Mutation features of 215 polymerase proteins from different influenza A viruses. Med Sci Monit 2005; 11: BR367–72.

  49. 49

    Wu G, Yan S . Reasoning of spike glycoproteins being more vulnerable to mutations among 158 coronavirus proteins from different species. J Mol Model 2005; 11: 8–16.

  50. 50

    Wu G, Yan S . Determination of amino acid pairs sensitive to variants in human low-density lipoprotein receptor precursor by means of a random approach. J Biochem Mol Biol Biophys 2002; 6: 401–6.

  51. 51

    Wu G, Yan SM . Estimation of amino acid pairs sensitive to variants in human phenylalanine hydroxylase protein by means of a random approach. Peptides 2002; 23: 2085–90.

  52. 52

    Wu G, Yan S . Analysis of amino acid pairs sensitive to variants in human collagen α5(IV) chain precursor by means of a random approach. Peptides 2003; 24: 347–52.

  53. 53

    Wu G, Yan S . Determination of amino acid pairs sensitive to variants in human β-glucocerebrosidase by means of a random approach. Protein Eng 2003; 16: 195–9.

  54. 54

    Wu G, Yan SM . Determination of amino acid pairs in human haemoglobulin α-chain sensitive to variants by means of a random approach. Comp Clin Pathol 2003; 12: 21–5.

  55. 55

    Wu G, Yan S . Determination of amino acid pairs sensitive to variants in human Bruton's tyrosine kinase by means of a random approach. Mol Simul 2003; 29: 249–54.

  56. 56

    Wu G, Yan S . Determination of amino acid pairs sensitive to variants in human coagulation factor IX precursor by means of a random approach. J Biomed Sci 2003; 10: 451–4.

  57. 57

    Wu G, Yan S . Determination of amino acid pairs in human p53 protein sensitive to mutations/variants by means of a random approach. J Mol Model 2003; 9: 337–41.

  58. 58

    Wu G, Yan S . Determination of amino acid pairs in Von Hippel-Lindau disease tumour suppressor (G7 protein) sensitive to variants by means of a random approach. J Appl Res 2003; 3: 512–20.

  59. 59

    Wu G, Yan S . Determination of amino acid pairs sensitive to variants in human copper-transporting ATPase 2. Biochem Biophys Res Commun 2004; 319: 27–31.

  60. 60

    Wu G, Yan S . Susceptible amino acid pairs in variants of human collagen a1(III) chain precursor. EXCLI J 2004; 3: 20–8.

  61. 61

    Wu G, Yan S . Amino acid pairs sensitive to variants in human collagen a1(I) chain precursor. EXCLI J 2004; 3: 10–9.

  62. 62

    Wu G, Yan S . Amino acid pairs susceptible to variants in human protein C precursor. Protein Pept Lett 2005; 10: 491–4.

  63. 63

    Wu G, Yan S . Determination of sensitive positions to mutations in human p53 protein. Biochem Biophys Res Commun 2004; 321: 313–9.

  64. 64

    Gao N, Yan S, Wu G . Pattern of positions sensitive to mutations in human haemoglobin α-chain. Protein Pept Lett 2006; 13: 101–7.

  65. 65

    Wu G, Yan S . Prediction of mutation trend in hemagglutinins and neuraminidases from influenza A viruses by means of cross-impact analysis. Biochem Biophys Res Commun 2005; 326: 475–82.

  66. 66

    Wu G, Yan S . Timing of mutation in hemagglutinins from influenza A virus by means of unpredictable portion of amino-acid pair and fast Fourier transform. Biochem Biophys Res Commun 2005; 333: 70–8.

  67. 67

    Wu G, Yan S . Timing of mutation in hemagglutinins from influenza A virus by means of amino-acid distribution rank and fast Fourier transform. Protein Pept Lett 2006; 13: 143–8.

  68. 68

    Hope-Simpson RE, Golubev DB . A new concept of the epidemic process of influenza A virus. Epidemiol Infect 1987; 99: 5–54.

  69. 69

    Capua I, Alexander DJ . Avian influenza and human health. Acta Trop 2002; 83: 1–6.

  70. 70

    Matrosovich M, Tuzikov A, Bovin N, Gambaryan A, Klimov A, Castrucci MR, et al. Early alterations of the receptor-binding properties of H1, H2, and H3 avian influenza virus hemagglutinins after their introduction into mammals. J Virol 2000; 74: 8502–12.

  71. 71

    Hiromoto Y, Yamazaki Y, Fukushima T, Saito T, Lindstrom SE, Omoe K, et al. Evolutionary characterization of the six internal genes of H5N1 human influenza A virus. J Gen Virol 2000; 81: 1293–303.

  72. 72

    Zhou NN, Shortridge KF, Claas ECJ, Krauss SL, Webster RG . Rapid evolution of H5N1 influenza viruses in chickens in Hong Kong. J Virol 1999; 73: 3366–74.

  73. 73

    Guan Y, Poon LL, Cheung CY, Ellis TM, Lim W, Lipatov AS, et al. H5N1 influenza: a protean pandemic threat. Proc Natl Acad Sci USA 2004; 101: 8156–61.

  74. 74

    Govorkova EA, Rehg JE, Krauss S, Yen HL, Guan Y, Peiris M, et al. Lethality to ferrets of H5N1 influenza viruses isolated from humans and poultry in 2004. J Virol 2005; 79: 2191–8.

  75. 75

    Puthavathana P, Auewarakul P, Charoenying PC, Sangsiriwut K, Pooruk P, Boonnak K, et al. Molecular characterization of the complete genome of human influenza H5N1 virus isolates from Thailand. J Gen Virol 2005; 86: 423–33.

  76. 76

    Webster RG, Bean WJ, Gorman OT, Chambers TM, Kawaoka Y . Evolution and ecology of influenza A viruses. Microbiol Rev 1992; 56: 152–79.

  77. 77

    Patterson KD . Pandemic and epidemic influenza, 1830–1848. Soc Sci Med 1985; 21: 571–80.

  78. 78

    Wu G, Yan S . Fate of influenza A virus proteins. Protein Pept Lett (accepted).

  79. 79

    Wu G, Yan S . Determination of mutation trend in proteins by means of translation probability between RNA codes and mutated amino acids. Biochem Biophys Res Commun 2005; 337: 692–700.

  80. 80

    Taubenberger JK, Reid AH, Krafft AE, Bijwaard KE, Fanning TG . Initial genetic characterization of the 1918 “Spanish” influenza virus. Science 1997; 275: 1793–6.

  81. 81

    Reid AH, Fanning TG, Hultin JV, Taubenberger JK . Origin and evolution of the 1918 “Spanish” influenza virus hemagglutinin gene. Proc Natl Acad Sci USA 1999; 96: 1651–6.

  82. 82

    Feller W . An introduction to probability theory and its applications. 3rd ed. Vol I. New York: Wiley; 1968.

  83. 83

    Gordon TG, Hayward H . Initial experiments with the cross-impact matrix method of forecasting. Futures 1968; 1: 100–16.

  84. 84

    Gordon TG . Cross-impact matrices - an illustration of their use for policy analysis. Futures 1969; 2: 527–31.

  85. 85

    Enzer S . Delphi and cross-impact techniques: an effective combination for systematic futures analysis. Futures 1970; 3: 48–61.

  86. 86

    Enzer S . Cross-impact techniques in technology assessment. Futures 1972; 4: 30–51.

  87. 87

    Sage AP . Methodology for large-scale systems. New York: McGraw-Hill; 1977. p 165–203.

  88. 88

    Wu G . Application of cross-impact analysis to the relationship between aldehyde dehydrogenase 2 allele and the flushing syndrome. Alcohol Alcohol 2000; 35: 55–9.

Download references

Author information

Correspondence to Guang Wu.

Rights and permissions

Reprints and Permissions

About this article

Keywords

  • amino acid sequence
  • hemagglutinins
  • influenza A virus
  • mutation
  • probability

Further reading