Plasmodium cynomolgi genome sequences provide insight into Plasmodium vivax and the monkey malaria clade

Journal name:
Nature Genetics
Volume:
44,
Pages:
1051–1055
Year published:
DOI:
doi:10.1038/ng.2375
Received
Accepted
Published online

P. cynomolgi, a malaria-causing parasite of Asian Old World monkeys, is the sister taxon of P. vivax, the most prevalent malaria-causing species in humans outside of Africa. Because P. cynomolgi shares many phenotypic, biological and genetic characteristics with P. vivax, we generated draft genome sequences for three P. cynomolgi strains and performed genomic analysis comparing them with the P. vivax genome, as well as with the genome of a third previously sequenced simian parasite, Plasmodium knowlesi. Here, we show that genomes of the monkey malaria clade can be characterized by copy-number variants (CNVs) in multigene families involved in evasion of the human immune system and invasion of host erythrocytes. We identify genome-wide SNPs, microsatellites and CNVs in the P. cynomolgi genome, providing a map of genetic variation that can be used to map parasite traits and study parasite populations. The sequencing of the P. cynomolgi genome is a critical step in developing a model system for P. vivax research and in counteracting the neglect of P. vivax.

At a glance

Figures

  1. Architecture of the P. cynomolgi genome and associated genome-wide variation data.
    Figure 1: Architecture of the P. cynomolgi genome and associated genome-wide variation data.

    Data are shown for each of the 14 P. cynomolgi chromosomes. The six concentric rings, from outermost to innermost, represent (i) the location of 5,049 P. cynomolgi genes, excluding those on small contigs (cyan lines); (ii) genome features, including 14 centromeres (thick black lines), 43 telomeric sequence repeats (short red lines), 43 tRNA genes (red lines), 10 rRNAs (dark blue lines) and several gene family members, including 53 cyir (dark green lines), 8 rbp (brown lines), 13 sera (serine-rich antigen; pink lines), 25 trag (tryptophan-rich antigen; purple lines), 12 msp3 (merozoite surface protein 3; light gray lines), 13 msp7 (merozoite surface protein 7; gray lines), 25 rad (silver lines), 8 etramp (orange lines), 16 Pf-fam-b (light blue lines) and 7 Pv-fam-d (light green lines); (iii) plot of dS/dN for 4,605 orthologs depicting genome-wide polymorphism within P. cynomolgi strains B and Berok (black line) and divergence between P. cynomolgi strains B and Berok and P. vivax Salvador I (red line); a track above the plot indicates P. cynomolgi genes under positive selection (red) and purifying selection (blue), and a track below the plot indicates P. cynomolgiP. vivax orthologs under positive selection (red) and purifying selection (blue); (iv) heatmap indicating SNP density of 3 P. cynomolgi strains plotted per 10-kb windows: red, 0–83 SNPs per 10 kb (regions of lowest SNP density); blue, 84–166 SNPs per 10 kb; green, 167–250 SNPs per 10 kb; purple, 251–333 SNPs per 10 kb; orange, 334–416 SNPs per10 kb; yellow, 417–500 SNPs per10 kb (regions of highest SNP density); (v) log2 ratio plot of CNVs identified from a comparison of P. cynomolgi strains B and Berok; and (vi) map of 182 polymorphic intergenic microsatellites (MS, black dots). The figure was generated using Circos software (see URLs).

  2. Genome synteny between six species of Plasmodium parasite.
    Figure 2: Genome synteny between six species of Plasmodium parasite.

    Protein-coding genes of P. cynomolgi are shown aligned with those of five other Plasmodium genomes: two species belonging to the monkey malaria clade, P. vivax and P. knowlesi; two species of rodent malaria, P. berghei and P. chabaudi; and P. falciparum. Highly conserved protein-coding regions between the genomes are colored in order from red (5′ end of chromosome 1) to blue (3′ end of chromosome 14) with respect to genomic position of P. cynomolgi.

  3. Comparison of the genes of P. cynomolgi, P. vivax and P. knowlesi.
    Figure 3: Comparison of the genes of P. cynomolgi, P. vivax and P. knowlesi.

    The Venn ellipses represent the three genomes, with the total number of genes assigned to the chromosomes indicated under the species name. Cylinders depict orthologous and non-orthologous genes between the three genomes, with the number of genes in each indicated and represented graphically by cylinder relative width. In each cylinder, genes are divided into three categories whose thickness is represented by colored bands proportional to category percentage.

Accession codes

Primary accessions

DDBJ/GenBank/EMBL

Sequence Read Archive

Referenced accessions

NCBI Reference Sequence

References

  1. Mendis, K., Sina, B.J., Marchesini, P. & Carter, R. The neglected burden of Plasmodium vivax malaria. Am. J. Trop. Med. Hyg. 64, 97106 (2001).
  2. Mueller, I. et al. Key gaps in the knowledge of Plasmodium vivax, a neglected human malaria parasite. Lancet Infect. Dis. 9, 555566 (2009).
  3. Baird, J.K. Resistance to chloroquine unhinges vivax malaria therapeutics. Antimicrob. Agents Chemother. 55, 18271830 (2011).
  4. Rayner, J.C., Liu, W., Peeters, M., Sharp, P.M. & Hahn, B.H. A plethora of Plasmodium species in wild apes: a source of human infection? Trends Parasitol. 27, 222229 (2011).
  5. Cornejo, O.E. & Escalante, A.A. The origin and age of Plasmodium vivax. Trends Parasitol. 22, 558563 (2006).
  6. Escalante, A.A. et al. A monkey's tale: the origin of Plasmodium vivax as a human malaria parasite. Proc. Natl. Acad. Sci. USA 102, 19801985 (2005).
  7. Mu, J. et al. Host switch leads to emergence of Plasmodium vivax malaria in humans. Mol. Biol. Evol. 22, 16861693 (2005).
  8. Singh, B. et al. A large focus of naturally acquired Plasmodium knowlesi infections in human beings. Lancet 363, 10171024 (2004).
  9. Pain, A. et al. The genome of the simian and human malaria parasite Plasmodium knowlesi. Nature 455, 799803 (2008).
  10. Eyles, D.E., Coatney, G.R. & Getz, M.E. Vivax-type malaria parasite of macaques transmissible to man. Science 131, 18121813 (1960).
  11. Gibbs, R.A. et al. Evolutionary and biomedical insights from the rhesus macaque genome. Science 316, 222234 (2007).
  12. Carlton, J.M. et al. Comparative genomics of the neglected human malaria parasite Plasmodium vivax. Nature 455, 757763 (2008).
  13. Saxena, A.K., Wu, Y. & Garboczi, D.N. Plasmodium p25 and p28 surface proteins: potential transmission-blocking vaccines. Eukaryot. Cell 6, 12601265 (2007).
  14. Iyer, J., Gruner, A.C., Renia, L., Snounou, G. & Preiser, P.R. Invasion of host cells by malaria parasites: a tale of two protein families. Mol. Microbiol. 65, 231249 (2007).
  15. Okenu, D.M., Malhotra, P., Lalitha, P.V., Chitnis, C.E. & Chauhan, V.S. Cloning and sequence analysis of a gene encoding an erythrocyte binding protein from Plasmodium cynomolgi. Mol. Biochem. Parasitol. 89, 301306 (1997).
  16. Coatney, G.R., Collins, W.E., Warren, M. & Contacos, P.G. The Primate Malarias (US Department of Health, Education and Welfare, Washington, DC, 1971).
  17. Cunningham, D., Lawton, J., Jarra, W., Preiser, P. & Langhorne, J. The pir multigene family of Plasmodium: antigenic variation and beyond. Mol. Biochem. Parasitol. 170, 6573 (2010).
  18. al-Khedery, B., Barnwell, J.W. & Galinski, M.R. Antigenic variation in malaria: a 3′ genomic alteration associated with the expression of a P. knowlesi variant antigen. Mol. Cell 3, 131141 (1999).
  19. Krotoski, W.A. The hypnozoite and malarial relapse. Prog. Clin. Parasitol. 1, 119 (1989).
  20. Campbell, T.L., De Silva, E.K., Olszewski, K.L., Elemento, O. & Llinas, M. Identification and genome-wide prediction of DNA binding specificities for the ApiAP2 family of regulators from the malaria parasite. PLoS Pathog. 6, e1001165 (2010).
  21. Mu, J. et al. Genome-wide variation and identification of vaccine targets in the Plasmodium falciparum genome. Nat. Genet. 39, 126130 (2007).
  22. Volkman, S.K. et al. A genome-wide map of diversity in Plasmodium falciparum. Nat. Genet. 39, 113119 (2007).
  23. Xie, C. & Tammi, M.T. CNV-seq, a new method to detect copy number variation using high-throughput sequencing. BMC Bioinformatics 10, 80 (2009).
  24. Lee, K.S. et al. Plasmodium knowlesi: reservoir hosts and tracking the emergence in humans and macaques. PLoS Pathog. 7, e1002015 (2011).
  25. Nei, M. & Gojobori, T. Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol. Biol. Evol. 3, 418426 (1986).
  26. Doi, M. et al. Worldwide sequence conservation of transmission-blocking vaccine candidate Pvs230 in Plasmodium vivax. Vaccine 29, 43084315 (2011).
  27. Carlton, J.M., Sina, B.J. & Adams, J.H. Why is Plasmodium vivax a neglected tropical disease? PLoS Negl. Trop. Dis. 5, e1160 (2011).
  28. Ohtsubo, Y., Ikeda-Ohtsubo, W., Nagata, Y. & Tsuda, M. GenomeMatcher: a graphical user interface for DNA sequence comparison. BMC Bioinformatics 9, 376 (2008).
  29. Cantarel, B.L. et al. MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res. 18, 188196 (2008).
  30. Popendorf, K., Tsuyoshi, H., Osana, Y. & Sakakibara, Y. Murasaki: a fast, parallelizable algorithm to find anchors from multiple genomes. PLoS ONE 5, e12651 (2010).
  31. Thompson, J.D., Higgins, D.G. & Gibson, T.J. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 46734680 (1994).
  32. Felsenstein, J. PHYLIP, Phylogeny Inference Package, 3.6a3 edn (University of Washington, Seattle, 2005).
  33. Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 15861591 (2007).
  34. Ning, Z., Cox, A.J. & Mullikin, J.C. SSAHA: a fast search method for large DNA databases. Genome Res. 11, 17251729 (2001).
  35. Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 29872993 (2011).
  36. Tamura, K., Dudley, J., Nei, M. & Kumar, S. MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol. Biol. Evol. 24, 15961599 (2007).
  37. Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 17541760 (2009).
  38. Edgar, R.C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 17921797 (2004).

Download references

Author information

  1. These authors jointly directed this work.

    • Jane M Carlton &
    • Kazuyuki Tanabe

Affiliations

  1. Laboratory of Malariology, Research Institute for Microbial Diseases, Osaka University, Suita, Japan.

    • Shin-Ichiro Tachibana,
    • Hajime Honma &
    • Kazuyuki Tanabe
  2. Department of Biology, Center for Genomics and Systems Biology, New York University, New York, New York, USA.

    • Steven A Sullivan,
    • Hyunjae R Kim,
    • Patrick L Sutton,
    • Rimma Shakhbatyan &
    • Jane M Carlton
  3. Laboratory of Tropical Medicine and Parasitology, Institute of International Education and Research, Dokkyo Medical University, Shimotsuga, Japan.

    • Satoru Kawai
  4. Genome Information Research Center, Research Institute for Microbial Diseases, Osaka University, Suita, Japan.

    • Shota Nakamura,
    • Naohisa Goto &
    • Teruo Yasunaga
  5. Department of Molecular Protozoology, Research Institute for Microbial Diseases, Osaka University, Suita, Japan.

    • Nobuko Arisue,
    • Nirianne M Q Palacpac,
    • Hajime Honma,
    • Masanori Yagi,
    • Takahiro Tougan,
    • Toshihiro Horii &
    • Kazuyuki Tanabe
  6. The Corporation for Production and Research of Laboratory Primates, Tsukuba, Japan.

    • Yuko Katakai
  7. Department of Protozoology, Institute of Tropical Medicine (NEKKEN) and Global COE (Centers of Excellence) Program, Nagasaki University, Nagasaki, Japan.

    • Osamu Kaneko
  8. Department of Molecular and Cellular Parasitology, Graduate School of Medicine, Juntendo University, Tokyo, Japan.

    • Toshihiro Mita
  9. Department of Biomedical Chemistry, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.

    • Kiyoshi Kita
  10. Tsukuba Primate Research Center, National Institute of Biomedical Innovation, Tsukuba, Japan.

    • Yasuhiro Yasutomi
  11. Center for Global Health, Centers for Disease Control and Prevention, Divison of Parasitic Diseases and Malaria, Atlanta, Georgia, USA.

    • John W Barnwell
  12. Center for Evolutionary Medicine and Informatics, The Biodesign Institute, Arizona State University, Tempe, Arizona, USA.

    • Ananias A Escalante
  13. Present address: Career-Path Promotion Unit for Young Life Scientists, Kyoto University, Kyoto, Japan.

    • Shin-Ichiro Tachibana

Contributions

K.T., J.M.C., A.A.E. and J.W.B. conceived and conducted the study. S.K., Y.K., Y.Y., S.-I.T. and J.W.B. provided P. cynomolgi material. S.N., N.G., T.Y. and H.R.K. constructed a computing system for data processing, and S.-I.T., H.H., P.L.S., S.A.S. and H.R.K. performed scaffolding of contigs and manual annotation of the predicted genes. S.N. performed sequence correction of supercontigs and gene prediction. S.-I.T., S.N., N.G., N.A., M.Y., O.K., K.T., H.R.K., R.S., S.A.S. and J.M.C. analyzed data. S.-I.T., N.M.Q.P., T.T., T.M., K.K., J.M.C., T.H., A.A.E., J.W.B. and K.T. wrote the manuscript.

Competing financial interests

The authors declare no competing financial interests.

Corresponding authors

Correspondence to:

Author details

Supplementary information

PDF files

  1. Supplementary Text and Figures (6M)

    Supplementary Tables 1, 2, 4, 6, 8 , 9, 13 and 14, Supplementary Figures 1–5 and Supplementary Note

Excel files

  1. Supplementary Table 3 (2M)

    List of orthologs between the genomes of P. cynomolgi, P. vivax and P. knowlesi

  2. Supplementary Table 5 (119K)

    List of multigene families in P. cynomolgi, P. vivax and P. knowlesi

  3. Supplementary Table 7 (164K)

    List of P. cynomolgi cyir genes, P. vivax vir genes, P. knowlesi kir genes, and P. knowlesi SICAvar genes and their homologs in P. cynomolgi, P. vivax and P. knowlesi

  4. Supplementary Table 10 (74K)

    List of polymorphic microsatellite loci identified between P. cynomolgi strains B and Berok.

  5. Supplementary Table 11 (356K)

    Ds-Dn within 4,605 orthologs of P. cynomolgi strains B and Berok.

  6. Supplementary Table 12 (385K)

    Ds-Dn between 4,605 orthologs of P. cynomolgi strains B and Berok, and P. vivax Salvador I.

Additional data