Potato (Solanum tuberosum L.) is the world’s most important non-grain food crop and is central to global food security. It is clonally propagated, highly heterozygous, autotetraploid, and suffers acute inbreeding depression. Here we use a homozygous doubled-monoploid potato clone to sequence and assemble 86% of the 844-megabase genome. We predict 39,031 protein-coding genes and present evidence for at least two genome duplication events indicative of a palaeopolyploid origin. As the first genome sequence of an asterid, the potato genome reveals 2,642 genes specific to this large angiosperm clade. We also sequenced a heterozygous diploid clone and show that gene presence/absence variants and other potentially deleterious mutations occur frequently and are a likely cause of inbreeding depression. Gene family expansion, tissue-specific expression and recruitment of genes to new pathways contributed to the evolution of tuber development. The potato genome sequence provides a platform for genetic improvement of this vital crop.
At a glance
- Global distribution of the potato crop. Am. J. Potato Res. 78, 403–412 (2001)
- Nutrients, bioactive non-nutrients and anti-nutrients in potatoes. J. Food Compost. Anal. 22, 494–502 (2009) , &
- Influence of culture medium and in vitro conditions on shoot regeneration in Solanum phureja monoploids and fertility of regenerated doubled monoploids. Plant Breed. 118, 53–57 (1999) &
- De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 20, 265–272 (2010) et al.
- Nuclear DNA content of some important plant species. Plant Mol. Biol. Rep. 9, 208–218 (1991) &
- Assignment of genetic linkage maps to diploid Solanum tuberosum pachytene chromosomes by BAC-FISH technology. Chromosome Res. 17, 899–915 (2009) et al.
- Solanum lycopersicum cv. Heinz 1706 chromosome 6: distribution and abundance of genes and retrotransposable elements. Plant J. 58, 857–869 (2009) et al.
- Patterns of embryological and biochemical evolution in the Asterids. Syst. Bot. 26, 242–262 (2001) , &
- Unraveling ancient hexaploidy through multiply-aligned angiosperm gene maps. Genome Res. 18, 1944–1954 (2008) et al.
- The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449, 463–467 (2007) et al.
- Plants with double genomes might have had a better chance to survive the Cretaceous–Tertiary extinction event. Proc. Natl Acad. Sci. USA 106, 5737–5742 (2009) , &
- Genome-wide patterns of genetic variation among elite maize inbred lines. Nature Genet. 42, 1027–1030 (2010) et al.
- Gene ontology: tool for the unification of biology. Nature Genet. 25, 25–29 (2000) et al.
- A first-generation haplotype map of maize. Science 326, 1115–1117 (2009) et al.
- Gene expression during tuber development in potato plants. FEBS Lett. 268, 334–338 (1990) et al.
- Members of the Kunitz-type protease inhibitor gene family of potato inhibit soluble tuber invertase in vitro. Potato Res. 45, 163–176 (2002) , , &
- Nucleotides and nucleotide sugars in developing maize endosperms: synthesis of ADP-glucose in brittle-1. Plant Physiol. 110, 835–843 (1996) , &
- Antisense inhibition of plastidial phosphoglucomutase provides compelling evidence that potato tuber amyloplasts import carbon from the cytosol in the form of glucose-6-phosphate. Plant J. 23, 43–53 (2000) et al.
- Glucose 1-phosphate is efficiently taken up by potato (Solanum tuberosum) tuber parenchyma cells and converted to reserve starch granules. New Phytol. 185, 663–675 (2010) et al.
- Control of potato tuber sprouting. Trends Plant Sci. 6, 333–335 (2001)
- CONSTANS activates SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1 through FLOWERING LOCUS T to promote flowering in Arabidopsis. Plant Physiol. 139, 770–778 (2005) et al.
- Genome-wide identi?cation of NBS resistance genes in Populus trichocarpa. Plant Mol. Biol. 66, 619–636 (2008) et al.
- Comparative sequence analysis of Solanum and Arabidopsis in a hot spot for pathogen resistance on potato chromosome V reveals a patchwork of conserved and rapidly evolving genome segments. BMC Genomics 8, 112 (2007) et al.
- The R1 resistance gene cluster contains three groups of independently evolving, type I R1 homologues and shows substantial structural variation among haplotypes of Solanum demissum. Plant J. 44, 37–51 (2005) et al.
- Multiple genetic processes result in heterogeneous rates of evolution within the major cluster disease resistance genes in lettuce. Plant Cell 16, 2870–2894 (2004) , , , &
- Genome sequence and analysis of the Irish potato famine pathogen Phytophthora infestans. Nature 461, 393–398 (2009) et al.
- 100–110 (International Potato Center (CIP), 1972) in Prospects for the Potato in the Developing World: an International Symposium on Key Problems and Potentials for Greater Use of the Potato in the Developing World (ed. )
- Construction of a 10,000-marker ultradense genetic recombination map of potato: providing a framework for accelerated gene isolation and a genomewide physical map. Genetics 173, 1075–1087 (2006) et al.
- Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinformatics 25, 4.10.1–4.10.14 (2004)
- Creating a honey bee consensus gene set. Genome Biol. 8, R13 (2007) et al.
- TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009) , &
- Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nature Biotechnol. 28, 511–515 (2010) et al.
- OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 13, 2178–2189 (2003) , &
- SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics 25, 1966–1967 (2009) et al.
- Fragment assembly with short reads. Bioinformatics 20, 2067–2074 (2004) , &
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997) et al.
- EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 16, 276–277 (2000) , &
- 2006) in JoinMap 4, Software for the Calculation of Genetic Linkage Maps in Experimental Populations (ed. ) (Wageningen,
- 2008) Construction and Use of a Physical Map of Potato. PhD thesis, Wageningen Univ. (
- Sequencing the potato genome: outline and first results to come from the elucidation of the sequence of the world's third most important crop. Am. J. Potato Res. 86, 417–429 (2009) et al.
- 2010) et al. in Whole Genome Profiling of the Diploid Potato Clone RH89-039-16 (Plant & Animal Genomes XVIII Conference,
- SSAHA: a fast search method for large DNA databases. Genome Res. 11, 1725–1729 (2001) , &
- Repbase Update, a database of eukaryotic repetitive elements. Cytogenet. Genome Res. 110, 462–467 (2005) et al.
- DupMasker: a tool for annotating primate segmental duplications. Genome Res. 18, 1362–1368 (2008) , , &
- Identification of miniature inverted-repeat transposable elements (MITEs) and biogenesis of their siRNAs in the Solanaceae: new functional implications for MITEs. Genome Res. 19, 42–56 (2009) et al.
- AUGUSTUS: a web server for gene finding in eukaryotes. Nucleic Acids Res. 32, W309–W312 (2004) , , &
- Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 268, 78–94 (1997) &
- GeneWise and Genomewise. Genome Res. 14, 988–995 (2004) , &
- Assessing performance of orthology detection strategies applied to eukaryotic genomes. PLoS ONE 2, e383 (2007) , , &
- i-ADHoRe 2.0: an improved tool to detect degenerated genomic homology using genomic profiles. Bioinformatics 24, 127–128 (2008) , , &
- The value of prior knowledge in discovering motifs with MEME. Proc. Int. Conf. Intell. Syst. Mol. Biol. 3, 21–29 (1995) &
- Genome-wide identification of NBS-encoding resistance genes in Brassica rapa. Mol. Genet. Genomics 282, 617–631 (2009) , , &
- An HMM model for coiled-coil domains and a comparison with PSSM-based predictions. Bioinformatics 18, 617–625 (2002) &
- Paircoil2: improved predictions of coiled coils from sequence. Bioinformatics 22, 356–358 (2006) , , &
- Genome-wide analysis of Carica papaya reveals a small NBS resistance gene family. Mol. Genet. Genomics 281, 609–626 (2009) et al.
- PRGdb: a bioinformatics platform for plant resistance gene analysis. Nucleic Acids Res. 38, D814–D821 (2010) et al.
- The KEGG resource for deciphering the genome. Nucleic Acids Res. 32, D277–D280 (2004) , , , &
- Supplementary Information (20.2 MB)
The file contains Supplementary Text, Supplementary Figures 1-15 with legends and Supplementary Tables 1-18 and 20-34. This is a 500-page document and may take a short while to download. This file was replaced on 9 May 2014 as Supplementary Figure 12 was corrupted.
- Supplementary Tables (12.4 MB)
The zipped file contains two Supplementary Tables:
Supplementary Table 19A displaying FPKM levels in DM tissues.
Supplementary Table 19B displaying FPKM levels in RH tissues.