Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Deciphering the genome structure and paleohistory of Theobroma cacao


We sequenced and assembled the genome of Theobroma cacao, an economically important tropical fruit tree crop that is the source of chocolate. The assembly corresponds to 76% of the estimated genome size and contains almost all previously described genes, with 82% of them anchored on the 10 T. cacao chromosomes. Analysis of this sequence information highlighted specific expansion of some gene families during evolution, for example flavonoid-related genes. It also provides a major source of candidate genes for T. cacao disease resistance and quality improvement. Based on the inferred paleohistory of the T. cacao genome, we propose an evolutionary scenario whereby the ten T. cacao chromosomes were shaped from an ancestor through eleven chromosome fusions. The T. cacao genome can be considered as a simple living relic of higher plant evolution.

Author information



Rights and permissions

Creative Commons Attribution 3.0 License.

Reprints and Permissions

About this article

Cite this article

Chaparro, C., Murat, F., Carlson, J. et al. Deciphering the genome structure and paleohistory of Theobroma cacao. Nat Prec (2010).

Download citation


  • cocoa
  • cacao
  • genome
  • sequence
  • Theobroma cacao
  • Evolution


Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing