Genome sequencing and analysis of the model grass Brachypodium distachyon

doi:10.1038/nature08747

Article
Published: 11 February 2010

Genome sequencing and analysis of the model grass Brachypodium distachyon

The International Brachypodium Initiative

Nature volume 463, pages 763–768 (2010)Cite this article

39k Accesses
1350 Citations
42 Altmetric
Metrics details

Subjects

Abstract

Three subfamilies of grasses, the Ehrhartoideae, Panicoideae and Pooideae, provide the bulk of human nutrition and are poised to become major sources of renewable energy. Here we describe the genome sequence of the wild grass Brachypodium distachyon (Brachypodium), which is, to our knowledge, the first member of the Pooideae subfamily to be sequenced. Comparison of the Brachypodium, rice and sorghum genomes shows a precise history of genome evolution across a broad diversity of the grasses, and establishes a template for analysis of the large genomes of economically important pooid grasses such as wheat. The high-quality genome sequence, coupled with ease of cultivation and transformation, small size and rapid life cycle, will help Brachypodium reach its potential as an important model system for developing new energy and food crops.

You have full access to this article via your institution.

Download PDF

Phylogenomics and the rise of the angiosperms

Article Open access 24 April 2024

The genome and population genomics of allopolyploid Coffea arabica reveal the diversification history of modern coffee cultivars

Article Open access 15 April 2024

Long noncoding RNAs underlie multiple domestication traits and leafhopper resistance in soybean

Article 29 April 2024

Main

Grasses provide the bulk of human nutrition, and highly productive grasses are promising sources of sustainable energy¹. The grass family (Poaceae) comprises over 600 genera and more than 10,000 species that dominate many ecological and agricultural systems^2,3. So far, genomic efforts have largely focused on two economically important grass subfamilies, the Ehrhartoideae (rice) and the Panicoideae (maize, sorghum, sugarcane and millets). The rice⁴ and sorghum⁵ genome sequences and a detailed physical map of maize⁶ showed extensive conservation of gene order^5,7 and both ancient and relatively recent polyploidization.

Most cool season cereal, forage and turf grasses belong to the Pooideae subfamily, which is also the largest grass subfamily. The genomes of many pooids are characterized by daunting size and complexity. For example, the bread wheat genome is approximately 17,000 megabases (Mb) and contains three independent genomes⁸. This has prohibited genome-scale comparisons spanning the three most economically important grass subfamilies.

Brachypodium, a member of the Pooideae subfamily, is a wild annual grass endemic to the Mediterranean and Middle East⁹ that has promise as a model system. This has led to the development of highly efficient transformation^10,11, germplasm collections^12,13,14, genetic markers¹⁴, a genetic linkage map¹⁵, bacterial artificial chromosome (BAC) libraries^16,17, physical maps¹⁸ (M.F., unpublished observations), mutant collections (http://brachypodium.pw.usda.gov, http://www.brachytag.org), microarrays and databases (http://www.brachybase.org, http://www.phytozome.net, http://www.modelcrop.org, http://mips.helmholtz-muenchen.de/plant/index.jsp) that are facilitating the use of Brachypodium by the research community. The genome sequence described here will allow Brachypodium to act as a powerful functional genomics resource for the grasses. It is also an important advance in grass structural genomics, permitting, for the first time, whole-genome comparisons between members of the three most economically important grass subfamilies.

Genome sequence assembly and annotation

The diploid inbred line Bd21 (ref. 19) was sequenced using whole-genome shotgun sequencing (Supplementary Table 1). The ten largest scaffolds contained 99.6% of all sequenced nucleotides (Supplementary Table 2). Comparison of these ten scaffolds with a genetic map (Supplementary Fig. 1) detected two false joins and created a further seven joins to produce five pseudomolecules that spanned 272 Mb (Supplementary Table 3), within the range measured by flow cytometry^20,21. The assembly was confirmed by cytogenetic analysis (Supplementary Fig. 2) and alignment with two physical maps and sequenced BACs (Supplementary Data). More than 98% of expressed sequence tags (ESTs) mapped to the sequence assembly, consistent with a near-complete genome (Supplementary Table 4 and Supplementary Fig. 3). Compared to other grasses, the Brachypodium genome is very compact, with retrotransposons concentrated at the centromeres and syntenic breakpoints (Fig. 1). DNA transposons and derivatives are broadly distributed and primarily associated with gene-rich regions.

Figure 1: **Chromosomal distribution of the main** ***Brachypodium*** **genome features.**

We analysed small RNA populations from inflorescence tissues with deep Illumina sequencing, and mapped them onto the genome sequence (Fig. 2a, Supplementary Fig. 4 and Supplementary Table 5). Small RNA reads were most dense in regions of high repeat density, similar to the distribution reported in Arabidopsis²². We identified 413 and 198 21- and 24-nucleotide phased short interfering RNA (siRNA) loci, respectively. Using the same algorithm, the only phased loci identified in Arabidopsis were five of the eight trans-acting siRNA loci, and none was 24-nucelotide phased. The biological functions of these clusters of Brachypodium phased siRNAs, which account for a significant number of small RNAs that map outside repeat regions, are not known at present.

Figure 2: **Transcript and gene identification and distribution among three grass subfamilies.**

A total of 25,532 protein-coding gene loci was predicted in the v1.0 annotation (Supplementary Information and Supplementary Table 6). This is in the same range as rice (RAP2, 28,236)²³ and sorghum (v1.4, 27,640)⁵, suggesting similar gene numbers across a broad diversity of grasses. Gene models were evaluated using ∼10.2 gigabases (Gb) of Illumina RNA-seq data (Supplementary Fig. 5)²⁴. Overall, 92.7% of predicted coding sequences (CDS) were supported by Illumina data (Fig. 2b), demonstrating the high accuracy of the Brachypodium gene predictions. These gene models are available from several databases (such as http://www.brachybase.org, http://www.phytozome.net, http://www.modelcrop.org and http://mips.org).

Between 77 and 84% of gene families (defined according to Supplementary Fig. 6) are shared among the three grass subfamilies represented by Brachypodium, rice and sorghum, reflecting a relatively recent common origin (Fig. 2c). Grass-specific genes include transmembrane receptor protein kinases, glycosyltransferases, peroxidases and P450 proteins (Supplementary Table 7B). The Pooideae-specific gene set contains only 265 gene families (Supplementary Table 7C) comprising 811 genes (1,400 including singletons). Genes enriched in grasses were significantly more likely to be contained in tandem arrays than random genes, demonstrating a prominent role for tandem gene expansion in the evolution of grass-specific genes (Supplementary Fig. 7 and Supplementary Table 8).

To validate and improve the v1.0 gene models, we manually annotated 2,755 gene models from 97 diverse gene families (Supplementary Tables 9–11) relevant to bioenergy and food crop improvement. We annotated 866 genes involved in cell wall biosynthesis/modification and 948 transcription factors from 16 families²⁵. Only 13% of the gene models required modification and very few pseudogenes were identified, demonstrating the accuracy of the v1.0 annotation. Phylogenetic trees for 62 gene families were constructed using genes from rice, Arabidopsis, sorghum and poplar. In nearly all cases, Brachypodium genes had a similar distribution to rice and sorghum, demonstrating that Brachypodium is suitably generic for grass functional genomics research (Supplementary Figs 8 and 9). Analysis of the predicted secretome identified substantial differences in the distribution of cell wall metabolism genes between dicots and grasses (Supplementary Tables 12, 13 and Supplementary Fig. 10), consistent with their different cell walls²⁶. Signal peptide probability curves also suggested that start codons were accurately predicted (Supplementary Fig. 11).

Maintaining a small grass genome size

Exhaustive analysis of transposable elements (Supplementary Information and Supplementary Table 14) showed retrotransposon sequences comprise 21.4% of the genome, compared to 26% in rice, 54% in sorghum, and more than 80% in wheat²⁷. Thirteen retroelement sets were younger than 20,000 years, showing a recent activation compared to rice²⁸ (Supplementary Fig. 12), and a further 53 retroelement sets were less than 0.1 million years (Myr) old. A minimum of 17.4 Mb has been lost by long terminal repeat (LTR)–LTR recombination, demonstrating that retroelement expansion is countered by removal through recombination. In contrast, retroelements persist for very long periods of time in the closely related Triticeae²⁸.

DNA transposons comprise 4.77% of the Brachypodium genome, within the range found in other grass genomes^5,29. Transcriptome data and structural analysis suggest that many non-autonomous Mariner DTT and Harbinger elements recruit transposases from other families. Two CACTA DTC families (M and N) carried five non-element genes, and the Harbinger U family has amplified a NBS-LRR gene family (Supplementary Figs 13 and 14), adding it to the group of transposable elements implicated in gene mobility^30,31. Centromeric regions were characterized by low gene density, characteristic repeats and retroelement clusters (Supplementary Fig. 15). Other repeat classes are described in Supplementary Table 15. Conserved non-coding sequences are described in Supplementary Fig. 16.

Whole-genome comparison of three diverse grass genomes

The evolutionary relationships between Brachypodium, sorghum, rice and wheat were assessed by measuring the mean synonymous substitution rates (K_s) of orthologous gene pairs (Supplementary Information, Supplementary Fig. 17 and Supplementary Table 16), from which divergence times of Brachypodium from wheat 32–39 Myr ago, rice 40–53 Myr ago, and sorghum 45–60 Myr ago (Fig. 3a) were estimated. The K_s of orthologous gene pairs in the intragenomic Brachypodium duplications (Fig. 3b) suggests duplication 56–72 Myr ago, before the diversification of the grasses. This is consistent with previous evolutionary histories inferred from a small number of genes^3,32,33,34.

Figure 3: ***Brachypodium*** **genome evolution and synteny between grass subfamilies.**

Paralogous relationships among Brachypodium chromosomes showed six major chromosomal duplications covering 92.1% of the genome (Fig. 3b), representing ancestral whole-genome duplication³⁵. Using the rice and sorghum genome sequences, genetic maps of barley³⁶ and Aegilops tauschii (the D genome donor of hexaploid wheat)³⁷, and bin-mapped wheat ESTs^38,39, 21,045 orthologous relationships between Brachypodium, rice, sorghum and Triticeae were identified (Supplementary Information). These identified 59 blocks of collinear genes covering 99.2% of the Brachypodium genome (Fig. 3c–e). The orthologous relationships are consistent with an evolutionary model that shaped five Brachypodium chromosomes from a five-chromosome ancestral genome by a 12-chromosome intermediate involving seven major chromosome fusions³⁹ (Supplementary Fig. 18). These collinear blocks of orthologous genes provide a robust and precise sequence framework for understanding grass genome evolution and aiding the assembly of sequences from other pooid grasses. We identified 14 major syntenic disruptions between Brachypodium and rice/sorghum that can be explained by nested insertions of entire chromosomes into centromeric regions (Fig. 4a, b)^2,37,40. Similar nested insertions in sorghum³⁷ and barley (Fig. 4c, d) were also identified. Centromeric repeats and peaks in retroelements at the junctions of chromosome insertions are footprints of these insertion events (Supplementary Fig. 15C and Fig. 1), as is higher gene density at the former distal regions of the inserted chromosomes (Fig. 1). Notably, the reduction in chromosome number in Brachypodium and wheat occurred independently because none of the chromosome fusions are shared by Brachypodium and the Triticeae³⁷ (Supplementary Fig. 18).

Figure 4: **A recurring pattern of nested chromosome fusions in grasses.**

Comparisons of evolutionary rates between Brachypodium, sorghum, rice and Ae. tauschii demonstrated a substantially higher rate of genome change in Ae. tauschii (Supplementary Table 17). This may be due to retroelement activity that increases syntenic disruptions, as proposed for chromosome 5S later⁴¹. Among seven relatively large gene families, four were highly syntenic and two (NBS-LRR and F-box) were almost never found in syntenic order when compared to rice and sorghum (Supplementary Table 18), consistent with the rapid diversification of the NBS-LRR and F-box gene families⁴².

The short arm of chromosome 5 (Bd5S) has a gene density roughly half of the rest of the genome, high LTR retrotransposon density, the youngest intact Gypsy elements and the lowest solo LTR density. Thus, unlike the rest of the Brachypodium genome, Bd5S is gaining retrotransposons by replication and losing fewer by recombination. Syntenic regions of rice (Os4S) and sorghum (Sb6S) demonstrate maintenance of this high repeat content for ∼50–70 Myr (Supplementary Fig. 19)⁴³. Bd5S, Os4S and Sb6S also have the lowest proportion of collinear genes (Fig. 4a and Supplementary Fig. 19). We propose that the chromosome ancestral to Bd5S reached a tipping point in which high retrotransposon density had deleterious effects on genes.

Discussion

As the first genome sequence of a pooid grass, the Brachypodium genome aids genome analysis and gene identification in the large and complex genomes of wheat and barley, two other pooid grasses that are among the world’s most important crops. The very high quality of the Brachypodium genome sequence, in combination with those from two other grass subfamilies, enabled reconstruction of chromosome evolution across a broad diversity of grasses. This analysis contributes to our understanding of grass diversification by explaining how the varying chromosome numbers found in the major grass subfamilies derive from an ancestral set of five chromosomes by nested insertions of whole chromosomes into centromeres. The relatively small genome of Brachypodium contains many active retroelement families, but recombination between these keeps genome expansion in check. The short arm of chromosome 5 deviates from the rest of the genome by exhibiting a trend towards genome expansion through increased retroelement numbers and disruption of gene order more typical of the larger genomes of closely related grasses.

Grass crop improvement for sustainable fuel⁴⁴ and food⁴⁵ production requires a substantial increase in research in species such as Miscanthus, switchgrass, wheat and cool season forage grasses. These considerations have led to the rapid adoption of Brachypodium as an experimental system for grass research. The similarities in gene content and gene family structure between Brachypodium, rice and sorghum support the value of Brachypodium as a functional genomics model for all grasses. The Brachypodium genome sequence analysis reported here is therefore an important advance towards securing sustainable supplies of food, feed and fuel from new generations of grass crops.

Methods Summary

Genome sequencing and assembly

Sanger sequencing was used to generate paired-end reads from 3 kb, 8 kb, fosmid (35 kb) and BAC (100 kb) clones to generate 9.4× coverage (Supplementary Table 1). The final assembly of 83 scaffolds covers 271.9 Mb (Supplementary Table 3). Sequence scaffolds were aligned to a genetic map to create pseudomolecules covering each chromosome (Supplementary Figs 1 and 2).

Protein-coding gene annotation

Gene models were derived from weighted consensus prediction from several ab initio gene finders, optimal spliced alignments of ESTs and transcript assemblies, and protein homology. Illumina transcriptome sequence was aligned to predicted genome features to validate exons, splice sites and alternatively spliced transcripts.

Repeats analysis

The MIPS ANGELA pipeline was used to integrate analyses from expert groups. LTR-STRUCT and LTR-HARVEST⁴⁶ were used for de novo retroelement searches.

Accession codes

Primary accessions

DDBJ/GenBank/EMBL

GenBank/EMBL/DDBJ

GT758162–GT865804

Data deposits

The whole-genome shotgun sequence of Brachypodium distachyon has been deposited at DDBJ/EMBL/GenBank under the accession ADDN00000000. (The version described in this manuscript is the first version, accession ADDN01000000). EST sequences have been deposited with dbEST (accessions 67946317–68053959) and GenBank (accessions GT758162–GT865804). The short read archive accession for RNA-seq data is SRA010177.

References

Somerville, C. The billion-ton biofuels vision. Science 312, 1277 (2006)
Article CAS Google Scholar
Kellogg, E. A. Evolutionary history of the grasses. Plant Physiol. 125, 1198–1205 (2001)
Article CAS Google Scholar
Gaut, B. S. Evolutionary dynamics of grass genomes. New Phytol. 154, 15–28 (2002)
Article CAS Google Scholar
International Rice Genome Sequencing Project. The map-based sequence of the rice genome. Nature 436, 793–800 (2005)
Article Google Scholar
Paterson, A. H. et al. The Sorghum bicolor genome and the diversification of grasses. Nature 457, 551–556 (2009)
Article ADS CAS Google Scholar
Wei, F. et al. Physical and genetic structure of the maize genome reflects its complex evolutionary history. PLoS Genet. 3, e123 (2007)
Article Google Scholar
Moore, G., Devos, K. M., Wang, Z. & Gale, M. D. Cereal genome evolution. Grasses, line up and form a circle. Curr. Biol. 5, 737–739 (1995)
Article CAS Google Scholar
Salamini, F., Ozkan, H., Brandolini, A., Schafer-Pregl, R. & Martin, W. Genetics and geography of wild cereal domestication in the near east. Nature Rev. Genet. 3, 429–441 (2002)
Article CAS Google Scholar
Draper, J. et al. Brachypodium distachyon. A new model system for functional genomics in grasses. Plant Physiol. 127, 1539–1555 (2001)
Article CAS Google Scholar
Vain, P. et al. Agrobacterium-mediated transformation of the temperate grass Brachypodium distachyon (genotype Bd21) for T-DNA insertional mutagenesis. Plant Biotechnol. J. 6, 236–245 (2008)
Article CAS Google Scholar
Vogel, J. & Hill, T. High-efficiency Agrobacterium-mediated transformation of Brachypodium distachyon inbred line Bd21–3. Plant Cell Rep. 27, 471–478 (2008)
Article CAS Google Scholar
Vogel, J. P., Garvin, D. F., Leong, O. M. & Hayden, D. M. Agrobacterium-mediated transformation and inbred line development in the model grass Brachypodium distachyon . Plant Cell Tissue Organ Cult. 84, 100179–100191 (2006)
Article Google Scholar
Filiz, E. et al. Molecular, morphological and cytological analysis of diverse Brachypodium distachyon inbred lines. Genome 52, 876–890 (2009)
Article CAS Google Scholar
Vogel, J. P. et al. Development of SSR markers and analysis of diversity in Turkish populations of Brachypodium distachyon . BMC Plant Biol. 9, 88 (2009)
Article Google Scholar
Garvin, D. F. et al. An SSR-based genetic linkage map of the model grass Brachypodium distachyon . Genome 53, 1–13 (2009)
Article Google Scholar
Huo, N. et al. Construction and characterization of two BAC libraries from Brachypodium distachyon, a new model for grass genomics. Genome 49, 1099–1108 (2006)
Article CAS Google Scholar
Huo, N. et al. The nuclear genome of Brachypodium distachyon: analysis of BAC end sequences. Funct. Integr. Genomics 8, 135–147 (2008)
Article CAS Google Scholar
Gu, Y. Q. et al. A BAC-based physical map of Brachypodium distachyon and its comparative analysis with rice and wheat. BMC Genomics 10, 496 (2009)
Article Google Scholar
Garvin, D. F. et al. Development of genetic and genomic research resources for Brachypodium distachyon, a new model system for grass crop research. Crop Sci. 48, S-69–S-84 (2008)
Article Google Scholar
Bennett, M. D. & Leitch, I. J. Nuclear DNA amounts in angiosperms: progress, problems and prospects. Ann. Bot. (Lond.) 95, 45–90 (2005)
Article CAS Google Scholar
Vogel, J. P. et al. EST sequencing and phylogenetic analysis of the model grass Brachypodium distachyon . Theor. Appl. Genet. 113, 186–195 (2006)
Article CAS Google Scholar
Rajagopalan, R., Vaucheret, H., Trejo, J. & Bartel, D. P. A diverse and evolutionarily fluid set of microRNAs in Arabidopsis thaliana. Genes Dev. 20, 3407–3425 (2006)
Article CAS Google Scholar
Tanaka, T. et al. The rice annotation project database (RAP-DB): 2008 update. Nucleic Acids Res. 36, D1028–D1033 (2008)
CAS PubMed Google Scholar
Fox, S., Filichkin, S. & Mockler, T. Applications of ultra-high-throughput sequencing. Methods Mol. Biol. 553, 79–108 (2009)
Article CAS Google Scholar
Gray, J. et al. A recommendation for naming transcription factor proteins in the grasses. Plant Physiol. 149, 4–6 (2009)
Article CAS Google Scholar
Vogel, J. Unique aspects of the grass cell wall. Curr. Opin. Plant Biol. 11, 301–307 (2008)
Article CAS Google Scholar
Bennetzen, J. L. & Kellogg, E. A. Do plants have a one-way ticket to genomic obesity? Plant Cell 9, 1509–1514 (1997)
Article CAS Google Scholar
Wicker, T. & Keller, B. Genome-wide comparative analysis of copia retrotransposons in Triticeae, rice, and Arabidopsis reveals conserved ancient evolutionary lineages and distinct dynamics of individual copia families. Genome Res. 17, 1072–1081 (2007)
Article CAS Google Scholar
Wicker, T. et al. Analysis of intraspecies diversity in wheat and barley genomes identifies breakpoints of ancient haplotypes and provides insight into the structure of diploid and hexaploid triticeae gene pools. Plant Physiol. 149, 258–270 (2009)
Article CAS Google Scholar
Jiang, N., Bao, Z., Zhang, X., Eddy, S. R. & Wessler, S. R. Pack-MULE transposable elements mediate gene evolution in plants. Nature 431, 569–573 (2004)
Article ADS CAS Google Scholar
Morgante, M. et al. Gene duplication and exon shuffling by helitron-like transposons generate intraspecies diversity in maize. Nature Genet. 37, 997–1002 (2005)
Article CAS Google Scholar
Grass Phylogeny Working Group. Phylogeny and subfamilial classification of the grasses (Poaceae). Ann. Mo. Bot. Gard. 88, 373–457 (2001)
Bossolini, E., Wicker, T., Knobel, P. A. & Keller, B. Comparison of orthologous loci from small grass genomes Brachypodium and rice: implications for wheat genomics and grass genome annotation. Plant J. 49, 704–717 (2007)
Article CAS Google Scholar
Charles, M. et al. Sixty million years in evolution of soft grain trait in grasses: emergence of the softness locus in the common ancestor of Pooideae and Ehrhartoideae, after their divergence from Panicoideae . Mol. Biol. Evol. 26, 1651–1661 (2009)
Article CAS Google Scholar
Paterson, A. H., Bowers, J. E. & Chapman, B. A. Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics. Proc. Natl Acad. Sci. USA 101, 9903–9908 (2004)
Article ADS CAS Google Scholar
Stein, N. et al. A 1,000-loci transcript map of the barley genome: new anchoring points for integrative grass genomics. Theor. Appl. Genet. 114, 823–839 (2007)
Article CAS Google Scholar
Luo, M. C. et al. Genome comparisons reveal a dominant mechanism of chromosome number reduction in grasses and accelerated genome evolution in Triticeae. Proc. Natl Acad. Sci. USA 106, 15780–15785 (2009)
Article ADS CAS Google Scholar
Qi, L. L. et al. A chromosome bin map of 16,000 expressed sequence tag loci and distribution of genes among the three genomes of polyploid wheat. Genetics 168, 701–712 (2004)
Article CAS Google Scholar
Salse, J. et al. Identification and characterization of shared duplications between rice and wheat provide new insight into grass genome evolution. Plant Cell 20, 11–24 (2008)
Article CAS Google Scholar
Srinivasachary, M. M., Gale, M. D. & Devos, K. M. Comparative analyses reveal high levels of conserved colinearity between the finger millet and rice genomes. Theor. Appl. Genet. 115, 489–499 (2007)
Article CAS Google Scholar
Vicient, C. M., Kalendar, R. & Schulman, A. H. Variability, recombination, and mosaic evolution of the barley BARE-1 retrotransposon. J. Mol. Evol. 61, 275–291 (2005)
Article ADS CAS Google Scholar
Meyers, B. C., Kozik, A., Griego, A., Kuang, H. & Michelmore, R. W. Genome-wide analysis of NBS-LRR-encoding genes in Arabidopsis . Plant Cell 15, 809–834 (2003)
Article CAS Google Scholar
Ma, J. & Bennetzen, J. L. Rapid recent growth and divergence of rice nuclear genomes. Proc. Natl Acad. Sci. USA 101, 12404–12410 (2004)
Article ADS CAS Google Scholar
U.S. Department of Energy Office of Science. Breaking the Biological Barriers to Cellulosic Ethanol: A Joint Research Agenda 〈 http://genomicscience.energy.gov/biofuels/b2bworkshop.shtml〉 (2006)
Food and Agriculture Organization of the United Nations. World Agriculture: Towards 2030/2050 Interim Report 〈 http://www.fao.org/ES/esd/AT2050web.pdf〉 (2006)
McCarthy, E. M. & McDonald, J. F. LTR_STRUC: a novel search and identification program for LTR retrotransposons. Bioinformatics 19, 362–367 (2003)
Article CAS Google Scholar

Download references

Acknowledgements

We acknowledge the contributions of the late M. Gale, who identified the importance of conserved gene order in grass genomes. This work was mainly supported by the US Department of Energy Joint Genome Institute Community Sequencing Program project with J.P.V., D.F.G., T.C.M. and M.W.B., a BBSRC grant to M.W.B., an EU Contract Agronomics grant to M.W.B. and K.F.X.M., and GABI Barlex grant to K.F.X.M. Illumina transcriptome sequencing was supported by a DOE Plant Feedstock Genomics for Bioenergy grant and an Oregon State Agricultural Research Foundation grant to T.C.M.; small RNA research was supported by the DOE Plant Feedstock Genomics for Bioenergy grants to P.J.G. and T.C.M.; annotation was supported by a DOE Plant Feedstocks for Genomics Bioenergy grant to J.P.V. A full list of support and acknowledgements is in the Supplementary Information.

Author Contributions See list of consortium authors below.

Author information

A list of participants and their affiliations appears at the end of the paper.

Authors and Affiliations

USDA-ARS Western Regional Research Center, Albany, California 94710, USA.,
John P. Vogel, Naxin Huo, Yong Q. Gu, Gerard R. Lazo, Olin D. Anderson, John P. Vogel (Leader), Jennifer N. Bragg, Debbie Laudencia-Chingcuanco, John P. Vogel, William Belknap, Yong Q. Gu, Jennifer N. Bragg, Ludmila Tyler, Jiajie Wu, Yong Q. Gu, Gerard R. Lazo, Debbie Laudencia-Chingcuanco, James Thomson & John P. Vogel (Leader)
USDA-ARS Plant Science Research Unit and University of Minnesota, St Paul, Minnesota 55108, USA.,
David F. Garvin, David F. Garvin & David F. Garvin
Oregon State University, Corvallis, Oregon 97331-4501, USA.,
Todd C. Mockler, Samuel E. Fox, Henry D. Priest, Sergei A. Filichkin, Scott A. Givan, Douglas W. Bryant, Jeff H. Chang, Todd C. Mockler (Leader), Noah Fahlgren, Samuel E. Fox, Christopher M. Sullivan, Todd C. Mockler, James C. Carrington, Elisabeth J. Chapman, Samuel E. Fox, Sergei A. Filichkin, Noah Fahlgren, Jeffrey A. Kimbrel, Jeff H. Chang, Christopher M. Sullivan, Elisabeth J. Chapman, James C. Carrington, Todd C. Mockler & David F. Garvin
HudsonAlpha Institute, Huntsville, Alabama 35806, USA.,
Jeremy Schmutz, Jeremy Schmutz (Leader) & Jane Grimwood
US DOE Joint Genome Institute, Walnut Creek, California 94598, USA.,
Dan Rokhsar, Kerrie Barry, Susan Lucas, Miranda Harmon-Smith, Kathleen Lail, Hope Tice, Erika Lindquist & Mei Wang
University of California Berkeley, Berkeley, California 94720, USA.,
Dan Rokhsar, Therese Mitros, Dan Rokhsar & Ludmila Tyler
John Innes Centre, Norwich NR4 7UJ, UK.,
Michael W. Bevan, Neil McKenzie, Michael W. Bevan, Jonathan Wright, Melanie Febrer, Michael W. Bevan, Neil McKenzie, Michael W. Bevan, Jonathan Wright, Michael Bevan, Mary E. Byrne, Sean Walsh, Janet Higgins & Michael Bevan
University of California Davis, Davis, California 95616, USA.,
Frank M. You, Ming-Cheng Luo, Jan Dvorak, Frank M. You, Ming-Cheng Luo, Jan Dvorak, Jiajie Wu, Laura E. Bartley, Peijian Cao, Ki-Hong Jung, Manoj K Sharma, Miguel Vega-Sanchez & Pamela Ronald
University of Silesia, 40-032 Katowice, Poland.,
Dominika Idziak & Robert Hasterok
Iowa State University, Ames, Iowa 50011, USA.,
Haiyan Wu, Wei Wu, An-Ping Hsia & Patrick S. Schnable
Washington State University, Pullman, Washington 99163, USA.,
Anantharaman Kalyanaraman
University of Florida, Gainsville, Florida 32611, USA.,
Brad Barbazuk
Rutgers University, Piscataway, New Jersey 08855-0759, USA.,
Todd P. Michael, Remy Bruggmann, Joachim Messing & Todd Michael
University of Massachusetts, Amherst, Massachusetts 01003-9292, USA.,
Samuel P. Hazen, Samuel P. Hazen & Shan Chen
Horticulture Department, USDA-ARS Vegetable Crops Research Unit, University of Wisconsin, Madison, Wisconsin 53706, USA.,
Yiqun Weng
Helmholtz Zentrum München, D-85764 Neuherberg, Germany.,
Georg Haberer, Manuel Spannagl, Klaus Mayer (Leader), Heidrun Gundlach, Georg Haberer, Manuel Spannagl & Klaus Mayer
Technical University München, 80333 München, Germany.,
Thomas Rattei
Cornell University, Ithaca, New York 14853, USA.,
Sang-Jik Lee & Jocelyn K. C. Rose
Boyce Thompson Institute for Plant Research, Ithaca, New York 14853-1801, USA.,
Lukas A. Mueller, Thomas L. York, Pinghua Li & Thomas Brutnell
University of Zurich, 8008 Zurich, Switzerland.,
Thomas Wicker (Leader) & Jan P. Buchmann
MTT Agrifood Research and University of Helsinki, FIN-00014 Helsinki, Finland.,
Jaakko Tanskanen & Alan H. Schulman (Leader)
Federal University of Pelotas, Pelotas, 96001-970, RS, Brazil.,
Antonio Costa de Oliveira & Luciano da C. Maia
Michigan State University, East Lansing, Michigan 48824, USA.,
Ning Jiang
China Agricultural University, Beijing 10094, China.,
Haiyan Wu, Patrick S. Schnable, Jinsheng Lai, Yu Cui, Shuhong Ouyang, Qixin Sun & Zhiyong Liu
Purdue University, West Lafayette, Indiana 47907, USA.,
Liucun Zhu & Jianxin Ma
The University of Texas, Arlington, Arlington, Texas 76019, USA.,
Cheng Sun & Ellen Pritham
Institut National de la Recherché Agronomique UMR 1095, 63100 Clermont-Ferrand, France.,
Jerome Salse (Leader), Florent Murat, Michael Abrouk & Elisabeth J. Chapman
University of California San Diego, La Jolla, California 92093, USA.,
Elisabeth J. Chapman
National Centre for Genome Resources, Santa Fe, New Mexico 87505, USA.,
Greg D. May
University of Delaware, Newark, Delaware 19716, USA.,
Jixian Zhai, Matthias Ganssmann, Sai Guna Ranjan Gurazada, Marcelo German, Blake C. Meyers & Pamela J. Green (Leader)
Joint Bioenergy Institute, Emeryville, California 94720, USA.,
Henrik V. Scheller, Laura E. Bartley, Peijian Cao, Ki-Hong Jung, Manoj K Sharma, Miguel Vega-Sanchez & Pamela Ronald
University of Copenhagen, Frederiksberg DK-1871, Denmark.,
Jesper Harholt & Peter Ulvskov
USDA-ARS Appalachian Fruit Research Station, Kearneysville, West Virginia 25430, USA.,
Christopher D. Dardick
VIB Department of Plant Systems Biology, VIB and Department of Plant Biotechnology and Genetics, Ghent University, Technologiepark 927, 9052 Gent, Belgium.,
Stefanie De Bodt, Wim Verelst & Dirk Inzé
Institut de Biologie Moléculaire des Plantes du CNRS, Strasbourg 67084, France.,
Maren Heese & Arp Schnittger
BioEnergy Science Center and Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831-6422, USA.,
Xiaohan Yang, Udaya C. Kalluri & Gerald A. Tuskan
University of Wisconsin-Madison, Madison, Wisconsin 53706, USA.,
Zhihua Hua & Richard D. Vierstra
The Ohio State University, Columbus, Ohio 43210, USA.,
Alper Yilmaz & Erich Grotewold
Institut Jean-Pierre Bourgin, UMR1318, Institut National de la Recherche Agronomique, 78026 Versailles cedex, France.,
Richard Sibout, Kian Hematy, Gregory Mouille & Herman Höfte
Université de Picardie, Amiens 80039, France.,
Jérome Pelloux
Plant Gene Expression Center, University of California Berkeley, Albany, California 94710, USA.,
Devin O’Connor, James Schnable, Scott Rowe & Frank Harmon
Illinois State University and DOE Great Lakes Bioenergy Research Center, Normal, Illinois 61790, USA.,
Cynthia L. Cass & John C. Sedbrook
Sabanci University, Istanbul 34956, Turkey.,
Turgay Unver & Hikmet Budak
Unité de Recherche en Génomique Végétale: URGV (INRA-CNRS-UEVE), Evry 91057, France.,
Harry Belcram, Mathieu Charles & Boulos Chalhoub
USDA-ARS/Donald Danforth Plant Science Center, St Louis, Missouri 63130, USA.,
Ivan Baxter

Consortia

The International Brachypodium Initiative

Principal investigators
- John P. Vogel
- , David F. Garvin
- , Todd C. Mockler
- , Jeremy Schmutz
- , Dan Rokhsar
- & Michael W. Bevan
DNA sequencing and assembly
- Kerrie Barry
- , Susan Lucas
- , Miranda Harmon-Smith
- , Kathleen Lail
- , Hope Tice
- , Jeremy Schmutz (Leader)
- , Jane Grimwood
- , Neil McKenzie
- & Michael W. Bevan
Pseudomolecule assembly and BAC end sequencing
- Naxin Huo
- , Yong Q. Gu
- , Gerard R. Lazo
- , Olin D. Anderson
- , John P. Vogel (Leader)
- , Frank M. You
- , Ming-Cheng Luo
- , Jan Dvorak
- , Jonathan Wright
- , Melanie Febrer
- , Michael W. Bevan
- , Dominika Idziak
- , Robert Hasterok
- & David F. Garvin
Transcriptome sequencing and analysis
- Erika Lindquist
- , Mei Wang
- , Samuel E. Fox
- , Henry D. Priest
- , Sergei A. Filichkin
- , Scott A. Givan
- , Douglas W. Bryant
- , Jeff H. Chang
- , Todd C. Mockler (Leader)
- , Haiyan Wu
- , Wei Wu
- , An-Ping Hsia
- , Patrick S. Schnable
- , Anantharaman Kalyanaraman
- , Brad Barbazuk
- , Todd P. Michael
- , Samuel P. Hazen
- , Jennifer N. Bragg
- , Debbie Laudencia-Chingcuanco
- , John P. Vogel
- , David F. Garvin
- , Yiqun Weng
- , Neil McKenzie
- & Michael W. Bevan
Gene analysis and annotation
- Georg Haberer
- , Manuel Spannagl
- , Klaus Mayer (Leader)
- , Thomas Rattei
- , Therese Mitros
- , Dan Rokhsar
- , Sang-Jik Lee
- , Jocelyn K. C. Rose
- , Lukas A. Mueller
- & Thomas L. York
Repeats analysis
- Thomas Wicker (Leader)
- , Jan P. Buchmann
- , Jaakko Tanskanen
- , Alan H. Schulman (Leader)
- , Heidrun Gundlach
- , Jonathan Wright
- , Michael Bevan
- , Antonio Costa de Oliveira
- , Luciano da C. Maia
- , William Belknap
- , Yong Q. Gu
- , Ning Jiang
- , Jinsheng Lai
- , Liucun Zhu
- , Jianxin Ma
- , Cheng Sun
- & Ellen Pritham
Comparative genomics
- Jerome Salse (Leader)
- , Florent Murat
- , Michael Abrouk
- , Georg Haberer
- , Manuel Spannagl
- , Klaus Mayer
- , Remy Bruggmann
- , Joachim Messing
- , Frank M. You
- , Ming-Cheng Luo
- & Jan Dvorak
Small RNA analysis
- Noah Fahlgren
- , Samuel E. Fox
- , Christopher M. Sullivan
- , Todd C. Mockler
- , James C. Carrington
- , Elisabeth J. Chapman
- , Greg D. May
- , Jixian Zhai
- , Matthias Ganssmann
- , Sai Guna Ranjan Gurazada
- , Marcelo German
- , Blake C. Meyers
- & Pamela J. Green (Leader)
Manual annotation and gene family analysis
- Jennifer N. Bragg
- , Ludmila Tyler
- , Jiajie Wu
- , Yong Q. Gu
- , Gerard R. Lazo
- , Debbie Laudencia-Chingcuanco
- , James Thomson
- , John P. Vogel (Leader)
- , Samuel P. Hazen
- , Shan Chen
- , Henrik V. Scheller
- , Jesper Harholt
- , Peter Ulvskov
- , Samuel E. Fox
- , Sergei A. Filichkin
- , Noah Fahlgren
- , Jeffrey A. Kimbrel
- , Jeff H. Chang
- , Christopher M. Sullivan
- , Elisabeth J. Chapman
- , James C. Carrington
- , Todd C. Mockler
- , Laura E. Bartley
- , Peijian Cao
- , Ki-Hong Jung
- , Manoj K Sharma
- , Miguel Vega-Sanchez
- , Pamela Ronald
- , Christopher D. Dardick
- , Stefanie De Bodt
- , Wim Verelst
- , Dirk Inzé
- , Maren Heese
- , Arp Schnittger
- , Xiaohan Yang
- , Udaya C. Kalluri
- , Gerald A. Tuskan
- , Zhihua Hua
- , Richard D. Vierstra
- , David F. Garvin
- , Yu Cui
- , Shuhong Ouyang
- , Qixin Sun
- , Zhiyong Liu
- , Alper Yilmaz
- , Erich Grotewold
- , Richard Sibout
- , Kian Hematy
- , Gregory Mouille
- , Herman Höfte
- , Todd Michael
- , Jérome Pelloux
- , Devin O’Connor
- , James Schnable
- , Scott Rowe
- , Frank Harmon
- , Cynthia L. Cass
- , John C. Sedbrook
- , Mary E. Byrne
- , Sean Walsh
- , Janet Higgins
- , Michael Bevan
- , Pinghua Li
- , Thomas Brutnell
- , Turgay Unver
- , Hikmet Budak
- , Harry Belcram
- , Mathieu Charles
- , Boulos Chalhoub
- & Ivan Baxter

Corresponding authors

Correspondence to John P. Vogel, David F. Garvin, Todd C. Mockler, Michael W. Bevan, Michael W. Bevan, John P. Vogel (Leader), Michael W. Bevan, David F. Garvin, John P. Vogel, David F. Garvin, Michael W. Bevan, Michael Bevan, Todd C. Mockler, John P. Vogel (Leader), Todd C. Mockler, David F. Garvin or Michael Bevan.

Supplementary information

Supplementary Information

This file contains Supplementary Information, Supplementary Tables S1-S18, Supplementary Figures S1-S19 with Legends, Supplementary Acknowledgments and Supplementary References. (PDF 2282 kb)

Supplementary Data

This file shows dot-plot alignments of the sequence of 23 randomly-selected BAC clones compared 2,378,733 finished bp to the whole genome shotgun assembly. The alignment shows only one mismatch in collinearity, demonstrating the accuracy of the final assemblies. (JPG 3992 kb)

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

PowerPoint slide for Fig. 3

PowerPoint slide for Fig. 4

Rights and permissions

Reprints and permissions

About this article

Cite this article

The International Brachypodium Initiative. Genome sequencing and analysis of the model grass Brachypodium distachyon. Nature 463, 763–768 (2010). https://doi.org/10.1038/nature08747

Download citation

Received: 29 August 2009
Accepted: 09 December 2009
Issue Date: 11 February 2010
DOI: https://doi.org/10.1038/nature08747

This article is cited by

Three near-complete genome assemblies reveal substantial centromere dynamics from diploid to tetraploid in Brachypodium genus
- Chuanye Chen
- Siying Wu
- Handong Su
Genome Biology (2024)
A chromosome level genome assembly of Pseudoroegneria Libanotica reveals a key Kcs gene involves in the cuticular wax elongation for drought resistance
- Xingguang Zhai
- Dandan Wu
- Haiqin Zhang
BMC Genomics (2024)
Fast-track transformation and genome editing in Brachypodium distachyon
- Camille Soulhat
- Houssein Wehbi
- Oumaya Bouchabké-Coussa
Plant Methods (2023)
Expression divergence of expansin genes drive the heteroblasty in Ceratopteris chingii
- Yue Zhang
- Yves Van de Peer
- Xingyu Yang
BMC Biology (2023)
Genome-wide expansion and reorganization during grass evolution: from 30 Mb chromosomes in rice and Brachypodium to 550 Mb in Avena
- Qing Liu
- Lyuhan Ye
- John Seymour Heslop-Harrison
BMC Plant Biology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Main

Genome sequence assembly and annotation

Maintaining a small grass genome size

Whole-genome comparison of three diverse grass genomes

Discussion

Methods Summary

Genome sequencing and assembly

Protein-coding gene annotation

Repeats analysis

Accession codes

Primary accessions

DDBJ/GenBank/EMBL

GenBank/EMBL/DDBJ

Data deposits

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

The International Brachypodium Initiative

Principal investigators

DNA sequencing and assembly

Pseudomolecule assembly and BAC end sequencing

Transcriptome sequencing and analysis

Gene analysis and annotation

Repeats analysis

Comparative genomics

Small RNA analysis

Manual annotation and gene family analysis

Corresponding authors

Supplementary information

PowerPoint slides

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links