Analysis of 1.9 Mb of contiguous sequence from chromosome 4 of Arabidopsis thaliana

Article metrics


The plant Arabidopsis thaliana (Arabidopsis) has become an important model species for the study of many aspects of plant biology1. The relatively small size of the nuclear genome and the availability of extensive physical maps of the five chromosomes2,3,4 provide a feasible basis for initiating sequencing of the five chromosomes. The YAC (yeast artificial chromosome)-based physical map of chromosome 4 was used to construct a sequence-ready map of cosmid and BAC (bacterial artificial chromosome) clones covering a 1.9-megabase (Mb) contiguous region5, and the sequence of this region is reported here. Analysis of the sequence revealed an average gene density of one gene every 4.8 kilobases (kb), and 54% of the predicted genes had significant similarity to known genes. Other interesting features were found, such as the sequence of a disease-resistance gene locus, the distribution of retroelements, the frequent occurrence of clustered gene families, and the sequence of several classes of genes not previously encountered in plants.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.

Figure 1: This map shows the positions of genes, predicted genes and other features.
Figure 2: The pie chart shows the proportion of predicted genes with assigned cellular roles in each of the functional categories described in Table 2.


  1. 1

    Meyerowitz, E. M. & Somerville, C. R. (eds) Arabidopsis (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1994).

  2. 2

    Schmidt, al. Physical map and orgnaization of Arabidopsis chromosome 4. Science 270, 480–483 ( 1995).

  3. 3

    Zachgo, E. al. Aphysical map of chromosome 2 of Arabidopsis thaliana. Genome Res. 6, 19–25 ( 1996).

  4. 4

    Schmidt, R., Love, K., West, J., Lenehan, Z. & Dean, C. Description of 31 YAC contigs spanning the majority of Arabidopsis thaliana chromosome 5. Plant J. 11, 563–572 (1997).

  5. 5

    Bancroft, al. Astrategy involving the use of high-redundancy YAC subclone libraries facilitates the contiguous representation in cosmid and BAC clones of 1.7 Mb of the genome of Arabidopsis thaliana. Weeds World 4, 1–9 (1997).

  6. 6

    Sato, al. Structural analysis of Arabidopsis thaliana chromosome 5. I. Sequence features of the 1.6 Mb regions covered by twenty physically assigned P1 clones. DNA Res. 4, 215– 230 (1997).

  7. 7

    Pearson, W. R. & Lipman, D. J. Improved tools for biological sequence comparison. Proc. Natl Acad. Sci. USA 85, 2444–2448 (1988).

  8. 8

    Riley, M. Functions of gene products in E. coli. Microbiol. Rev. 57, 862–952 (1993).

  9. 9

    Mewes, al. Overview of the yeast genome. Nature 387 (suppl.) 7–84 (1997).

  10. 10

    White, S. E., Habera, L. F. & Wessler, S. R. Retrotransposons in the flanking regions of normal plant genes: A role for copia-like elements in the evolution fo gene structure and expression. Proc. Natl Acad. Sci. USA 91, 11792–11796 (1994).

  11. 11

    Konieczny, A., Voytas, D. F., Cummings, M. P. & Ausubel, F. M. Asuperfamily of Arabidopsis thaliana retrotransposons. Genetics 127, 801–809 ( 1991).

  12. 12

    SanMiguel, al. Nested retrotransposons in the intergenic regions of the maize genome. Science 274, 765– 768 (1996).

  13. 13

    Wessler, S. R., Bureau, T. E. & White, S. E. LTR-retrotransposons and MITEs: important players in the evolution of plant genomes. Curr. Opin. Genet. Dev. 5, 814–821 (1995).

  14. 14

    Parker, J. al. The Arabidopsis downy mildew resistance gene RPP5 shares similarity to the Toll and interleukin-1 receptors with N and L6. The Plant Cell 9, 879 –894 (1997).

  15. 15

    Pear, J. R., Kawagoe, Y., Screckengost, W. E., Delmer, D. P. & Stalker, D. M. Higher plants contain homologues of the bacterial celA genes encoding the catalytic subunit of cellulose synthase. Proc. Natl Acad. Sci. USA 93, 12637–12642 (1996).

  16. 16

    Back, K. & Chappell, J. Cloning and bacterial expression of a sesquiterpene cyclase from Hyoscamus muticus and its molecular comparison to related terpene cyclases. J. Biol. Chem. 270, 7375–7381 (1995).

  17. 17

    Gavin, K. A., Hidaka, M. & Stillman, B. Conserved initiator proteins in eukaryotes. Science 270, 1667–1671 ( 1995).

  18. 18

    Fishel, al. The human mutator gene homologue MHS2 and its association with hereditary nonpolyposis colon cancer. Cell 75, 1027 –1038 (1993).

  19. 19

    Marcus, G. A., Silverman, N., Berger, S. L., Horiuchi, J. & Guarente, L. Functional similarity and physical association between GCN5 and ADA2: putative transcriptional adaptors. EMBO J. 13, 4807–4815 ( 1994).

  20. 20

    Neuwald, A. F. & Landsman, D. GCN5-related histone N-acetyltransferases belong to a diverse superfamily that includes the yeast SPT10 protein. Trends Biochem. Sci. 22, 154–155 (1997).

  21. 21

    Oppenheimer, D. al. Essential role for a kinesin-like protein in Arabidopsis trichome morphogenesis. Proc. Natl Acad. Sci. USA 94, 6261–6266 (1997).

  22. 22

    Pongs, al. Frequenin–a novel calcium-binding protein that modulates synaptic efficacy in the Drosophila nervous system. Neuron 11, 15–28 (1993).

  23. 23

    Friesen, H., Lunz, R., Doyle, S. & Segall, J. Mutation in the SPS1-encoded protein kinase of Saccharomyces cerevisiae leads to defects in transcription and morphology during spore formation. Genes Dev. 8, 2162–2175 (1994).

  24. 24

    Gabor Miklos, G. & Rubin, G. M. The role of the genome project in determining gene function: Insights from model organisms. Cell 86, 521–529 (1996).

  25. 25

    Wilson, al. 2.2Mb of contiguous nucleotide sequence from chromosome III of C. elegans . Nature 368, 32–38 (1994).

  26. 26

    Meyerowitz, E. M. Plants and the logic of development. Genetics 145, 5–9 (1997).

  27. 27

    Bent, E., Johnson, S., Bancroft, I. BAC representation of two low-copy regions of the genome of Arabidopsis thaliana. The Plant J. (in the press).

  28. 28

    Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).

  29. 29

    Hebsgaard, S. al. Splice site prediction in Arabidopsis thaliana pre-mRNA by combining local and global sequence information. Nucleic Acids Res. 24, 3439–3452 ( 1996).

Download references


This work was initiated and sponsored by the European Commission, DG-XII Life Sciences. Additional support from the BBSRC Plant and Animal Genome Analysis Programme, GREG (Groupe de Recherche et d'Etude des Genomes), BioResearch Ireland, and Plan Nacional de Investigacion Cientifica y Technica is gratefully acknowledged.

Author information

Correspondence to M. Bevan.

Rights and permissions

Reprints and Permissions

About this article

Further reading


By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.