The genome sequence and structure of rice chromosome 1

Sasaki, Takuji; Matsumoto, Takashi; Yamamoto, Kimiko; Sakata, Katsumi; Baba, Tomoya; Katayose, Yuichi; Wu, Jianzhong; Niimura, Yoshihito; Cheng, Zhukuan; Nagamura, Yoshiaki; Antonio, Baltazar A.; Kanamori, Hiroyuki; Hosokawa, Satomi; Masukawa, Masatoshi; Arikawa, Koji; Chiden, Yoshino; Hayashi, Mika; Okamoto, Masako; Ando, Tsuyu; Aoki, Hiroyoshi; Arita, Kohei; Hamada, Masao; Harada, Chizuko; Hijishita, Saori; Honda, Mikiko; Ichikawa, Yoko; Idonuma, Atsuko; Iijima, Masumi; Ikeda, Michiko; Ikeno, Maiko; Ito, Sachie; Ito, Tomoko; Ito, Yuichi; Ito, Yukiyo; Iwabuchi, Aki; Kamiya, Kozue; Karasawa, Wataru; Katagiri, Satoshi; Kikuta, Ari; Kobayashi, Noriko; Kono, Izumi; Machita, Kayo; Maehara, Tomoko; Mizuno, Hiroshi; Mizubayashi, Tatsumi; Mukai, Yoshiyuki; Nagasaki, Hideki; Nakashima, Marina; Nakama, Yuko; Nakamichi, Yumi; Nakamura, Mari; Namiki, Nobukazu; Negishi, Manami; Ohta, Isamu; Ono, Nozomi; Saji, Shoko; Sakai, Kumiko; Shibata, Michie; Shimokawa, Takanori; Shomura, Ayahiko; Song, Jianyu; Takazaki, Yuka; Terasawa, Kimihiro; Tsuji, Kumiko; Waki, Kazunori; Yamagata, Harumi; Yamane, Hiroko; Yoshiki, Shoji; Yoshihara, Rie; Yukawa, Kazuko; Zhong, Huisun; Iwama, Hisakazu; Endo, Toshinori; Ito, Hidetaka; Hahn, Jang Ho; Kim, Ho-Il; Eun, Moo-Young; Yano, Masahiro; Jiang, Jiming; Gojobori, Takashi

doi:10.1038/nature01184

Download PDF

Letter
Open access
Published: 21 November 2002

The genome sequence and structure of rice chromosome 1

Takuji Sasaki¹,
Takashi Matsumoto¹,
Kimiko Yamamoto¹,
Katsumi Sakata¹,
Tomoya Baba¹,
Yuichi Katayose¹,
Jianzhong Wu¹,
Yoshihito Niimura²,
Zhukuan Cheng³,
Yoshiaki Nagamura¹,
Baltazar A. Antonio¹,
Hiroyuki Kanamori¹,
Satomi Hosokawa¹,
Masatoshi Masukawa¹,
Koji Arikawa¹,
Yoshino Chiden¹,
Mika Hayashi¹,
Masako Okamoto¹,
Tsuyu Ando¹,
Hiroyoshi Aoki¹,
Kohei Arita¹,
Masao Hamada¹,
Chizuko Harada¹,
Saori Hijishita¹,
Mikiko Honda¹,
Yoko Ichikawa¹,
Atsuko Idonuma¹,
Masumi Iijima¹,
Michiko Ikeda¹,
Maiko Ikeno¹,
Sachie Ito¹,
Tomoko Ito¹,
Yuichi Ito¹,
Yukiyo Ito¹,
Aki Iwabuchi¹,
Kozue Kamiya¹,
Wataru Karasawa¹,
Satoshi Katagiri¹,
Ari Kikuta¹,
Noriko Kobayashi¹,
Izumi Kono¹,
Kayo Machita¹,
Tomoko Maehara¹,
Hiroshi Mizuno¹,
Tatsumi Mizubayashi¹,
Yoshiyuki Mukai¹,
Hideki Nagasaki¹,
Marina Nakashima¹,
Yuko Nakama¹,
Yumi Nakamichi¹,
Mari Nakamura¹,
Nobukazu Namiki¹,
Manami Negishi¹,
Isamu Ohta¹,
Nozomi Ono¹,
Shoko Saji¹,
Kumiko Sakai¹,
Michie Shibata¹,
Takanori Shimokawa¹,
Ayahiko Shomura¹,
Jianyu Song¹,
Yuka Takazaki¹,
Kimihiro Terasawa¹,
Kumiko Tsuji¹,
Kazunori Waki¹,
Harumi Yamagata¹,
Hiroko Yamane¹,
Shoji Yoshiki¹,
Rie Yoshihara¹,
Kazuko Yukawa¹,
Huisun Zhong¹,
Hisakazu Iwama²,
Toshinori Endo⁴,
Hidetaka Ito⁴,
Jang Ho Hahn⁵,
Ho-Il Kim⁵,
Moo-Young Eun⁵,
Masahiro Yano¹,
Jiming Jiang³ &
…
Takashi Gojobori²

Nature volume 420, pages 312–316 (2002)Cite this article

17k Accesses
426 Citations
8 Altmetric
Metrics details

Abstract

The rice species Oryza sativa is considered to be a model plant because of its small genome size, extensive genetic map, relative ease of transformation and synteny with other cereal crops^1,2,3,4. Here we report the essentially complete sequence of chromosome 1, the longest chromosome in the rice genome. We summarize characteristics of the chromosome structure and the biological insight gained from the sequence. The analysis of 43.3 megabases (Mb) of non-overlapping sequence reveals 6,756 protein coding genes, of which 3,161 show homology to proteins of Arabidopsis thaliana, another model plant. About 30% (2,073) of the genes have been functionally categorized. Rice chromosome 1 is (G + C)-rich, especially in its coding regions, and is characterized by several gene families that are dispersed or arranged in tandem repeats. Comparison with a draft sequence⁵ indicates the importance of a high-quality finished sequence.

The genome and population genomics of allopolyploid Coffea arabica reveal the diversification history of modern coffee cultivars

Article Open access 15 April 2024

Jarkko Salojärvi, Aditi Rambani, … Patrick Descombes

Genetic gains underpinning a little-known strawberry Green Revolution

Article Open access 19 March 2024

Mitchell J. Feldmann, Dominique D. A. Pincot, … Steven J. Knapp

A pan-genome of 69 Arabidopsis thaliana accessions reveals a conserved genome structure throughout the global species range

Article Open access 11 April 2024

Qichao Lian, Bruno Huettel, … Raphael Mercier

Main

Rice has been studied extensively by molecular genetics and constitutes one of the best characterized crop plants with a fine genetic map of 3,267 markers (http://rgp.dna.affrc.go.jp/publicdata/geneticmap2000/index.html)¹, a yeast artificial chromosome (YAC) physical map with 80.8% coverage², sequences for about 10,000 unique expressed sequence tags (ESTs)³, and a transcriptional map indicating the placement of 6,591 unique ESTs². The Rice Genome Research Program (RGP) in Japan launched its rice genome sequencing project in 1998. It is a partner of the International Rice Genome Sequencing Project (IRGSP), which involves ten countries in Asia, North America, South America and Europe that are working towards the immediate release of high-quality sequence data to the public domain⁴. The draft sequences of the two main subspecies of rice, japonica and indica, have been reported^5,6. Both studies were based on whole-genome shotgun sequencing rather than on the clone-by-clone approach of the IRGSP. Although the release of the draft sequence is of immense scientific value, many challenges in rice genomics demand the availability of a complete, accurate, map-based rice genome sequence.

We determined the sequence of chromosome 1 from 390 overlapping phage (P1)-derived artificial chromosome (PAC) and bacterial artificial chromosome (BAC) clones and assembled it into nine contigs (Fig. 1). The longest contig is 14.4 Mb and spans positions 106.2 centimorgans (cM) to 157.1 cM on the molecular genetic map. Among the eight remaining gaps, gap 4, located at 73.4 cM, corresponds to a portion of the centromeric region and is estimated to be about 1,400 kilobases (kb) by the pachytene fluorescence in situ hybridization (FISH) method⁷. PAC/BAC clones adjacent to this gap contain copies of the rice centromere-specific sequence RCS2 (ref. 8). Two PAC clones, P0402A09 and P0020E09, are localized to the most distal ends of the short arm and the long arm, and their map positions have been verified by pachytene FISH using pAtT4 (ref. 9), a telomeric clone of Arabidopsis (Supplementary Fig. 1). This indicates that our physical map extends to within less than 50 kb of the telomeres. Integration of the PAC/BAC physical mapping with the results from fibre FISH gives a total length of 45.7 Mb for chromosome 1, corresponding to 181.8 cM on the genetic map, excluding the telomeres.

**Figure 1: Physical map of rice chromosome 1.**

Statistics for the nucleotide sequence of rice chromosome 1 are summarized in Table 1. The non-overlapping sequence covers 43,276,883 nucleotides. In this sequence, 6,756 genes were either identified or predicted. Thus, the average gene density of chromosome 1 is about one gene per 6.4 kb. If this distribution is assumed to be similar throughout the whole genome, then the total number of genes in the rice genome (400 Mb) is roughly 62,500. This number is 2.5 times larger than the gene total of Arabidopsis¹⁰. But this difference might easily be the result of an overestimate of rice genes, because it assumes that there is a uniform distribution of genes along the chromosomes.

Table 1 Compositional analysis of the sequence of rice chromosome 1

Full size table

Cytogenetic analysis has indicated clear differences in the content of heterochromatin in each of the 12 rice chromosomes, and chromosome 1 shows the least amount of heterochromatic material¹¹. The average exon size is comparable to that of Arabidopsis, but the average intron size is about 3.6 times larger. This means that, although the longer introns engender larger gene sizes in rice, the average transcriptome size is similar in both species. The G + C content of coding and noncoding regions in rice is higher than in Arabidopsis—the rice coding regions are especially (G + C)-rich. This characteristic is reflected by the biased usage of G/C at the third position of codons within predicted genes (Supplementary Table 1). Buoyant density experiments have shown that rice genes are localized in (G + C)-rich islands that occupy 24% of the genome¹². When we plotted the average G + C values against chromosomal position in chromosome 1, however, we did not detect any CpG islands, indicating a neutral nucleotide distribution. The ratios of physical to genetic distance on the short and the long arms are 214 kb cM^-1 (r² = 0.983) and 288 kb cM^-1 (r² = 0.976), respectively, suggesting that the rate of recombination differs along the two arms of the chromosome.

We compared our finished sequence (493,729 bp from the distal end of the short chromosome arm) with 127,550 indica sequence contigs assembled from the whole-genome shotgun sequences of the Beijing Genomics Institute (BGI, http://btn.genomics.org.cn/rice/) using the japonica sequence as a query for basic BLASTN (basic local alignment search tool) analysis (Fig. 2). We could detect the corresponding indica sequence in about 78% of the whole region. But there were 65 gaps in the aligned contigs, and a total of 110,389 bases (22%) of japonica sequence could not be identified in the indica assembly. This may partly reflect the sequence difference between the two subspecies, although some artefacts in the whole-genome shotgun assembly cannot be ruled out. Among the 96 predicted genes in this region of the completed japonica sequence, 55 genes are intact, 33 genes are partially predicted and 8 genes are not predicted in the corresponding indica draft sequence. Relative identities near the repeat (retrotransposon-like) regions are lower than in the other regions, indicating a misassembly in the sequence.

**Figure 2: Comparison between the Nipponbare finished sequence and the *indica* draft sequence.**

Direct comparison with the japonica draft sequence could not be made because the sequence data are not in the public domain. But previously, 4,467 genes were predicted from a set of 99 BAC contigs assigned to chromosome 1 (ref. 6). It is likely that an estimated 2,835–4,211 gaps (either 63 gaps per megabase or 10% of 42,109 total gaps) for this chromosome prevented an accurate prediction of the number of genes. Not surprisingly, only half of the genes predicted contain complete coding regions. In addition, no basis was provided for the assignment of genes to chromosome⁶.

We used an automated annotation system, RiceGAAS¹³, to characterize the gene composition of chromosome 1 (Supplementary Fig. 2, http://RiceGAAS.dna.affrc.go.jp/chromosome1/). The distribution of genes along both arms of the chromosome indicates higher density (18–19 genes per 100 kb) in distal as compared with proximal regions (10–12 genes per 100 kb). This was verified by experimental results obtained by mapping 977 expressed sequence tags on to chromosome 1 (ref. 2). Among the 6,756 predicted genes, 2,073 (31%) were functionally characterized by homology to known proteins using BLASTP, whereas 69% of the predicted genes corresponded to proteins with no known function (Table 2). The protein signature search program InterPro detected protein domains in 3,660 (54%) of the total predicted genes (see http://RiceGAAS.dna.affrc.go.jp/chromosome1/). In particular, 1,170 (33%) of 3,600 hypothetical proteins showed domain homology, suggesting that these proteins may correspond to newly identified proteins in rice. BLASTN analysis was done using the cereal EST entries from the EST database at the National Institute for Biotechnology Information (NCBI). Exon regions from all predicted genes were used as queries, and 546,723 unclustered ESTs from wheat, maize, barley and sorghum were searched using a threshold probability value of 10^-5. A total of 2,985 predicted genes, including 756 hypothetical proteins, have cereal homologues. Thus, among the 6,756 predicted genes, 4,803 (71%) show some evidence of homology to a domain, a functional site, a cereal EST or a protein.

Table 2 Functional classification of the proteins encoded on rice chromosome 1

Full size table

The predicted proteins found on chromosome 1 were categorized into gene families by BLASTP, using a threshold probability score of 10^-20 over more than 50% of the length of the gene. The most abundant gene family was the serine/threonine receptor kinase family with 132 members distributed along the chromosome (Fig. 3a). A cluster of this gene family was observed at the distal end of the short arm, although some members of the cluster seemed to be pseudogenes. The highest number of tandem repeats detected at a single site was a cluster of ten copies of the hypothetical gene family located on the short arm of chromosome 1. These results are summarized in Fig. 3b, which shows a dot matrix plot of chromosome 1, indicating the predicted genes with significant homology to a given gene. On this plot, which disregards self-homology, a clear diagonal line was obtained, indicating that a significant number of genes are duplicated and arrayed in tandem.

**Figure 3: Analysis of gene families and gene clusters.**

To determine whether any of the proteins on rice chromosome 1 are not present in Arabidopsis, the 6,756 predicted proteins were queried in BLASTP searches against all the Arabidopsis proteins in the Munich Information Center for Protein Sequence (MIPS) database using a threshold probability score of 10^-5. Among 3,161 positive queries, 824 showed strong similarities (probability value less than 10^-100) to proteins found in Arabidopsis, whereas 3,595 sequences (53%) did not have positive BLASTP hits with predicted Arabidopsis proteins at a probability threshold of 10^-5. Only 27 of these sequences had homology to known proteins and among them, only Bowman–Birk trypsin inhibitor and cytochrome f (chloroplast) were clearly found in rice chromosome 1. This suggests that almost all of the known proteins found in rice chromosome 1 are also found in Arabidopsis. Among the hypothetical proteins, 3,051 genes have no counterpart in Arabidopsis and 442 (15%) genes have grass orthologues. Analysis of the draft sequence also showed that half of the predicted genes have no homologues in Arabidopsis^5,6. Although many of these hypothetical genes could be artefacts resulting from prediction errors, functional characterization of these genes in the future may identify grass-specific or even rice-specific genes.

We also observed rice chloroplast genes in sequential order on the chromosomal DNA. For example, at 149.1 cM we identified 3,564 bp of sequence that matched the rice chloroplast sequence with only a 3-bp difference. This sequence contains three genes¹⁴, PSII cytochrome b₅₅₉, cytochrome f and the chloroplast envelope membrane protein ORF230. We also detected 85 putative transfer RNA genes using tRNAscan SE¹⁵. Analysis of the retrotransposable elements and DNA intermediate transposons, including miniature inverted-repeats transposable elements (MITEs)¹⁶, using RepeatMasker is given in Table 1 and summarized in Supplementary Fig. 3. MITEs have a tendency to be dispersed along the chromosome, whereas the retrotransposons and other autonomous type DNA-mediated transposable elements are clustered in the pericentromeric region. Among retroelements, Ty3/Gypsy-type elements are the most frequent (2,157), followed by Ty1/Copia-type elements (384). The sum of the lengths of these three repetitive elements is 6.0 Mb, corresponding to 13% of chromosome 1.

There are at least three compelling reasons for obtaining finished high-quality sequence for the complete rice genome: first, the ability to determine gene function is highly dependent on having accurate sequences; second, as a model plant for the cereal grasses, the complete rice sequence will directly affect what can be accomplished with the other cereal grasses; and last, the identification of genes responsible for agronomic traits of economic importance requires precise map-based genomic sequence. Chromosome 1 contains many biologically important genes. More than 20 gene loci have been identified by genetic analysis, including genes controlling dwarfing and fertility. One of these genes, sd1 has been cloned and shown to encode one of the enzymes in gibberellic acid synthesis¹⁷.

The complete genomic sequence of chromosome 1 has yielded several findings that would be observed only using a clone-by-clone sequencing strategy. Gene families comprising active and inactive members and sets of tandemly repeated genes seem to be common features of chromosome 1. This redundancy may account for the unexpectedly large number of predicted genes on this chromosome. The intergenic repetitive fraction of the genome is not well understood and is frequently described as ‘junk’. Repetitive sequences are usually removed or separated from other sequences before whole-genome shotgun assembly because they can cause global misassembly. But we know that functional genes are found in repetitive sequences and that transposable elements embedded in the repetitive sequences can restructure genomes, can control gene action and are likely to be involved in generating some of the allelic variation that has been selected in plants.

In addition, high-quality finished sequence provides the only real opportunity to study gene regulation, because most of the essential regulatory sequences fall outside the transcribed regions and our analysis of a restricted region of the genome showed that 43% of the genes predicted from whole-genome shotgun sequence methods were incomplete. Our results and those from the sequencing of rice chromosome 4 (ref. 18) show clearly the importance of the finished sequence. The IRGSP has an immediate goal of sequencing the rice genome to a minimum standard of the high-throughput genomic sequence (HTG) phase 2 level by the end of 2002 and is committed to a long-term goal of obtaining finished high-quality sequence for the whole genome.

Methods

Chromosome sequencing

We sequenced the whole chromosome 1 of Oryza sativa ssp. japonica, variety Nipponbare, from 390 overlapping PAC/BAC clones. Initially, we constructed a sequence-ready physical map using the RGP Sau3AI PAC and MboI BAC libraries¹⁹. We also used HindIII or EcoRI BAC libraries constructed by Clemson University Genomics Institute (CUGI), and BAC clones with draft sequence data provided by Monsanto for gap filling in particular. We carried out shotgun sequencing of RGP and CUGI PAC/BAC clones to obtain sequence data with tenfold overlap. For Monsanto BAC clones²⁰, we complemented the available draft sequence (fivefold redundancy) with an additional fivefold overlap sequence (http://rgp.dna.affrc.go.jp/genomicdata/seqstrategy/newstrategy.html).

After the initial assembly of sequence data, stretches of poor or ambiguous quality and apparent gap regions were identified for further sequencing to obtain greater than 99.99% sequence accuracy. But despite extensive efforts to improve the sequence quality and to fill the gaps, 4 of the 390 PAC/BAC clones sequenced are still at phase 1 (GenBank, http://www.ncbi.nlm.nih.gov/HTGS/) because the consensus sequence could not be ordered correctly owing to numerous repeats. The remainder comprises 16 phase 2 and 370 phase 3 clones. The nine contigs for chromosome 1 representing the non-overlapping segments of continuous sequence were conjoined by inserting into the gap regions nucleotides that were calculated on the basis of the results of FISH experiments. All of the sequence information of chromosome 1 has been submitted to the DNA Data Bank of Japan (DDBJ, http://www.ddbj.nig.ac.jp/) with the accession number BA000010 (Con Division).

Gene prediction and functional classification

We carried out gene prediction using our in-house automated gene prediction system RiceGAAS¹³. The algorithm for gene domain prediction in RiceGAAS was designed by combining several prediction programs including GENSCAN²¹ for maize, GENSCAN²¹ for Arabidopsis, RiceHMM (http://rgp.dna.affrc.go.jp/RiceHMM/index.html) and the exon-finding program MZEF (http://argon.cshl.org/genefinder/), with homology search results from BLASTN and BLASTX (http://www.ncbi.nlm.nih.gov/BLAST/). These results were merged and integrated for gene prediction. Domain search was done using InterPro (http://www.ebi.ac.uk/interpro/scan.html), and repeats were identified using RepeatMasker (http://ftp.genome.washington.edu/cgi-bin/RepeatMasker). The predicted proteins were used to query the nonredundant protein database using BLASTP and categorized according to functional categories defined for Arabidopsis by MIPS (http://mips.gsf.de/cgi-bin/proj/thal/filter_funcat.pl?all) with a threshold probability value of 10^-20.

References

Harushima, Y. et al. A high-density rice genetic linkage map with 2275 markers using a single F2 population. Genetics 148, 479–494 (1998)
CAS PubMed PubMed Central Google Scholar
Wu, J. et al. A comprehensive rice transcript map containing 6591 expressed sequence tag sites. Plant Cell 14, 525–535 (2002)
Article CAS PubMed PubMed Central Google Scholar
Yamamoto, K. & Sasaki, T. Large-scale EST sequencing in rice. Plant Mol. Biol. 35, 135–144 (1997)
Article CAS PubMed Google Scholar
Sasaki, T. & Burr, B. International rice genome sequencing project: the effort to completely sequence the rice genome. Curr. Opin. Plant Biol. 3, 138–141 (2000)
Article CAS PubMed Google Scholar
Yu, J. et al. A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science 296, 79–91 (2002)
Article ADS CAS PubMed Google Scholar
Goff, S. et al. A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science 296, 92–100 (2002)
Article ADS CAS PubMed Google Scholar
Cheng, Z. et al. Functional rice centromeres are marked by a satellite repeat and a centromere-specific retrotransposon. Plant Cell 14, 1691–1704 (2002)
Article CAS PubMed PubMed Central Google Scholar
Dong, F. et al. Rice (Oryza sativa) centromeric regions consist of complex DNA. Proc. Natl Acad. Sci. USA 95, 8135–8140 (1998)
Article ADS CAS PubMed PubMed Central Google Scholar
Richards, E. J. & Ausubel, F. M. Isolation of a higher eukaryotic telomere from Arabidopsis thaliana. Cell 53, 127–136 (1988)
Article CAS PubMed Google Scholar
The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408, 796–820 (2000)
Article ADS Google Scholar
Cheng, Z. et al. Toward a cytological characterization of the rice genome. Genome Res. 11, 2133–2141 (2001)
Article CAS PubMed PubMed Central Google Scholar
Barakat, A., Carels, N. & Bernardi, G. The distribution of genes in the genomes of Gramineae. Proc. Natl Acad. Sci. USA 94, 6857–6861 (1997)
Article ADS CAS PubMed PubMed Central Google Scholar
Sakata, K. et al. RiceGAAS: an automated annotation system and database for rice genome sequence. Nucleic Acids Res. 30, 98–102 (2002)
Article ADS CAS PubMed PubMed Central Google Scholar
Hiratsuka, J. et al. The complete sequence of rice (Oryza sativa) chloroplast genome: Intermolecular recombination between distinct tRNA genes accounts for a major plastid DNA inversion during the evolution of the cereals. Mol. Gen. Genet. 217, 185–194 (1989)
Article CAS PubMed Google Scholar
Lowe, T. & Eddy, S. R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964 (1997)
Article CAS PubMed PubMed Central Google Scholar
Wessler, S. R., Bureau, T. E. & White, S. E. LTR-retrotransposons and MITEs: important players in the evolution of plant genomics. Curr. Opin. Genet. Dev. 5, 814–821 (1995)
Article CAS PubMed Google Scholar
Sasaki, A. et al. A mutant gibberellin-synthesis gene in rice. Nature 416, 701–702 (2002)
Article ADS CAS PubMed Google Scholar
Feng, Q. et al. Sequence and analysis of rice chromosome 4. Nature this issue
Baba, T. et al. Construction and characterization of rice genome libraries: PAC library of japonica variety, Nipponbare, and BAC library of indica variety, Kasalath. Bull. Natl. Inst. Agrobiol. Resour. (Japan) 14, 41–52 (2000)
CAS Google Scholar
Barry, G. The use of the Monsanto draft rice genome sequence in research. Plant Physiol. 125, 1164–1165 (2001)
Article CAS PubMed PubMed Central Google Scholar
Burge, C. & Karlin, S. Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 268, 78–94 (1997)
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Monsanto for the BAC contig information, BAC clones and their sequence data; R. Wing of Clemson University Genomics Institute and Novartis for the rice Nipponbare BAC library and its fingerprint data, respectively; M. Hattori for technical assistance; B. Burr and F. Burr for critically reading the manuscript; T. Slezak for comments; and K. Eguchi for encouragement.

Author information

Authors and Affiliations

Rice Genome Research Program, National Institute of Agrobiological Sciences, and Institute of the Society for Techno-innovation of Agriculture, Forestry and Fisheries, 1-2, Kannondai 2-chome, Tsukuba, Ibaraki, 305-8602, Japan
Takuji Sasaki, Takashi Matsumoto, Kimiko Yamamoto, Katsumi Sakata, Tomoya Baba, Yuichi Katayose, Jianzhong Wu, Yoshiaki Nagamura, Baltazar A. Antonio, Hiroyuki Kanamori, Satomi Hosokawa, Masatoshi Masukawa, Koji Arikawa, Yoshino Chiden, Mika Hayashi, Masako Okamoto, Tsuyu Ando, Hiroyoshi Aoki, Kohei Arita, Masao Hamada, Chizuko Harada, Saori Hijishita, Mikiko Honda, Yoko Ichikawa, Atsuko Idonuma, Masumi Iijima, Michiko Ikeda, Maiko Ikeno, Sachie Ito, Tomoko Ito, Yuichi Ito, Yukiyo Ito, Aki Iwabuchi, Kozue Kamiya, Wataru Karasawa, Satoshi Katagiri, Ari Kikuta, Noriko Kobayashi, Izumi Kono, Kayo Machita, Tomoko Maehara, Hiroshi Mizuno, Tatsumi Mizubayashi, Yoshiyuki Mukai, Hideki Nagasaki, Marina Nakashima, Yuko Nakama, Yumi Nakamichi, Mari Nakamura, Nobukazu Namiki, Manami Negishi, Isamu Ohta, Nozomi Ono, Shoko Saji, Kumiko Sakai, Michie Shibata, Takanori Shimokawa, Ayahiko Shomura, Jianyu Song, Yuka Takazaki, Kimihiro Terasawa, Kumiko Tsuji, Kazunori Waki, Harumi Yamagata, Hiroko Yamane, Shoji Yoshiki, Rie Yoshihara, Kazuko Yukawa, Huisun Zhong & Masahiro Yano
Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, 411-8540, Japan
Yoshihito Niimura, Hisakazu Iwama & Takashi Gojobori
Department of Horticulture, University of Wisconsin-Madison, Wisconsin, 53706, Madison, USA
Zhukuan Cheng & Jiming Jiang
Department of Bioinformatics, Tokyo Medical and Dental University, 1-5-45 Yushima, Bunkyo-ku, 113-8510, Tokyo, Japan
Toshinori Endo & Hidetaka Ito
Rice Genome Sequencing Project, National Institute of Agricultural Science and Technology, RDA, 249 Seodun-dong, 441-707, Suwon, Korea
Jang Ho Hahn, Ho-Il Kim & Moo-Young Eun

Authors

Takuji Sasaki
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Matsumoto
View author publications
You can also search for this author in PubMed Google Scholar
Kimiko Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar
Katsumi Sakata
View author publications
You can also search for this author in PubMed Google Scholar
Tomoya Baba
View author publications
You can also search for this author in PubMed Google Scholar
Yuichi Katayose
View author publications
You can also search for this author in PubMed Google Scholar
Jianzhong Wu
View author publications
You can also search for this author in PubMed Google Scholar
Yoshihito Niimura
View author publications
You can also search for this author in PubMed Google Scholar
Zhukuan Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Yoshiaki Nagamura
View author publications
You can also search for this author in PubMed Google Scholar
Baltazar A. Antonio
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyuki Kanamori
View author publications
You can also search for this author in PubMed Google Scholar
Satomi Hosokawa
View author publications
You can also search for this author in PubMed Google Scholar
Masatoshi Masukawa
View author publications
You can also search for this author in PubMed Google Scholar
Koji Arikawa
View author publications
You can also search for this author in PubMed Google Scholar
Yoshino Chiden
View author publications
You can also search for this author in PubMed Google Scholar
Mika Hayashi
View author publications
You can also search for this author in PubMed Google Scholar
Masako Okamoto
View author publications
You can also search for this author in PubMed Google Scholar
Tsuyu Ando
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyoshi Aoki
View author publications
You can also search for this author in PubMed Google Scholar
Kohei Arita
View author publications
You can also search for this author in PubMed Google Scholar
Masao Hamada
View author publications
You can also search for this author in PubMed Google Scholar
Chizuko Harada
View author publications
You can also search for this author in PubMed Google Scholar
Saori Hijishita
View author publications
You can also search for this author in PubMed Google Scholar
Mikiko Honda
View author publications
You can also search for this author in PubMed Google Scholar
Yoko Ichikawa
View author publications
You can also search for this author in PubMed Google Scholar
Atsuko Idonuma
View author publications
You can also search for this author in PubMed Google Scholar
Masumi Iijima
View author publications
You can also search for this author in PubMed Google Scholar
Michiko Ikeda
View author publications
You can also search for this author in PubMed Google Scholar
Maiko Ikeno
View author publications
You can also search for this author in PubMed Google Scholar
Sachie Ito
View author publications
You can also search for this author in PubMed Google Scholar
Tomoko Ito
View author publications
You can also search for this author in PubMed Google Scholar
Yuichi Ito
View author publications
You can also search for this author in PubMed Google Scholar
Yukiyo Ito
View author publications
You can also search for this author in PubMed Google Scholar
Aki Iwabuchi
View author publications
You can also search for this author in PubMed Google Scholar
Kozue Kamiya
View author publications
You can also search for this author in PubMed Google Scholar
Wataru Karasawa
View author publications
You can also search for this author in PubMed Google Scholar
Satoshi Katagiri
View author publications
You can also search for this author in PubMed Google Scholar
Ari Kikuta
View author publications
You can also search for this author in PubMed Google Scholar
Noriko Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Izumi Kono
View author publications
You can also search for this author in PubMed Google Scholar
Kayo Machita
View author publications
You can also search for this author in PubMed Google Scholar
Tomoko Maehara
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Mizuno
View author publications
You can also search for this author in PubMed Google Scholar
Tatsumi Mizubayashi
View author publications
You can also search for this author in PubMed Google Scholar
Yoshiyuki Mukai
View author publications
You can also search for this author in PubMed Google Scholar
Hideki Nagasaki
View author publications
You can also search for this author in PubMed Google Scholar
Marina Nakashima
View author publications
You can also search for this author in PubMed Google Scholar
Yuko Nakama
View author publications
You can also search for this author in PubMed Google Scholar
Yumi Nakamichi
View author publications
You can also search for this author in PubMed Google Scholar
Mari Nakamura
View author publications
You can also search for this author in PubMed Google Scholar
Nobukazu Namiki
View author publications
You can also search for this author in PubMed Google Scholar
Manami Negishi
View author publications
You can also search for this author in PubMed Google Scholar
Isamu Ohta
View author publications
You can also search for this author in PubMed Google Scholar
Nozomi Ono
View author publications
You can also search for this author in PubMed Google Scholar
Shoko Saji
View author publications
You can also search for this author in PubMed Google Scholar
Kumiko Sakai
View author publications
You can also search for this author in PubMed Google Scholar
Michie Shibata
View author publications
You can also search for this author in PubMed Google Scholar
Takanori Shimokawa
View author publications
You can also search for this author in PubMed Google Scholar
Ayahiko Shomura
View author publications
You can also search for this author in PubMed Google Scholar
Jianyu Song
View author publications
You can also search for this author in PubMed Google Scholar
Yuka Takazaki
View author publications
You can also search for this author in PubMed Google Scholar
Kimihiro Terasawa
View author publications
You can also search for this author in PubMed Google Scholar
Kumiko Tsuji
View author publications
You can also search for this author in PubMed Google Scholar
Kazunori Waki
View author publications
You can also search for this author in PubMed Google Scholar
Harumi Yamagata
View author publications
You can also search for this author in PubMed Google Scholar
Hiroko Yamane
View author publications
You can also search for this author in PubMed Google Scholar
Shoji Yoshiki
View author publications
You can also search for this author in PubMed Google Scholar
Rie Yoshihara
View author publications
You can also search for this author in PubMed Google Scholar
Kazuko Yukawa
View author publications
You can also search for this author in PubMed Google Scholar
Huisun Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Hisakazu Iwama
View author publications
You can also search for this author in PubMed Google Scholar
Toshinori Endo
View author publications
You can also search for this author in PubMed Google Scholar
Hidetaka Ito
View author publications
You can also search for this author in PubMed Google Scholar
Jang Ho Hahn
View author publications
You can also search for this author in PubMed Google Scholar
Ho-Il Kim
View author publications
You can also search for this author in PubMed Google Scholar
Moo-Young Eun
View author publications
You can also search for this author in PubMed Google Scholar
Masahiro Yano
View author publications
You can also search for this author in PubMed Google Scholar
Jiming Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Gojobori
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Takuji Sasaki.

Ethics declarations

Competing interests

The authors declare that they have no competing financial interests.

Supplementary information

Supplementary Figure 1 (PDF 2226 kb)

Supplementary Figure 2 (PDF 4047 kb)

Supplementary Figure 3 (PDF 238 kb)

Supplementary Figure Legend (DOC 21 kb)

Supplementary Table 1-1 (XLS 11 kb)

Supplementary Table 1-2 (XLS 11 kb)

Supplementary Table 1-3 (XLS 23 kb)

Rights and permissions

This article is distributed under the terms of the Creative Commons Attribution-Non-Commercial-Share Alike licence (http://creativecommons.org/licenses/by-nc-sa/3.0/), which permits distribution, and reproduction in any medium, provided the original author and source are credited. This licence does not permit commercial exploitation, and derivative works must be licensed under the same or similar licence.

Reprints and permissions

About this article

Cite this article

Sasaki, T., Matsumoto, T., Yamamoto, K. et al. The genome sequence and structure of rice chromosome 1. Nature 420, 312–316 (2002). https://doi.org/10.1038/nature01184

Download citation

Received: 04 April 2002
Accepted: 19 September 2002
Issue Date: 21 November 2002
DOI: https://doi.org/10.1038/nature01184

This article is cited by

Investigation of B-atp6-orfH79 distributing in Chinese populations of Oryza rufipogon and analysis of its chimeric structure
- Xuemei Zhang
- Shuying Chen
- Yating Liu
BMC Plant Biology (2023)
Genome sequencing-based coverage analyses facilitate high-resolution detection of deletions linked to phenotypes of gamma-irradiated wheat mutants
- Shoya Komura
- Hironobu Jinno
- Fuminori Kobayashi
BMC Genomics (2022)
Rice functional genomics: decades’ efforts and roads ahead
- Rongzhi Chen
- Yiwen Deng
- Jiayang Li
Science China Life Sciences (2022)
Using hyperspectral analysis as a potential high throughput phenotyping tool in GWAS for protein content of rice quality
- Dawei Sun
- Haiyan Cen
- Yong He
Plant Methods (2019)
Identification of candidate genes responsible for the susceptibility of apple (Malus × domestica Borkh.) to Alternaria blotch
- Shigeki Moriya
- Shingo Terakami
- Kazuyuki Abe
BMC Plant Biology (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

The genome sequence and structure of rice chromosome 1

Abstract

Similar content being viewed by others

The genome and population genomics of allopolyploid Coffea arabica reveal the diversification history of modern coffee cultivars

Genetic gains underpinning a little-known strawberry Green Revolution

A pan-genome of 69 Arabidopsis thaliana accessions reveals a conserved genome structure throughout the global species range

Main

Methods

Chromosome sequencing

Gene prediction and functional classification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Supplementary Figure 1 (PDF 2226 kb)

Supplementary Figure 2 (PDF 4047 kb)

Supplementary Figure 3 (PDF 238 kb)

Supplementary Figure Legend (DOC 21 kb)

Supplementary Table 1-1 (XLS 11 kb)

Supplementary Table 1-2 (XLS 11 kb)

Supplementary Table 1-3 (XLS 23 kb)

Rights and permissions

About this article

Cite this article

This article is cited by

Investigation of B-atp6-orfH79 distributing in Chinese populations of Oryza rufipogon and analysis of its chimeric structure

Genome sequencing-based coverage analyses facilitate high-resolution detection of deletions linked to phenotypes of gamma-irradiated wheat mutants

Rice functional genomics: decades’ efforts and roads ahead

Using hyperspectral analysis as a potential high throughput phenotyping tool in GWAS for protein content of rice quality

Identification of candidate genes responsible for the susceptibility of apple (Malus × domestica Borkh.) to Alternaria blotch

Comments

Search

Quick links

Abstract

Similar content being viewed by others

Main

Methods

Chromosome sequencing

Gene prediction and functional classification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links