Genome sequencing and analysis of Aspergillus oryzae

Machida, Masayuki; Asai, Kiyoshi; Sano, Motoaki; Tanaka, Toshihiro; Kumagai, Toshitaka; Terai, Goro; Kusumoto, Ken-Ichi; Arima, Toshihide; Akita, Osamu; Kashiwagi, Yutaka; Abe, Keietsu; Gomi, Katsuya; Horiuchi, Hiroyuki; Kitamoto, Katsuhiko; Kobayashi, Tetsuo; Takeuchi, Michio; Denning, David W.; Galagan, James E.; Nierman, William C.; Yu, Jiujiang; Archer, David B.; Bennett, Joan W.; Bhatnagar, Deepak; Cleveland, Thomas E.; Fedorova, Natalie D.; Gotoh, Osamu; Horikawa, Hiroshi; Hosoyama, Akira; Ichinomiya, Masayuki; Igarashi, Rie; Iwashita, Kazuhiro; Juvvadi, Praveen Rao; Kato, Masashi; Kato, Yumiko; Kin, Taishin; Kokubun, Akira; Maeda, Hiroshi; Maeyama, Noriko; Maruyama, Jun-ichi; Nagasaki, Hideki; Nakajima, Tasuku; Oda, Ken; Okada, Kinya; Paulsen, Ian; Sakamoto, Kazutoshi; Sawano, Toshihiko; Takahashi, Mikio; Takase, Kumiko; Terabayashi, Yasunobu; Wortman, Jennifer R.; Yamada, Osamu; Yamagata, Youhei; Anazawa, Hideharu; Hata, Yoji; Koide, Yoshinao; Komori, Takashi; Koyama, Yasuji; Minetoki, Toshitaka; Suharnan, Sivasundaram; Tanaka, Akimitsu; Isono, Katsumi; Kuhara, Satoru; Ogasawara, Naotake; Kikuchi, Hisashi

doi:10.1038/nature04300

Letter
Published: 22 December 2005

Genome sequencing and analysis of Aspergillus oryzae

Masayuki Machida¹,
Kiyoshi Asai²,
Motoaki Sano¹,
Toshihiro Tanaka³,
Toshitaka Kumagai²,
Goro Terai^2,20,
Ken-Ichi Kusumoto⁴,
Toshihide Arima⁵,
Osamu Akita⁵,
Yutaka Kashiwagi⁴,
Keietsu Abe⁶,
Katsuya Gomi⁶,
Hiroyuki Horiuchi⁷,
Katsuhiko Kitamoto⁷,
Tetsuo Kobayashi⁸,
Michio Takeuchi⁹,
David W. Denning¹⁰,
James E. Galagan¹¹,
William C. Nierman^12,13,
Jiujiang Yu¹⁴,
David B. Archer¹⁵,
Joan W. Bennett¹⁶,
Deepak Bhatnagar¹⁴,
Thomas E. Cleveland¹⁴,
Natalie D. Fedorova¹²,
Osamu Gotoh²,
Hiroshi Horikawa³,
Akira Hosoyama³,
Masayuki Ichinomiya⁷,
Rie Igarashi³,
Kazuhiro Iwashita⁵,
Praveen Rao Juvvadi⁷,
Masashi Kato⁸,
Yumiko Kato³,
Taishin Kin²,
Akira Kokubun³,
Hiroshi Maeda⁶,
Noriko Maeyama³,
Jun-ichi Maruyama⁷,
Hideki Nagasaki²,
Tasuku Nakajima⁶,
Ken Oda⁵,
Kinya Okada²,
Ian Paulsen¹²,
Kazutoshi Sakamoto⁵,
Toshihiko Sawano³,
Mikio Takahashi³,
Kumiko Takase¹,
Yasunobu Terabayashi¹,
Jennifer R. Wortman¹²,
Osamu Yamada⁵,
Youhei Yamagata⁶,
Hideharu Anazawa¹⁷,
Yoji Hata¹⁸,
Yoshinao Koide¹⁹,
Takashi Komori²⁰,
Yasuji Koyama²¹,
Toshitaka Minetoki²²,
Sivasundaram Suharnan²³,
Akimitsu Tanaka²⁴,
Katsumi Isono³,
Satoru Kuhara²⁵,
Naotake Ogasawara²⁶ &
…
Hisashi Kikuchi³

Nature volume 438, pages 1157–1161 (2005)Cite this article

35k Accesses
964 Citations
14 Altmetric
Metrics details

Abstract

The genome of Aspergillus oryzae, a fungus important for the production of traditional fermented foods and beverages in Japan, has been sequenced. The ability to secrete large amounts of proteins and the development of a transformation system¹ have facilitated the use of A. oryzae in modern biotechnology^2,3,4. Although both A. oryzae and Aspergillus flavus belong to the section Flavi of the subgenus Circumdati of Aspergillus, A. oryzae, unlike A. flavus, does not produce aflatoxin, and its long history of use in the food industry has proved its safety. Here we show that the 37-megabase (Mb) genome of A. oryzae contains 12,074 genes and is expanded by 7–9 Mb in comparison with the genomes of Aspergillus nidulans⁵ and Aspergillus fumigatus⁶. Comparison of the three aspergilli species revealed the presence of syntenic blocks and A. oryzae-specific blocks (lacking synteny with A. nidulans and A. fumigatus) in a mosaic manner throughout the genome of A. oryzae. The blocks of A. oryzae-specific sequence are enriched for genes involved in metabolism, particularly those for the synthesis of secondary metabolites. Specific expansion of genes for secretory hydrolytic enzymes, amino acid metabolism and amino acid/sugar uptake transporters supports the idea that A. oryzae is an ideal microorganism for fermentation.

You have full access to this article via your institution.

Download PDF

The genome and population genomics of allopolyploid Coffea arabica reveal the diversification history of modern coffee cultivars

Article Open access 15 April 2024

Jarkko Salojärvi, Aditi Rambani, … Patrick Descombes

Elucidation of genes enhancing natural product biosynthesis through co-evolution analysis

Article 12 April 2024

Xinran Wang, Ningxin Chen, … Xiaozhou Luo

A pan-genome of 69 Arabidopsis thaliana accessions reveals a conserved genome structure throughout the global species range

Article Open access 11 April 2024

Qichao Lian, Bruno Huettel, … Raphael Mercier

Main

Sequencing of the A. oryzae genome was accomplished using the whole-genome shotgun (WGS) approach. The 37-Mb genome was predicted to contain a total of 12,074 genes encoding proteins with a length greater than 100 amino acid residues (see Methods). The genome was confirmed to comprise eight chromosomes (chromosomes 1–8 in decreasing size), the assignment of which is different from a previous report⁷ (Supplementary Table S1 and schematic drawing in Supplementary Fig. S1). Interestingly, the A. oryzae genome contained numerous stretches (1,750) of (A + T)-rich sequence (that is, >90% A + T composition in 50 nucleotides or longer), 6–9 times more than for A. fumigatus (197) and A. nidulans (308).

The A. oryzae genome is larger than those of A. fumigatus and A. nidulans by approximately 34% and 29%, respectively. Syntenic analysis of the three aspergilli revealed the presence of syntenic blocks and A. oryzae-specific blocks of sequence (lacking synteny with the two other aspergilli) in a mosaic manner throughout the A. oryzae genome (Fig. 1). Phylogenetic analysis of the three aspergilli using the whole-genome data showed that A. nidulans branched off earlier than A. oryzae and A. fumigatus⁵. Thus, the increase in genome size seems to be due to an A. oryzae lineage-specific acquisition of sequence, rather than loss of sequence in A. nidulans and A. fumigatus. If, on the other hand, A. nidulans and A. fumigatus are assumed to have lost 7–9 Mb of sequence after branching off from their A. oryzae-like ancestor, a greater proportion of syntenic blocks would be conserved between each of them and A. oryzae than between the two. However, we observed an almost equal proportion of syntenic blocks in the three species. This suggests that the genome size differences are largely due to sequence acquisition in A. oryzae. The expansion in genome size appears to be characteristic of the organisms closely related to A. oryzae, as the estimated genome size of its close relatives A. flavus (W. Nierman, personal communication) and Aspergillus niger⁸ is comparable to that of A. oryzae.

Figure 1: **Distribution and expression of the genes on chromosome 1.**

Using the cluster of orthologous group (COG)⁹ classification, most of the gene family expansion in the A. oryzae genome as compared to A. fumigatus was found to have occurred in those predicted to have roles in metabolism (C to Q), of which those for secondary metabolism (Q) are most significantly increased (Fig. 2; see also Supplementary Table S2). No significant differences were observed in the number of genes for any other COG category in comparison with A. nidulans and A. fumigatus, except for the genes involved in defence mechanisms (V) and extracellular structures (M).

Figure 2: **Comparison of relative gene numbers for each COG.**

These secondary metabolism genes are enriched in regions lacking synteny with either A. fumigatus or A. nidulans (P = 9.8 × 10^-32, see Fig. 1 for chromosome 1 and Supplementary Fig. S2 for all eight chromosomes), and the genes having expressed sequence tags (ESTs) are considerably enriched in the syntenic regions (P = 4.1 × 10^-134). Many more cytochrome P450 genes were observed in A. oryzae (149) compared with A. nidulans (102) and A. fumigatus (65) (Table 1). Of the polyketide synthase (PKS) genes, a specific expansion of WA-like PKS genes was observed (Supplementary Table S3). In addition, the A. oryzae genome contained a variety of homologues of trichothecene hydroxylases, isotrichodermin hydroxylases and trichodiene oxygenases, as well as pisatin demethylases that are used by plant pathogenic fungi (for example, Nectria haematococca, Fusarium spp.) for detoxification of antimicrobial agents¹⁰. This is consistent with the close phylogenetic relationship of A. oryzae with the opportunistic plant pathogen A. flavus.

Table 1 Redundancy of the cytochrome P450 genes in aspergilli

Full size table

Although genes predicted to be involved in the aflatoxin synthetic pathway are present in A. oryzae, no ESTs of these genes were detected except for aflJ and norA (Akao, T. et al., unpublished data), whereas ESTs for all 25 of the aflatoxin pathway genes were found in A. flavus¹¹. A. oryzae might have been selected as a non-toxigenic strain either during the long history of its industrial use or from the beginning.

In A. oryzae, all of the COG categories related to metabolism show an expansion of gene content (Fig. 2), the highest increase of which was observed for those involved in phenylalanine/tryptophan degradation (2 and 6 in Supplementary Fig. S3) and toluene/m-cresol/p-cymene degradation (9, 11 and 12 in Supplementary Fig. S3). This was based on the analysis using the Saccharomyces cerevisiae metabolic map as a reference. BAT1 and BAT2, which contribute to the metabolism of hydrophobic amino acids lysine and serine, are also over-represented (see Supplementary Fig. S4 for the entire metabolic pathways). There is also a significant expansion in the ATP-binding cassette (ABC), the amino acid-polyamine-organocation (APC) and the major facilitator superfamily (MFS) transporter genes (Supplementary Table S4), which are concerned with multidrug resistance, transport of amino acids and transport of sugars, respectively.

Within the koji culture, A. oryzae grows on the surface of solid material such as steamed rice or ground soybean, where amino acids and sugars are deficient at the beginning. The need for A. oryzae to get access to external nitrogen sources effectively and to degrade proteins and starches seems consistent with the observed expansion of the metabolism and transporter-related gene families. Judging from the EST data, the genes for alcohol dehydrogenase, pyruvate decarboxylase and sugar transporters are typical examples of the A. oryzae genes that are transcribed most strongly (Akao, T. et al., unpublished data). The strong expression of such genes might also have been enhanced through various adaptations¹² during the course of domestication.

Aspergilli possess more sensor histidine kinases (13–15) than S. cerevisiae (1) and Schizosaccharomyces pombe (3), whereas histidine-containing phosphotransfer factors and response regulators are found in similar numbers. Aspergillus histidine kinases are classified into nine families (HK1–9), of which the HK8 orthologue is absent in Neurospora crassa and the sequenced plant pathogens Cochliobolus heterostrophus, Gibberella moniliformis, Fusarium graminearum and Magnaporthe grisea. Whereas A. fumigatus, A. nidulans, N. crassa and the plant pathogens possess a single HK6 gene (Nik-1 in N. crassa) that is essential for growth in high osmotic pressure, A. oryzae has two additional homologues. Continuous culturing under high osmolarity conditions (possibly through koji cultures) may have led to A. oryzae acquiring the additional Nik-1 homologues. There are three MAPKKs and MAPKKKs in the genomes of the three Aspergillus species and N. crassa. However, whereas A. nidulans and A. fumigatus possess four MAPKs and A. oryzae five, N. crassa, F. graminearum and M. grisea possess only three. Thus, A. oryzae may possess the most complex signal transduction cascade among the four filamentous fungi.

A. oryzae has the largest expansion of hydrolytic genes among the three aspergilli (Supplementary Table S5). The genomes of A. oryzae, A. fumigatus and A. nidulans contain 135, 99 and 90 secreted proteinase genes, respectively, which constitute roughly 1% of the total genes in each genome (Supplementary Table S6). All of the proteinase genes found in A. fumigatus and/or A. nidulans have orthologues in A. oryzae except for the one encoding aminopeptidase. On the other hand, several A. oryzae proteinase genes are missing in A. fumigatus and A. nidulans. Similarly, A. oryzae possesses more secretory proteinase genes that function in acidic pH, including aspartic proteinase, pepstatin-insensitive proteinase, serine type carboxypeptidase and aorsin (Supplementary Table S6). These increases may reflect A. oryzae's adaptation to acidic pH during the course of its domestication.

The phylogenetic tree of secretory aspartic proteinases from the three aspergilli genomes (Fig. 3) shows six homologous clusters (yellow boxes) distributed on all chromosomes other than chromosome 7. Their features, including intron conservation, are similar to each other except for cluster 4, which shows the highest diversity. Each cluster contains four member genes (blue boxes), namely three orthologues from each Aspergillus species and an extra A. oryzae-specific homologue. All of the extra A. oryzae homologues are located in the A. oryzae-specific regions, whereas the orthologous clusters are located in the common regions, except for AO070319000053 of cluster 4. It is interesting to note that the clustering feature of the orthologues and extra homologues for aspartic proteinases is also conserved with the genes for carboxypeptidases (Supplementary Fig. S5a) and metalloproteinases. In contrast, the number of genes encoding intracellular enzymes (Supplementary Fig. S5b), including serine proteinases, is consistent in the three aspergilli. A similar expansion pattern was also observed for the genes for maltases (Supplementary Fig. S5c) and extracellular α-glucosidases. Besides the secretory hydrolases, some metabolic genes, including those in glucose fermentation and lysine biosynthesis, showed a similar gene expansion pattern (Supplementary Fig. S6).

Figure 3: **Phylogenetic analysis of aspartic proteinases.**

It is well known that A. oryzae has three α-amylase genes (taka-amylase genes: amyA, amyB and amyC)¹³ that have almost identical nucleotide sequences with only one and two mismatches in the 5′-flanking and coding regions, respectively. The amyA gene has a transposon-like element at its 5′-flanking region, and the amyB and amyC genes have highly similar nucleotide sequences spanning approximately 5 kilobases (kb), including an incomplete transposon sequence at their 5′-flanking region. Phylogenetic analysis supports gene duplication to account for the expansion of the three α-amylase genes after A. oryzae branched off from the other two Aspergillus species (Supplementary Fig. S5d)—this is in clear contrast to the mode of gene expansion for the secretory proteinases mentioned above.

In contrast to the overall increase in the number of proteinases, A. oryzae has fewer glycosyl hydrolases with a cellulose-binding domain (five genes) or a starch-binding domain (glaA¹⁴) to digest insoluble cellulose or raw and granular starch, respectively (Supplementary Table S5). Apparently, no additional enzymes for accessing carbohydrates are required during fermentation in contrast to those, including knottins, found in A. fumigatus, which seems appropriate for its ecological niche of rotting vegetable matter.

Protein folding in the endoplasmic reticulum is assisted by chaperones (for example, BiP, calnexin) and foldases (three protein-disulphide isomerase family proteins and a peptidyl-prolyl cis–trans isomerase). As in other fungi, however, there is no calreticulin homologue (Supplementary Table S7). Major secretory component genes, which alter the efficiency of protein secretion, were identified in all three aspergilli with an exception of the A. fumigatus SSS1 homologue (Supplementary Table S8).

In comparison to the common regions, the A. oryzae-specific regions contained 1.7 times lower density of genes homologous to those in eukaryotes other than A. fumigatus and A. nidulans. In a search for bacterial homologues, we found two genes (AO070319000101 and AO070319000102) in an A. oryzae-specific region with highest sequence similarity to those of Agrobacterium tumefaciens (AGR_L_1864 (biotin carboxylase) and AGR_L_1866, hypothetical protein genes with E-values of 0.0 and 1 × 10^-119, respectively). Because the two genes are adjacently located in both A. oryzae and A. tumefaciens (Supplementary Fig. S7a), and the two A. oryzae genes reside in a ‘bacterial cluster’ (Supplementary Fig. S7b), they are suggested to have been laterally transferred.

The expansion of A. oryzae-specific homologues might be the result of genome-wide duplication, as observed in yeast. The speciation of Aspergillus was estimated to have taken place approximately 20 million years ago¹⁵ and was later than the whole-genome duplication event in yeast, which was estimated to have taken place 150 million years ago¹⁶. We were unable to observe any extended stretch of region within the A. oryzae genome that showed a certain degree of similarity to another stretch of region despite the fact that we observed synteny among the three aspergilli (Fig. 1) and that segmentally duplicated stretches were detected by the same method within the S. cerevisiae genome. Thus, the increase in the genome size of A. oryzae relative to A. fumigatus and A. nidulans does not appear to be due to chromosomal duplication. The large segmental duplication, if any, must have taken place much earlier than the separation of the three aspergilli, and the similarity between the duplicated regions might have been completely lost by extensive sequence alterations and rearrangements. However, if the three aspergilli had a common ancestor possessing the expanded gene families found in A. oryzae, both A. nidulans and A. fumigatus must independently have lost approximately 3,000 genes in common with the putative common ancestor.

The mosaic structure of the genome, considered to be evidence for horizontal gene transfer¹⁷, was found by synteny analysis of the A. oryzae genome and was further characterized by the localization of the EST expression (see above) of non-metabolic genes (P = 1.78 × 10^-95 and 1.32 × 10^-51 for information and storage (J to B) and cellular function and signalling (D to O), respectively) and the genes of high codon adaptation index (top 5% genes, P = 9.8 × 10^-28). The phylogenetic distance between the genes in the orthologous cluster and the A. oryzae-specific ones was similar to that between the genes of Aspergillus and the other genera belonging to Sordariomycetes. The statistical analysis by ref. 18 of some A. oryzae-specific homologues of aspartic proteinase, carboxypeptidase, maltase, pyruvate decarboxylase and lysine-ketoglutarate reductase/saccharopine dehydrogenase showed P-values of between 0.000 and 0.004. The results indicated phylogenetic inconsistency of these genes. These results, together with the above discussion, imply that the A. oryzae-specific genes have been transferred by a similar mechanism observed for an asexual pathogenic fungus¹⁹, in which chromosomes are transferred between genetically isolated clonal lines. It has been reported that yeast chromosomes are rearranged frequently under starved culture conditions and that (A + T)-rich sequences or transfer RNA often mediate such rearrangements²⁰. Our EST analysis shows that the expression profile in solid-state cultivation is similar to that observed when a carbon source is omitted²¹. These reports suggest that the acquired foreign DNA has been rearranged in a short period of time by large-scale solid-state cultivation since A. oryzae was domesticated from an ancestor of A. flavus²².

It is tempting to speculate that the gene expansion of A. oryzae is explained by horizontal gene transfer; however, at this moment we cannot exclude the possibility of massive gene loss in the two other Aspergillus species. Future comparative analyses with more closely related species would provide more insight into the scenario of the genome evolution of A. oryzae, including that which occurred during the centuries of domestic cultivation.

Methods

Strain and DNA preparation

Aspergillus oryzae RIB40 (National Research Institute of Brewing Stock Culture and ATCC-42149) was used as the DNA donor. Genomic DNA preparation and removal of mitochondrial DNA was performed as described by refs 23 and 24, respectively.

Genome sequencing

The genome sequencing of A. oryzae was accomplished using the WGS approach by accumulating raw sequence reads of approximately ×9 depth of coverage. Contigs generated were mapped by Southern hybridization onto chromosomes separated by PFGE. Linkage between contigs was analysed by fingerprinting and PCR methods. Sequence assembly was validated with high-density end sequences of bacterial artificial chromosome (BAC) and cosmid clones and by Optical Mapping (OpGen). See Supplementary Information for details.

Gene prediction and annotation

Genes were predicted in the A. oryzae genome based on the homologies to known genes in the public database, ESTs of A. oryzae and A. flavus, and the statistical features of the genes by applying a combination of gene-finding software. Transfer RNAs were identified using tRNAScan-SE²⁵. Repeated sequences were detected using RepeatMasker (Smit, A. F. A. and Green, P., http://ftp.genome.washington.edu/RM/RepeatMasker.html). The homologues of the proteins of aspergilli, N. crassa, M. grisea, Gibberella zeae, Penicillium and Paecilomyces are searched for by running BlastX with a threshold value of E ≤ 1 × 10^-10. The resultant candidates of homologues are evaluated by ALN²⁶, which predicts the precise gene structures by aligning the Blast hits and the protein sequences. ALN takes into account frameshift errors, coding potentials and signals for translational initiation, termination and splicing. Of the 6,586 genes thus predicted by ALN, 489 highly reliable genes were adopted into a learning set for GeneDecoder²⁷ and GlimmerM²⁸ software that work based on the statistic features of genes. GeneDecoder also integrates the information for splice sites provided by the ESTs, which are aligned with the genome sequence by SIM4 (ref. 29). Fivefold cross-validation of the gene finders trained by the above data set showed sensitivity/specificity for the exon prediction of 0.74/0.53 and 0.66/0.59 for GeneDecoder and GlimmerM, respectively, and those for coding sequences of 0.93/0.90 and 0.92/098. Genes partially supported by ESTs were predicted by GeneDecoder and those without any support by the known genes or ESTs were predicted by GlimmerM. The numbers of genes predicted by ALN, GlimmerM and GeneDecoder were 5,367, 6,983 and 1,713, respectively. All of the predicted protein-coding genes were annotated by searching against the COG database⁹ using BlastP, followed by manual corrections.

Synteny analysis

Orthologues between A. oryzae and either A. nidulans or A. fumigatus were identified using the best bi-directional hit method (BlastP with a bit score greater than 200). In addition, putative homologous regions between the species were identified by TBlastX with a bit score greater than 100. Orthologues and homologous regions between the contigs of two species were aligned to make a contiguous block, until no orthologues or homologous regions were found within the range of 10 kb. A region of conserved synteny was defined as the longest contiguous block that contained at least one orthologue and one additional orthologue or homologous region.

COG analysis

The number of genes for each COG category was analysed by a BlastP search using the amino acid sequences in the COG set⁹ with the bit score of ≥60.

Gene localization

Distribution of all predicted genes and the genes with ESTs that were obtained from mycelia grown in either liquid-rich medium, liquid-starved medium or solid-state cultivation (Akao, T. et al., unpublished data) were analysed by counting the corresponding genes in a 5-kb window. Distributions of non-metabolic genes, secondary metabolism genes, extra A. oryzae-specific homologues that have homology (bit score ≥100) to orthologues identified by best bi-directional match between A. oryzae and either A. fumigatus or A. nidulans, as well as A. oryzae-specific genes without homology to either A. fumigatus or A. nidulans genes (bit score <100) were analysed in the same way by applying a window size of 15 kb.

Statistical analyses

Localization of the secondary metabolism genes at the A. oryzae-specific regions was evaluated by the one-tailed P-value based on the binomial distribution with the sample size of 413. Localization of the genes with EST expression, non-metabolic genes and the top 5% of genes with a high CAI value at the syntenic regions was evaluated in the same way with sample sizes of 33,77, 1,839 and 703, respectively. The analyses were performed when A. oryzae-specific regions were detected by comparing the A. oryzae and A. fumigatus genomes. The phylogenetic inconsistency was statistically analysed by the method described in ref. 18 using data sets consisting of the genes of the three aspergilli and three species belonging to Sordariomycetes or Eurotiomycetes other than Aspergillus. The reference and test data sets included the A. oryzae gene in the orthologous cluster and the extra A. oryzae-specific homologue, respectively.

References

Gomi, K., Iimura, Y. & Hara, S. Integrative transformation of Aspergillus oryzae with a plasmid containing the Aspergillus nidulans argB gene. Agric. Biol. Chem. 51, 2549–2555 (1987)
CAS Google Scholar
Christensen, T. et al. High level expression of recombinant genes in Aspergillus oryzae. Bio/Technology 6, 1419–1422 (1988)
CAS Google Scholar
Ward, P. P. et al. Production of biologically active recombinant human lactoferrin in Aspergillus oryzae. Bio/Technology 10, 784–789 (1992)
CAS PubMed Google Scholar
Tsuchiya, K. et al. High level expression of the synthetic human lysozyme gene in Aspergillus oryzae. Appl. Microbiol. Biotechnol. 38, 109–114 (1992)
Article CAS Google Scholar
Galagan, J. E. et al. Sequencing of Aspergillus nidulans and comparative analysis with A. fumigatus and A. oryzae. Nature doi:10.1038/nature04341 (this issue)
Nierman, W. et al. Genomic sequence of the pathogenic and allergenic filamentous fungus Aspergillus fumigatus. Nature doi:10.1038/nature04332 (this issue)
Kitamoto, K., Kimura, K., Gomi, K. & Kumagai, C. Electrophoretic karyotype and gene assignment to chromosomes of Aspergillus oryzae. Biosci. Biotechnol. Biochem. 58, 1467–1470 (1994)
Article CAS Google Scholar
Archer, D. B. & Dyer, P. S. From genomics to post-genomics in Aspergillus. Curr. Opin. Microbiol. 7, 499–504 (2004)
Article CAS Google Scholar
Tatusov, R. L. et al. The COG database: an updated version includes eukaryotes. BMC Bioinformatics 4, 41 (2003)
Article Google Scholar
van den Brink, H. M., van Gorcom, R. F., van den Hondel, C. A. & Punt, P. J. Cytochrome P450 enzyme systems in fungi. Fungal Genet. Biol. 23, 1–17 (1998)
Article CAS Google Scholar
Yu, J., Whitelaw, C. A., Nierman, W. C., Bhatnagar, D. & Cleveland, T. E. Aspergillus flavus expressed sequence tags for identification of genes with putative roles in aflatoxin contamination of crops. FEMS Microbiol. Lett. 237, 333–340 (2004)
PubMed Google Scholar
Ferea, T. L., Botstein, D., Brown, P. O. & Rosenzweig, R. F. Systematic changes in gene expression patterns following adaptive evolution in yeast. Proc. Natl Acad. Sci. USA 96, 9721–9726 (1999)
Article ADS CAS Google Scholar
Sakaguchi, K., Takagi, M., Horiuchi, H. & Gomi, K. in Applied Molecular Genetics in Filamentous Fungi (eds Kinghorn, J. R. & Turner, G.) 54–99 (Blackie Academic & Professional, Glasgow, 1992)
Book Google Scholar
Hata, Y. et al. Nucleotide sequence and expression of the glucoamylase-encoding gene (glaA) from Aspergillus oryzae. Gene 108, 145–150 (1991)
Article CAS Google Scholar
Hori, H. & Osawa, S. Origin and evolution of organisms as deduced from 5S ribosomal RNA sequences. Mol. Biol. Evol. 4, 445–472 (1987)
CAS PubMed Google Scholar
Langkjaer, R. B., Cliften, P. F., Johnston, M. & Piskur, J. Yeast genome duplication was followed by asynchronous differentiation of duplicated genes. Nature 421, 848–852 (2003)
Article ADS CAS Google Scholar
Prade, R. A., Griffith, J., Kochut, K., Arnold, J. & Timberlake, W. E. In vitro reconstruction of the Aspergillus (= Emericella) nidulans genome. Proc. Natl Acad. Sci. USA 94, 14564–14569 (1997)
Article ADS CAS Google Scholar
Lawrence, J. G. & Hartl, D. L. Inference of horizontal genetic transfer from molecular data: an approach using the bootstrap. Genetics 131, 753–760 (1992)
CAS PubMed PubMed Central Google Scholar
Masel, A. M., He, C., Poplawski, A. M., Irwin, J. A. G. & Manners, J. M. Molecular evidence for chromosome transfer between biotypes of Colletotrichum gloeosporioides. Mol. Plant–Microbe Interact. 9, 339–348 (1996)
Article CAS Google Scholar
Dunham, M. J. et al. Characteristic genome rearrangements in experimental evolution of Saccharomyces cerevisiae. Proc. Natl Acad. Sci. USA 99, 16144–16149 (2002)
Article ADS CAS Google Scholar
Maeda, H. et al. Transcriptional analysis of genes for energy catabolism and hydrolytic enzymes in the filamentous fungus Aspergillus oryzae using cDNA microarrays and expressed sequence tags. Appl. Microbiol. Biotechnol. 65, 74–83 (2004)
Article CAS Google Scholar
Geiser, D. M., Pitt, J. I. & Taylor, J. W. Cryptic speciation and recombination in the aflatoxin-producing fungus Aspergillus flavus. Proc. Natl Acad. Sci. USA 95, 388–393 (1998)
Article ADS CAS Google Scholar
Iimura, Y., Gomi, K., Uzu, H. & Hara, S. Transformation of Aspergillus oryzae through plasmid-mediated complementation of the methionine-auxotrohpic mutation. Agric. Biol. Chem. 51, 323–328 (1987)
CAS Google Scholar
Watson, J. & Thompson, W. F. Purification and restriction endonuclease analysis of plant nuclear DNA. Methods Enzymol. 118, 57–75 (1986)
Article CAS Google Scholar
Lowe, T. M. & Eddy, S. R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964 (1997)
Article CAS Google Scholar
Gotoh, O. Homology-based gene structure prediction: simplified matching algorithm using a translated codon (tron) and improved accuracy by allowing for long gaps. Bioinformatics 16, 190–202 (2000)
Article CAS Google Scholar
Asai, K., Itou, K., Ueno, Y. & Yada, T. Recognition of human genes by stochastic parsing. Pac. Symp. Biocomput. 3, 228–239 (1998)
Google Scholar
Majoros, W. H., Pertea, M., Antonescu, C. & Salzberg, S. L. GlimmerM, Exonomy and Unveil: three ab initio eukaryotic genefinders. Nucleic Acids Res. 31, 3601–3604 (2003)
Article CAS Google Scholar
Florea, L., Hartzell, G., Zhang, Z., Rubin, G. M. & Miller, W. A computer program for aligning a cDNA sequence with a genomic DNA sequence. Genome Res. 8, 967–974 (1998)
Article CAS Google Scholar
Thompson, J. D., Gibson, T. J., Plewniak, F., Jeanmougin, F. & Higgins, D. G. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 25, 4876–4882 (1997)
Article CAS Google Scholar

Download references

Acknowledgements

The authors are grateful to N. Hall and H. Hagiwara for discussions and critical reading of the manuscript. We thank M. Tadenuma and T. Ishikawa of the Brewing Society of Japan for the office work necessary for the collaborative research work of companies, national institutes and universities. Author Contributions M.S., T.T., K. Kusumoto, T.A., Y. Kashiwagi, H. Horikawa, A.H., R.I., Y. Kato, A.K., N.M., T.S., K.T., S.S., K. Isono, S.K., N.O., H.K. and M.M. sequenced the A. oryzae genome; the genes were computationally predicted and annotated from the A. oryzae genome by K.A., T. Kumagai, G.T., J.Y., D.B., T.E.C., O.G., T. Kin, H.N. and T. Komori; M.S., K. Kusumoto, T.A., O.A., Y. Kashiwagi, K.A., K.G., H. Horiuchi, K. Kitamoto, T. Kobayashi, M. Takeuchi, D.W.D., D.A., J.W.B., M.I., K. Iwashita, P.R.J., M.K., H.M., J.M., T.N., K. Oda, I.P., K.S., Y.T., O.Y., Y.Y., H.A., Y.H., Y. Koide, Y. Koyama, T.M., A.T. and M.M. contributed correction and hand annotation of the predicted genes; K.A., T. Kumagai, G.T., D.W.D., J.E.G., W.C.N., N.D.F., T. Kin, H.N., Y.T., J.W., T. Komori and M.M. analysed gene localization and development of the A. oryzae genome.

Author information

Authors and Affiliations

Institute for Biological Resources and Functions, National Institute of Advanced Industrial Science and Technology (AIST), Higashi 1-1-1, Ibaraki, 305-8566, Tsukuba, Japan
Masayuki Machida, Motoaki Sano, Kumiko Takase & Yasunobu Terabayashi
Computational Biology Research Center, AIST, 2-42 Aomi, 135-0064, Tokyo, Koto-ku, Japan
Kiyoshi Asai, Toshitaka Kumagai, Goro Terai, Osamu Gotoh, Taishin Kin, Hideki Nagasaki & Kinya Okada
National Institute of Technology and Evaluation, Nishihara 2-49-10, 151-0066, Shibuya-ku, Tokyo, Japan
Toshihiro Tanaka, Hiroshi Horikawa, Akira Hosoyama, Rie Igarashi, Yumiko Kato, Akira Kokubun, Noriko Maeyama, Toshihiko Sawano, Mikio Takahashi, Katsumi Isono & Hisashi Kikuchi
National Food Research Institute, 2-1-12 Kannondai, 305-8642, Ibaraki, Tsukuba, Japan
Ken-Ichi Kusumoto & Yutaka Kashiwagi
National Research Institute of Brewing, 3-7-1 Kagamiyama, 739-0046, Higashihiroshima, Hiroshima, Japan
Toshihide Arima, Osamu Akita, Kazuhiro Iwashita, Ken Oda, Kazutoshi Sakamoto & Osamu Yamada
Tohoku University, 1-1 Tsutsumidori-Amamiyamachi, 981-8555, Aoba-ku, Sendai, Japan
Keietsu Abe, Katsuya Gomi, Hiroshi Maeda, Tasuku Nakajima & Youhei Yamagata
The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, 113-8657, Tokyo, Japan
Hiroyuki Horiuchi, Katsuhiko Kitamoto, Masayuki Ichinomiya, Praveen Rao Juvvadi & Jun-ichi Maruyama
Nagoya University, 464-8601, Nagoya, Furo-cho, Chikusa-ku, Japan
Tetsuo Kobayashi & Masashi Kato
Tokyo University of Agriculture and Technology, Saiwai-cho 3-5-8, Fuchu, 183-0054, Tokyo, Japan
Michio Takeuchi
The University of Manchester, M23 9PL, Manchester, UK
David W. Denning
Broad Institute of MIT and Harvard, 320 Charles Street, Massachusetts, 02142, Cambridge, USA
James E. Galagan
The Institute for Genomic Research, Maryland, 20850, Rockville, USA
William C. Nierman, Natalie D. Fedorova, Ian Paulsen & Jennifer R. Wortman
Department of Biochemistry and Molecular Biology, The George Washington University School of Medicine, 2300 Eye Street NW, DC 20037, Washington, USA
William C. Nierman
USDA/ARS Southern Regional Research Center, 1100 Robert E. Lee Boulevard, Louisiana, 70124, New Orleans, USA
Jiujiang Yu, Deepak Bhatnagar & Thomas E. Cleveland
School of Biology, University of Nottingham, NG7 2RD, Nottingham, UK
David B. Archer
Tulane University, Louisiana, 70118, New Orleans, USA
Joan W. Bennett
Kyowa Hakko Kogyo Co. Ltd, 1-6-1 Otemachi, Chiyoda-ku, 100-8185, Tokyo, Japan
Hideharu Anazawa
Research Institute, Gekkeikan Sake Co. Ltd, 24 Shimotoba-koyanagi-cho, 612-8361, Fushimi-ku, Kyoto, Japan
Yoji Hata
Amano Enzyme Inc., 4-179-35, Sue-cho, Kakamigahara, Gifu, 509-0108, Japan
Yoshinao Koide
INTEC Web and Genome Informatics Corporation, 1-3-3 Shinsuna, 136-8637, Koto-ku, Tokyo, Japan
Goro Terai & Takashi Komori
Kikkoman Corporation, 399 Noda, 278-0037, Noda, Chiba, Japan
Yasuji Koyama
Ozeki Co, Ltd., 4-9 Imadudezaike-cho, Nishinomiya, Hyogo, 663-8227, Japan
Toshitaka Minetoki
Axiohelix, 2-45, Aomi, 135-0064, Koto-ku, Tokyo, Japan
Sivasundaram Suharnan
Higeta Shoyu, Co. Ltd., 2-8 Chuo-cho, Choshi, 288-8680, Chiba, Japan
Akimitsu Tanaka
Kyushu University, Hakozaki 6-10-1, 812-8581, Higashi-ku, Fukuoka, Japan
Satoru Kuhara
Nara Institute of Science and Technology, 8916-5, Takayama, 630-0101, Nara Ikoma, Japan
Naotake Ogasawara

Authors

Masayuki Machida
View author publications
You can also search for this author in PubMed Google Scholar
Kiyoshi Asai
View author publications
You can also search for this author in PubMed Google Scholar
Motoaki Sano
View author publications
You can also search for this author in PubMed Google Scholar
Toshihiro Tanaka
View author publications
You can also search for this author in PubMed Google Scholar
Toshitaka Kumagai
View author publications
You can also search for this author in PubMed Google Scholar
Goro Terai
View author publications
You can also search for this author in PubMed Google Scholar
Ken-Ichi Kusumoto
View author publications
You can also search for this author in PubMed Google Scholar
Toshihide Arima
View author publications
You can also search for this author in PubMed Google Scholar
Osamu Akita
View author publications
You can also search for this author in PubMed Google Scholar
Yutaka Kashiwagi
View author publications
You can also search for this author in PubMed Google Scholar
Keietsu Abe
View author publications
You can also search for this author in PubMed Google Scholar
Katsuya Gomi
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyuki Horiuchi
View author publications
You can also search for this author in PubMed Google Scholar
Katsuhiko Kitamoto
View author publications
You can also search for this author in PubMed Google Scholar
Tetsuo Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Michio Takeuchi
View author publications
You can also search for this author in PubMed Google Scholar
David W. Denning
View author publications
You can also search for this author in PubMed Google Scholar
James E. Galagan
View author publications
You can also search for this author in PubMed Google Scholar
William C. Nierman
View author publications
You can also search for this author in PubMed Google Scholar
Jiujiang Yu
View author publications
You can also search for this author in PubMed Google Scholar
David B. Archer
View author publications
You can also search for this author in PubMed Google Scholar
Joan W. Bennett
View author publications
You can also search for this author in PubMed Google Scholar
Deepak Bhatnagar
View author publications
You can also search for this author in PubMed Google Scholar
Thomas E. Cleveland
View author publications
You can also search for this author in PubMed Google Scholar
Natalie D. Fedorova
View author publications
You can also search for this author in PubMed Google Scholar
Osamu Gotoh
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Horikawa
View author publications
You can also search for this author in PubMed Google Scholar
Akira Hosoyama
View author publications
You can also search for this author in PubMed Google Scholar
Masayuki Ichinomiya
View author publications
You can also search for this author in PubMed Google Scholar
Rie Igarashi
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhiro Iwashita
View author publications
You can also search for this author in PubMed Google Scholar
Praveen Rao Juvvadi
View author publications
You can also search for this author in PubMed Google Scholar
Masashi Kato
View author publications
You can also search for this author in PubMed Google Scholar
Yumiko Kato
View author publications
You can also search for this author in PubMed Google Scholar
Taishin Kin
View author publications
You can also search for this author in PubMed Google Scholar
Akira Kokubun
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Maeda
View author publications
You can also search for this author in PubMed Google Scholar
Noriko Maeyama
View author publications
You can also search for this author in PubMed Google Scholar
Jun-ichi Maruyama
View author publications
You can also search for this author in PubMed Google Scholar
Hideki Nagasaki
View author publications
You can also search for this author in PubMed Google Scholar
Tasuku Nakajima
View author publications
You can also search for this author in PubMed Google Scholar
Ken Oda
View author publications
You can also search for this author in PubMed Google Scholar
Kinya Okada
View author publications
You can also search for this author in PubMed Google Scholar
Ian Paulsen
View author publications
You can also search for this author in PubMed Google Scholar
Kazutoshi Sakamoto
View author publications
You can also search for this author in PubMed Google Scholar
Toshihiko Sawano
View author publications
You can also search for this author in PubMed Google Scholar
Mikio Takahashi
View author publications
You can also search for this author in PubMed Google Scholar
Kumiko Takase
View author publications
You can also search for this author in PubMed Google Scholar
Yasunobu Terabayashi
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer R. Wortman
View author publications
You can also search for this author in PubMed Google Scholar
Osamu Yamada
View author publications
You can also search for this author in PubMed Google Scholar
Youhei Yamagata
View author publications
You can also search for this author in PubMed Google Scholar
Hideharu Anazawa
View author publications
You can also search for this author in PubMed Google Scholar
Yoji Hata
View author publications
You can also search for this author in PubMed Google Scholar
Yoshinao Koide
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Komori
View author publications
You can also search for this author in PubMed Google Scholar
Yasuji Koyama
View author publications
You can also search for this author in PubMed Google Scholar
Toshitaka Minetoki
View author publications
You can also search for this author in PubMed Google Scholar
Sivasundaram Suharnan
View author publications
You can also search for this author in PubMed Google Scholar
Akimitsu Tanaka
View author publications
You can also search for this author in PubMed Google Scholar
Katsumi Isono
View author publications
You can also search for this author in PubMed Google Scholar
Satoru Kuhara
View author publications
You can also search for this author in PubMed Google Scholar
Naotake Ogasawara
View author publications
You can also search for this author in PubMed Google Scholar
Hisashi Kikuchi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Masayuki Machida.

Ethics declarations

Competing interests

The genome sequence has been submitted to DDBJ under the accession numbers AP007150–AP007177. Reprints and permissions information is available at npg.nature.com/reprintsandpermissions. The authors declare no competing financial interests.

Supplementary information

Supplementary Methods

This describes the methods that were used for the library construction, sequencing, mapping contigs, southern hybridization and pulse field gel electrophoresis (DOC 44 kb)

Supplementary Figure 1

Schematic drawing of the A. oryzae genome sequence assembly. The 8 chromosomes of A. oryzae by the genome sequencing are represented from short arm to long arm together with the PFGE image of the A. oryzae chromosomes. (PPT 141 kb)

Supplementary Figure 2

Distribution and expression of genes throughout the A. oryzae genome. Non metabolism, Q genes, Extra homologues, AO specific genes, Genes with EST and Gene density are aligned with the eight A. oryzae chromosomes together with synteny to the A. fumigatus and A. nidulans genomes. (PDF 131 kb)

Supplementary Figure 3

Gene redundancy of pathway members. The amino acid sequences of S. cerevisiae genes for metabolic pathways members were obtained by SGD database and were searched by BLASTP with E-value of =< 10^-20. (PDF 19 kb)

Supplementary Figure 4

Metabolic pathway view of the gene redundancy of pathway members. The gene redundancy of each member of the metabolic pathways in A. oryzae genome as compared to A. fumigatus and A. nidulans was analysed, and was superimposed on the metabolic pathway map. (PDF 100 kb)

Supplementary Figure 5

Phylogenetic analysis of proteinases and glucosidases. Phylogenetic relationship of carboxypeptidases, ATP-dependent proteinases, maltase and α-amylase genes from the three Aspergilli were analysed. (PDF 25 kb)

Supplementary Figure 6

Phylogenetic analysis of metabolic genes. Phylogenetic relationship of pyruvate decarboxylase, saccharopine dehydrogenase, homoaconitase and saccharopine dehydrogenase (NADP+, L-glutamate forming) in lysine biosynthesis from the three Aspergilli were analysed. (PDF 21 kb)

Supplementary Figure 7

Analysis of a pair of syntenic genes highly homologous between A. oryzae and A. tumefaciens. This figure shows the alignment of the pair of syntenic genes from the two species. Phylogenetic analysis using amino acid sequences of the genes from variety of organisms was carried out. (PDF 31 kb)

Supplementary Figures Legends

This file contains figure legends for Supplementary Figures 1–7. (DOC 37 kb)

Supplementary Tables

This file contains Supplementary Tables 1–8. (DOC 124 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Machida, M., Asai, K., Sano, M. et al. Genome sequencing and analysis of Aspergillus oryzae. Nature 438, 1157–1161 (2005). https://doi.org/10.1038/nature04300

Download citation

Received: 18 May 2005
Accepted: 06 October 2005
Issue Date: 22 December 2005
DOI: https://doi.org/10.1038/nature04300

This article is cited by

Aspergillus oryzae PrtR alters transcription of individual peptidase genes in response to the growth environment
- Rika Numazawa
- Yukako Tanaka
- Youhei Yamagata
Applied Microbiology and Biotechnology (2024)
High-throughput droplet microfluidics screening and genome sequencing analysis for improved amylase-producing Aspergillus oryzae
- Qinghua Li
- Jinchang Lu
- Jian Chen
Biotechnology for Biofuels and Bioproducts (2023)
Insights into the capability of the lignocellulolytic enzymes of Penicillium parvum 4-14 to saccharify corn bran after alkaline hydrogen peroxide pretreatment
- Liangkun Long
- Wei Wang
- Shaojun Ding
Biotechnology for Biofuels and Bioproducts (2023)
Effective production of kojic acid in engineered Aspergillus niger
- Liu Wu
- Licheng Zhang
- Hao Liu
Microbial Cell Factories (2023)
A recombinant Aspergillus oryzae fungus transmitted from larvae to adults of Anopheles stephensi mosquitoes inhibits malaria parasite oocyst development
- Leila Kianifard
- Ab. Matteen Rafiqi
- Serdar Uysal
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.