A multi-omic map of the lipid-producing yeast Rhodosporidium toruloides

Zhu, Zhiwei; Zhang, Sufang; Liu, Hongwei; Shen, Hongwei; Lin, Xinping; Yang, Fan; Zhou, Yongjin J.; Jin, Guojie; Ye, Mingliang; Zou, Hanfa; Zhao, Zongbao K.

doi:10.1038/ncomms2112

Download PDF

Article
Open access
Published: 09 October 2012

A multi-omic map of the lipid-producing yeast Rhodosporidium toruloides

Zhiwei Zhu^1,2,
Sufang Zhang^1,3,
Hongwei Liu⁴,
Hongwei Shen^1,3,
Xinping Lin^1,2,
Fan Yang^1,5,
Yongjin J. Zhou^1,2,
Guojie Jin^1,2,
Mingliang Ye⁴,
Hanfa Zou⁴ &
…
Zongbao K. Zhao^1,3

Nature Communications volume 3, Article number: 1112 (2012) Cite this article

25k Accesses
284 Citations
19 Altmetric
Metrics details

Subjects

A Corrigendum to this article was published on 29 July 2013

This article has been updated

Abstract

Triacylglycerols are among the most attractive alternative raw materials for biofuel development. Current oil plant-based technologies are limited in terms of triacylglycerol production capacity and rate. These limitations may be circumvented by biotransformation of carbohydrates into lipids; however, our understanding of microbial oleaginicity remains limited. Here we present the results of a multi-omic analysis of Rhodosporidium toruloides, a robust triacylglycerol-producing fungus. The assembly of genome and transcriptome sequencing data reveals a genome of 20.2 Mb containing 8,171 protein-coding genes, the majority of which have multiple introns. Genes including a novel fatty acid synthase are predicted to participate in metabolic pathways absent in non-oleaginous yeasts. Transcriptomic and proteomic data suggest that lipid accumulation under nitrogen-limited conditions correlates with the induction of lipogenesis, nitrogenous compound recycling, macromolecule metabolism and autophagy. The multi-omic map of R. toruloides therefore provides a valuable resource for efforts to rationally engineer lipid-production pathways.

Genomic insights into the lifestyles, functional capacities and oleagenicity of members of the fungal family Trichosporonaceae

Article Open access 17 February 2020

Habibu Aliyu, Olga Gorte, … Katrin Ochsenreither

Lipid and DHA-production in Aurantiochytrium sp. – Responses to nitrogen starvation and oxygen limitation revealed by analyses of production kinetics and global transcriptomes

Article Open access 19 December 2019

Tonje M. B. Heggeset, Helga Ertesvåg, … Inga Marie Aasen

Untargeted metabolomics analysis of Ralstonia eutropha during plant oil cultivations reveals the presence of a fucose salvage pathway

Article Open access 12 July 2021

Björn Gutschmann, Martina C. E. Bock, … Sebastian L. Riedel

Introduction

Triacylglycerols (TAGs) are neutral lipids that are naturally produced via fatty acid biosynthetic pathways. TAGs are used as foods, animal feeds and feedstocks for chemical syntheses¹. Recently, derivatives of TAGs, such as long-chain fatty acid methyl esters² and alkanes³, have been explored as drop-in biofuels. However, traditional technologies that are based on oil crops and plants are limited in terms of their production capacity and rate. The harnessing of microbial fatty acid biosynthetic machineries may circumvent those limitations^4,5.

Some oleaginous species are exceptionally robust in converting carbohydrates into lipids, such as TAGs⁶. Yeast species of the anamorphic genus Rhodotorula have often been used for transformation of renewable resources into microbial oils^7,8,9. Rhodosporidium toruloides (formerly known as Rhodotorula glutinis or Rhodotorula gracilis), which is a non-pathogenic, pink-coloured basidiomycetous fungus, can accumulate lipids to more than 70% of its dry cell weight^6,10. This 'red' yeast is also a good producer of carotenoids¹¹ and biotechnologically important enzymes, such as cephalosporin esterase¹² and epoxide hydrolase¹³. More importantly, R. toruloides has excellent tolerance of inhibitory compounds that are found in biomass hydrolysates¹⁴. Thus, R. toruloides is a unique yeast strain of great biotechnological potential. Although a copious literature presenting the results of lipid production by oleaginous species exists^6,15, the understanding of the molecular basis of microbial oleaginicity and the sparse capacity for rational strain engineering is limited, largely because genetic backgrounds of naturally oleaginous species remain poorly documented.

Next-generation DNA sequencing technologies enable rapid genome decoding, high-throughput transcriptome sequencing (RNA-seq)¹⁶ and gene expression profiling (3′-Tag sequencing)¹⁷. Furthermore, advances in mass spectrometry-based proteomic analysis allow for the identification of thousands of proteins in a single experiment. The mystery of microbial oleaginicity may be unearthed using a natural lipid producer and an in-depth study that integrates the above-mentioned state-of-the-art technologies^18,19,20.

Here we present the results of genomic sequencing of R. toruloides, and differential transcriptomic and proteomic analyses of samples that were prepared under various culture conditions. We show that lipid accumulation under nitrogen-limited conditions is tightly connected with cellular processes that are related to lipogenesis, macromolecule metabolism and autophagy. We also find novel genes and pathways (such as fatty acid synthesis and lipid storage) that are committed to oleaginicity. Much information is consistent at different omic levels, and such multi-omic data significantly enriches our understanding of the molecular basis of lipid accumulation. This should facilitate the engineering of a natural lipid producers and other microorganisms for improved efficiency, robustness and economics of the microbial production of fatty acid derivatives.

Results

Genomic and transcriptomic sequencing and annotation

The genome of the haploidic strain R. toruloides NP11 was sequenced using the Illumina sequencer platform (Supplementary Table S1), and the de novo sequence assembly resulted in a 20.2-Mb genome with an average coverage of 96× (Supplementary Fig. S1) and 94 scaffolds (>1 kb, L50=575 kb, Supplementary Table S2). After PCR-based gap closure, the genome was narrowed down to 34 scaffolds, which included one ribosomal DNA unit sequence (Supplementary Fig. S2 and Supplementary Table S2). The genome was organized into 16 chromosomes that ranged from 650 to 1900 kb (Supplementary Fig. S3 and Supplementary Methods). To validate the completeness of the genome assembly and to define the transcriptional landscape of R toruloides, the transcriptome was sequenced using RNA-seq, which generated 6.7 M paired-end reads (Supplementary Table S1). It was found that 79% of the sequenced complementary DNA (cDNA) fragments and 98.6% of the assembled expressed sequence tags could be mapped to the genome. Moreover, except for KOG1185 and KOG1394, which are absent in basidiomycetous fungi, 246 of the 248 core eukaryotic genes²¹ were found. These results indicated that the assembled genome encompassed a very high percentage of the gene space.

The repetitive sequences constituted 1.66% of the current assembly (Supplementary Table S3), and RNA-seq alignment combined with de novo gene prediction, homology search and manual curation resulted in the prediction of 8,171 protein-coding genes (Supplementary Fig. S4 and Supplementary Table S4). The gene-coding regions were considerably dense with an average of 2.6 kb per gene. In addition, similar to other basidiomycetous fungi, split genes, which have short and multiple introns, were predominant, and 7,938 genes were found to have at least one intron. Moreover, most genes with 5′- and 3′-untranslated regions were annotated with the assistance of RNA-seq (Supplementary Table S4). It was found that 90% of the gene models had more than 30% of their length covered by RNA-seq reads, which were mapped to the annotated open-reading frames using the Bowtie software²², and 38% of the annotated proteins had at least one unique peptide that was present at the proteomic level (Supplementary Table S5). RNA-seq data indicated that 1,371 genes encoded two or more transcript isoforms, suggesting that the transcriptome was more complex than that of the static genomic data set (Supplementary Fig. S5). For the functional annotation, 79% of the R. toruloides proteins had homologous proteins or conserved domains of known proteins (Supplementary Table S5). Approximately 60% of proteins were annotated with Gene Ontology (GO) terms, and 32% of proteins were found with KEGG (Kyoto Encyclopedia of Genes and Genomes) orthology numbers.

On the basis of these annotations, we reconstructed the central and lipid metabolism of this organism (Fig. 1). In comparison with Saccharomyces cerevisiae, R. toruloides contains ATP:citrate lyase (ACL), and the mitochondrial β-oxidation (MBO) and carotenoid biosynthetic pathways. Although cytoplasmic acetyl-CoA synthase can produce acetyl-CoA from acetate, in oleaginous fungi and higher eukaryotes, acetyl-CoA, which results from the cleavage of citrate by ACL, is a major source for lipid synthesis²³. Unlike β-oxidation in peroxisomes, the MBO pathway recovers more energy through the degradation of fatty acids by using the flavin adenine dinucleotide (FAD) cofactor, which couples with the respiratory chain to produce ATP. Moreover, MBO-associated enzymes are required to metabolize branched-chain amino acids to produce acetyl-CoA^24,25. Thus, MBO may provide alternative acetyl-CoA and energy sources for fatty acid synthesis by degrading amino acids and membrane lipids (vide infra). Two genes that code putative carotenoid synthesis-related enzymes, phytoene synthase (PSY1) and phytoene dehydrogenase (CRTI), were identified, revealing the genetic basis for the formation of the pink-coloured pigments by this organism (Supplementary Fig. S6 a).

**Figure 1: Reconstruction of central and lipid metabolism pathways of *R. toruloides*.**

A novel fatty acid synthase (FAS) system was found in R. toruloides (Fig. 2). FASs are typically classified into two variants, the dissociated type II system and the integrated type I multi-enzyme. In fungi, FASs are usually composed of eight distinct domains and organized into two subunits or one polypeptide²⁶. Although split FASs are common in fungi, FAS in R. toruloides is split into a novel form that consists of two subunits: the β-subunit (Fas1), which contains acyltransferase and enoyl reductase domains, and the α-subunit (Fas2), which includes all other domains (Fig. 2a). Two acyl carrier protein (ACP) domains of 76% sequence identity are present in Fas2, and these two ACPs share high similarity with ACPs from other species (Supplementary Figure S7). Unique peptides from different ACPs were also found by LC-MS/MS (liquid chromatography–mass spectrometry) during proteomic analysis (Fig. 2b; Supplementary Fig. S8), which unambiguously indicated the expression of the FAS with two ACPs. As ACP has a key role in iterative substrate shuttling in the chain-elongation process, the presence of tandem ACPs may improve the efficacy of fatty acid biosynthesis by providing a higher intermediate concentration in the reaction chamber of FAS²⁷.

**Figure 2: *R. toruloides* fatty acid synthase system.**

Transcriptomic analysis

Early biochemical studies demonstrated that nitrogen limitation was a major regulator for initiating lipid overproduction (Supplementary Fig. S6 d,f,g)^6,10. In oleaginous fungi, mitochondrial NAD⁺-dependent isocitrate dehydrogenase (IDH) is dependent on adenosine monophosphate (AMP), and under nitrogen limitation conditions the cellular AMP level is lowered because of AMP deaminase activity, which leads to an impaired activity of IDH. Consequently, mitochondrial citrate accumulates and permeates out from the mitochondria to the cytosol where it serves as a substrate for ACL to form acetyl-CoA for fatty acid biosynthesis. In addition, cytosolic NADP⁺-dependent malic enzyme (ME) is considered as the key enzyme that supplements nicotinamide adenine dinucleotide phosphate (NADPH) for de novo lipogenesis. In addition to lipid accumulation, nitrogen limitation is expected to mediate a more global response, such as the release of nitrogen catabolite repression (NCR)²⁸ and the induction of autophagy by the target of rapamycin (TOR) signalling cascades^29,30.

We grew chemostat cultures of R. toruloides to examine the influence of nitrogen starvation on this organism (Table 1). In a minimal medium with abundant ammonium (MM), cells accumulated lipids to 22.8%, and the residual ammonium concentration was 47±1 mM. In minimal medium with trivial amounts of ammonium (MM-N), the cellular lipid content was 33.3%. Furthermore, the oil yield, which was based on consumed sugar (Y_SP), in the MM-N medium was 3.6-fold higher than that in the MM medium. These results clearly indicated that nitrogen limitation promoted lipid accumulation and that the metabolic flux was repartitioned for lipid biosynthesis.

Table 1 Parameters and results of R. toruloides cultures.

Full size table

The 3′-tag digital gene expression (DGE) profiling experiment produced more than three million tags from each RNA sample that was prepared from cells cultured in the MM or MM-N media (Supplementary Table S1). Of those tags, 78% from the MM sample and 63% from the MM-N sample mapped to annotated transcripts, and 89% from the MM sample and 74% from the MM-N sample mapped to the genome (Supplementary Table S6). In addition, 5.7% of those tags mapped to antisense transcripts of predicted genes, which led to the observation that 74% of the total genes exhibited antisense transcription (Supplementary Table S6). These data indicated that antisense transcription is widespread in R. toruloides. Approximately 12% of the reads that uniquely mapped to sense transcripts were not aligned to the 3′-most NlaIII site (Supplementary Figure S9), which suggested that transcription at the 3′-end was heterogeneous. For transcript quantification, the tags that mapped to sense sequences were counted and compared. Of the 7,045 genes that were expressed in these two samples (Supplementary Table S6), 1,177 and 853 genes were significantly upregulated and downregulated, respectively (Fig. 3a). It was obvious that nitrogen limitation influenced a variety of cellular processes such as metabolism, localization and transportation (Fig. 3b). NCR-related genes were induced (Fig. 3c), and MEP2, which is the orthologue of the S. cerevisiae high-affinity ammonia transporter, was upregulated tenfold. In addition, three genes that coded amino acid permeases (RHTO_00398, RHTO_00931 and RHTO_01882) were upregulated. The central nitrogen metabolism machinery, which is composed of glutamate dehydrogenase (GDH), glutamine synthase (GLN) and glutamate synthase (GOGAT/GLT), is the hub of cellular nitrogen utilization, and it was discovered that the transcription of GDH1, GDH2 and GLN1 were upregulated under nitrogen-limited conditions. It was also found that the expression of genes involved in urate and urea degradation was upregulated, whereas the expression of genes related to the utilization of other poor nitrogen sources was not significantly altered (Supplementary Table S7). Furthermore, genes coding proteases (PRB1 and PEP4) and amino acid permeases (RHTO_05155 and RHTO_02128) that are linked to vacuolar protein degradation were induced. As Pep4 activates other vacuolar hydrolases, this elevated expression level would enhance the degradation of macromolecules and organelles. Indeed, autophagy-related genes (ATG1, ATG2, ATG20 and ATG8) and vacuolar-type ATPases (VMA5, VMA9 and VMA11), vacuolar lipase (ATG15) and α-mannosidase (AMS1) were also upregulated (Supplementary Table S7), which suggested that the cells initiated the turnover and recycling of intracellular components. The downregulated genes were related to primary metabolism, macromolecule metabolism and cellular metabolism, especially nucleobase, nucleoside, nucleotide and nucleic acid metabolic processes (GO: 0006139) and ribonucleoprotein complex biogenesis (GO: 0022613). Ribosome biogenesis is responsive to nutrient starvation through an evolutionarily conserved TOR signalling mechanism³¹, and for the 65 genes that were sampled by DGE and coded cytoplasmic ribosomal proteins, 39 genes were downregulated (Fig. 3d, blue), which suggested that inhibition of the TOR complex suppressed the transcription of ribosomal components. Together, nitrogen limitation triggered a set of rescue pathways that included the induction of nitrogenous compound uptake, nitrogen assimilation, alternative nitrogen-source utilization, vacuolar protein degradation and autophagy, and inhibition of nitrogenous macromolecule biosynthesis.

**Figure 3: Differential transcriptome of *R. toruloides* in response to nitrogen limitation.**

The regulation of central metabolism and lipid metabolism at the transcriptional level was also investigated (Fig. 1; Supplementary Table S8). FAS in the MM-N sample was upregulated by 4.7- and 1.8-fold for FAS1 and FAS2, respectively, but there were no significant differences in the expression of genes that were involved in the late stage of TAG biosynthesis. Isoprenoids are a variety of compounds (for example, steroids, carotenoids and terpenes) that are derived from the common precursor dimethylallyl diphosphate and its isomer 3-isopentenyl diphosphate. Four genes that are involved in isoprenoid backbone synthesis were upregulated in the MM-N sample (Fig. 1). Indeed, the pink colour development of the culture was an indication of carotenoid biosynthesis. As lipids are highly reduced compounds, the supplement of a reducing power, mainly NADPH, is equally important to that of the precursor acetyl-CoA⁶. Activities of glucose-6-phosphate dehydrogenase (ZWF1), 6-phosphogluconate dehydrogenase (GND1), NADP⁺-dependent isocitrate dehydrogenase (IDP1) and ME (ME1) can generate NADPH. Although ZWF1 and IDP1 were upregulated by 1.9- and 4.8-fold, respectively, the expression of GND1 and ME1 were downregulated by 6.8- and 4.5-fold, respectively. The reduced transcriptional level of ME1 was consistent with early observations that indicated the expression of ME was suppressed during nitrogen exhaustion^32,33. Interestingly, the expression of some genes that were involved in lipid degradation (that is, lipolysis, the β-oxidation cycle, and glyoxylate shunt) were also upregulated (Fig. 1), which might be caused by free fatty acids being released by an elevated autophagy process³⁴.

We also compared the highly expressed genes of cells that were cultured in yeast-extract peptone dextrose (YPD), MM or MM-N media (Fig. 4; Supplementary Table S10). It was found that the numbers of highly expressed genes were similar, and approximately half of those genes were shared by the three samples (Fig. 4b, group G). The shared genes included those coding ribosomal proteins, actin, ACP (RHTO_01048), TAL1 of the phosphopentose pathway, MDH1 and IDH2 of the tricarboxylic acid (TCA) cycle and GAPDH, ENO1 and FBA1 of glycolysis. Forty-two genes were highly expressed exclusively in the MM-N sample (group C) and included those encoding FAS1& 2, PYC1, GDH1 and acyl-CoA-binding protein (RHTO_03853). Genes coding ACL1 and pyruvate dehydrogenase components (LAT1 and LPD1) were highly expressed in the MM and MM-N samples (group E), where lipid contents were greater than 20%.

**Figure 4: Gene expression levels of *R. toruloides* cultured in different media.**

Comparative proteomic analysis

In an earlier study, we performed comparative proteomic analysis of R. toruloides based on the proteome database of S. cerevisiae. As a result, 184 proteins were identified, and 46 of those proteins were found to change significantly during the lipid-production stage³⁵. Here we re-analysed raw LC-MS/MS data that was based on the newly annotated protein data set that had been generated from the genomic data of R. toruloides. Although the previous analysis only cumulatively observed 184 proteins, in this study we observed 3,108 proteins, which amounted to roughly a 17-fold increment over the previous data set (Supplementary Table S11). Thus, 38% of annotated genes were confirmed at the proteomic level. We were able to identify 2,057 proteins, which included over 32-fold more proteins than our previous data set (Fig. 5a; Supplementary Table S11). The intersection between the '24 h' and the '96 h' samples included approximately 100 more proteins than those of the other two intersections, which indicated that samples at the lipid-production stages more physiologically resembled each other. Further semi-quantitative analysis revealed 538 proteins that significantly varied in abundance between any two samples (Fig. 5b and Supplementary Data 1). A good correlation was observed between protein expression levels of the '24 h' and the '96 h' samples, which had a Pearson's correlation coefficient of 0.79 (Fig. 5c) and suggested that many proteins were constantly expressed during lipid accumulation under nitrogen-limited conditions.

Significantly higher levels were found for proteins linked to nitrogen metabolism under nitrogen-limited conditions. Specifically, proteins involved in nitrogenous compound transportation (Mep2, RHTO_07825, RHTO_07915, RHTO_00398, Dur3 and Nrt2), alternative nitrogen source utilization (proline, urate, urea and nitrate) and central nitrogen metabolism (Gdh1, Gln1, Gln2 and Glt1) were found at higher levels in the lipid-production samples than those in the 'Seed' sample (Fig. 5d), which suggested that nitrogen limitation alleviated the NCR process. On the other hand, the levels of ribosomal proteins, aminoacyl transfer RNA synthases, translation initiation and elongation factors, and protein folding-related proteins were reduced (Supplementary Data 1), which indicated that the protein biosynthesis machinery was suppressed. The increased levels of vacuolar degradation- and autophagy-related proteins, such as vacuolar proteases (Cpb1, Rpb1 and Pep4), vacuolar-type ATPases (Vma2, Vma4, Vma13 and Stv1) and α-mannosidase (Ams1) suggested that an activated autophagy process was involved in responding to nitrogen starvation. The above-mentioned proteomic observations were also in good agreement with the differential transcriptomic data.

Proteins involved in lipogenesis, especially for de novo fatty acid biosynthesis, were found at elevated levels (Fig. 5e). These proteins included Acl1, Me1, Acc1, Fas1 and Fas2 for fatty acid synthesis and glycerol-3-phosphate dehydrogenase isoform 2 (Gpd2) and glycerol-3-phosphate acyltransferase (Gat1) for glycerolipid synthesis. Cytosolic NADP⁺-dependent ME is the key enzyme that supplements NADPH for de novo lipogenesis^6,32,33. Interestingly, although the transcription of ME1 was downregulated, the protein level of Me1 was significantly higher in the lipid production samples, which suggested that the regulation of ME activity was complicated. The level of acetyl-CoA synthase (Acs1) was increased by 2.2- and 2.1-fold in the '24 h' and '96 h' samples, respectively (Fig. 6a); however, levels of acetyl-CoA C-acetyltransferase (Erg10) and acetyl-CoA hydrolase (Ach1), which are the branch point enzymes for acetyl-CoA, were decreased. These data demonstrated that the acetyl-CoA flux was switched to the synthesis of fatty acids but not of isoprenoids. In fungi, a perilipin-like protein was characterized as a lipid-droplet protein that protected lipids from degradation³⁶. RHTO_05627 was a perilipin family protein³⁷ that showed significantly higher levels at lipid-production stages, and the levels of proteins that were related to MBO were also increased, which was consistent with the upregulation of β-oxidation at the transcription level.

**Figure 6: Expression of acetyl-CoA- and pyruvate-associate enzymes.**

The levels of proteins associated with the TCA cycle (that is, Idh1, Sdh1, Sdh2 and Mdh1) decreased in the lipid-production samples (Fig. 5f), which was in agreement with early biochemical observations⁶, and the levels of proteins involved in pyruvate metabolism were significantly altered (Figs 5f and 6b). In the lipid-production samples, higher levels of pyruvate kinase (Pyk1) and pyruvate dehydrogenase components (Pdb1 and Lpd1) were found, whereas a lower level of pyruvate decarboxylase (Pdc1) was observed, which suggested that pyruvate was channelled to acetyl-CoA. Pyruvate carboxylase (PYC) functions as a key enzyme for the anaplerosis of the TCA cycle and gluconeogenesis. PYC was also proposed to be an essential part of the transhydrogenation machinery, which is composed of PYC, ME and malate dehydrogenase for the conversion of NADH to NADPH⁶. As the level of phosphoenolpyruvate carboxykinase (Pck1) that is responsible for the transformation of pyruvate to phosphoenolpyruvate was reduced, an increased level of Pyc1 should be mainly related to the NADPH supply for lipogenesis (Fig. 6b). Interestingly, enzymes catalysing these irreversible reactions in glycolysis (Hxk2 and Pyk1) were elevated. Moreover, the key enzyme in glycolysis, phosphofructokinase (Pfk1), was also augmented by 1.4-fold (P-value <0.006, n=3 two-tailed, Student's t-test) at '24 h' and by 1.3-fold (P-value <0.19, n=3 two-tailed, Student's t-test) at '96 h'. This evidence suggested that the glycolysis pathway would be slightly upregulated at lipid-production stages.

Discussion

Long-chain fatty acid derivatives have significant promise as drop-in biofuels and chemicals as replacements for petroleum products. In this regard, natural microbial hosts that produce neutral lipids such as TAGs should be explored, and the engineering of model microorganisms should be studied. The robustness of lipid production by the basidiomycetous fungus R. toruloides has been well-documented^10,14,38. Auxotrophic mutants were achieved by physical and chemical mutagenesis³⁹, and efficient transformation of R. toruloides with artificial plasmids has been demonstrated⁴⁰. The current study disclosed the genome of R. toruloides based on deep sequencing and RNA-seq-assisted annotation, and further transcriptomic and proteomic analyses provided integrated information to discuss, for the first time and in much more detail, the molecular mechanism of fungi oleaginicity, nitrogen metabolism and the relationship between these processes.

Some features of the R. toruloides genome that were linked to oleaginicity included the presence of ACL, split FAS, the perilipin family protein, and the MBO and carotenoid biosynthetic pathways. In R. toruloides, the expression level of ACL1 was extremely high (Supplementary Table S8), which led to elevated citrate lysis activity for acetyl-CoA production. ACL1 is an essential gene for oleaginicity and is absent in some Saccharomycetes fungi. Y. lipolytica, which is the only known species of the Saccharomyces that contains ACL, is an oleaginous yeast⁴¹. The presence of split FAS with two ACPs seemed conserved among oleaginous red yeasts (Supplementary Fig. S10). Tandem ACPs were distributed broadly in polyketide synthases, and it was demonstrated that the yield of polyunsaturated fatty acids were dependent on the number of ACP domains⁴². Although it was unclear whether tandem ACPs in FAS would improve fatty acid production, this novel domain composition was to the best of our knowledge, first identified in FAS. Neutral lipids are stored in lipid droplets, which are specialized organelles that are coated with a phospholipid monolayer and proteins. Perilipin, a diverse family of lipid-droplet coating proteins, governs lipolysis³⁷. In fungi, a perilipin-like protein MPL1 was characterized as a lipid-droplet protein that protected lipids from degradation³⁶. The MPL1-like perilipins were exclusively found in pezizomycotinal ascomycetes such as Y. lipolytica, and the open-reading frame YALI0F24167g.aa (perilipin family protein, E-value 1.7e−05) of Y. lipolytica was identified in the most abundant fraction of the lipid-droplet proteome⁴³. We found a new class of perilipin proteins, which was composed of the amino terminal perilipin and carboxy terminal apolipoprotein domains, in basidiomycetes (Supplementary Fig. S11). During lipid accumulation, the expression of perilipin was consistent with the lipid amount and lipid-droplet volume (Fig. 5e), which suggested its importance to lipid accumulation.

Nitrogen limitation has been extensively practiced to initiate lipid overproduction for oleaginous species^6,10. Nitrogen limitation leads to amino acid (especially glutamine and leucine) starvation in the cytosol, which inhibits the activity of the TOR complex 1, a global regulator of nutrient signalling, by the Npr2/3 (ref. 31) or Gtr1/2 complexes (orthologues of the Rag protein)⁴⁴. The conserved TOR complexes were present in R. toruloides (Fig. 7a) and the expression of NPR2 was indeed downregulated in this organism on nitrogen starvation (Supplementary Table S7). Inactivated TORC1 then inhibits translation initiation and ribosome biogenesis and activates autophagy (Fig. 7b). As a result, protein synthesis is blocked and autophagy-related proteolysis is activated, which recycles amino acids. In S. cerevisiae, NCR-related genes are regulated by TOR signalling in response to nitrogen starvation through controlling the translocation of GATA transcription factors⁴⁵. In the basidiomycetous fungus Cryptococcus neoformans, the GATA-type transcription factor Gat1 was also found to be responsible for nitrogen regulation⁴⁶. Eleven putative GATA-type transcription factors were found in R. toruloides, and the HGATAR (H=A, C or T, and R=A or G) motif, which is recognized by GATA factors, was overrepresented in the upstream regions of those upregulated nitrogen metabolic genes (Supplementary Fig. S12). Therefore, it would be interesting to characterize the roles of GATA factors in the transcriptional regulation of nitrogen metabolism in R. toruloides. Ribosome biogenesis and protein translation, the read-outs of TOR signalling, were also inhibited (Fig. 3d and Supplementary Data 1); thus, nitrogen starvation resulted in blocked biosynthesis of nitrogenous macromolecules (that is, nucleic acids and protein). Inhibited TOR signalling cascades can also activate autophagy²⁹, and a number of autophagy-related genes were induced on nitrogen starvation. As a result, free amino acids and fatty acids were expected to be released on degradation of proteins and membrane lipids, respectively. The free fatty acids might contribute to the activation of the MBO pathway for alternative acetyl-CoA production, and the free amino acids could contribute to relieving nitrogen stress. At the same time, autophagy was also demonstrated to have roles in lipid-droplet biogenesis⁴⁷ by making more room for lipid-droplet formation through the clearance of organelles or portions thereof (Supplementary Fig. S6d,g).

**Figure 7: TOR signalling links nitrogen regulation and lipid accumulation.**

Except for the upregulated transcription of FAS1, FAS2 and PYC1, under nitrogen-limiting conditions, some key genes for lipogenesis, such as ACL1, ACC1, ME1, IDH1 and IDH2, were not transcriptionally regulated, which was consistent with the transcriptomic analyses of the oleaginous yeast Y. lipolytica¹⁹. Comparative proteomic analysis would suggest that the regulation of these genes was post-transcriptional. We also found that cells at the lipid-production stage had elevated lipid-droplet wrapping, intensified glycolysis, an impaired TCA cycle and enhanced metabolic fluxes at branch points where pyruvate and acetyl-CoA were channelled into fatty acid biosynthesis. In addition, regulation of nitrogen metabolism at the transcriptional and proteomic levels included the following: (1) upregulation of genes involved in assimilation and transport of nitrogenous sources and genes for degradation/recycling of proteins or nitrogenous compounds, and (2) decreases in the biosynthesis of proteins and macromolecules.

Although major evidence was found in the current study that support the classical opinion on nitrogen-limitation-mediated lipid production^6,23, we now have a significantly broader and clearer view of microbial oleaginicity. This multi-omic information should aid the scientific community in genetic manipulation and metabolic engineering of lipid producers for the production of fatty acid derivatives that include tailor-made lipids, advanced fuel molecules⁴⁶, microdiesel⁵ and hydrocarbons^5,49.

Methods

Strain background

The parent of the sequenced haploid strain R. toruloides NP11 was Y4 (ref. 10) and was derived from R. toruloides CGMCC 2.1389, which is identical to CBS 6016. Haploid NP11 (MAT A1) and NP5-2 (MAT A2) were isolated from basidiospores that were germinated from teliospores of Y4 (Supplementary Fig. S13).

Cell culture conditions and DNA/RNA sample preparation

For genomic DNA preparation, R. toruloides was cultivated in YPD liquid medium (10 g l⁻¹ yeast extract, 20 g l⁻¹ peptone and 20 g l⁻¹ dextrose) at 30 °C. Genomic DNA isolation was performed by following a previous protocol⁵⁰. For cell disruption, the culture pellets were ground in liquid nitrogen or digested with MAN5C⁵¹ followed by SDS and protease K (Takara) treatment. DNA was extracted by phenol (pH 8.0) and chloroform:isoamyl alcohol (24:1) and precipitated by ethanol (75%, v/v). The DNA samples were then dissolved in TE buffer (10 mM Tris–Cl, 1 mM EDTA, pH 7.6), incubated in the presence of RNase A (DNase-free, 20 μg ml⁻¹, Genview) at 37 °C for 30 min and extracted again. High-quality DNA (optical density (OD)_260/280=1.8~2.0) that did not smear during electrophoresis was obtained, and the samples were lyophilized and delivered for genome sequencing.

For massively parallel cDNA sequencing (RNA-seq), a chemostat culture of R. toruloides was run in a 3-l stirred tank reactor (Baoxing Bio-Engineering Equipment, Shanghai, China) that was equipped with an Inpro 6800 O₂ sensor and 405-DPAS-SC-K8S/225 pH electrode (Mettler Toledo). The cells were grown in YPD medium at 30 °C, pH 5.6, the airflow was 100 l h⁻¹ (approximately 1 VVM) and the stirrer speed was 600 r.p.m., which maintained the dissolved oxygen at up to 70% of air saturation. The dilution rate was set at 0.5 h⁻¹, the feeding rate was maintained at 0.83 l h⁻¹ and the working volume was 1.55 l, which made the growth rate similar to exponential growth in a shaking flask and resulted in a generation time of 2 h. Steady-state samples (OD_{600 nm} ~1.7) were harvested by centrifugation (6,200g at 4 °C for 5 min) and cell pellets were frozen in liquid nitrogen. Total RNA was isolated using Trizol (Invitrogen) after grinding and was then treated with DNase I (RNase-free, NEB) to remove residual DNA⁵². The quality of the RNA was checked using an Agilent 2100 bioanalyser.

For the 3′-tag DGE profiling, chemostat cultures of R. toruloides that were grown in minimal media were performed as aforementioned. The minimal medium (MM) was composed of 25 g l⁻¹ glucose, 0.5 g l⁻¹ MgSO₄·7H₂O, 3 g l⁻¹ KH₂PO₄ and 5 g l⁻¹ (NH₄)₂SO₄, whereas the nitrogen-limited medium (MM-N) contained the same components of the MM medium, but contained only 0.2 g l⁻¹ (NH₄)₂SO₄ as a nitrogen source and 6.3 g l⁻¹ K₂SO₄ to supplement the equivalent sulphur element. The culture conditions were set at 30 °C and pH 5.6, which was adjusted with 2 M KOH and 2% H₂SO₄. The dissolved oxygen was kept above 85% of air saturation, the stirrer speed was 500 r.p.m. and the airflow was 100 l h⁻¹. The dilution rate was 0.085 h⁻¹, which was much lower than that of the YPD culture. Sample harvesting and total RNA isolation were the same as that performed for RNA-seq, and the residual glucose and ammonium in the culture supernatants, dry cell weight and lipid content were measured as described previously¹⁰.

Genome sequencing and assembly

Four R. toruloides genomic DNA libraries with insert sizes of 200 bp, and 2 and 6 kb (Supplementary Table S1) were constructed and sequenced in four lanes using the Illumina GAII or GAIIx sequencing platforms at BGI-Shenzhen. Approximately 5 Gb of data were obtained and assembled using the SOAPdenovo software⁵³ to generate 94 scaffolds that consisted of a total length of 20.2 Mb, close to the genome size of Rhodotorula graminis or Sporobolomycetes roseus (Supplementary Fig. S14). The GenomeWalker Universal kit (Clontech) was used for genome walking to close the gaps between the scaffolds, and 34 scaffolds were obtained. Reads were aligned to the genome assembly using the Bowtie software²² to estimate the sequencing depth. The CEGMA pipeline²¹ was used to identify core eukaryotic genes.

Massively parallel cDNA sequencing

Initially, 10 μg of RNA was used to prepare the sequencing library using the Illumina mRNA-Seq 8-Sample Prep Kit (Illumina). The paired-end cDNA library with an insert length of 200±20 bp was sequenced using the Illumina HiSeq2000, and a total of 6.7 M paired-end reads of 90/90 bp in length were obtained (Supplementary Table S1). RNA-seq reads were de novo assembled using SOAPdenovo (v1.04), and 78,979 contigs and 14,830 unigenes (≥200 bp), which were scaffolded and closed with paired-end reads, were obtained and re-aligned to the genomic sequence by BLAT⁵⁴. A majority (98.6%) of the 17,814 assembled contigs (≥200 bp) could be mapped to the genome with more than 95% identity and 90% coverage. The raw reads were also mapped to the genome sequence using TopHat⁵⁵ and assembled ab initio using Cufflinks (v1.0.1)⁵⁶.

Genome annotation

Initial gene prediction was done as described (Supplementary Methods). The first round of predictions and alignments were combined using GLEAN⁵⁷ to generate a consensus gene set with 6,469 gene models. An extra training using the GlimmerHMM and SNAP software was carried out to achieve sensitivity of ~90% and accuracy of ~60%, and were assessed with 269 independently annotated genes that were derived from the RNA-seq assembly. The gene predictions were also implemented using the Augustus software with the assistance of RNA-seq (http://augustus.gobics.de/binaries/readme.rnaseq.html), and all data were visualized and manually curated using the Argo software (http://www.broadinstitute.org/annotation/argo/). The final set of genes was selected mainly on the basis of expression and homology (Supplementary Fig. S4). For those genes that were merely predicted ab initio, consensus models from at least two predictions were reserved. The 3′- and/or 5′-untranslated regions were annotated if expressed, and finally 8,171 gene models were obtained.

For gene expression analysis, the Cufflinks pipeline was used to estimate gene expression abundance and identify transcript isoforms when using option '-g'. Furthermore, the expression levels of genes and transcripts were normalized to FPKM (Fragments Per Kilobase of exon model per Million mapped fragments). Genes with alternatively spliced transcripts (1,371 were found) were also identified in silico. For gene function annotation, every predicted gene was annotated using the BLAST search engine against the non-redundant protein database at NCBI, the UniProtKB/Swiss-Prot database, the UniProtKB/Trembl database (http://www.uniprot.org/) and KEGG genes database (http://www.genome.jp/kegg/genes.html). The domain and family of each protein was annotated via searching the InterPro database at EBI using InterProScan (http://www.ebi.ac.uk/Tools/pfa/iprscan/), and Gene Ontology annotations were assigned using Blast2GO (http://www.blast2go.com/b2ghome). Gene function was classified by the eukaryotic orthologous group (KOG, http://www.ncbi.nlm.nih.gov/COG/), and the automatic assignment of metabolic pathway was implemented by KAAS (http://www.genome.jp/tools/kaas/).

3′-Tag DGE profiling

The NlaIII libraries were prepared using the Illumina Tag Profiling Sample Prep Kits (Illumina) and were sequenced on the Illumina GAIIx platform with read lengths of 35 bp. More than three million reads were obtained (Supplementary Table S1) and analysed in silico using the in-house Perl pipeline that was developed by BGI-Shenzhen. In brief, a database with all sense and antisense tags (CATG+17 nucleotides) of the annotated transcripts was created. The raw reads were processed to filter out low-quality tags and adaptor sequences, and subsequently, clean tags that contained CATG and an additional 17 bp were counted and mapped to the tag database of transcriptomic or genomic sequences without more than 1 bp mismatch. For transcript quantification, tags that occurred only once or had multiple hits to transcripts were discarded, and tags that mapped to the same transcript regardless of their positions were counted and normalized to tags per million (TPM). To identify differentially expressed genes, the rigorous algorithm Audic and Claverie's⁵⁸ test, which was described previously, was used to test for significance. In addition, the false discovery rate (FDR) adjustment was used to control the error rate, a threshold of 0.001 for FDR and a fold change of 2 was set as the criteria for differentially expressed genes. We found 1,177 and 853 genes that were upregulated and downregulated, respectively.

Proteomic analysis

Proteomic data from a previous study were analysed using the newly annotated protein data set³⁵. MS/MS spectra data were searched against the protein database of R. toruloides using SEQUEST as described³⁵. The results were filtered using the SFOER software with optimized criteria⁵⁹, and the corresponding FDR was below 1%. In total, 3,108 proteins were identified using at least one peptide, and 2,057 proteins that occurred more than twice in the triplicate analyses were counted as expressed proteins. For semi-quantitative comparison, the spectral counts of proteins that were identified in all three replications were used and normalized, and combined statistical tests were applied to assess the significance⁶⁰. Proteins with a fold change >2, a P-value <0.05 for the t-test and a G-value >1.35 for the G-test (corresponding to 3.9 for raw spectral counts) of each of the two samples were assigned as differentially expressed proteins, and this analysis led to 538 proteins that showed significant level changes.

Additional information

Accession codes: This Whole Genome Shotgun Sequencing project has been deposited into DDBJ/EMBL/GenBank under the accession number ALAU00000000. The version that is described in this paper is the first version, ALAU01000000. Raw reads of the WGS sequencing and RNA-seq analysis have been deposited into the NCBI Sequence Read Archive (SRX156633–SRX156637). DGE profiling data have been deposited into NCBI's Gene Expression Omnibus and was given the accession number GSE39023. The proteomic data that is associated with this manuscript may be downloaded from ProteomeCommons.org Tranche using the following hash: HzAX+3G+zoH9LngqJ+29NyszSltcjGUyxKWjNhCIRKOab/G0LG9Kw15quSpAteJOio7+ V2tR9O3vt9pyHyu1O9/I54gAAAAAAAA43A==. All data are also available at http://www.bioconversion.dicp.ac.cn/EWEB/ZZW-data.html.

How to cite this article: Zhu, Z et al. A multi-omic map of the lipid-producing yeast Rhodosporidium toruloides. Nat. Commun. 3:1112 doi: 10.1038/ncomms2112 (2012).

Accession codes

Accessions

EMBL/GenBank/DDBJ

ALAU00000000

Gene Expression Omnibus

GSE39023

Change history

29 July 2013
The original version of this Article contained a typographical error in the spelling of the author Hanfa Zou, which was incorrectly given as Hanfan Zou. This has now been corrected in both the PDF and HTML versions of the Article.

References

Biermann, U., Bornscheuer, U., Meier, M. A. R., Metzger, J. O. & Schäfer, H. J. Oils and fats as renewable raw materials in chemistry. Angew. Chem. In. Ed. 50, 3854–3871 (2011).
Article Google Scholar
Li, H., Cann, A. F. & Liao, J. C. Biofuels: biomolecular engineering fundamentals and advances. Annu. Rev. Chem. Biomol. Eng. 1, 19–36 (2010).
Article Google Scholar
Madsen, A. T., Ahmed, E. H., Christensen, C. H., Fehrmann, R. & Riisager, A. Hydrodeoxygenation of waste fat for diesel production: Study on model feed with Pt/alumina catalyst. Fuel 90, 3433–3438 (2011).
Article Google Scholar
Fortman, J. L. et al. Biofuel alternatives to ethanol: pumping the microbial well. Trends Biotechnol. 26, 375–381 (2008).
Article Google Scholar
Steen, E. J. et al. Microbial production of fatty-acid-derived fuels and chemicals from plant biomass. Nature 463, 559–562 (2010).
Article ADS Google Scholar
Ratledge, C. & Wynn, J. P. The biochemistry and molecular biology of lipid accumulation in oleaginous microorganisms. Adv. Appl. Microbiol. 51, 1–51 (2002).
Article Google Scholar
Kosa, M. & Ragauskas, A. J. Lipids from heterotrophic microbes: advances in metabolism research. Trends Biotechnol. 29, 53–61 (2011).
Article Google Scholar
Ageitos, J. M., Vallejo, J. A., Veiga-Crespo, P. & Villa, T. G. Oily yeasts as oleaginous cell factories. Appl. Microbiol. Biotechnol. 90, 1219–1227 (2011).
Article Google Scholar
Beopoulos, A., Nicaud, J. M. & Gaillardin, C. An overview of lipid metabolism in yeasts and its impact on biotechnological processes. Appl. Microbiol. Biotechnol. 90, 1193–1206 (2011).
Article Google Scholar
Li, Y., Zhao, Z. & Bai, F. High-density cultivation of oleaginous yeast Rhodosporidium toruloides Y4 in fed-batch culture. Enzyme Microb. Technol. 41, 312–317 (2007).
Article Google Scholar
Buzzini, P. et al. Carotenoid profiles of yeasts belonging to the genera Rhodotorula, Rhodosporidium, Sporobolomyces, and Sporidiobolus. Can. J. Microbiol. 53, 1024–1031 (2007).
Article Google Scholar
Politino, M., Tonzi, S. M., Burnett, W. V., Romancik, G. & Usher, J. J. Purification and characterization of a cephalosporin esterase from Rhodosporidium toruloides. Appl. Environ. Microbiol. 63, 4807–4811 (1997).
Google Scholar
Botes, A. L. Affinity purification and characterization of a yeast epoxide hydrolase. Biotechnol. Lett. 21, 511–517 (1999).
Article Google Scholar
Hu, C., Zhao, X., Zhao, J., Wu, S. & Zhao, Z. K. Effects of biomass hydrolysis by-products on oleaginous yeast Rhodosporidium toruloides. Bioresource Technol. 100, 4843–4847 (2009).
Article Google Scholar
Papanikolaou, S. & Aggelis, G. Lipids of oleaginous yeasts. Part II: technology and potential applications. Eur. J. Lipid Sci. Tech. 113, 1052–1073 (2011).
Article Google Scholar
Nagalakshmi, U. et al. The transcriptional landscape of the yeast genome defined by RNA sequencing. Science 320, 1344–1349 (2008).
Article ADS Google Scholar
Morrissy, A. S. et al. Next-generation tag sequencing for cancer gene expression profiling. Genome Res. 19, 1825–1835 (2009).
Article Google Scholar
Liu, H. et al. The proteome analysis of oleaginous yeast Lipomyces starkeyi. FEMS Yeast Res. 11, 42–51 (2011).
Article Google Scholar
Morin, N. et al. Transcriptomic analyses during the transition from biomass production to lipid accumulation in the oleaginous yeast Yarrowia lipolytica. PLoS One 6, e27966 (2011).
Article ADS Google Scholar
Radakovits, R. et al. Draft genome sequence and genetic transformation of the oleaginous alga Nannochloropis gaditana. Nat. Commun. 3, 686 (2012).
Article Google Scholar
Parra, G., Bradnam, K., Ning, Z., Keane, T. & Korf, I. Assessing the gene space in draft genomes. Nucleic Acids Res. 37, 289–297 (2008).
Article Google Scholar
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).
Article Google Scholar
Boulton, C. A. & Ratledge, C. Correlation of lipid accumulation in yeasts with possession of ATP: citrate lyase. J. Gen. Microbiol. 127, 169–176 (1981).
Google Scholar
Maggio-Hall, L. A. & Keller, N. P. Aspergillus nidulans. Mol. Microbiol. 54, 1173–1185 (2004).
Article Google Scholar
Vorapreeda, T., Thammarongtham, C., Cheevadhanarak, S. & Laoteng, K. Alternative routes of acetyl-CoA synthesis identified by comparative genomic analysis: Involvement in lipid production of oleaginous yeast and fungi. Microbiology 158, 217–228 (2012).
Article Google Scholar
Jenni, S. et al. Structure of fungal fatty acid synthase and implications for iterative substrate shuttling. Science 316, 254–261 (2007).
Article ADS Google Scholar
Johansson, P. et al. Inhibition of the fungal fatty acid synthase type I multienzyme complex. Proc. Natl Acad. Sci. USA 105, 12803–12808 (2008).
Article ADS Google Scholar
Crespo, J. L. The TOR-controlled transcription activators GLN3, RTG1, and RTG3 are regulated in response to intracellular levels of glutamine. Proc. Natl Acad. Sci. USA 99, 6784–6789 (2002).
Article ADS Google Scholar
Wullschleger, S., Loewith, R. & Hall, M. N. TOR signaling in growth and metabolism. Cell 124, 471–484 (2006).
Article Google Scholar
Hardwick, J. S., Kuruvilla, F. G., Tong, J. K., Shamji, A. F. & Schreiber, S. L. Rapamycin-modulated transcription defines the subset of nutrient-sensitive signaling pathways directly controlled by the Tor proteins. Proc. Natl Acad. Sci. USA 96, 14866–14870 (1999).
Article ADS Google Scholar
Biggins, S., Neklesa, T. K. & Davis, R. W. A genome-wide screen for regulators of TORC1 in response to amino acid starvation reveals a conserved Npr2/3 complex. PLoS Genet. 5, e1000515 (2009).
Article Google Scholar
Wynn, J. P., Hamid, A. B. A. & Ratledge, C. The role of malic enzyme in the regulation of lipid accumulation in filamentous fungi. Microbiology 145, 1911–1917 (1999).
Article Google Scholar
Zhang, Y., Adams, I. P. & Ratledge, C. Malic enzyme: the controlling activity for lipid production? Overexpression of malic enzyme in Mucor circinelloides leads to a 2.5-fold increase in lipid accumulation. Microbiology 153, 2013–2025 (2007).
Article Google Scholar
Singh, R. et al. Autophagy regulates lipid metabolism. Nature 458, 1131–1135 (2009).
Article ADS Google Scholar
Liu, H. et al. Comparative proteomic analysis of Rhodosporidium toruloides during lipid accumulation. Yeast 26, 553–566 (2009).
Article Google Scholar
Wang, C. & St. Leger, R. J. The Metarhizium anisopliae perilipin homolog MPL1 regulates lipid metabolism, appressorial turgor pressure, and virulence. J. Biol. Chem. 282, 21110–21115 (2007).
Article Google Scholar
Bickel, P. E., Tansey, J. T. & Welte, M. A. PAT proteins, an ancient family of lipid droplet proteins that regulate cellular lipid stores. Biochim. Biophys. Acta 1791, 419–440 (2009).
Article Google Scholar
Evans, C. T. & Ratledge, C. Influence of nitrogen metabolism on lipid accumulation by Rhodosporidium toruloides CBS 14. J. Gen. Microbiol. 130, 1705–1710 (1984).
Google Scholar
Tully, M. Enrichment of mutants of Rhodosporidium toruloides by the use of inositol starvation. J. Basic Microbiol. 25, 683–686 (1985).
Article Google Scholar
Tully, M. & Gilbert, H. J. Transformation of Rhodosporidium toruloides. Gene 36, 235–240 (1985).
Article Google Scholar
Hynes, M. J. & Murray, S. L. ATP-citrate lyase is required for production of cytosolic acetyl coenzyme A and development in Aspergillus nidulans. Eukaryot. Cell 9, 1039–1048 (2010).
Article Google Scholar
Jiang, H. et al. The role of tandem acyl carrier protein domains in polyunsaturated fatty acid biosynthesis. J. Am. Chem. Soc. 130, 6336–6337 (2008).
Article Google Scholar
Athenstaedt, K. et al. Lipid particle composition of the yeast Yarrowia lipolytica depends on the carbon source. Proteomics 6, 1450–1459 (2006).
Article Google Scholar
Sancak, Y. et al. The Rag GTPases bind raptor and mediate amino acid signaling to mTORC1. Science 320, 1496–1501 (2008).
Article ADS Google Scholar
Beck, T. & Hall, M. N. The TOR signalling pathway controls nuclear localization of nutrient-regulated transcription factors. Nature 402, 689–692 (1999).
Article ADS Google Scholar
Kmetzsch, L. et al. The GATA-type transcriptional activator Gat1 regulates nitrogen uptake and metabolism in the human pathogen Cryptococcus neoformans. Fungal Genet. Biol. 48, 192–199 (2011).
Article Google Scholar
Rodriguez-Navarro, J. A. & Cuervo, A. M. Autophagy and lipids: tightening the knot. Semin. Immunopathol. 32, 343–353 (2010).
Article Google Scholar
Knothe, G. 'Designer' biodiesel: optimizing fatty ester composition to improve fuel properties. Energy Fuels 22, 1358–1364 (2008).
Article Google Scholar
Schirmer, A., Rude, M. A., Li, X., Popova, E. & del Cardayre, S. B. Microbial biosynthesis of alkanes. Science 329, 559–562 (2010).
Article ADS Google Scholar
Burke, D., Dawson, D. & Stearns, T. Methods in Yeast Genetics: A Cold Spring Harbor Laboratory Course Manual. (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 2000).
Yang, F., Zhang, S., Jin, G., Lin, X. & Zhao, Z. K. Purification and characterization of a â-1,3-glucomannanase expressed in Pichia pastoris. Enzyme Microb. Technol. 49, 223–228 (2011).
Article Google Scholar
Yang, F., Tan, H., Zhou, Y., Lin, X. & Zhang, S. High-quality RNA preparation from Rhodosporidium toruloides and cDNA library construction therewith. Mol. Biotechnol. 47, 144–151 (2010).
Article Google Scholar
Li, R. et al. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 20, 265–272 (2009).
Article Google Scholar
Kent, W. J. BLAT—the BLAST-like alignment tool. Genome Res. 12, 656–664 (2002).
Article Google Scholar
Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009).
Article Google Scholar
Trapnell, C. et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat. Biotechnol. 28, 511–515 (2010).
Article Google Scholar
Elsik, C. G. et al. Creating a honey bee consensus gene set. Genome Biol. 8, R13 (2007).
Article Google Scholar
Audic, S. & Claverie, J.- M. The significance of digital gene expression profiles. Genome Res. 7, 986– 995 (1997).
Article Google Scholar
Jiang, X., Jiang, X., Han, G., Ye, M. & Zou, H. Optimization of filtering criterion for SEQUEST database searching to improve proteome coverage in shotgun proteomics. BMC Bioinformatics 8, 323 (2007).
Article Google Scholar
Heinecke, N. L., Pratt, B. S., Vaisar, T. & Becker, L. PepC: proteomics software for identifying differentially expressed proteins based on spectral counting. Bioinformatics 26, 1574–1575 (2010).
Article Google Scholar

Download references

Acknowledgements

We acknowledge the sequencing, assembly and associated in silico analyses from the Beijing Genomics Institute at Shenzhen, China. We also thank Dr Yonghong Li for the preparation of the transmission electron microscope images. This work was funded by the National Basic Research and Development Program of China (2011CB707405), the Natural Sciences Foundation of China (31000052, 31170060), the Knowledge Innovation Program of Chinese Academy of Sciences (KSCX2-EW-G-1-3) and the Directors' Program of Dalian Institute of Chemical Physics, CAS.

Author information

Authors and Affiliations

Division of Biotechnology, Dalian Institute of Chemical Physics, CAS, 457 Zhongshan Road, Dalian, 116023, China
Zhiwei Zhu, Sufang Zhang, Hongwei Shen, Xinping Lin, Fan Yang, Yongjin J. Zhou, Guojie Jin & Zongbao K. Zhao
Graduate University of Chinese Academy of Sciences, 19A Yuquanlu, Beijing, 100049, China
Zhiwei Zhu, Xinping Lin, Yongjin J. Zhou & Guojie Jin
Division of Biomass Conversion and Bio-Energy, Dalian National Laboratory for Clean Energy, 457 Zhongshan Road, Dalian, 116023, China
Sufang Zhang, Hongwei Shen & Zongbao K. Zhao
Key Laboratory of Separation Sciences for Analytical Chemistry, Dalian Institute of Chemical Physics, CAS, 457 Zhongshan Road, Dalian, 116023, China
Hongwei Liu, Mingliang Ye & Hanfa Zou
School of Biological Engineering, Dalian Polytechnic University, 1 Qinggongyuan, Dalian 116034, China.,
Fan Yang

Authors

Zhiwei Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Sufang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hongwei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Hongwei Shen
View author publications
You can also search for this author in PubMed Google Scholar
Xinping Lin
View author publications
You can also search for this author in PubMed Google Scholar
Fan Yang
View author publications
You can also search for this author in PubMed Google Scholar
Yongjin J. Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Guojie Jin
View author publications
You can also search for this author in PubMed Google Scholar
Mingliang Ye
View author publications
You can also search for this author in PubMed Google Scholar
Hanfa Zou
View author publications
You can also search for this author in PubMed Google Scholar
Zongbao K. Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.K.Z. conceived the project. Z.K.Z. and Z.Z. designed the study. Z.Z. performed most of the experiments and analyses unless specified otherwise. S.Z. isolated the haploid strains and participated in sample preparation, genome annotation and data analysis; H.L., M.Y. and H.Z. participated in the proteomic data analysis; H.S. prepared the chemostat cultures; X.L. and F.Y. participated in the genome annotation; Y.Z. participated in data analysis; G.J. prepared the scanned electrical microscope images; and Z.Z. and Z.K.Z. wrote the manuscript.

Corresponding authors

Correspondence to Hanfa Zou or Zongbao K. Zhao.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures S1-S14, Supplementary Tables S1-S11, Supplementary Methods and Supplementary References (PDF 2175 kb)

Supplementary Data 1

Differentially expressed proteins between "seed", "24 h" and "96 h" (XLS 263 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/

Reprints and permissions

About this article

Cite this article

Zhu, Z., Zhang, S., Liu, H. et al. A multi-omic map of the lipid-producing yeast Rhodosporidium toruloides. Nat Commun 3, 1112 (2012). https://doi.org/10.1038/ncomms2112

Download citation

Received: 21 February 2012
Accepted: 04 September 2012
Published: 09 October 2012
DOI: https://doi.org/10.1038/ncomms2112

This article is cited by

Evaluating oleaginous yeasts for enhanced microbial lipid production using sweetwater as a sustainable feedstock
- Valériane Malika Keita
- Yi Qing Lee
- Dong-Yup Lee
Microbial Cell Factories (2024)
Research progress on carotenoid production by Rhodosporidium toruloides
- Zhuo-Ting Xie
- Bing-Qian Mi
- Zhi-Wei Ye
Applied Microbiology and Biotechnology (2024)
Enhanced glycerol assimilation and lipid production in Rhodotorula toruloides CBS14 upon addition of hemicellulose primarily correlates with early transcription of energy-metabolism-related genes
- Giselle C. Martín-Hernández
- Mikołaj Chmielarz
- Volkmar Passoth
Biotechnology for Biofuels and Bioproducts (2023)
Integrative omics analyses of the ligninolytic Rhodosporidium fluviale LM-2 disclose catabolic pathways for biobased chemical production
- Nathália Vilela
- Geizecler Tomazetto
- Fabio Marcio Squina
Biotechnology for Biofuels and Bioproducts (2023)
Bioconversion of non-food corn biomass to polyol esters of fatty acid and single-cell oils
- Guang-Lei Liu
- Xian-Ying Bu
- Ya-Jun Liu
Biotechnology for Biofuels and Bioproducts (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.