Introduction

The recent advances in chemical biology and functional genomics have given rise to new methods and opportunities for drug discovery, and in turn new methods for the generation of small-molecule libraries1,2,3,4,5. Combinatorial chemistry, the method of choice for assembling chemically synthesized libraries, has also undergone rapid development over the last few years6,7,8. Yet, despite providing almost unlimited functional group diversity, chemically synthesized libraries of small molecules still lack a suitable, straightforward method for their decoding. Split and pool protocols allow for the chemical synthesis of libraries with up to 105 members9,10; however, the paths to identifying the active members of such libraries are often laborious and complicated11. In contrast, biologically synthesized libraries of small molecules are often several-fold larger in size and allow for very straightforward identification of the active members12,13,14. When combined with an in vivo screen, biological libraries become part of a powerful, rapid and facile method for the screening of a large number of compounds against a chosen target4,14,15,16.

Biological libraries are principally polypeptides, typically embedded within, or fused to larger molecules to overcome the host cell's catabolic machinery. Alternatively, intracellular stability is achieved by constraining the ends of the peptide with non-covalent and covalent interactions, with varying results17,18. We have pursued intracellular backbone cyclization to generate bio-stable peptide libraries. This procedure, termed SICLOPPS19,20, utilizes the Synechocystis sp PCC6803 DnaE split intein21. By rearranging the order of the elements of the intein, an active cis-intein (IC:target peptide:IN) is yielded that upon splicing results in cyclization of the target protein/peptide sequence (Fig. 1).

Figure 1: SICLOPPS mechanism.
figure 1

The expressed fusion protein folds to form an active intein. An N-to-S acyl shift at the target N-terminal intein junction produces a thioester, which undergoes transesterification with a side-chain nucleophile (serine or cysteine, X = O or S) at the C-terminal intein junction to form a lariat intermediate. An asparagine side chain liberates the cyclic product as a lactone, and an X-to-N acyl shift generates the thermodynamically favored lactam product in vivo.

Any target can be incorporated into the SICLOPPS vector without limitations on sequence identity by engineering restriction sites into the C-terminal (IC) and N-terminal inteins (IN). Cyclic peptide characterization and protein purification is aided by a chitin-binding domain (CBD) fused to the C terminus of IN (Fig. 2).

Figure 2: Construction of SICLOPPS vectors.
figure 2

The cloning instructions have been kept general so that any expression vector may be used in the construction of the SICLOPPS plasmid as long as it contains four contiguous restriction sites [A]–[D]. i, the IC domain is cloned between restriction sites [A] and [B]; ii, the IN domain is cloned into the IC vector between restriction sites [C] and [D]; iii, introduction of an AflII restriction site at the 5′ end of the IN gene allows peptide sequences targeted for cyclization to be cloned between the BglI/MfeI and AflII sites. Introduction of the KpnI at the 3′ end of IN allows the introduction of affinity tags. iv, an affinity tag (CBD) is cloned into the SICLOPPS vector between the KpnI and [D] restriction sites.

To utilize SICLOPPS in the biosynthesis of cyclic peptide libraries19, degenerate oligonucleotides encoding those peptides need to be introduced between the IC and IN genes, while making sure that the correct reading frame is maintained throughout. A PCR-based technique was developed in which the random oligonucleotides of the library are incorporated into the forward primer (library primer; see Fig. 3) between the region that binds the 3′ end of IC and the 5′ end of IN. The reverse primer anneals to the chitin-CBD, amplifying the IN-CBD fusion gene (CBDr primer; see Fig. 3). The variable segment was encoded in the form NNS where N represents any of the four DNA bases (A, C, G or T) and S represents C or G. The NNS sequence generates 32 codons and encodes all 20 amino acids while eliminating the ochre (UAA) and opal (UGA) stop codons from the library. The intein chemistry requires the first amino acid to be a nucleophilic cysteine or serine. There are no limits on the number of amino acids in the target peptide, allowing cyclic peptides of various sizes. Typically, five variable amino acids are used because the theoretical number of library members at DNA levels (34 million) is within the number of transformants that can be readily achieved.

Figure 3: Construction of the library vectors.
figure 3

i, a library primer encoding the 5′ end of IC, a degenerate sequence and the 3′ end of IN is used to amplify an IN-CBD fragment; ii, the resulting CR product was found to contain mismatched pairs, and so is subjected to a second round of amplification with a zipper primer. The amplified fragments and the SICLOPPS vector are digested with the corresponding restriction enzymes. iii, ligation of the vector with the library insert results in the SICLOPPS library plasmids.

After the first PCR, half of the amplified DNA fragments were found to contain mismatches in the random nucleotide region, probably owing to the sequence complexity of the library. A second PCR using a “zipper” primer corresponding to the 3′ end of IC was therefore used to ensure the annealing of all DNA sequences to their complimentary strand. These pieces of DNA were incorporated into the SICLOPPS plasmid using standard techniques, thus generating the desired library (Fig. 3).

We have demonstrated the flexibility of the procedure toward peptide length and composition, making it a versatile method for the intracellular generation of a library of stable cyclic peptides. SICLOPPS has been successfully combined with a reverse two-hybrid system in our laboratories to yield a method for the genetic selection of inhibitors of protein–protein interactions15,16.

Materials

Reagents

  • ElectroMAX DH5α-E (Invitrogen, # 11319-019)

  • Synechocystis sp PCC6803 genomic DNA (ATCC, # 27184)

  • Any expression vector containing four contiguous restriction sites corresponding to [A], [B], [C] and [D] in the PCR primers (in our laboratories we use the pARCBD-p plasmid vector)20

  • pTYB1 plasmid vector (New England Biolabs, # N6701S)

  • PCR primers can be synthesized “in house” if a DNA synthesizer is available, otherwise they are ordered (IDT in the USA, Sigma in the UK). The oligonucleotides below are listed 5′ to 3′:

    • ICf primer: [restriction site A]—ATG GTT AAA GTT ATC GGT CGT CGT TCC C

    • ICr primer: [restriction site B]—ATT GTG CGC AAT CGC CCC AT

    • INf primer: [restriction site C]—GCC TCA GTT TTG GC

    • INr primer: [restriction site D]—TTA TTT AAT AGT CCC AGC GTC

    • IN AflIIf primer: [restriction site C]—TGC TTA AGT TTT GGC ACC

    • IN AflIIr primer: GGT GCC AAA ACT TAA GCA—[restriction site C]

    • IN KpnIf primer: TAC TTG ACG CTG GTA CCA TTA AAT AA—[restriction site D]

    • IN KpnIr primer: [restriction site D]—TTA TTT AAT GGT ACC AGC GTC AAG TA

    • CDBf primer: GGG GTA CCA TTA AAA CGA CAA ATC CTG GTG TA

    • CBDr primer: [restriction site D]—TCA TTG AAG CTG CCA CAA GG

    • C+5 library primer: G GAA TTC GCC AAT GGG GCG ATC GCC CAC AAT TGC NNS NNS NNS NNS NNS TGC TTA AGT TTT GGC

    • Zipper primer: GGA ATT CGC CAA TGG GGC GAT CGC C

  • Unless otherwise stated, all regents are bought from Fisher Scientific

  • Lauria-Bertani (LB) broth: 10 g bacto-tryptone, 5 g bacto-yeast extract, 10 g NaCl and 1 liter water. Autoclaved22

    Critical

    It is important to autoclave the denoted reagents to prevent contamination.

  • Deionized water. Autoclaved

  • 10% (v/v) glycerol solution: 10 ml glycerol and 90 ml deionized water. Autoclaved

  • 10 × thermophilic DNA polymerase buffer and Taq DNA polymerase (Promega, # M3171)

  • dNTP mix (dATP, dCTP, dGTP and dTTP, 10 mM) (Promega, # U1511)

  • QiaQuick PCR purification, Plasmid midi kit and QiaQuick gel extraction kits (QIAGEN, # 28104, # 27104 and # 28704)

  • Agarose, molecular biology grade (CAS, # 9012-36-6)

  • Ethidium bromide (CAS, # 1239-45-8)

    Caution

    Ethidium bromide is a known carcinogen and should be handled and disposed of with care.

  • Appropriate 10 × restriction enzyme buffer, 100 × BSA and restriction enzymes corresponding to restriction sites [A], [B], [C] and [D] in the PCR primers (New England Biolabs)

  • Shrimp alkaline phosphatase (Promega, # M8201)

  • T4 DNA ligase and 10 × ligation buffer (Promega, # M1801)

  • PhiX174 DNA/HaeII and Lambda DNA/HinDIII markers (Promega, # G1761 and # G1711)

  • Quick change site-directed mutagenesis kit (Stratagene)

  • 0.2% (w/v) arabinose solution (CAS, # 87-72-9)

  • Chitin affinity resin (New England Biolabs, # S6651S)

  • Chitin buffer: 20 mM Tris·HCl, pH 7.8, 1 mM TCEP and 0.5 mM NaCl

  • Sodium dodecyl sulfate (SDS) gel-loading buffer: 100 mM Tris·HCl, pH 6.8, 200 mM dithiothreitol, 4% (w/v) SDS, 0.2% (w/v) bromophenol blue and 20% (v/v) glycerol

  • SDS-PAGE gel: 15% resolving gel and 4% stacking gel (Bio-Rad Laboratories, # 161-1103EDU)

  • 1 × SDS-PAGE Tris-glycine electrophoresis buffer: 5 × solution: 15.1 g Tris base, 94 g glycine and 5 g SDS made up to 1 liter with deionized water

  • SDS-PAGE stain solution: 0.25 g Coomassie brilliant blue R250 (CAS, # 6104-58-1) in 45 ml methanol, 45 ml water and 10 ml glacial acetic acid

  • SDS-PAGE de-stain solution: 45 ml methanol, 45 ml water and 10 ml glacial acetic acid

Equipment

  • Thermal cycler programmed with the amplification protocol

  • Incubator/shaker at 37 °C

  • Electroporator

  • Protein and nucleic acid electrophoresis equipment

  • Gel imaging equipment

  • UV transilluminator

  • UV spectrophotometer

  • 0.2 cm gene pulser cuvettes (Bio-Rad Laboratories, # 165-2083)

  • Membrane filter paper, 0.025 μm pore size, 25 mm diameter (Millipore, # VSWP02500)

Reagent setup

  • 1 × Tris-acetate EDTA buffer 50 × solution: 242 g Tris base, 57.1 ml glacial acetic acid and 100 ml EDTA, pH 8.0, make up to 1 liter final volume with water.

  • SOC medium 20 g bacto-tryptone, 5 g bacto-yeast extract and 0.5 g NaCl. Add 10 ml of 25 mM KCl. Adjust to pH 7.0 with 5 N NaOH. Dilute to 1 liter with distilled water. Autoclave. Add 2 M MgCl2 and 20 ml of 1 M glucose before use.

  • LB agar plates 15 g bacto-agar added to LB broth. Autoclave. Allow the agar to cool to 60 °C and add the appropriate antibiotic. Dispense in 100 × 15 mm Petri dishes.

  • Library agar plates LB agar (as above) dispensed in 245 × 245 × 25 bio-assay dish (NUNC, # 240835).

Procedure

Cloning of IC and IN genes

Timing 2 days

  1. 1

    To clone the IC and IN gene inserts, set up two PCRs containing the following:

    Table 1
  2. 2

    Run a PCR with the following program:

    Table 2
  3. 3

    Run an agarose gel to verify the presence of amplified DNA of the correct size in the reaction mixture22.

    Caution

    Ethidium bromide is a carcinogen; therefore, the gel should be handled with care and disposed off accordingly.

  4. 4

    Purify the product of each PCR with the QiaQuick PCR kit (as instructed by the manufacturer).

    Pause point

    The PCR product should be stored overnight at −20 °C.

  5. 5

    Digest the PCR products with the corresponding restriction enzymes ([A] and [B] for IC; [C] and [D] for IN) according to the manufacturer's protocol (typically, mix 5 μl of 10 × digestion buffer with 5 μl of BSA, 25 μl of purified PCR product, 2 μl of restriction enzyme and 13 μl water. Incubate for 4 h at 37 °C). Recover and purify the digested PCR products with the QiaQuick PCR kit.

    Pause point

    The DNA should be stored overnight at −20 °C.

Ligation and electroporation

Timing 12 days

  1. 6

    Digest the vector with the chosen restriction enzymes [A] and [B] corresponding to the sites on the IC gene PCR products above. Again, refer to the manufacturer's instructions corresponding to the chosen restriction enzymes.

  2. 7

    Add 1 μl of shrimp alkaline phosphatase to the digested vector (to dephosphorylate the termini). Incubated the mixture at 37 °C for 1 h. Inactivate the enzyme by incubating at 65 °C for 30 min.

  3. 8

    Separate the digested vector backbone from the digested fragment on an agarose gel in 1 × Tris-acetate EDTA (TAE) buffer as outlined above. Identify the vector backbone (by size) under a UV transilluminator and cut out of the gel with a blade.

  4. 9

    Purify the excised band using a QiaQuick gel extraction kit as instructed by the manufacturer. Determine the DNA concentration of the vector and IC gene insert using a UV spectrophotometer.

    Pause point

    The DNA should be stored overnight at −20 °C.

  5. 10

    Electrocompetent DH5α-E cells can be prepared as described in the literature22,23 or bought (Invitrogen).

  6. 11

    Set up the following ligation mixtures, one for the actual ligation, the other, without insert, as a control:

    Table 3
  7. 12

    Incubate both reactions at room temperature (25 °C) for 4 h. Heat-inactivate the T4 ligase by incubating at 65 °C for 30 min.

  8. 13

    Add 3 μl of the ligation mixture to 100 μl of electrocompetent cells. Transfer the mixture to an electroporation cuvette. Apply electric pulse with an electroporator as instructed by the manufacturer. Immediately add 900 μl of SOC medium and transfer to a 10 ml culture tube. Repeat for the negative control. Allow the cells to recover in an incubator/shaker for 1 h at 37 °C.

  9. 14

    Plate 100 μl of the solution on LB agar plates containing the appropriate antibiotic.

    Pause point

    Incubate overnight at 37 °C.

  10. 15

    Pick eight random colonies and inoculate them into separate culture tubes containing 3 ml of LB broth and the appropriate antibiotic for the marker on the vector.

    Pause point

    Grow to stationary phase overnight.

    Troubleshooting

  11. 16

    Purify the plasmids with the QiaGen plasmid midi kit.

  12. 17

    Identify those E. coli colonies containing the plasmids and the IC gene insert by PCR. Use the ICf and ICr primers to amplify the plasmid solutions obtained in Step 16 following the same PCR protocol described in Step 1 (see above). Run an agarose gel to identify the plasmids containing the IC gene (look for amplified gene products on the gel).

  13. 18

    Confirm the sequence integrity of the IC gene in the vector by DNA sequencing.

    Pause point

    If an “in house” DNA sequencing facility is not available, this process may take several days.

    Critical Step

    Despite the delay it may cause in the completion of this protocol, we strongly advise against proceeding further without checking the integrity of the inserted sequence. All the following sequencing steps should be treated in a similar manner.

  14. 19

    Insert the IN gene into the IC-containing vector between digestion sites [C] and [D] following Steps 6–18 of the procedure outlined above for the insertion of the IC gene.

  15. 20

    After insertion into the vector, confirm the integrity of the IN gene by DNA sequencing.

  16. 21

    Use the manufacturer's instructions to introduce an AflII restriction site in the above vector with the Quick Change site-directed mutagenesis kit using the IN Aflf and IN Aflr primers.

  17. 22

    Use the Quick Change site mutagenesis kit with the IN Kpnf and IN Kpnr primers to introduce a KpnI restriction site in the above vector.

  18. 23

    Confirm the vector identity by DNA sequencing.

  19. 24

    Introduce a CBD (or any other chosen affinity tag) into the SICLOPPS plasmid using a similar protocol (Steps 6–18) as that outlined for the cloning and introduction of the IC gene. Insert the PCR product (CBDf, CDBr primers, pTYB1 template) into the SICLOPPS plasmid between the KpnI and [D] digestion sites.

  20. 25

    Confirm the sequence identity of the SICLOPPS-CBD vector by DNA sequencing.

SICLOPPS library construction

Timing 4 days

  1. 26

    The library primer used here codes for a cysteine + 5 random amino acid, hexapeptide library. You can design your own SICLOPPS cyclic peptide library by changing the sequence of the library primer. Remember to include a cysteine or serine at amino-acid position 1 to act as the nucleophile to initiate intein processing (see Fig. 1).

  2. 27

    Set up the following PCR mixture in a 200 μl PCR tube:

    Table 4
  3. 28

    Run a PCR with the following program:

    Table 5

    Critical Step

    Do not use lower annealing temperatures, as the resulting constructs have been found to contain frameshifts downstream of the library sequence.

  4. 29

    Purify the PCR products with the QiaQuick PCR purification kit. Determine the DNA concentration of the purified PCR product with a UV spectrophotometer.

  5. 30

    Set up and run a second PCR with a “zipper” primer and the above PCR product as template to ensure the annealing of all library members to their corresponding strands (see INTRODUCTION and Fig. 3), as follows:,

    Table 6
    Table 7
  6. 31

    Digest the fragment with BglI (or MfeI) and [D] following the protocol supplied by the manufacturer of the restriction enzymes.

  7. 32

    Digest the SICLOPPS-CBD vector with BglI (or MfeI) and [D]. Treat with shrimp alkaline phosphatase and purify the linear vector backbone on an agarose gel in 1 × TAE buffer as above.

    Critical Step

    We have found that small changes in the vector to insert ratio in the ligation mixture have large effects on transformation efficiency and therefore the size of the library. Therefore, optimal conditions need to be determined for every plasmid/insert pair.

  8. 33

    Combine 100, 150, 200, 250 and 300 ng of digested library insert with 50, 85 and 125 ng of digested and dephosphorylated vector to give a total of 15 ligation reactions. To each reaction, add 1 μl of 10 × ligation buffer and water to a final volume of 9 μl. Prepare control ligations for each vector concentration (do not add insert). Add 1 μl of T4 DNA ligase to each reaction and controls.

    Pause point

    Incubate at room temperature overnight.

  9. 34

    In the morning, inactivate the enzyme by incubating at 65 °C for 30 min.

  10. 35

    Dialyze each ligation mixture by carefully placing the reaction mixture as a drop (with a micropipette) in the center of a Millipore membrane filter in a beaker of 50 ml deionized water for an hour. Carefully retrieve the drop with a micropipette.

  11. 36

    Transform the DNA from each ligation mixture and control reaction into DH5α-E cells as described above (Steps 13 and 14). After recovery of the cells, plate 10−4 and 10−6 dilutions for every ligation condition and controls.

    Pause point

    Incubate overnight at 37 °C.

  12. 37

    Subtract the number of colonies on the control plates from those on the library plates to determine the best ligation condition (giving the most colonies). Repeat the ligation with the optimal ratio of vector to insert.

  13. 38

    Dialyze the DNA and transform the library into DH5α-E as in Step 35.

  14. 39

    After recovery at 37 °C for an hour, plate 10−6 dilution on an agar plate containing the antibiotic marker corresponding to the SICLOPPS plasmid. Plate the rest of the cells onto a library agar plate (the reference plate whose bacterial colonies will be used for further applications) containing the same antibiotic marker.

    Pause point

    Incubate both plates overnight at 37 °C.

    Troubleshooting

  15. 40

    Determine library size by counting the number of colonies on the 10−6 dilution plate (or the dilution plate containing the smallest number of colonies).

  16. 41

    Scrape the colonies off the library plate into 10 ml LB with a glass scraper. Pellet the cells in a clinical centrifuge (1300g, 10 min). Resuspend the cells in 2 ml LB. The SICLOPPS library is now in hand.

    Pause point

    To store the library for future use, place 0.5 ml aliquots in 1.7 ml Eppendorf tubes. Freeze in liquid nitrogen and store at −78 °C.

  17. 42

    (OPTIONAL) If you require the library for a biological screen or a genetic selection, purify the plasmids from 500 μl of the resuspended cells with a Qiagen plasmid midi kit.

  18. 43

    (OPTIONAL) Transform the above purified plasmids into the reporter strain. Initiate cyclic peptide production by adding the appropriate inducer (arabinose, IPTG) according to the promoter on your chosen vector. DNA sequencing of the corresponding SICLOPPS plasmid readily identifies active cyclic peptides.

Purification and analysis of individual library members

Timing 2 days

  1. 44

    Inoculate 100 ml of LB broth with cells containing the SICLOPPS plasmid of interest. Incubate at 37 °C with agitation until OD600 of 0.6 is reached. Induce SICLOPPS with the appropriate compound (IPTG, arabinose, etc.) for the promoter/repressor in your vector. Continue to incubate at 37 °C for 2 h.

  2. 45

    Pellet the cells by centrifugation (12,000g; 6,000 r.p.m. in a Sorvall evolution RC centrifuge, Sorvell SLC-6000 rotor) for 10 min. Resuspend the pellet in 4 ml of chitin buffer and lyse the cells by sonication. Pellet the insoluble debris by centrifugation (85,000g; 16,000 r.p.m. in a Sorvall evolution RC centrifuge, Sorvell SLC-6000 rotor) for 30 min at 4 °C.

  3. 46

    Add the soluble fraction to a column containing 200 μl of chitin beads and wash the column with 5 ml of chitin buffer.

    Pause point

    Leave the inteins process on the chitin column overnight at room temperature.

  4. 47

    Remove around five beads of resin from the column and suspend in a tube with 100 μl of SDS-PAGE loading buffer. Incubate in a boiling water bath for 10 min.

  5. 48

    Centrifuge the samples (110,000g; 18,000 r.p.m. in a Sorvall evolution RC centrifuge, Sorvell SLC-6000 rotor) for 10 min and load 2 μl onto the SDS-PAGE gel. Run with Tris-glycine buffer at 250 V until the tracking dye is 5 mm from the bottom of the gel. Remove the gel and cover with stain solution, carefully heat the gel and stain in a microwave oven for 2–3 min until the gel is stained.

    Caution

    Take care not to let the stain solution boil over when using the microwave. Also the stain solution will be extremely hot, so handle with care.

  6. 49

    Remove the stain solution and cover the gel with de-stain solution and agitate until protein bands are visible. As some SICLOPPS libraries may produce peptides that are too small to be visible on the Tris/glycine gel, the processed intein fragments (IN = 14 kDa, IC = 4 kDa) serve as suitable markers of cyclization (Fig. 4).

    Figure 4: SDS-PAGE data.
    figure 4

    Lane 1 shows an unprocessed intein, lanes 2 and 3 show processed inteins, randomly selected from a C + 5 peptide library.

  7. 50

    (OPTIONAL) To isolate cyclic peptides, repeat Steps 44 and 45 using 1 liter of LB broth, inoculating with 10 ml of overnight culture of the colony containing the SICLOPPS plasmid of interest. After the second centrifugation (having resuspended the cells in 40 ml of chitin buffer), extract the soluble fraction with 40 ml of n-BuOH. Separate the organic phase and remove the solvent in vacuo. The pure peptide obtained is further analyzed or purified by HPLC.

Troubleshooting

Problem: The ligation/transformation step is not working.

Solution: Optimize the insert/vector ratio for the ligation step as outlined in Steps 32 and 33. Also make sure the T4 DNA ligase buffer is at room temperature and that all its components are fully dissolved in solution.

Problem: Library size is much lower than expected.

Solution: As library size depends on transformation efficiency, ensure that steps are taken to maximize this process22, such as using maximum efficiency electrocompetent cells, and pre-warming the SOC medium to 37 °C and adding it rapidly after the electric pulse. Be sure to optimize the ligation step between the library insert and SICLOPPS vector as outlined in Steps 32 and 33.

Timing

This procedure is expected to take around 3–4 weeks to implement, depending on the time taken for DNA sequencing.

Anticipated results

In our hands, the above procedure yields cyclic peptide libraries of between 107 and 108 transformants. It should be noted that despite the relative promiscuity of the Synechocystis sp PCC6803 DnaE intein, not all peptide sequences are cyclized. The cyclization efficiency of a library is sequence dependent and can be probed by MALDI mass spectrometry or SDS-PAGE analysis of random individual members.

As well as peptides, SICLOPPS has also been used to cyclize the E. coli enzyme dihydrofolate reductase (DHFR)20. DHFR had been previously circularly permuted24,25,26 and cyclized via a disulfide bond27. The cyclic DHFR produced by SICLOPPS was shown to be resistant to proteolysis and had steady-state kinetic parameters, and substrate, cofactor and methotrexate dissociation constants that were indistinguishable from those of the wild-type enzyme at 25 °C. Activity assays conducted after preincubation of wild-type and cyclic DHFR at 65 °C showed an improvement in the thermostability of the cyclic enzyme.

Despite lower transformation efficiencies, application of this split-intein strategy in eukaryotes is of interest. To this end, SICLOPPS has been ported for use in mammalian cells28 and efforts are underway in our laboratories to port this system into Saccharomyces cerevisiae.