Synopsis

Subject Categories: Functional genomics

Molecular Systems Biology 2 Article number: 2006.0008  doi:10.1038/msb4100050
Published online: 21 February 2006
Citation: Molecular Systems Biology 2:2006.0008

Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection

There is a Report associated with this document.

Tomoya Baba1,2, Takeshi Ara1, Miki Hasegawa1,3, Yuki Takai1,3, Yoshiko Okumura1, Miki Baba1, Kirill A Datsenko4, Masaru Tomita1, Barry L Wanner4 & Hirotada Mori1,2

  1. Institute for Advanced Biosciences, Keio University, Tsuruoka City, Yamagata, Japan
  2. Graduate School of Biological Sciences, Nara Institute of Science and Technology, Ikoma, Nara, Japan
  3. CREST, JST (Japan Science and Technology), Kawaguchi, Saitama, Japan
  4. Department of Biological Sciences, Purdue University, West Lafayette, IN, USA

Correspondence to: Barry L Wanner4 Department of Biological Sciences, Purdue University, West Lafayette, IN 47907-2054, USA. Tel.: +1 765 494 8034; Fax: +1 765 494 0876; E-mail: Email: blwanner@purdue.edu

Correspondence to: Hirotada Mori1,2 Graduate School of Biological Sciences, Nara Institute of Science and Technology, 8916-5 Takayama, Ikoma, Nara 630-0101, Japan. Tel.: +81 743 72 5660; Fax: +81 743 72 5669; E-mail: Email: hmori@gtc.naist.jp

Received 28 September 2005; Accepted 7 December 2005; Published online 21 February 2006

Top

Article highlights

  • A complete set of single-gene deletions of all non-essential genes has been created in E. coli K-12 by replacing coding regions with a kanamycin resistance cassette.
  • Deletions were designed to create in-frame, non-polar mutations upon eviction of the resistance cassette by use of the FLP recombinase.
  • These mutants called the Keio collection provide a new resource for genome-wide functional genomics of the E. coli model cell and whole cell modeling.

Top

Synopsis

The long-term goal of biomedical research has always been the complete understanding of biological systems. In the last century, reductionist approaches proved immensely powerful in elucidating many biochemical, genetic, and molecular mechanisms. In this century, we are entering a more synthetic phase in which we will accomplish the goal of completely understanding biological systems in their incredible living complexity. This understanding will be expressed in a number of models, ranging from traditional biological understanding (where individuals construct models in their heads) to formal mathematical models. In any case, reaching a complete understanding requires an unprecedented standardization and completeness of data, greatly improved methods of accessing and linking information, and improved techniques and approaches for mathematical modeling.

E. coli K-12 is the best-characterized organism at the molecular level. In the accompanying report, we describe its highly accurate sequence (Hayashi et al, 2006), perhaps more accurate than of any genome of similar size, maybe even error free. Determination of a highly accurate sequence provided the impetus for re-annotation of its genome (Riley et al, 2006), which is of fundamental importance to studies not only of E. coli biology but also of other organisms because properties of more than half of its gene products have been experimentally determined.

More than a half-century of experimental investigation has led to the identification of nearly all the metabolic reactions and the small molecule metabolites involved therein. Many of the regulatory circuits have been identified and computational methods for the predication of many regulatory sites are available. It is thus a truism that '... all cell biologists have two cells of interest, the one they are studying and Escherichia coli' (Neidhardt, 1996). E. coli has the further advantage of being a simple unicellular organism without as extensive an elaboration of compartments and transport mechanisms as are present even in simple eukaryotes such as yeast (Figure 7, Holden, 2002). The completeness of our knowledge and the relative simplicity of E. coli provide compelling reasons for choosing it as the first cellular system to be targeted for complete understanding. This was clearly seen by Francis Crick when in 1973 (Crick, 1973) he proposed 'Project K: the complete solution of E. coli.' Of course, his suggestion was hopelessly premature, being before many key technologies, rapid computation, and the web (Crick, 2002).

Figure 7
Figure 7 :  Unfortunately we are unable to provide accessible alternative text for this. If you require assistance to access this image, or to obtain a text description, please contact npg@nature.com

Electron micrograph of E. coli K-12 by Melvin L Demphilis and Julius Adler. Republished with permission (Holden, 2002).

Full figure and legend (73K)Figures & Tables index

With a goal towards complete understanding of E. coli as a simple cellular system, we have begun the construction of uniformly designed and comprehensively prepared resources. Here, we describe a complete set of precisely defined, single-gene deletions of nonessential E. coli K-12 genes. These mutants were constructed by using a PCR gene replacement method similar to the one used to create a nearly complete set of yeast gene mutants (Giaever et al, 2002), except by using E. coli cells carrying a plasmid expressing the highly efficient lambda Red recombinase (Datsenko and Wanner, 2000) (Figure 8).

Figure 8
Figure 8 :  Unfortunately we are unable to provide accessible alternative text for this. If you require assistance to access this image, or to obtain a text description, please contact npg@nature.com

PCR gene replacement strategy. (A) Gene targeting fragment encoding kanamycin resistance with short homology extensions (H1 and H2) is generated by PCR by using priming sites P1 and P2 (Step 1). Gene targeting fragment is introduced into E. coli K-12 BW25113 expressing the Red recombinase from pKD46 (Step 2). Kanamycin-resistant transformants are selected (Step 3). Transformants are verified by PCR (Step 4). (B) Elimination of the resistance cassette by use of the FLP recombinase plasmid pCP20 is expected to leave behind a 102-nt 'scar' encoding a 34-residue peptide (Step 1). The scar region is amplified and sequenced to be sure no mutations, especially 1-nt deletions (Datsenko and Wanner, 2000), were introduced (Step 2).

Full figure and legend (233K)Figures & Tables index

Deletions were obtained for 3985 of 4288 targeted genes. Based on finding mutants with the predicted structures, the majority of these 3985 genes are probably nonessential. Because a small fraction (ca. 0.2%) of cells are predicted to contain genetic duplications (Anderson and Roth, 1977), a small number of these 3985 genes may in fact be essential. The majority of the 303 genes for which no mutants were obtained are candidates for essential genes, at least under our selection conditions (aerobic growth on a complex medium at 37°C).

In bacteria, genes are often arranged in operons that are transcribed as a unit and in which neighboring genes frequently overlap a few to several nucleotides. In such arrangements, mutation of a single gene can simultaneously affect function of neighboring or downstream genes. To circumvent these kinds of problems, mutants were designed taking into account gene organization to avoid affecting properties of more than one gene simultaneously. All mutants contain a kanamycin resistance cassette in place of the gene coding region. In most cases, the coding region from the 2nd through the 7th codon from the C-terminus has been deleted. The kanamycin resistance gene is oriented for expression of downstream genes (Figure 8A). Further, the mutants were constructed by use of a resistance cassette that can be easily eliminated (Datsenko and Wanner, 2000). The resultant kanamycin-sensitive derivatives are predicted to encode a small in-frame peptide in place of the mutated gene, in order to reduce effects on expression of downstream genes (Figure 8B).

Results of profiling the mutants for growth on synthetic and rich media are described in the manuscript. These mutants provide a new basic resource not only for systematic functional genomics studies but also experimental data source for systems biology approaches. By providing this resource openly to the research community, the authors hope to contribute to worldwide efforts directed towards a comprehensive understanding of the E. coli K-12 model cell. Accordingly, we are making the entire mutant collection freely available for nonprofit, noncommercial use via GenoBase (http://ecoli.aist-nara.ac.jp) for cost of duplication and shipping fees. Commercial and for-profit investigators should contact one of the corresponding authors directly.

Top

Acknowledgements

This work was supported by a Grant-in-Aid for Scientific Research on Priority Areas from the Ministry of Education, Culture, Sports, Science, and Technology of Japan, a grant from CREST, JST (Japan Science and Technology), and in part from NEDO (New Energy and Industrial Technology Development Organization) and from Tsuruoka City and Yamagata Prefecture governments. BLW is supported by NIH GM62662. We thank Miki Naba, Daisuke Kido, Narith Chy, Toru Kodama, Koji Komatsu, and Prof. Kazuyuki Shimizu from the Kyushu Institute of Technology for help in measuring growth of the glycolysis gene deletion mutants.

Top

References

  1. Anderson RP, Roth JR (1977) Tandem genetic duplications in phage and bacteria. Annu Rev Microbiol31: 473–505 | Article | PubMed | ISI | ChemPort |
  2. Crick F (2002) Letter to Barry L. Wanner dated October 1, 2002.
  3. Crick FHC (1973) Project K: 'The complete solution of E. coli'. Perspectives in Biology and Medicine (17): 67–70; Baltimore: Johns Hopkins University Press. | ISI |
  4. Datsenko KA, Wanner BL (2000) One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. Proc Natl Acad Sci USA97: 6640–6645 | Article | PubMed | ChemPort |
  5. Giaever G, Chu AM, Ni L, Connelly C, Riles L, Veronneau S, Dow S, Lucau-Danila A, Anderson K, Andre B, Arkin AP, Astromoff A, EL Bakkoury M, Bangham R, Benito R, Brachat S, Campanaro S, Curtiss M, Davis K, Deutschbauer A, Entian KD, Flaherty P, Foury F, Garfinkel DJ, Gerstein M, Gotte D, Guldener U, Hegemann JH, Hempel S, Herman Z, Jaramillo DF, Kelly DE, Kelly SL, Kotter P, LaBonte D, Lamb DC, Lan N, Liang H, Liao H, Liu L, Luo C, Lussier M, Mao R, Menard P, Ooi SL, Revuelta JL, Roberts CJ, Rose M, Ross-Macdonald P, Scherens B, Schimmack G, Shafer B, Shoemaker DD, Sookhai-Mahadeo S, Storms RK, Strathern JN, Valle G, Voet M, Volckaert G, Wang CY, Ward TR, Wilhelmy J, Winzeler EA, Yang Y, Yen G, Youngman E, Yu K, Bussey H, Boeke JD, Snyder M, Philippsen P, Davis RW, Johnston M (2002) Functional profiling of the Saccharomyces cerevisiae genome. Nature418: 387–391 | Article | PubMed | ISI | ChemPort |
  6. Hayashi K, Morooka N, Yamamoto Y, Fujita K, Isono K, Choi S, Ohtsubo E, Baba T, Wanner BL, Mori H, Horiuchi T (2006) Highly accurate genome sequences of the Escherichia coli K-12 strains MG1655 and W3110. Mol Syst Biol 2006.000716738553 doi:10.1038/msb4100049
  7. Holden C (2002) Cell biology: alliance launched to modelE. coli. Science297: 1459–1460 | ChemPort |
  8. Neidhardt FC (1996) The enteric bacterial cell and the age of bacteria. In Escherichia coli and Salmonella Cellular and Molecular Biology, Neidhardt FC, Curtiss III R, Ingraham JL, Lin ECC, Low KB, Magasanik B, Reznikoff WS, Riley M, Schaechter M and Umbarger HE (eds), 2 edn, pp 1–4. Washington, DC: ASM Press
  9. Riley M, Abe T, Arnaud MB, Berlyn MB, Blattner FR, Chaudhuri RR, Glasner JD, Mori H, Horiuchi T, Keseler IM, Kosuge T, Perna NT, PlunkettIII G, Rudd KE, Serres MH, Thomas GH, Thomson NR, Wishart DS, Wanner BL (2006) Escherichia coli K-12: a cooperatively developed annotation snapshot—2005. Nucleic Acids Res34: 1–9 | Article | PubMed | ISI | ChemPort |

Extra navigation

.
ADVERTISEMENT