Insertion sequencing (INSeq) is a method for determining the insertion site and relative abundance of large numbers of transposon mutants in a mixed population of isogenic mutants of a sequenced microbial species. INSeq is based on a modified mariner transposon containing MmeI sites at its ends, allowing cleavage at chromosomal sites 16–17 bp from the inserted transposon. Genomic regions adjacent to the transposons are amplified by linear PCR with a biotinylated primer. Products are bound to magnetic beads, digested with MmeI and barcoded with sample-specific linkers appended to each restriction fragment. After limited PCR amplification, fragments are sequenced using a high-throughput instrument. The sequence of each read can be used to map the location of a transposon in the genome. Read count measures the relative abundance of that mutant in the population. Solid-phase library preparation makes this protocol rapid (18 h), easy to scale up, amenable to automation and useful for a variety of samples. A protocol for characterizing libraries of transposon mutant strains clonally arrayed in a multiwell format is provided.
- Signature-tagged mutagenesis: barcoding mutants for genome-wide screens. Nat. Rev. Genet. 7, 929–939 (2006). , , &
- Identifying genetic determinants needed to establish a human gut symbiont in its habitat. Cell Host Microbe 6, 279–289 (2009). et al.
- DNA Sudoku—harnessing high-throughput sequencing for multiplexed specimen analysis. Genome Res. 19, 1243–1253 (2009). et al.
- Overlapping pools for high-throughput targeted resequencing. Genome Res. 19, 1254–1261 (2009). &
- Shifted Transversal Design smart-pooling for high coverage interactome mapping. Genome Res. 19, 1262–1269 (2009). et al.
- Tracking insertion mutants within libraries by deep sequencing and a genome-wide screen for Haemophilus genes required in the lung. Proc. Natl. Acad. Sci. USA 106, 16422–16427 (2009). , , , &
- Simultaneous assay of every Salmonella Typhi gene using one million transposon mutants. Genome Res. 19, 2308–2316 (2009). et al.
- Tn-seq: high-throughput parallel sequencing for fitness and genetic interaction studies in microorganisms. Nat. Methods 6, 767–772 (2009). , &
- Genome-scale identification of resistance functions in Pseudomonas aeruginosa using Tn-seq. MBio 2, 00315-10 (2011). , &
- Genome-wide fitness and genetic interactions determined by Tn-seq, a high-throughput massively parallel sequencing method for microorganisms. Curr. Protoc. Microbiol. 19, 1E.3.1–1E.3.16 (2010). &
- MmeI: a minimal type II restriction-modification system that only modifies one DNA strand for host protection. Nucleic Acids Res. 36, 6558–6570 (2008). , , &
- Sodium boric acid: a Tris-free, cooler conductive medium for DNA electrophoresis. Biotechniques 36, 214–216 (2004). &
- Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009). , , &
- Operon prediction without a training set. Bioinformatics 21, 880–888 (2005). , , &
- Supplementary Table 1 (137K)
- Supplementary Dataset 1 (15M)
INSeq_analysis.zip: INSeq data analysis pipeline and README.txt file.