As the catalogue of sequenced genomes and metagenomes continues to grow, massively parallel approaches for the comprehensive and functional analysis of gene products and regulatory elements are becoming increasingly valuable. Current strategies to synthesize or clone complex libraries of DNA sequences are limited by the length of the DNA targets, throughput and cost. Here, we show that long-adapter single-strand oligonucleotide (LASSO) probes can capture and clone thousands of kilobase DNA fragments in a single reaction. As proof of principle, we simultaneously cloned over 3,000 bacterial open reading frames (ORFs) from Escherichia coli genomic DNA (spanning 400- to 5,000-bp targets). Targets were enriched up to a median of around 60-fold compared with non-targeted genomic regions. At a cutoff of three times the median non-target reads per kilobase of genetic element per million reads, around 75% of the targeted ORFs were successfully captured. We also show that LASSO probes can clone human ORFs from complementary DNA, and an ORF library from a human-microbiome sample. LASSO probes could be used for the preparation of long-read sequencing libraries and for massively multiplexed cloning.
This work was supported in part by the Shriners Hospitals for Children (B.P. and L.T.), a Prostate Cancer Foundation Young Investigator award (H.B.L.), and National Institutes of Health Grants R01EB012521 (B.P.), K01DK087770 (B.P.) and 1U24AI118633 (H.B.L.).
Supplementary sequence data.