An ultra-low-input native ChIP-seq protocol for genome-wide profiling of rare cell populations

Brind’Amour, Julie; Liu, Sheng; Hudson, Matthew; Chen, Carol; Karimi, Mohammad M.; Lorincz, Matthew C.

doi:10.1038/ncomms7033

Article
Published: 21 January 2015

An ultra-low-input native ChIP-seq protocol for genome-wide profiling of rare cell populations

Julie Brind’Amour¹,
Sheng Liu¹,
Matthew Hudson¹,
Carol Chen¹,
Mohammad M. Karimi^1,2 &
…
Matthew C. Lorincz¹

Nature Communications volume 6, Article number: 6033 (2015) Cite this article

39k Accesses
236 Citations
32 Altmetric
Metrics details

Subjects

Abstract

Combined chromatin immunoprecipitation and next-generation sequencing (ChIP-seq) has enabled genome-wide epigenetic profiling of numerous cell lines and tissue types. A major limitation of ChIP-seq, however, is the large number of cells required to generate high-quality data sets, precluding the study of rare cell populations. Here, we present an ultra-low-input micrococcal nuclease-based native ChIP (ULI-NChIP) and sequencing method to generate genome-wide histone mark profiles with high resolution from as few as 10³ cells. We demonstrate that ULI-NChIP-seq generates high-quality maps of covalent histone marks from 10³ to 10⁶ embryonic stem cells. Subsequently, we show that ULI-NChIP-seq H3K27me3 profiles generated from E13.5 primordial germ cells isolated from single male and female embryos show high similarity to recent data sets generated using 50–180 × more material. Finally, we identify sexually dimorphic H3K27me3 enrichment at specific genic promoters, thereby illustrating the utility of this method for generating high-quality and -complexity libraries from rare cell populations.

You have full access to this article via your institution.

Download PDF

Profiling chromatin states using single-cell itChIP-seq

Article 03 September 2019

Genome-wide profiling of nucleosome position and chromatin accessibility in single cells using scMNase-seq

Article 13 December 2019

scPCOR-seq enables co-profiling of chromatin occupancy and RNAs in single cells

Article Open access 08 July 2022

Introduction

Chromatin immunoprecipitation followed by next-generation sequencing (ChIP-seq) is a widely used approach to study genome-wide DNA–protein interactions. While such experiments have yielded significant insights, standard ChIP-seq protocols require ~10⁷ cells^1,2,3, precluding their use on rare cell populations. In recent years, scaled-down ChIP-Chip⁴ and ChIP-seq procedures^5,6,7,8,9,10 were developed for inputs ranging from 10³ to 10⁶ cells. However, most include crosslinking (XChIP) and pre-amplification of ChIP material before library construction^5,6,7, which can reduce library complexity and generate PCR artefacts¹¹. Despite advances, few groups have generated high-quality data from rare in vivo cell populations using these methods. Three groups recently published data sets from purified primordial germ cells (PGCs) pooled from the gonadal ridges of mouse embryos^10,12,13. The large amount of input material used in these analyses, however, is prohibitive for studies involving single embryos or very rare cell types.

The reduced number of steps and improved resolution relative to XChIP makes micrococcal nuclease (MNase)-based ‘native’ ChIP (NChIP) an attractive alternative to study histone modifications in rare cells. A low-input NChIP-seq method to generate high-quality and resolution-sequencing libraries was recently described⁸, but libraries built from <10⁵ cells using this method had low levels of complexity and high levels of duplicates. We therefore sought to develop an improved NChIP procedure that would generate high-complexity libraries from significantly smaller amounts of input material.

Here, we present a flexible and robust ultra-low-input (ULI) NChIP-seq method optimized for chromatin isolated from as few as 10³ cells. H3K9me3 and H3K27me3 NChIP-seq libraries generated from 10³ to 10⁵ mouse embryonic stem cells (ESCs) yield results comparable to those previously generated from 10⁶ ESCs. We further validated our approach by generating sex-specific H3K27me3 NChIP-seq data sets from 10³ PGCs isolated from the gonadal ridges of single male and female E13.5 embryos. The maps generated have higher complexity and resolution than previously published data sets^12,13. Moreover, by intersecting our NChIP-seq data sets with RNA-seq libraries generated from 10³ male and female E13.5 PGCs, we identified a subset of genes involved in meiosis and transforming growth factor-β receptor signalling that show sex-specific differences in expression and H3K27me3 enrichment in their promoter regions.

Results

Complexity of ULI-NChIP-seq libraries from 10³ to 10⁵ cells

To improve the yield of chromatin isolated from small samples, we optimized a dilution-based NChIP-seq procedure that can easily be adjusted to cell sample size. A comparison of our method with standard NChIP-seq and low-input XChIP-seq protocols highlighting steps improved to prevent sample loss is presented in Fig. 1a. ULI-NChIP-seq allows for sorting of cells directly into a detergent-based nuclear isolation buffer, thereby enabling extended sample storage or pooling of samples. Importantly, unlike most low-input XChIP-seq methods, no pre-amplification of ChIP material is required before library construction, minimizing the generation of PCR artefacts.

Figure 1: **A NChIP-seq protocol to** **generate genome-wide chromatin maps from low cell numbers.**

Using this protocol, we prepared H3K9me3 NChIP-seq libraries from 10³–10⁵ ESCs (Supplementary Fig. 1a and Supplementary Methods). To serve as a reference, we also generated an H3K9me3 library from 10⁶ ESCs using a previously described NChIP-seq (‘gold-standard’) protocol¹⁴. All libraries were indexed, pooled and paired-end sequenced (100 bp reads). Depending on the number of libraries pooled on a single lane, we obtained from 45–145 million reads. We evaluated library complexity by comparing the total number of distinct reads with the number of duplicate and unaligned reads in each library (Supplementary Fig. 1b). Unmapped reads represented from 7 to 15% of all reads, independent of sequencing depth or input size, suggesting that the low number of PCR cycles (8–10) used for library amplification introduced relatively few PCR artefacts. The H3K9me3 library prepared from 10⁶ cells was sequenced the deepest (~147 million reads) and also had the highest proportion of duplicates (28%). Independent of sequencing depth (45–100 million reads) or the number of input cells, ULI libraries prepared from 10³ to 10⁵ cells had a total of 21–25% uniquely and multi-aligned duplicate reads, suggesting that these libraries were sufficiently complex for deeper sequencing (Supplementary Fig. 1a). As we are comparing libraries with different sequencing depths, we used the PreSeq package¹⁵ to extrapolate and compare the potential complexity of our libraries (Fig. 1b). Although our H3K9me3 libraries built from 10³ to 10⁵ cells display a lower potential complexity than our ‘gold-standard’ library (Fig. 1b, top panel), all could potentially be sequenced several times deeper than the ~20 million distinct reads recommended to generate high-quality profiles for such broad chromatin marks.

In addition, we prepared H3K27me3 NChIP-seq libraries from 10³ to 10⁵ ESCs using similar conditions, and obtained 29–42 million distinct reads per library, with ~10% unmapped reads and only 3–8% total duplicate reads in each case (Supplementary Fig. 1c). As for H3K9me3 libraries, using PreSeq¹⁵ to extrapolate the potential complexity of these libraries indicates that even with the lowest input, all of the H3K27me3 libraries could be several times the required depth to obtain high-quality profiles (Fig. 1c).

To determine whether this method can be used to create profiles for active histone marks, we next generated ULI-NChIP-seq data for H3K4me3. As this promoter-enriched chromatin mark is less abundant than H3K9me3 and H3K27me3, H3K4me3 libraries were amplified for 2–4 additional PCR cycles in order to obtain sufficient material for sequencing (Supplementary Fig. 1a). Deep sequencing (37.7 million reads) of an H3K4me3 library built from 10⁵ cells showed under 10% of unaligned reads and 36% total duplicate reads (Supplementary Fig. 1d). Shallow sequencing of H3K4me3 libraries prepared from 5 × 10³ and 1 × 10⁴ cells (9.5 and 7.8 million reads, respectively) showed an increased proportion of unaligned reads (55–70%), indicative of lower complexity libraries. As this was a shallow round of sequencing, the proportion of duplicate reads remains very low (<5%). Extrapolation of potential library complexity indicates that, despite the increased proportion of unaligned reads, deeper sequencing of these libraries could generate enough reads to saturate H3K4me3 peaks (Fig. 1d).

Correlation between ULI and standard ChIP-seq libraries

Visual inspection of NChIP-seq profiles from randomly chosen regions shows similar enrichment in libraries built from 10³ to 10⁶ cells (Fig. 2a–c). We compared H3K9me3 enrichment in genome-wide 2 kb bins, and calculated Pearson correlation coefficients to assess the similarity between ULI and standard NChIP-seq libraries (Fig. 2d and Supplementary Fig. 2a). H3K9me3 libraries built from 10³ to 10⁵ cells had correlations ranging from 0.83 to 0.9 when compared with ‘gold-standard’ H3K9me3 NChIP-seq. As expected, low-input libraries had modestly higher background levels, as illustrated by an increase in variance (Supplementary Fig. 2c). We next defined regions enriched for H3K9me3 using MACS (see Methods section). Of all H3K9me3 peaks identified in our ‘gold-standard’ library, 76–85% were also detected in libraries generated from 10³ to 10⁵ cells (Supplementary Fig. 3a,c). Consistent with previous reports showing that specific endogenous retroviruses (ERVs) are marked and silenced by H3K9me3 (refs 1, 16, 17), our ‘gold-standard’ and ULI libraries show H3K9me3 enrichment at the same subset of ERV1 and ERVK subfamilies (Supplementary Fig. 4a), in the unique 1 kb 5′ flank of ERVKs (Supplementary Fig. 4b), and at individual IAP ERVK elements (Supplementary Fig. 4c).

Figure 2: **Correlation between standard and ULI-NChIP-seq libraries built from 10** ³ **to 10** ⁵ **ESCs.**

Similarly, H3K27me3 libraries built from 10⁴ to 10⁵ cells were highly correlated, with a genome-wide correlation (2 kb bins) of 0.9. Likely owing to a modest increase in background levels, the library built from 10³ cells had correlations of 0.77 and 0.78 to the libraries built from 10⁴ and 10⁵ cells, respectively (Fig. 2d and Supplementary Fig. 2b,c). Regardless, H3K27me3-enriched regions showed good correlation between libraries, with 80% and 70% of peaks detected in our 10⁵ cell input library overlapping with peaks detected in our libraries built from 10⁴ and 10³ cells, respectively (Supplementary Fig. 3b,d). We next compared H3K27me3 enrichment levels around transcription start sites (TSSs), as H3K27me3 marks the promoter regions of bivalent or silenced genes¹. Libraries from all input sizes showed high correlation to each other (0.86–0.96), and H3K27me3 enrichment at gene promoters was correlated with relatively low levels of gene expression, as expected (Supplementary Fig. 5a–c).

As H3K4me3 is a narrow chromatin mark present at the promoter region of actively transcribed and bivalent genes, we compared H3K4me3 enrichment around TSSs (±500 bp) of ULI-NChIP-seq to ENCODE¹⁸ libraries (built from E14 ESCs). We obtained high Pearson correlation coefficients between our libraries built from 5 × 10³ to 5 × 10⁵ cells (0.90–0.96) and good correlations (0.71–0.83) to ENCODE libraries (Fig. 2e). The lower correlations to ENCODE libraries are presumably due in part to the large difference in sequencing depth, as well as to the different ESC lines and antibodies used. Visual inspection of ChIP-seq profiles reveals that the same promoters are generally marked, but with varying intensities (Fig. 2c). While preliminary attempts to generate H3K4me3 profiles from 10³ cells did not yield sufficient coverage (data not shown), further optimization of the ChIP conditions for this mark will likely improve the resolution of signal above background.

Sex-specific H3K27me3 profiles in PGCs from single embryos

As low-input methods are particularly useful for the study of cell types present in limited numbers in vivo, we validated our method on PGCs, the precursors to mature gametes. To determine sex-specific H3K27me3 profiles correlation to sex-specific gene expression, we used ULI-NChIP-seq data sets prepared from 10³ PGCs purified from the gonads of single male and female E13.5 embryos¹⁹. Comparison with previously published low-input H3K27me3 data sets generated from 5.2 × 10⁴ to 1.8 × 10⁵ PGCs in two independent studies^12,13 reveals that our method yielded similar or greater sequencing depth while minimizing total duplicate generation (<15%) (Fig. 3a). Of note, while a fraction of the reads labelled as duplicates are likely owing to preferred MNase cleavage sites, the use of paired-end rather than single-end sequencing for ULI-NChIP-seq allows for improved discrimination between technical (PCR) and biological duplicate reads. While H3K27me3 enrichment patterns around and upstream of the HoxC cluster are broadly similar to those described by Ng et al.¹³ and Lesch et al.¹² (Fig. 3b), our method yields higher resolution maps, likely owing to a combination of high number of distinct reads, longer reads and lower number of PCR amplification cycles used during library construction. In addition, fragmentation of chromatin using MNase generates smaller and more uniformly sized fragments than does sonication of crosslinked chromatin, while the use of paired-end sequencing allows for the determination of true fragment size. Relative H3K27me3 enrichment around all annotated TSSs (±2 kb) was similar to previously published data^12,13 (Fig. 3c), with Pearson correlations between 0.68 and 0.85. Of note, the more deeply sequenced of the two female libraries from Lesch et al.¹² showed greater correlation to the female H3K27me3 data set generated using ULI-NChIP-seq19 (0.69) than to its replicate library (0.51) (Fig. 3c).

**Figure 3: High-resolution gender-specific H3K27me3 profiles generated from E13.5 PGCs isolated from single embryos.**

Intriguingly, while male and female E13.5 PGCs have distinct differentiation programs and transcription patterns^12,20,21, our results indicate that their H3K27me3 distribution profiles are broadly similar (Supplementary Fig. 6 and ref. 19). Using our ULI-NChIP-seq data sets, we therefore sought to identify sex-specific H3K27me3-marked promoters associated with gene silencing in E13.5 PGCs. In both males and females, H3K27me3 around TSSs was associated with low levels of transcription (Fig. 4a,b and ref. 19). Most genic promoters harbouring H3K27me3 in male PGCs are also marked in females and vice versa, with approximately two-thirds of those also marked in ESCs (Supplementary Fig. 7). Interestingly, a relatively large number of promoters (~1,500) are enriched for H3K27me3 exclusively in female PGCs, while a smaller proportion (~270) are enriched exclusively in male PGCs. While most of the genes marked in a sex-specific manner are silenced in both male and female PGCs, we identified a subset of sex-specific H3K27me3-marked genes that show an inverse relationship with expression in PGCs (Fig. 4c–e and Supplementary Tables 1 and 2). In accordance with female E13.5 PGCs preparing to initiate meiosis I and male PGCs undergoing mitotic arrest^22,23, several meiotic genes, including Lfhg and Stra8 (ref. 24), show a higher level of expression in female PGCs and, conversely, a higher level of H3K27me3 in male PGCs (Supplementary Fig. 8 and Supplementary Tables 1 and 2). On the other hand, only a small number of male-specific genes, including transforming growth factor-β receptor binding factors Lefty1 and Lefty2, are marked by H3K27me3 in female PGCs exclusively (Supplementary Fig. 8 and Supplementary Tables 1 and 2), consistent with the recent observation that Nodal signalling is activated specifically in males²⁵. Taken together, these results reveal that at this stage in PGC development, the polycomb pathway may be engaged more frequently in the male germ line to regulate germ cell-specific genes.

**Figure 4: Gender-specific H3K27me3 profiles from E13.5 PGCs isolated from single embryos.**

Discussion

We present a rapid, ULI-NChIP-seq procedure, which can be carried out with as few as 10³ cells, without sacrificing complexity or resolution^5,6,7,8,9,10. Despite the small input size, libraries generated with this method show high resolution and complexity comparable to libraries built with 10⁶ cells. Indexing and pooling multiple libraries per sequencing lane not only minimizes sequencing costs but also eliminates the need for pre-amplification of raw ChIP material, which in combination with low PCR cycles at the library construction step reduces the fraction of duplicates and unaligned reads generated. Moreover, the protocol presented here is flexible, allowing freezing, storing and pooling of samples prepared on different days, a valuable feature when working with in vivo samples. ULI-NChIP-seq may also be useful for analysis of non-histone proteins, including transcription factors, that can be immmunoprecipitated in the absence of crosslinking²⁶.

Using this ULI-NChIP-seq method, we generated H3K27me3 libraries in PGCs isolated from single male and female embryos¹⁹. While these data sets are correlated with previously published data generated from PGCs pooled from multiple embryos^12,13, ULI-NChIP-seq data sets show improved resolution and a reduced proportion of reads flagged as duplicates, highlighting the benefit of minimizing the number of library amplification cycles and paired-end sequencing. Intersection of our high-resolution NChIP-seq libraries with low-input RNA-seq profiles allowed us to identify a subset of differentially expressed genes that are marked in a sex-specific manner by H3K27me3 in E13.5 PGCs, including both previously identified targets of polycomb group (PcG)-dependent silencing and novel candidates.

While it is possible to pool rare samples to generate ChIP-seq libraries, obtaining sufficient cell numbers for previously published ‘low-input’ protocols (>10⁴ cells) can be impractical. For example, in our recently published study¹⁹, only ~3 × 10³ and ~6 × 10³ SSEA1+PGCs could be purified by fluorescence-activated cell sorting (FACS) from single male and female wild-type embryos, respectively. In genetically manipulated animals, cell viability can be impacted, decreasing sample yield yet further. Furthermore, embryos with the desired genotype may represent only a small fraction of each litter, so the ULI-NChIP-seq method presented here minimizes the breeding colony size required for genome-wide analyses. As multiple histone marks can be profiled simultaneously with transcription in individual embryos, the variability inherent in studies of cell types that are in the process of transcriptional reprogramming in association with developmental stage is also minimized. ULI-NChIP-seq should also be useful for studies of clinical samples, where cell numbers are frequently limiting.

Methods

Cell culture and isolation

TT2 mouse ESCs²⁷ were cultured in DMEM supplemented with 15% fetal bovine serum (HyClone), 20 mM HEPES, 0.1 mM non-essential amino acids, 0.1 mM 2-mercaptoethanol, 100 U ml⁻¹ penicillin, 0.05 mM streptomycin, leukemia inhibitory factor and 2 mM L-glutamine on gelatinized plates. Trypsinized cells were either FACS-sorted or aliquoted in nuclear isolation buffer (Sigma, N3408) containing protease inhibitor cocktail (Roche), flash-frozen and stored at −80 °C for a few weeks to a few months.

‘Gold-standard’ NChIP

For ‘gold-standard’ NChIP^14,16, 10⁶ cells were resuspended in douncing buffer (10 mM Tris-HCl, pH 7.5, 4 mM MgCl₂, 1 mM CaCl₂ and protease inhibitor cocktail) and homogenized through a syringe. Chromatin was digested in 2 U μl⁻¹ MNase (Worthington Biochemicals) at 37 °C for 5 min, and the reaction was quenched by 0.5 M EDTA. Chromatin was resuspended in hypotonic buffer (0.2 mM EDTA, pH 8.0, 0.1 mM benzamidine, 0.1 mM phenylmethylsulfonyl fluoride, 1.5 mM dithiothreitol and 1 × protease inhibitor cocktail (PIC) and incubated for 1 h on ice. Cellular debris was pelleted and the supernatant was recovered. Chromatin was pre-cleared with 20 μl of 1:1 protein A:protein G Dynabeads (Life Technologies) and immunoprecipitation was carried out with antibody–bead complexes (5 μl Active Motif no. 39161 H3K9me3 antibody and 20 μl 1:1 protein A:protein G Dynabeads) overnight at 4 °C. IPed complexes were washed twice with 400 μl of ChIP wash buffer I (20 mM Tris-HCl, pH 8.0, 0.1% SDS, 1% Triton X-100, 2 mM EDTA and 150 mM NaCl) and twice with 400 μl of ChIP wash buffer II (20 mM Tris-HCl (pH 8.0), 0.1% SDS, 1% Triton X-100, 2 mM EDTA and 500 mM NaCl). Protein–DNA complexes were eluted in 200 μl of elution buffer (100 mM NaHCO₃ and 1% SDS) for 2 h at 68 °C. IPed material was purified by phenol chloroform and 5 ng of raw ChIP material was processed for library construction.

ULI-NChIP

A detailed, step-by-step procedure is presented in Supplementary Methods. We based our chromatin preparation on a previously published MNase chromatin fragmentation and library construction from single cells²⁸. TT2 mouse ESCs were either FACS-sorted directly in nuclear isolation buffer (Sigma; <20,000 cells) or pelleted and re-suspended in nuclear isolation buffer (Sigma). Depending on input size chromatin was fragmented for 5–7.5 min using MNase at 21 or 37 °C, and diluted in NChIP immunoprecipitation buffer (20 mM Tris-HCl pH 8.0, 2 mM EDTA, 15 mM NaCl, 0.1% Triton X-100, 1 × EDTA-free protease inhibitor cocktail and 1 mM phenylmethanesulfonyl fluoride (Sigma)). Chromatin was pre-cleared with 5 or 10 μl of 1:1 protein A:protein G Dynabeads (Life Technologies) and IPed with 0.25 or 1 mg of H3K9me3 (Active Motif no. 39161), H3K27me3 (Diagenode pAb-069–050) or pan-H3 (Sigma, I8140) antibody–bead complexes overnight at 4 °C. IPed complexes were washed twice with 400 μl of ChIP wash buffer I (20 mM Tris-HCl, pH 8.0, 0.1% SDS, 1% Triton X-100, 0.1% deoxycholate, 2 mM EDTA and 150 mM NaCl) and twice with 400 μl of ChIP wash buffer II (20 mM Tris-HCl (pH 8.0), 0.1% SDS, 1% Triton X-100, 0.1% deoxycholate, 2 mM EDTA and 500 mM NaCl). Protein–DNA complexes were eluted in 30 μl of ChIP elution buffer (100 mM NaHCO₃ and 1% SDS) for 2 h at 68 °C. IPed material was purified by phenol chloroform, ethanol-precipitated and raw ChIP material was re-suspended in 10 mM Tris-HCl pH 8.0. As material obtained after ChIP is minimal, DNA concentration was not measured in samples before library construction. For optimal results, raw ChIP material was re-purified with 1.8 × volume of Ampure XP DNA purification beads (Agencourt) before library construction.

RNA extraction and double-stranded cDNA preparation

Total RNA was extracted from a frozen 10³ cells aliquots using TRIzol (Invitrogen, AM9738) according to the manufacturer’s manual. Residual genomic DNA was removed by treatment with DNase I (Promega), and ribosomal RNA was depleted using the RiboMinusTranscriptome Isolation kit (Invitrogen) according the manufacturer’s low-input protocol. First strand cDNA synthesis was carried out using Superscript III (Invitrogen 18080-093) with T4 protein 32 and a combination of random 15-mers and oligo dT (NEB), followed by second strand cDNA synthesis using the Klenow polymerase (NEB) in the presence of RNase H. Double-stranded cDNA was fragmented using a BioRuptor (Diagenode) for 15 min (low power mode, 30 s on and 30 s off).

Library construction

For ‘gold-standard’ H3K9me3 NChIP-seq, 5 ng of raw ChIP material was used for library construction. For ULI-NChIP-seq, 85% of the raw ChIP material was used for library construction. Illumina libraries were constructed using a modified custom paired-end protocol²⁸. In brief, samples were end-repaired (1 × T4 DNA ligase buffer, 0.4 mM dNTP mix, 2.25 U T4 DNA polymerase, 0.75 U Klenow DNA polymerase and 7.5 U T4 polynucleotide kinase; 30 min at 21–25 °C), A-tailed (1 × NEB buffer 2, 0.4 mM dNTPs and 3.75 U of Klenow (exo-); 30 min at 37 °C) and ligated (1 × rapid DNA ligation buffer, 1 mM Illumina PE adapters and 1,600 U DNA ligase; 1–8 h at 21–25 °C). Ligated fragments were amplified using indexed primers (Illumina) for 8–10 PCR cycles. DNA was purified with 1.8 × volume Ampure XP DNA purification beads between each step.

Sequencing and alignment

Amplified indexed libraries were pooled, size selected on a 2% agarose gel and diluted to a final concentration of 10 mM. Cluster generation and paired-end sequencing (100 bp reads) were performed on the Illumina cluster station and Illumina HiSeq 2000 or Illumina HiSeq 2500 sequencing platforms using Illumina Read 1 and Read 2 primers, and a third custom primer (5′- GATCGGAAGAGCGGTTCAGCAGGAATGCCGAGACCG -3′) to sequence the 6-mer unique index. Sequence reads were mapped to mm9 (NCBI 37) using Burrows-Wheeler Aligner (BWA)²⁹, and duplicate reads were marked using Picard-tools (http://picard.sourceforge.net). Reads passing Illumina’s default chastity filter (total reads) were used to generate library statistics using Samtools Flagstats³⁰, where reads with the exact same sequence are identified as ‘duplicates’, non-duplicate reads with a MapQ>5 are identified as distinct uniquely aligned reads and reads with a MapQ<5 are identified as distinct multi-aligned reads.

Data sets

ChIP-seq and RNA-seq data sets prepared for this manuscript are available at the Gene Expression Omnibus repository under the accession number GSE63523. H3K27me3 ChIP-seq data sets prepared from 10³ male and female E13.5 PGCs using ULI-NChIP-seq¹⁹, and low-input RNA-seq data sets are available under the accession GSE60377. ENCODE¹⁸ H3K4me3 data sets generated from E14 ESCs (SRR568477 and SRR568478) and H3K27me3 data sets generated from E13.5 PGCs were obtained from accessions GSE38165 (ref. 13) and SRA027978 (ref. 12).

Data analysis

For analysis of relative ChIP enrichment at unique loci, duplicate reads (with identical coordinates) and reads with a MapQ<5 (multi-aligned reads) were removed. Multi-aligned reads were included for calculating the relative ChIP enrichment at agglomerated transposable elements. Normalization of relative ChIP enrichment was calculated as reads per kilobase per million mapped reads (RPKM)^31,32. For mined data sets using short, single-end reads, reads were extended to 300 bp before generating RPKM values. Potential library complexity was determined using the extrapolate function of the PreSeq package¹⁵. For expression analysis, normalization of RNA-seq read enrichment was calculated as RPKM at exonic regions only (RefSeq transcripts).

Peak calling

Regions enriched for H3K9me3 or H3K27me3 were determined using MACS and MACS2 peak callers on non-duplicate, uniquely aligned reads^33,34. For H3K9me3 peaks, broad domains were identified using MACS2 broadpeaks (P value=0.05) and combined with narrow domains identified with MACS (10⁵ and 10⁶ cells input: P value=0.01; 10³ and 10⁴ cells input: P value=0.02). Peaks closer than 2 kb apart were merged and peaks larger than 0.5 kb were included in our analysis. Similarly, for H3K27me3 peaks, broad regions were called using MACS2 broadpeaks (P value=0.05) and combined with narrower domains identified with MACS (10⁴ and 10⁵ cells input: P value-0.01; 10³ cells input: P value=0.02). Peaks closer than 2 kb apart were merged and peaks larger than 0.5 kb were included in our analysis.

Additional information

How to cite this article: Brind’Amour, J. et al. An ultra-low-input native ChIP-seq protocol for genome-wide profiling of rare cell populations. Nat. Commun. 6:6033 doi: 10.1038/ncomms7033 (2015).

Accession codes: ChIP-seq and RNA-seq data sets prepared for this manuscript are available at the Gene Expression Omnibus (GEO) repository under the accession number GSE63523. Referenced data sets:GEO repository: GSE60377, GSE38165. Sequence Read Archive (SRA) repository: SRA027978, SRR568477 and SRR568478.

Accession codes

Accessions

Gene Expression Omnibus

Sequence Read Archive

References

Mikkelsen, T. S. et al. Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature 448, 553–560 (2007).
Article CAS ADS Google Scholar
Robertson, G. et al. Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing. Nat. Methods 4, 651–657 (2007).
Article CAS Google Scholar
Barski, A. et al. High-resolution profiling of histone methylations in the human genome. Cell 129, 823–837 (2007).
Article CAS Google Scholar
Dahl, J. A., Reiner, A. H. & Collas, P. Fast genomic muChIP-chip from 1,000 cells. Genome Biol. 10, R13 (2009).
Article Google Scholar
Adli, M., Zhu, J. & Bernstein, B. E. Genome-wide chromatin maps derived from limited numbers of hematopoietic progenitors. Nat. Methods 7, 615–618 (2010).
Article CAS Google Scholar
Hitchler, M. J. & Rice, J. C. Genome-wide epigenetic analysis of human pluripotent stem cells by ChIP and ChIP-Seq. Methods Mol. Biol. 767, 253–267 (2011).
Article CAS Google Scholar
Shankaranarayanan, P. et al. Single-tube linear DNA amplification (LinDA) for robust ChIP-seq. Nat. Methods 8, 565–567 (2011).
Article CAS Google Scholar
Gilfillan, G. D. et al. Limitations and possibilities of low cell number ChIP-seq. BMC Genomics 13, 645 (2012).
Article CAS Google Scholar
Blecher-Gonen, R. et al. High-throughput chromatin immunoprecipitation for genome-wide mapping of in vivo protein-DNA interactions and epigenomic states. Nat. Protoc. 8, 539–554 (2013).
Article Google Scholar
Sachs, M. et al. Bivalent chromatin marks developmental regulatory genes in the mouse embryonic germline in vivo. Cell Rep. 3, 1777–1784 (2013).
Article CAS Google Scholar
Kidder, B. L., Hu, G. & Zhao, K. ChIP-Seq: technical considerations for obtaining high-quality data. Nat. Immunol. 12, 918–922 (2011).
Article CAS Google Scholar
Lesch, B. J., Dokshin, G. A., Young, R. A., McCarrey, J. R. & Page, D. C. A set of genes critical to development is epigenetically poised in mouse germ cells from fetal stages through completion of meiosis. Proc. Natl Acad. Sci. USA 110, 16061–16066 (2013).
Article CAS ADS Google Scholar
Ng, J. H. et al. In vivo epigenomic profiling of germ cells reveals germ cell molecular signatures. Dev. Cell 24, 324–333 (2013).
Article CAS Google Scholar
Maunakea, A. K. et al. Conserved role of intragenic DNA methylation in regulating alternative promoters. Nature 466, 253–257 (2010).
Article CAS ADS Google Scholar
Daley, T. & Smith, A. D. Predicting the molecular complexity of sequencing libraries. Nat. Methods 10, 325–327 (2013).
Article CAS Google Scholar
Karimi, M. M. et al. DNA methylation and SETDB1/H3K9me3 regulate predominantly distinct sets of genes, retroelements, and chimeric transcripts in mESCs. Cell Stem Cell 8, 676–687 (2011).
Article CAS Google Scholar
Matsui, T. et al. Proviral silencing in embryonic stem cells requires the histone methyltransferase ESET. Nature 464, 927–931 (2010).
Article CAS ADS Google Scholar
ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Liu, S. et al. Setdb1 is required for germline development and silencing of H3K9me3-marked endogenous retroviruses in primordial germ cells. Genes Dev. 28, 2041–2055 (2014).
Article CAS Google Scholar
Jameson, S. A. et al. Temporal transcriptional profiling of somatic and germ cells reveals biased lineage priming of sexual fate in the fetal mouse gonad. PLoS Genet. 8, e1002575 (2012).
Article CAS Google Scholar
Seisenberger, S. et al. The dynamics of genome-wide DNA methylation reprogramming in mouse primordial germ cells. Mol. Cell 48, 849–862 (2012).
Article CAS Google Scholar
Koubova, J. et al. Retinoic acid regulates sex-specific timing of meiotic initiation in mice. Proc. Natl Acad. Sci. USA 103, 2474–2479 (2006).
Article CAS ADS Google Scholar
Spiller, C. M., Bowles, J. & Koopman, P. Regulation of germ cell meiosis in the fetal ovary. Int. J. Dev. Biol. 56, 779–787 (2012).
Article CAS Google Scholar
Baltus, A. E. et al. In germ cells of mouse embryonic ovaries, the decision to enter meiosis precedes premeiotic DNA replication. Nat. Genet. 38, 1430–1434 (2006).
Article CAS Google Scholar
Souquet, B. et al. Nodal signaling regulates the entry into meiosis in fetal germ cells. Endocrinology 153, 2466–2473 (2012).
Article CAS Google Scholar
Kasinathan, S., Orsi, G. A., Zentner, G. E., Ahmad, K. & Henikoff, S. High-resolution mapping of transcription factor binding sites on native chromatin. Nat. Methods 11, 203–209 (2014).
Article CAS Google Scholar
Yagi, T. et al. A novel ES cell line, TT2, with high germline-differentiating potency. Anal. Biochem. 214, 70–76 (1993).
Article CAS Google Scholar
Falconer, E. et al. DNA template strand sequencing of single-cells maps genomic rearrangements at high resolution. Nat. Methods 9, 1107–1112 (2012).
Article CAS Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article Google Scholar
Mortazavi, A., Williams, B. A., McCue, K., Schaeffer, L. & Wold, B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods 5, 621–628 (2008).
Article CAS Google Scholar
Pepke, S., Wold, B. & Mortazavi, A. Computation for ChIP-seq and RNA-seq studies. Nat. Methods 6, S22–S32 (2009).
Article CAS Google Scholar
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Article Google Scholar
Feng, J., Liu, T., Qin, B., Zhang, Y. & Liu, X. S. Identifying ChIP-seq enrichment using MACS. Nat. Protoc. 7, 1728–1740 (2012).
Article CAS Google Scholar
Rugg-Gunn, P. J., Cox, B. J., Ralston, A. & Rossant, J. Distinct histone modifications in stem cell lines and tissue lineages from the early mouse embryo. Proc. Natl Acad. Sci. USA 107, 10783–10790 (2010).
Article CAS ADS Google Scholar

Download references

Acknowledgements

We would like to thank the BC Genome Sciences Center and UBC Sequencing Center for Illumina sequencing, and the ubcFLOW cytometry facility for FACS analyses. We also thank Ester Falconer, Aaron Bogutz and Ulrike Nauman for their helpful discussions, and Martin Hirst for the ‘gold-standard’ NChIP protocol. This work was supported by Canadian Institutes of Health Research grants 77805 and 92090 to M.C.L. M.M.K. was supported by a Michael Smith Foundation for Healthcare Research postdoctoral fellowship.

Author information

Authors and Affiliations

Department of Medical Genetics, Life Sciences Institute, The University of British Columbia, Vancouver, V6T 1Z3, British Columbia, Canada
Julie Brind’Amour, Sheng Liu, Matthew Hudson, Carol Chen, Mohammad M. Karimi & Matthew C. Lorincz
Biomedical Research Centre, The University of British Columbia, Vancouver, V6T 1Z3, British Columbia, Canada
Mohammad M. Karimi

Authors

Julie Brind’Amour
View author publications
You can also search for this author in PubMed Google Scholar
Sheng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Hudson
View author publications
You can also search for this author in PubMed Google Scholar
Carol Chen
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad M. Karimi
View author publications
You can also search for this author in PubMed Google Scholar
Matthew C. Lorincz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Experiments were designed by J.B. and M.C.L. ULI-NChIP-seq and RNA-seq libraries were prepared by J.B., S.L. and M.H., and ‘gold-standard’ ChIP was performed by C.C. Data analyses were performed by J.B., M.M.K. and S.L.

Corresponding author

Correspondence to Matthew C. Lorincz.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-8, Supplementary Tables 1-2, Supplementary Methods (PDF 1662 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Brind’Amour, J., Liu, S., Hudson, M. et al. An ultra-low-input native ChIP-seq protocol for genome-wide profiling of rare cell populations. Nat Commun 6, 6033 (2015). https://doi.org/10.1038/ncomms7033

Download citation

Received: 03 July 2014
Accepted: 04 December 2014
Published: 21 January 2015
DOI: https://doi.org/10.1038/ncomms7033

This article is cited by

Discordance between chromatin accessibility and transcriptional activity during the human primed-to-naïve pluripotency transition process
- Zhifen Tu
- Yan Bi
- Yixuan Wang
Cell Regeneration (2023)
The esBAF and ISWI nucleosome remodeling complexes influence occupancy of overlapping dinucleosomes and fragile nucleosomes in murine embryonic stem cells
- David C. Klein
- Kris Troy
- Sarah J. Hainer
BMC Genomics (2023)
Resetting histone modifications during human prenatal germline development
- Rui Gao
- Shiyang Zeng
- Jiayu Chen
Cell Discovery (2023)
Emerging evidence that the mammalian sperm epigenome serves as a template for embryo development
- Ariane Lismer
- Sarah Kimmins
Nature Communications (2023)
Unreprogrammed H3K9me3 prevents minor zygotic genome activation and lineage commitment in SCNT embryos
- Ruimin Xu
- Qianshu Zhu
- Xiaoyu Liu
Nature Communications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.