Single-cell m6A mapping in vivo using picoMeRIP–seq

Li, Yanjiao; Wang, Yunhao; Vera-Rodriguez, Maria; Lindeman, Leif Christopher; Skuggen, Linda Ellevog; Rasmussen, Erik M. K.; Jermstad, Ingunn; Khan, Shaista; Fosslie, Madeleine; Skuland, Trine; Indahl, Marie; Khodeer, Sherif; Klemsdal, Eva Kristine; Jin, Kang-Xuan; Dalen, Knut Tomas; Fedorcsak, Peter; Greggains, Gareth D.; Lerdrup, Mads; Klungland, Arne; Au, Kin Fai; Dahl, John Arne

doi:10.1038/s41587-023-01831-7

Download PDF

Brief Communication
Open access
Published: 22 June 2023

Single-cell m⁶A mapping in vivo using picoMeRIP–seq

Nature Biotechnology volume 42, pages 591–596 (2024)Cite this article

11k Accesses
8 Citations
45 Altmetric
Metrics details

Subjects

Abstract

Current N⁶-methyladenosine (m⁶A) mapping methods need large amounts of RNA or are limited to cultured cells. Through optimized sample recovery and signal-to-noise ratio, we developed picogram-scale m⁶A RNA immunoprecipitation and sequencing (picoMeRIP–seq) for studying m⁶A in vivo in single cells and scarce cell types using standard laboratory equipment. We benchmark m⁶A mapping on titrations of poly(A) RNA and embryonic stem cells and in single zebrafish zygotes, mouse oocytes and embryos.

Pooled multicolour tagging for visualizing subcellular protein dynamics

Article Open access 19 April 2024

Improving prime editing with an endogenous small RNA-binding protein

Article Open access 03 April 2024

Genome assembly in the telomere-to-telomere era

Article 22 April 2024

Main

N⁶-Methyladenosine (m⁶A) is the most prevalent endogenous modification of mRNAs in eukaryotes^1,2. The m⁶A modification is involved in regulating post-transcriptional RNA processes, including splicing³, export⁴, stability⁵, turnover⁶ and translation⁷, and has key roles in cell differentiation and reprogramming⁸, gametogenesis⁹, embryogenesis¹⁰, stress response¹¹, tumorigenesis¹² and cellular integrity maintenance by silencing endogenous retrovirus-derived RNAs¹³.

Since the first publications of m⁶A mapping methods in 2012 (m⁶A RNA immunoprecipitation and sequencing (MeRIP–seq¹⁴) and m⁶A-seq¹⁵), several techniques have been developed: antibody-based PA-m⁶A-seq¹⁶, miCLIP¹⁷, m⁶A-CLIP¹⁸ and m⁶A-LAIC-seq¹⁹ and antibody-free DART-seq²⁰, MAZTER-seq²¹, m⁶A-REF-seq²² and m⁶A-SEAL²³. Immunoprecipitation (IP)-based methods do not provide single-nucleotide resolution or m⁶A stoichiometry but can estimate position based on RRACH motif and can be used for differential enrichment analysis with tools such as DESeq2. Mapping of m⁶A typically requires large amounts of input material. The lowest starting amount reported to date is 10 ng of total RNA using the DART-seq technique²⁰, and recently, single-cell DART-seq was also demonstrated²⁴. However, DART-seq requires APOBEC1-YTH expression in cells to induce C-to-U deamination at sites adjacent to m⁶A residues, thus limiting its application to cultured cells²⁴. Despite these advances, there is still a need for highly sensitive and single-cell m⁶A mapping methods applicable to in vivo cell types.

To this end, we developed a sensitive picogram-scale MeRIP–seq (picoMeRIP–seq) method that is also suitable for single-cell MeRIP–seq (Fig. 1a) and benchmarked m⁶A mapping on titrations of mouse liver poly(A)-selected RNAs, spike-in control RNAs and mouse embryonic stem (mES) cells and in single zebrafish zygotes, single mouse oocytes and preimplantation embryos. First, we performed optimization of experimental parameters using mouse liver poly(A)-selected RNA and Pdzd8 mRNA as a positive control and Rdh10 mRNA as a negative control based on published data¹⁵. We assessed the effects of several experimental conditions on the signal-to-noise (S/N) ratio (Methods). We have previously shown that optimizing the S/N ratio is critical for successful downscaling of chromatin IP²⁵, and we reasoned that it would be equally important for RNA IP. This is based on the rationale that when scaling down the amount of starting material, while the surfaces available for nonspecific binding of RNA (the surface of plastic tubes and paramagnetic beads) are kept consistent, this results in a relative increase in the carryover of nonspecifically bound RNA and hence a reduction in S/N ratio. When scaling down the input amount, the S/N ratio is improved by the following: (1) increasing the detergent (SDS) and salt (NaCl) concentrations and roughly vortexing rather than gently rotating head over tail, suggesting that chemically and physically stringent washing is able to remove more of the nonspecifically bound material (Fig. 1a, step 4, Extended Data Fig. 1a and Methods); (2) using low-binding tubes to reduce carryover of nonspecifically bound material at the plastic surface of the tube and to reduce loss of RNA (Fig. 1a, steps 1–5, Extended Data Fig. 1b and Methods) and (3) thoroughly assessing commercially available antibodies to m⁶A and finding that anti-m⁶A from Millipore has a superior S/N ratio (Fig. 1a, step 3, Extended Data Fig. 1c and Methods).

**Fig. 1: Development of picoMeRIP–seq.**

Furthermore, we established reliable RNA fragmentation by sonication (Fig. 1a, step 2, Extended Data Fig. 1d and Methods). Finally, conventional preparation of RNA libraries requires conversion of the RNA starting material to cDNA and uses either RNA or DNA adaptor ligation to the target RNA or DNA molecules, which has relatively low efficiency. For epitranscriptome-wide mapping, we tailored Takara Bio’s Switching Mechanism At the 5′ end of RNA Template (SMART) library preparation protocol to a single-tube procedure (Fig. 1a, step 5, Extended Data Fig. 1e and Methods) with an efficient and simple workflow to generate stranded Illumina sequencing-ready libraries in a few hours. After Illumina sequencing, the resulting data were analyzed with standard preprocessing, read mapping and filtering (Fig. 1a, steps 6–8). Peak calling was performed with a commonly used model-based method, MACS²⁶, providing m⁶A peaks for further downstream analyses (Fig. 1a, steps 9 and 10, and Methods).

We assessed the performance of picoMeRIP–seq on a titration of input RNA amounts (10 ng, 1 ng and 100 pg of mouse liver poly(A)-selected RNA) and compared it to published data. picoMeRIP–seq generated consistent profiles between replicates and for reduced starting amounts and reproduced published data (Fig. 1b). A high degree of transcriptome-wide correlation of sequencing reads was observed between replicates and between different starting amounts (Pearson correlation coefficients of 0.82–0.96; Fig. 1c). On average, the numbers of m⁶A peaks called from different amounts of starting material were 11,895 (10 ng), 12,079 (1 ng) and 6,661 (100 pg) (Supplementary Table 1). Peak overlap between different starting amounts and between replicates was high and on par with, or better than, the overlap between replicates of published data from 3 μg of total RNA (Fig. 1d,e and Extended Data Fig. 2a)²⁷. picoMeRIP–seq (10 ng) identified about 87% of previously published peaks from liver RNA (3 μg of total RNA)²⁷, supporting the reliability of our method (Fig. 1d). Furthermore, we validated the specificity of our method by showing that de novo motif analysis of data obtained from 10 ng, 1 ng and 100 pg of mouse liver RNA all identified the well-known m⁶A motif RRACH as the most significantly enriched motif (Extended Data Fig. 2b)^15,27, and around 96% of m⁶A peaks from each sample had the RRACH motif (Supplementary Table 1). Last, mouse liver picoMeRIP–seq data presented clear m⁶A enrichment at the vicinity of the stop codon (Extended Data Fig. 2c), consistent with previous reports on m⁶A distribution^15,27.

Next, we assessed the effect of a key computational analysis parameter (the q value reported by MACS) on the reliability of identified m⁶A peaks using the following four evaluation factors: (1) fraction of peaks with a RRACH motif, (2) fraction of peaks that are located in either the stop codon or 3′ untranslated region (UTR), (3) fraction of peaks identified in two biological replicates and (4) fraction of peaks identified by picoMeRIP–seq and previously published work using 3 μg of RNA²⁷. As expected, the number of identified peaks decreased when increasing the stringency of the statistical significance cutoff (that is, lower q value) for peak calling, and the effect of this was comparable between picoMeRIP–seq data from 10 ng, 1 ng and 100 pg and published data (Extended Data Fig. 3a). Of note, the minor effect of q value cutoffs ranging from <0.05 to <1 × 10^–100 on the four evaluation factors listed above supported robustness of the picoMeRIP–seq data (Extended Data Fig. 3b–e). Furthermore, both for published data and for picoMeRIP–seq data, we observed that peaks supported by two biological replicates had higher fractions of RRACH motifs and higher fractions of m⁶A peaks with 3′-end transcript occupancy than peaks only supported by one biological replicate (Extended Data Fig. 3f,g).

Thereafter, we performed further experimental assessments of the level of specificity and background of picoMeRIP. To assess the quantitative performance of picoMeRIP, two control RNAs with (Gaussia luciferase (GLuc)) and without (Cypridina luciferase (CLuc)) m⁶A modifications were mixed at different ratios to obtain five samples with different methylation levels (100%, 80%, 50%, 20% and 0%) that were used for picoMeRIP–quantitative PCR (picoMeRIP–qPCR; Extended Data Fig. 4a). We achieved high agreement (R = 0.99) between the expected m⁶A levels and the experimentally observed m⁶A levels (Extended Data Fig. 4b). Furthermore, we spiked in the two control RNAs into mouse liver mRNA samples at a 1:1 ratio (GLuc:CLuc) and performed picoMeRIP. qPCR analysis showed high m⁶A signal compared to unmodified background for both spike-in controls and for previously validated m⁶A-positive (Pdzd8) and m⁶A-negative (Rdh10) liver transcripts (Extended Data Fig. 4c). Next, we compared the number of m⁶A peaks and peak signal strength of picoMeRIP–seq from wild-type (WT) and METTL3-deficient mES cells, including spiked-in control RNAs. We made use of a published Mettl3-knockout (KO) mES cell line⁸ and confirmed the absence of the METTL3 m⁶A writer protein by western blotting (Extended Data Fig. 4d). To identify m⁶A peaks specific to either WT or KO mES cells, we performed picoMeRIP–seq on (1) mouse liver mRNA, (2) mouse liver mRNA and WT mES cell mRNA added in a 1:1 ratio and (3) mouse liver mRNA and KO mES cell mRNA added in a 1:1 ratio (Extended Data Fig. 4e). This allowed for quantitative comparison of m⁶A signal between WT and METTL3-deficient mES cells. We identified 7,404 peaks specific to WT and only 1,915 peaks specific to METTL3-deficient mES cells (Extended Data Fig. 4f) and found that METTL3-deficient mES cell-specific peaks showed significantly lower m⁶A signal than WT-specific peaks (Extended Data Fig. 4g). These results are in agreement with previous reports demonstrating that METTL3 deficiency leads to incomplete removal of m⁶A methylation activity¹⁰. In parallel, assessment of the m⁶A-modified GLuc and unmodified CLuc control RNAs spiked into the samples showed low false discovery rates (Extended Data Fig. 4h). Together, these data support high specificity and utility of picoMeRIP–seq in m⁶A detection.

Next, we applied picoMeRIP–seq to single zebrafish zygotes for proof-of-principle single-cell m⁶A profiling. With the aim of starting from intact single cells and reducing loss as much as possible, we combined cell lysis and removal of both rRNA and DNA into a one-tube procedure (Fig. 1a, step 1, and Methods). Each single-cell experiment yielded from 195,976 to 641,234 (with an average of 400,079) uniquely aligned and deduplicated read pairs (Supplementary Table 1). We used picoMeRIP–seq data from pools of multiple zygotes as a reference. Genome browser assessment of picoMeRIP–seq data from single zygotes showed m⁶A profiles similar to those obtained with pools of zygotes (Fig. 1f). All metagene profiles of m⁶A enrichment displayed a clear enrichment at the vicinity of the stop codon, in agreement with previous studies (Fig. 1g)^14,15,28. We observed high transcriptome-wide correlation (Pearson correlation coefficient) between data from pools of zygotes (>0.98; Extended Data Fig. 5a,d), between data from single zygotes and pools of zygotes (0.83–0.88; Extended Data Fig. 5b,d) and between biological replicates of single zygotes (0.93–0.97; Extended Data Fig. 5c,d). The sensitivity of m⁶A peak detection improved with increasing numbers of reads (Extended Data Fig. 5e,f). By combining the data from five single-zygote experiments, we detected 8,516 peaks (Extended Data Fig. 5e). The numbers of detected peaks from single-zygote picoMeRIP–seq data were close to the expected maximum, as estimated by peak calling from randomly downsampled data from pools of 10 zygotes (Extended Data Fig. 5f). We observed a high fraction of peak overlap between picoMeRIP–seq data from single zygotes and data from pools of zygotes (89–95%) and a high fraction of m⁶A transcript overlap between picoMeRIP–seq data from single zygotes and published large-scale data²⁸ (78–85%), indicating a high level of specificity (Extended Data Fig. 6a). Finally, as further support of specificity, all picoMeRIP–seq experiments, also from single zygotes, demonstrated RRACH as the most significantly enriched motif (Extended Data Fig. 6b).

To further demonstrate the versatility of our method, we applied picoMeRIP–seq to mES cells sorted by fluorescence-activated cell sorting. We showed a high level of reproducibility for selected loci (Extended Data Fig. 7a) and by transcriptome-wide correlation analysis of picoMeRIP–seq data from 1,000 to 10 cells (Extended Data Fig. 7b). Assessment of m⁶A peak overlap showed high reproducibility between three biological replicates for 1,000 cells and 100 cells and also good overlap between 1,000 cells and 10 cells (Extended Data Fig. 7c). Although increased variation between three replicates of ten cells was observed (Extended Data Fig. 7b) and peak overlap was reduced (Extended Data Fig. 7c), significantly enriched motifs in m⁶A peaks (Extended Data Fig. 7d) and metagene profiles of m⁶A enrichment (Extended Data Fig. 7e) from ten-cell experiments were in agreement with previous studies in mES cells¹⁰. In addition, we tried to apply picoMeRIP–seq to single mES cells but did not obtain sufficient libraries for sequencing.

Next, to benchmark picoMeRIP–seq, we applied it to single mouse germinal vesicle (GV)-stage oocytes, metaphase II (MII)-stage oocytes and single embryos at the zygote, two-cell, eight-cell and blastocyst stages to generate m⁶A maps. Each experiment resulted in 1.1 million–5.1 million uniquely mapped and deduplicated reads (Supplementary Table 1). High library complexity allowed for a resolution sufficient to assess m⁶A enrichment at individual loci in the transcriptomes of single oocytes and single embryos (Fig. 2a) and even allowed for peak calling, demonstrating that m⁶A marking is a distinct feature in single cells. On average, 12,901 m⁶A peaks were identified for each stage (Fig. 2b and Supplementary Table 1), and 4,677 (GV), 3,764 (MII), 3,555 (zygote), 4,389 (two-cell), 6,140 (eight-cell) and 6,104 (blastocyst) gene transcripts that were m⁶A modified were identified (Fig. 2b). Principal component analysis (PCA) revealed that single-oocyte and single-embryo m⁶A data contained sufficient information for accurate clustering according to cell identity and could even distinguish closely related oocyte and embryo stages (Fig. 2c), demonstrating the power of picoMeRIP–seq to identify cell populations. In the future, higher-throughput analysis will likely allow further evaluation of heterogeneity between single cells. Metagene profiles showed typical distribution of m⁶A enrichment near stop codons for all oocyte and embryo stages (Fig. 2d). The m⁶A consensus motif RRACH was identified from the called peaks for all picoMeRIP–seq experiments (Fig. 2e and Extended Data Fig. 8a), consistent with our bulk embryo work²⁹. Enrichment at a certain genomic region compared to what would be expected by chance was assessed (Extended Data Fig. 8b). The stop codon and 3′ UTR showed high m⁶A enrichment for all oocyte and embryo stages. However, GV oocytes presented with the highest enrichment, and the enrichment decreased in MII oocytes, zygotes and two-cell embryos before increasing again in eight-cell and blastocyst-stage embryos (Extended Data Fig. 8b). A similar trend was observed for the fraction of m⁶A peaks where, on average, 39% of GV oocyte m⁶A peaks mapped to the stop codon or 3′ UTR before dropping to 25%, 26% and 24% in MII oocytes, zygotes and two-cell embryos, respectively, and increasing again to 36–37% in eight-cell and blastocyst-stage embryos (Extended Data Fig. 8b). The m⁶A enrichment in protein-coding sequence (CDS) regions was relatively more stable across all oocyte and embryo stages, but the fraction of m⁶A peaks at CDS regions also reached the lowest levels in two-cell embryos. It is unclear whether the dynamics of m⁶A are associated with maternal transcript degradation and/or other maternal-to-zygotic transition events, which needs further exploration. Addressing m⁶A stoichiometry in oocytes and embryos would require the development of new technology. Gene ontology (GO) analysis suggested that m⁶A was marking transcripts of known biological relevance to early embryo development. All investigated oocyte and embryo stages showed that genes marked by m⁶A were enriched in GO terms involved in transcription-related processes (Extended Data Fig. 9a). Genes marked by m⁶A in certain oocyte and embryo stages were enriched in GO terms such as cell proliferation, apoptosis, RNA splicing and embryonic development. By contrast, genes not marked by m⁶A were enriched in GO terms involving basic metabolic processes.

**Fig. 2: Profiling m⁶A methylation in single mouse oocytes and preimplantation embryos.**

As m⁶A is also present in noncoding RNAs, including retrotransposon-derived RNAs¹³, we assessed the capacity of single-oocyte/single-embryo picoMeRIP–seq to study retrotransposons. The enrichment of m⁶A across several types of retrotransposons in mouse GV and MII oocytes, zygotes and two-cell, eight-cell and blastocyst-stage embryos and other tissues was analyzed (Methods). The two retrotransposon subfamilies L1Md_A and L1Md_T of the LINE-1 family were frequently enriched for m⁶A signal in several different mouse tissues and mES cells but not in oocytes or before the eight-cell-stage (Extended Data Fig. 9b). Notably, the MTA subfamily, the evolutionarily younger member of the mammalian apparent LTR retrotransposons (MaLRs) family, was specifically enriched for m⁶A in GV oocytes to two-cell embryos (Extended Data Fig. 9c). Recent work supports dynamic m⁶A enrichment in the transcripts of transposable elements^29,30. MaLR retrotransposable elements are mainly transcribed in oocytes and early embryos, with MTA sequences reported to have a notable expression in oocytes³¹. MTA sequences are maternally expressed, and their RNA is largely degraded around the major zygotic genome activation (ZGA; Extended Data Fig. 9c)^29,31. One may speculate that m⁶A marking of MTA sequences could play a role in stage-specific expression through marking these sequences for maternal transcript degradation. Moreover, the high abundance of the ZGA-related retrotransposon MERVL³¹ was associated with frequent m⁶A marking in single two-cell embryos (Extended Data Fig. 9d), in agreement with bulk embryo work²⁹, suggesting a potential regulatory role of m⁶A-marked MERVL transcripts in ZGA.

In summary, we have developed and benchmarked a picogram-scale method for small-scale and single-cell m⁶A mapping without the use of specialized equipment, allowing it to be readily adopted by many laboratories. We anticipate that picoMeRIP–seq will enable m⁶A profiling of scarce cell types from in vivo sources, such as biopsies from healthy and diseased tissues. With a sensitivity that allows for single-oocyte and single-embryo studies, picoMeRIP–seq will open up the study of the m⁶A landscape of human oocytes and preimplantation embryos in relation to fertility and developmental defects.

Methods

Ethics statement

Zebrafish and mouse experiments were approved by the Animal Research Committee of the Norwegian Food Safety Authority (Forsøksdyrforvaltningens tilsyns- og søknadssystem (FOTS) IDs: 10898 and 24911). Experimental procedures conformed to the ARRIVE guidelines and were conducted in accordance with the ethical guidelines in Directive 2010/63/EU of the European Parliament on the protection of animals used for scientific purposes and Norwegian legislations.

Antibodies and tubes

The following antibodies to m⁶A were used in the experiments: Millipore, ABE572; New England Biolabs (NEB), E1610S; Diagenode, C15200082-50; Synaptic Systems, 202003.

The following low-binding tubes were used in the experiments: Axygen Maxymum Recovery 1.5-ml low-bind tubes (VWR.no, 525-0230); Axygen Maxymum Recovery 0.6-ml low-bind tubes (VWR.no, 525-0229); Axygen Maxymum Recovery 0.2-ml low-bind tubes (VWR.no, 732-0679).

Zebrafish zygotes, mouse liver, mES cells, mouse oocytes and embryo collection

Total RNA was extracted from 100 zebrafish zygotes using TRIzol reagent (Thermo Fisher Scientific) and eluted in 100 μl of RNase-free water. Then, 10-μl and 5-μl samples were taken for ten and five zygotes, respectively, and volumes were adjusted to 12 μl with nuclease-free water. Single zebrafish zygotes were manually picked and distributed into 12 μl of 1× lysis buffer (Takara). Finally, the samples were snap-frozen in liquid nitrogen and stored at −80 °C until further processing.

For Extended Data Fig. 7, mES cells (R1) were purchased from ATCC (SCRC-1011). Twelve microliters of 1× lysis buffer (Takara) was dispensed into each well of a 96-well PCR plate, and cells were sorted into each well according to the manufacturer’s instructions using a BD FACSMelody cell sorter (BD Biosciences). Plates were sealed with sealing films and immediately stored at −80 °C until further processing. Mettl3^–/– and WT control mES cell lines were gifted from S. Geula et al., Jacob H. Hanna laboratory, Weizmann Institute of Science⁸. Mycoplasma testing was performed on a regular basis, and all cell lines were free of Mycoplasma.

Mice were housed in individually ventilated cages (IVC, Scanbur) with a stable light/dark cycle (7:00 to 19:00), with 55 ± 5% relative humidity at 22 ± 2 °C with free access to water and standard rodent chow diet (2018S; 58 E% carbohydrate, 18 E% fat, 24 E%; Teklad Global 18% Protein Rodent Diet, Envigo). The presence of pathogens was monitored quarterly in accordance with the Federation of European Laboratory Animal Science Association guidelines. Animals were specific pathogen free according to the Federation of European Laboratory Animal Science Association recommendations (specific pathogen-free status).

To collect GV oocytes, 8-week-old C57BL6/N females were injected with 5 U of pregnant mare serum gonadotropin (PMSG), and 48 h after PMSG injection, ovaries were dissected, and oocytes were isolated by puncturing the follicles. The procedure was performed in M2 medium supplemented with 0.2 mM 3-isobutyl-1-methylxanthine (a cyclic nucleotide phosphodiesterase inhibitor; Sigma) to prevent the oocytes from further progress to GV breakdown. The cumulus cells were gently removed by pipetting, and the oocytes were briefly exposed to acidic Tyrode’s solution (Sigma) to remove the zona pellucida, followed by three washes in M2 medium.

To collect MII oocytes, 4- to 5-week-old C57BL6/N females were injected with 5 U of PMSG followed by 5 U of human chorionic gonadotropin (hCG) 45 h after PMSG injection. The oviducts were dissected 20–22 h later and transferred to a clean dish containing M2 medium (Sigma). The oviduct ampulla was identified under a stereomicroscope to isolate MII oocytes containing the cumulus mass. Oocytes were treated with 0.3 mg ml^–1 hyaluronidase dissolved in M2 medium to remove the cumulus cells and were exposed to acidic Tyrode’s solution (Sigma) for a few seconds to remove the zona pellucida. Finally, MII oocytes were washed in M2 medium.

To collect early embryos, female mice were superovulated by hormone injection (5 U of PMSG followed by 5 U of hCG 45 h later) and transferred to cages with C57BL/6N males (8 weeks old) for mating. At 27–28 h (zygote), 39–43 h (two cell), 68–70 h (eight cell) and 92–94 h (blastocyst) after hCG administration, female mice were killed by cervical dislocation. Embryos were flushed from the reproductive tract into HEPES-buffered CZB medium and transferred to acidic Tyrode’s solution (Sigma) for a few seconds to remove the zona pellucida, followed by three washes in M2 medium.

The mouse oocytes and embryos were manually picked and sorted into 12 μl of 1× lysis buffer (Takara). The samples were snap-frozen in liquid nitrogen and stored at −80 °C until further use.

Total RNA of C57BL/6N mouse livers was extracted using TRIzol reagent. Poly(A)⁺ RNA was selected twice with a Dynabeads mRNA purification kit (Thermo Fisher Scientific). Identification of ribosomal RNA contamination was conducted using an Agilent 2100 Bioanalyzer according to the user manual.

Real-time qPCR

cDNA was synthesized from m⁶A-immunoprecipitated RNA using SuperScript VILO master mix (Thermo Fisher Scientific), and real-time qPCR was conducted using Fast SYBR Green master mix (Thermo Fisher Scientific) following the manufacturer’s protocol.

The following Pdzd8 and Rdh10 primer sequences were used for real-time qPCR.

Positive m⁶A control Pdzd8:

Forward primer, 5´-GTGGTTCTCTCATAGGACATAAAG-3´

Reverse primer, 5´-CAAAGCCAGTTATCAATACAGTCA-3´

Negative m⁶A control Rdh10:

Forward primer, 5´-AGTGTAGTGCTCTGTTGTGT-3´

Reverse primer, 5´-CGCTGATCTCAAACTGACATC-3´

To calculate the S/N ratio, the following formula was used:

$$\begin{array}{l}{\mathrm{S}}/{\mathrm{N}}\, {\mathrm{ratio}}=\\ \left[\right.2^{(C_t\, {\mathrm{input}}\,(Pdzd8\, ({\mathrm{corrected}}))-C_t\, {\mathrm{IP}}\, (Pdzd8))}\left.\right]/\left[\right.2^{(C_t\, {\mathrm{input}}\,(Rdh10\, ({\mathrm{corrected}}))-C_t\, {\mathrm{IP}}\, (Rdh10))}\left.\right].\end{array}$$

C_t (cycle threshold) input (corrected) = (C_t input – log₂ (10)). We subtract log₂ (10) when the input represents 1/10th of the amount used for RNA immunoprecipitation. This is in order to correct for the difference in starting amount used for the input, and is only applied if using a different amount of starting material for the input as compared to the RNA IP. When the same amount of starting material is used for both the input and the RNA IP, there will be no correction for the input amount.

Single-cell picoMeRIP–seq

The following procedures were performed in a UV decontaminated LAF bench.

rRNA and DNA depletion

For single-tube rRNA and DNA depletion, we performed rRNA depletion using an NEBNext rRNA depletion kit (NEB) with some modifications to the user manual. Specifically, 3 µl of the RNA/probe master mix was added to a 12-µl sample, which was then subjected to a temperature ramp from 95 °C for 2 min to 22 °C at a rate of −0.1 °C s^–1, followed by a 5-min hold at 22 °C. Next, 5 µl of the RNase H reaction mix was added to the samples and incubated at 37 °C for 30 min, after which 30 µl of DNase I digestion mix was added and incubated at 37 °C for an additional 30 min. The resulting samples were purified using 2.2× volume of RNAClean XP beads, washed twice with 80% freshly prepared ethanol and eluted with 78 μl of nuclease-free water. Finally, 2 μl of RiboLock RNase inhibitor (40 U µl^–1) was added to the sample to prevent RNA degradation, resulting in a sample volume of 80 μl.

RNA fragmentation by sonication

A UP100H Ultrasonic Processor (Hielscher) with a 2-mm probe was used to sonicate the samples, using pulse settings with 0.5-s cycles and 27% power. The samples underwent n × 30 s sonication cycles, with 30 s of sonication followed by 30 s on ice for each cycle. The numbers (n) of sonication cycles used in this study for different amounts of input were optimized and can be found in Supplementary Table 1. For mouse liver samples, 10 ng and 100 pg were used to construct input libraries after sonication. RNA from pools of zygotes was used for input controls for single zebrafish zygotes (Supplementary Table 1). In the case of single mouse oocytes and early embryos, 10% of multiple oocyte/embryo RNA was removed and served as input control (Supplementary Table 1). To the samples consisting of 80 μl, 20 µl of 5× IP buffer (50 mM Tris-HCl (pH 7.5), 750 mM NaCl, 0.5% (vol/vol) NP-40 and 5 U µl^–1 RiboLock RNase inhibitor) was added to make a final volume of 100 µl for sonication.

Antibody–bead incubation

Before use, Dynabeads (Invitrogen) were washed by taking 20 μl of beads and washing them twice with 1× IP buffer (200 µl of 5× IP buffer supplemented with 800 µl of nuclease-free water) by vortexing, quickly centrifuging on a MiniGalaxy and placing on a magnetic rack before discarding the supernatant. In a separate tube, the antibody was diluted by taking 4 µl of anti-m⁶A, 16 µl of 5× IP buffer and 60 µl of nuclease-free water and mixing by gentle vortexing. The antibody-containing solution was added to the washed beads, and the antibody–beads were incubated overnight with head-over-tail rotation on a HulaMixer at 4 °C (40 r.p.m.). After conducting antibody testing and comparison experiments, anti-m⁶A from Millipore (ABE572) was selected for use in all subsequent experiments.

IP and washes

Antibody-coated beads were captured on the tube wall in a magnetic rack. The supernatant from the antibody–bead incubation was discarded. Antibody-coated beads were washed twice with 200 μl of 1× IP buffer by vortexing (four times for 5 s each) to remove unbound antibodies that would otherwise compete for binding to the epitope. At the end of the second wash, the antibody-coated beads were transferred to 0.2-ml PCR tubes. From 200 μl, a volume of 10 μl of homogenous antibody-coated bead solution was transferred to each PCR tube. The tubes were quickly centrifuged on a MiniGalaxy and placed in a magnetic rack for at least 2 min or until the solution became clear. After removing the supernatant, 100 μl of sonicated sample RNA solution was added to each antibody–bead-containing tube, and the samples were incubated with head-over-tail rotation on a HulaMixer at 4 °C for 2 h (40 r.p.m.). Tubes were quickly centrifuged on a MiniGalaxy and placed in a magnetic rack. The supernatant was removed, and the RNA–antibody–bead complexes were washed four times in the following solutions, quickly spun and placed in a magnetic rack in between washes: washed once with ice-cold medium-stringency RIPA buffer (10 mM Tris-HCl (pH 8.0), 300 mM NaCl, 1 mM EDTA, 0.5 mM EGTA, 1% (vol/vol) Triton X-100, 0.2% (vol/vol) SDS and 0.1% (vol/vol) sodium deoxycholate), washed twice with ice-cold high-stringency RIPA buffer (10 mM Tris-HCl (pH 8.0), 350 mM NaCl, 1 mM EDTA, 0.5 mM EGTA, 1% (vol/vol) Triton X-100, 0.23% (vol/vol) SDS and 0.1% (vol/vol) sodium deoxycholate) and washed once with ice-cold medium-stringency RIPA buffer. After the four washes, tubes were quickly spun and placed in a magnetic rack, and the supernatant was discarded. The RNA–antibody–bead complexes were then resuspended in 100 μl of 1× IP buffer and incubated for 5 min. The samples were then quickly spun and placed in a magnetic rack, and the supernatant was removed. The RNA–antibody–bead complexes were resuspended in 147.9 μl of elution buffer (5 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.05 % (vol/vol) SDS and 1 U µl^–1 RiboLock RNase inhibitor). Proteinase K (2.1 μl; NEB) was added to each tube, and tubes were then incubated on a Thermomixer at 1,200 r.p.m. and 55 °C for 1.5 h. After incubation, the tubes were briefly centrifuged and incubated further on a Thermomixer at 80 °C for 20 min to inactivate the Proteinase K. The samples were then placed in a magnetic rack for 2–3 min, and the supernatant containing the m⁶A-immunoprecipitated RNA was transferred to a new 1.5-ml low-binding tube. The remaining beads were resuspended again in 147.9 μl of elution buffer, and 2.1 μl of Proteinase K was added. The samples were placed immediately in a Thermomixer at 1,200 r.p.m. and 55 °C for 5 min, followed by inactivation of Proteinase K on a Thermomixer at 80 °C for 20 min. The tubes were then placed back in a magnetic rack for 2–3 min, and the supernatant was collected and pooled with the first supernatant in the same 1.5-ml low-binding tube to recover as much of the m⁶A-immunoprecipitated RNA as possible, resulting in a total volume of about 300 μl.

Ethanol precipitation

For both input and immunoprecipitated RNA samples, nuclease-free water was added to each tube to result in a final volume of 400 μl. Next, 40 μl of 3 M sodium acetate (pH 5.2; Thermo Fisher Scientific) and 10 μl of linear acrylamide 5 mg μl^–1 (Thermo Fisher Scientific) were added, followed by 1,000 μl of ice-cold 100% ethanol. The samples were vigorously vortexed without centrifugation or spinning and immediately placed at –80 °C for at least 2 h or overnight until completely frozen. Samples were recovered from –80 °C and allowed to briefly thaw on ice, and it was visually confirmed that all samples had thawed before starting centrifugation. The samples were centrifuged at 20,000g at 4 °C for 15 min, and the supernatant was carefully removed without disturbing the visible pellet. The pellet was then washed twice with 1 ml of ice-cold 75% ethanol. For washes, 75% ethanol was added, and the tube was gently vortexed until the pellet detached from the bottom; centrifugation was repeated as described above. After the last wash, as much as possible of the supernatant was removed, the tube lid was left open until all ethanol had evaporated, and the dried pellet was resuspended in 7 μl of nuclease-free water.

Library preparation and sequencing

With modifications to the manufacturer’s protocol, as described earlier, the SMART-Seq stranded kit (Takara, 634442) was used to construct sequencing libraries. For the fragmented input or immunoprecipitated RNA, we performed the protocol without the fragmentation step. After the first PCR amplification and following AMPure bead purification, we resuspended the beads by adding 46.5 µl of nuclease-free water and skipped the ribosomal cDNA depletion protocol in Section D. We then incubated the samples at room temperature for 5 min to allow time for rehydration and recovered 46 µl of supernatant from each sample. We continued following the protocol until completion. The libraries were assessed for quantity using KAPA library quantification kits (Roche), and size distribution was assessed using TapeStation D1000 ScreenTape (Agilent Technologies). In combination, this information provided for good estimation of pooling at equimolar ratios. The pooled libraries were sequenced on a NovaSeq system (Illumina) with 50-base pair (bp) paired-end mode.

Spike-in controls

Spike-in control RNAs and qPCR primers are from the EpiMark N⁶-methyladenosine enrichment kit. Before adding the spike-in control RNAs to an RNA sample for picoMeRIP, each control RNA was diluted to 0.001 fmol μl^–1, and 1 μl of the diluted control RNA was added. For the picoMeRIP–qPCR titration experiment, each control RNA was diluted to 1 fmol in 100 μl. The two control RNAs were then mixed together at the indicated ratio used for picoMeRIP–qPCR (Extended Data Fig. 4a).

Western blotting

Western blotting was performed as previously described³². Total proteins were extracted using RIPA lysis buffer (Thermo Scientific, 89900) containing protease inhibitor cocktail (Sigma-Aldrich, P8340) and phenylmethylsulfonyl fluoride (Sigma-Aldrich, P7626). Protein samples were denatured and resolved by Bolt Bis-Tris Plus gels (Invitrogen). Separated proteins were transferred onto nitrocellulose membranes and detected with primary antibodies to METTL3 (Abcam, ab195352) and GAPDH (Abcam, ab125247). The following secondary antibodies were used: donkey anti-mouse horseradish peroxidase (HRP; Abcam, ab6820) and donkey anti-rabbit (HRP; Abcam, ab6802). Blots were developed by enhanced chemiluminescence (Thermo Fisher Scientific, 32209 and 34095) and scanned with a Bio-Rad ChemiDoc XRS+ system.

Sequencing data processing, m⁶A peak identification and motif analysis

The code used for quality check, alignment and filtering of sequencing reads, identification of m⁶A peaks and m⁶A consensus motifs and abundance estimation of gene transcripts is available at GitHub (https://github.com/Augroup/MeRipBox).

Quality of raw sequencing reads was assessed using FastQC (v0.11.8; https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) with the default parameters. After trimming sequencing adapters and low-quality bases with Cutadapt (v1.8.1)³³ with the parameter ‘-q 20,20 -m 20 –max-n 0.01 –trim-n’, the clean read pairs were mapped to the reference genomes (mm10 for mouse and danRer11 for zebrafish) and reference sequences (for the two spike-in RNA controls, obtained from the manual for the EpiMark N⁶-methyladenosine enrichment kit) using HISAT2 (v2.1.0)³⁴ with the parameter ‘-5 8 –no-mixed –no-discordant’. Multiply mapped read pairs (that is, more than one genomic locus per read pair as reported by HISAT2) were discarded. We further filtered out PCR duplicates by using SAMtools (v1.9)³⁵ and read mates that overlapped with the genomic coordinates of ribosomal RNAs by using BEDTools (v2.28.0)³⁶. These uniquely aligned, deduplicated and non-rRNA reads were used for m⁶A peak calling.

We identified m⁶A peaks using a model-based method called MACS (v2.1.2)²⁶ with the mode ‘callpeak’, the parameter ‘–keep-dup all -B –nomodel –call-summits’ and estimated transcriptome sizes of ‘-gsize 242010196’ for mouse and ‘-gsize 117608789’ for zebrafish. The statistical significance cutoff for the identified m⁶A peaks was a q value of <0.05. The processed reads and identified m⁶A peaks are summarized in Supplementary Table 1.

Focusing on the 400-bp region where the center is the genomic coordinate of m⁶A peak summits reported by MACS2, we searched for consensus motifs by using Homer (v4.11.1)³⁷ findMotifsGenome.pl with the parameter ‘-rna -len 5,6,7,8’. The genomic strand of m⁶A peak summits was deduced based on their overlap with the annotated transcripts. The P values for all motifs were calculated and reported by Homer under the assumptions described at the Homer website (http://homer.ucsd.edu/homer/motif/). For mouse ES cell, oocyte and embryo samples, the four or five base positions starting with the GGAC motif were plotted in the corresponding figure panels.

Based on the gene annotation libraries (Gencode vM23 for mouse and Ensembl v100 for zebrafish), gene transcript expression abundance (transcript per million (TPM)) of input samples was estimated using StringTie (v1.3.5)³⁸ with the parameter ‘-e -A’.

Read density pileup visualization, Pearson correlation, m⁶A signal and PCA

We further removed the unpaired read mate from the reads that were used for m⁶A peak calling. We made the genome coverage bigWig format files (bin size = 10 bp, normalized by reads per kilobase per million reads (RPKM)) using deepTools (v3.2.0)³⁹ bamCoverage with the parameter ‘-bs 10 –normalizeUsing RPKM’. Based on these bigWig files, we plotted the distribution of read density along exonic regions of the selected transcripts, which had higher expression abundance (by input samples) than other transcripts of a gene, and also performed transcriptome-wide (exonic region) Pearson correlation analysis using deepTools multiBigwigSummary (‘bins’ mode and window size = 1 kilobase (kb), exonic region) and plotCorrelation.

Based on the bigWig files of input and IP samples, for each 10-bp bin, we defined the m⁶A signal as the log₂ ratio of (IP RPKM + 1) over (input RPKM + 1) and performed PCAs for single mouse oocytes and embryo samples using deepTools multiBigwigSummary (‘bins’ mode and window size = 1 kb, exonic region) and plotPCA.

m⁶A peak annotation and m⁶A gene definition

Using BEDTools intersect, all m⁶A peaks were assigned a genomic feature by looking at the overlap between the peak summit and annotated genomic features (Gencode vM23 for mouse and Ensembl v100 for zebrafish), including (1) stop codon, which was defined as the region from the upstream 200 bp to downstream 200 bp surrounding the annotated stop codon, (2) 3′ UTR, (3) 5′ UTR, (4) CDS, (5) exon, (6) intron and (7) intergenic. If the peak summit of an m⁶A peak overlapped with more than one genomic feature, we chose only the one with the following order of priority: stop codon, 3′ UTR, 5′ UTR, CDS, exon, intron and intergenic.

Using MetaPlotR⁴⁰, we generated the metagene profiles of m⁶A peak summits along the 5′ UTR, CDS and 3′ UTR of protein-coding genes. For each gene, only one transcript/isoform with highest expression abundance was used for plotting.

We defined a gene as m⁶A modified/marked if any of its genomic features (stop codon, 3′ UTR, 5′ UTR, CDS, exon and intron) overlapped with ≥1 m⁶A peak summit.

GO analyses

For each developmental stage of mouse oocytes and early embryos, we only chose an m⁶A gene if any of its genomic features (stop codon, 3′ UTR, 5′ UTR, CDS and exon) overlapped with ≥1 m⁶A peak summit in both biological replicates. The top 1,000 statistically significant (based on P values of assigned m⁶A peaks) m⁶A genes per developmental stage were used for GO analyses with the online tool DAVID (v6.8)⁴¹ using all mouse genes as the background. Only GO terms with a P value of <0.05 in the library ‘GOTERM_BP_DIRECT’ were selected for visualization. As a comparison, we also performed GO analyses for the 1,000 randomly sampled genes, which were expressed (TPM > 1) and did not overlap with any m⁶A peaks for each developmental stage.

m⁶A enrichment on retrotransposon-derived RNAs

Mouse retrotransposon annotation was obtained on 5 March 2021 using the UCSC Table Browser with the settings ‘clade=Mammal, genome=Mouse, assembly=GRCm38/mm10, group=Variation and Repeats, track=RepeatMasker, table=rmsk’. For each retrotransposon subfamily, we used only the genomic locus/copy if (1) its length was ≥90% of the full-length reference consensus sequence and (2) it had <50% overlap percentage with exons of annotated genes.

For a given retrotransposon locus, the mean RPKM value (from input samples) across all genomic bins (size = 10 bp) overlapping with this locus was denoted as its expression value, and the mean m⁶A signal value across all genomic bins (size = 10 bp) overlapping with this locus was denoted as its m⁶A signal value. At each developmental stage of mouse oocytes and embryos, for each retrotransposon locus, the mean expression value and the mean m⁶A signal value across two biological replicates were calculated. For each retrotransposon subfamily, the fraction of loci with >0 m⁶A signal value was calculated.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All sequencing data generated in this study are available in the Gene Expression Omnibus (GEO) under accession number GSE184893 (ref. ⁴²). The m⁶A-seq data of eight mouse tissues were obtained from Genome Sequence Archive in BIG Data Center, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences (http://bigd.big.ac.cn/gsa) under accession number CRA001962 (ref. ²⁷). The processed m⁶A-seq dataset of bulk zebrafish zygotes was from GEO under accession numbers GSM2088167 and GSM2088177 (ref. ²⁸). Source data are provided with this paper.

Code availability

The major analysis scripts in this study were integrated into the bioinformatics pipeline MeRipBox, which is available at GitHub (https://github.com/Augroup/MeRipBox)⁴³.

References

Zhao, B. S., Roundtree, I. A. & He, C. Post-transcriptional gene regulation by mRNA modifications. Nat. Rev. Mol. Cell Biol. 18, 31–42 (2017).
Article CAS PubMed Google Scholar
Zaccara, S., Ries, R. J. & Jaffrey, S. R. Reading, writing and erasing mRNA methylation. Nat. Rev. Mol. Cell Biol. 20, 608–624 (2019).
Article CAS PubMed Google Scholar
Xiao, W. et al. Nuclear m⁶A reader YTHDC1 regulates mRNA splicing. Mol. Cell 61, 507–519 (2016).
Article CAS PubMed Google Scholar
Fustin, J. M. et al. RNA-methylation-dependent RNA processing controls the speed of the circadian clock. Cell 155, 793–806 (2013).
Article CAS PubMed Google Scholar
Wang, X. et al. N⁶-Methyladenosine-dependent regulation of messenger RNA stability. Nature 505, 117–120 (2014).
Article PubMed Google Scholar
Ke, S. et al. m⁶A mRNA modifications are deposited in nascent pre-mRNA and are not required for splicing but do specify cytoplasmic turnover. Genes Dev. 31, 990–1006 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wang, X. et al. N⁶-Methyladenosine modulates messenger RNA translation efficiency. Cell 161, 1388–1399 (2015).
Article CAS PubMed PubMed Central Google Scholar
Geula, S. et al. Stem cells. m⁶A mRNA methylation facilitates resolution of naive pluripotency toward differentiation. Science 347, 1002–1006 (2015).
Article CAS PubMed Google Scholar
Zheng, G. et al. ALKBH5 is a mammalian RNA demethylase that impacts RNA metabolism and mouse fertility. Mol. Cell 49, 18–29 (2013).
Article CAS PubMed Google Scholar
Batista, P. J. et al. m⁶A RNA modification controls cell fate transition in mammalian embryonic stem cells. Cell Stem Cell 15, 707–719 (2014).
Article CAS PubMed PubMed Central Google Scholar
Xiang, Y. et al. RNA m⁶A methylation regulates the ultraviolet-induced DNA damage response. Nature 543, 573–576 (2017).
Article CAS PubMed PubMed Central Google Scholar
Huang, H., Weng, H. & Chen, J. m⁶A modification in coding and non-coding RNAs: roles and therapeutic implications in cancer. Cancer Cell 37, 270–288 (2020).
Article CAS PubMed PubMed Central Google Scholar
Liu, J. et al. The RNA m⁶A reader YTHDC1 silences retrotransposons and guards ES cell identity. Nature 591, 322–326 (2021).
Article CAS PubMed Google Scholar
Meyer, K. D. et al. Comprehensive analysis of mRNA methylation reveals enrichment in 3′ UTRs and near stop codons. Cell 149, 1635–1646 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dominissini, D. et al. Topology of the human and mouse m⁶A RNA methylomes revealed by m⁶A-seq. Nature 485, 201–206 (2012).
Article CAS PubMed Google Scholar
Chen, K. et al. High-resolution N⁶-methyladenosine (m⁶A) map using photo-crosslinking-assisted m⁶A sequencing. Angew. Chem. Int. Ed. 54, 1587–1590 (2015).
Article CAS Google Scholar
Linder, B. et al. Single-nucleotide-resolution mapping of m⁶A and m⁶Am throughout the transcriptome. Nat. Methods 12, 767–772 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ke, S. et al. A majority of m⁶A residues are in the last exons, allowing the potential for 3′ UTR regulation. Genes Dev. 29, 2037–2053 (2015).
Article CAS PubMed PubMed Central Google Scholar
Molinie, B. et al. m⁶A-LAIC-seq reveals the census and complexity of the m⁶A epitranscriptome. Nat. Methods 13, 692–698 (2016).
Article CAS PubMed PubMed Central Google Scholar
Meyer, K. D. DART-seq: an antibody-free method for global m⁶A detection. Nat. Methods 16, 1275–1280 (2019).
Article CAS PubMed PubMed Central Google Scholar
Garcia-Campos, M. A. et al. Deciphering the ‘m⁶A code’ via antibody-independent quantitative profiling. Cell 178, 731–747 (2019).
Article CAS PubMed Google Scholar
Zhang, Z. et al. Single-base mapping of m⁶A by an antibody-independent method. Sci. Adv. 5, eaax0250 (2019).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y., Xiao, Y., Dong, S., Yu, Q. & Jia, G. Antibody-free enzyme-assisted chemical approach for detection of N⁶-methyladenosine. Nat. Chem. Biol. 16, 896–903 (2020).
Article CAS PubMed Google Scholar
Tegowski, M., Flamand, M. N. & Meyer, K. D. scDART-seq reveals distinct m⁶A signatures and mRNA methylation heterogeneity in single cells. Mol. Cell 82, 868–878 (2022).
Article CAS PubMed PubMed Central Google Scholar
Dahl, J. A. et al. Broad histone H3K4me3 domains in mouse oocytes modulate maternal-to-zygotic transition. Nature 537, 548–552 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. et al. Model-based analysis of ChIP–seq (MACS). Genome Biol. 9, R137 (2008).
Article PubMed PubMed Central Google Scholar
Liu, J. et al. Landscape and regulation of m⁶A and m⁶Am methylome across human and mouse tissues. Mol. Cell 77, 426–440 (2020).
Article CAS PubMed Google Scholar
Zhao, B. S. et al. m⁶A-dependent maternal mRNA clearance facilitates zebrafish maternal-to-zygotic transition. Nature 542, 475–478 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y. et al. The RNA m⁶A landscape of mouse oocytes and preimplantation embryos. Nat. Struct. Mol. Biol. 30, 703–709 (2023).
Wu, Y. et al. N⁶-Methyladenosine regulates maternal RNA maintenance in oocytes and timely RNA decay during mouse maternal-to-zygotic transition. Nat. Cell Biol. 24, 917–927 (2022).
Peaston, A. E. et al. Retrotransposons regulate host genes in mouse oocytes and preimplantation embryos. Dev. Cell 7, 597–606 (2004).
Article CAS PubMed Google Scholar
Jin, K. X. et al. N⁶-Methyladenosine (m⁶A) depletion regulates pluripotency exit by activating signaling pathways in embryonic stem cells. Proc. Natl Acad. Sci. USA 118, e2105192118 (2021).
Marcel, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal 17, 10–12 (2011).
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576–589 (2010).
Article CAS PubMed PubMed Central Google Scholar
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ramirez, F., Dundar, F., Diehl, S., Gruning, B. A. & Manke, T. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res. 42, W187–W191 (2014).
Article CAS PubMed PubMed Central Google Scholar
Olarerin-George, A. O. & Jaffrey, S. R. MetaPlotR: a Perl/R pipeline for plotting metagenes of nucleotide modifications and other transcriptomic sites. Bioinformatics 33, 1563–1564 (2017).
Article CAS PubMed PubMed Central Google Scholar
Huang, D. W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44–57 (2009).
Article CAS PubMed Google Scholar
Li, Y., Wang, Y., Klungland, A., Au, K., Dahl, J. A. Single-cell m⁶A mapping in vivo using picoMeRIP–seq. Gene Expression Omnibus https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE184893 (2023).
Li, Y., Wang, Y., Klungland, A., Au, K., Dahl, J. A. Single-cell m⁶A mapping in vivo using picoMeRIP–seq. GitHub https://github.com/Augroup/MeRipBox (2023).

Download references

Acknowledgements

We thank P. Aleström for planning and supervising the collection of zebrafish zygotes. We thank G. Luo for discussions. We thank J. H. Hanna for Mettl3^–/– and WT control mES cell lines. We thank the Norwegian Transgenic Center for help with animal housing and oocyte collection. We thank the Norwegian Sequencing Centre (Oslo University Hospital and University of Oslo; www.sequencing.uio.no) and the Genomics Core Facility (Oslo University Hospital; https://oslo.genomics.no) for high-throughput sequencing. K.F.A. and Y.W. acknowledge an institutional fund from the Department of Biomedical Informatics, The Ohio State University; an institutional fund from the Department of Computational Medicine and Bioinformatics, University of Michigan; and the National Institutes of Health (R01HG008759, R01HG011469 and R01GM136886). M.L. was supported by the Danish National Research Foundation (grant DNRF115). This work was supported by the South‐Eastern Norway Regional Health Authority, Early Career Grant 2016058 and Grant 2018063, and by the Research Council of Norway, FRIPRO grant 289467 (to J.A.D.). Y.L. was supported by a UiO:Life Science convergence environment grant (to A.K., G.D.G., P.F. and J.A.D.).

Author information

Sherif Khodeer
Present address: KU Leuven-University of Leuven, Department of Development and Regeneration, Leuven Institute for Single-Cell Omics (LISCO), Leuven Stem Cell Institute, Leuven, Belgium
These authors contributed equally: Yanjiao Li, Yunhao Wang.

Authors and Affiliations

Department of Microbiology, Oslo University Hospital, Rikshospitalet, Oslo, Norway
Yanjiao Li, Maria Vera-Rodriguez, Linda Ellevog Skuggen, Madeleine Fosslie, Sherif Khodeer, Eva Kristine Klemsdal, Kang-Xuan Jin, Arne Klungland & John Arne Dahl
Department of Molecular Medicine, Institute of Basic Medical Sciences, University of Oslo, Oslo, Norway
Yanjiao Li
Department of Biomedical Informatics, The Ohio State University, Columbus, OH, USA
Yunhao Wang & Kin Fai Au
Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
Yunhao Wang & Kin Fai Au
Department of Reproductive Medicine, Oslo University Hospital, Rikshospitalet, Oslo, Norway
Maria Vera-Rodriguez, Trine Skuland, Marie Indahl, Peter Fedorcsak & Gareth D. Greggains
Faculty of Veterinary Medicine, Norwegian University of Life Sciences, Aas, Norway
Leif Christopher Lindeman & Erik M. K. Rasmussen
Norwegian Transgenic Centre, Institute of Basic Medical Sciences, University of Oslo, Oslo, Norway
Ingunn Jermstad, Shaista Khan & Knut Tomas Dalen
Division of Gynaecology and Obstetrics, Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Oslo, Norway
Trine Skuland, Marie Indahl & Peter Fedorcsak
Center for Chromosome Stability, Department of Cellular and Molecular Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Mads Lerdrup
Department of Microbiology, Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Oslo, Norway
Arne Klungland
Biomedical Informatics Shared Resources, The Ohio State University, Columbus, OH, USA
Kin Fai Au

Authors

Yanjiao Li
View author publications
You can also search for this author in PubMed Google Scholar
Yunhao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Maria Vera-Rodriguez
View author publications
You can also search for this author in PubMed Google Scholar
Leif Christopher Lindeman
View author publications
You can also search for this author in PubMed Google Scholar
Linda Ellevog Skuggen
View author publications
You can also search for this author in PubMed Google Scholar
Erik M. K. Rasmussen
View author publications
You can also search for this author in PubMed Google Scholar
Ingunn Jermstad
View author publications
You can also search for this author in PubMed Google Scholar
Shaista Khan
View author publications
You can also search for this author in PubMed Google Scholar
Madeleine Fosslie
View author publications
You can also search for this author in PubMed Google Scholar
Trine Skuland
View author publications
You can also search for this author in PubMed Google Scholar
Marie Indahl
View author publications
You can also search for this author in PubMed Google Scholar
Sherif Khodeer
View author publications
You can also search for this author in PubMed Google Scholar
Eva Kristine Klemsdal
View author publications
You can also search for this author in PubMed Google Scholar
Kang-Xuan Jin
View author publications
You can also search for this author in PubMed Google Scholar
Knut Tomas Dalen
View author publications
You can also search for this author in PubMed Google Scholar
Peter Fedorcsak
View author publications
You can also search for this author in PubMed Google Scholar
Gareth D. Greggains
View author publications
You can also search for this author in PubMed Google Scholar
Mads Lerdrup
View author publications
You can also search for this author in PubMed Google Scholar
Arne Klungland
View author publications
You can also search for this author in PubMed Google Scholar
Kin Fai Au
View author publications
You can also search for this author in PubMed Google Scholar
John Arne Dahl
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.L., A.K. and J.A.D. conceived the study and designed the experiments. Y.L. developed the picoMeRIP–seq method, with contributions from M.V.-R., L.E.S., E.K.K. and M.F. under the supervision of J.A.D. Y.L. performed all single-cell picoMeRIP–seq experiments, supervised by J.A.D. Y.W. performed the bioinformatics analysis with assistance from Y.L. and M.L. K.F.A. supervised the data analysis. Y.L., S. Khodeer. and K.-X.J. cultured and collected mESCs. Y.L. planned and collected mouse material with help from I.J., S. Khan., T.S. and M.I. Mouse material collection was supervised by G.D.G., K.T.D., P.F. and J.A.D. Y.L. planned and collected zebrafish zygotes with assistance from E.M.K.R. and L.C.L. Zebrafish zygote collection was supervised by J.A.D. Y.L., Y.W., A.K., K.F.A. and J.A.D. prepared the manuscript with input from all authors.

Corresponding authors

Correspondence to Arne Klungland, Kin Fai Au or John Arne Dahl.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Peer review

Peer review information

Nature Biotechnology thanks Gene Yeo and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Optimization and development of picoMeRIP-seq.

a, qPCR assessment of signal-to-noise ratio for the evaluation of the effect of increased stringency of the wash buffers. Mouse liver polyA selected RNA (10 ng) was used in these experiments. The previously validated m⁶A positive (Pdzd8) and negative (Rdh10) loci were used to calculate the signal-to-noise ratio (See Methods). Data are presented as mean ± standard deviation (SD), with n = 3 independent experiments for each experimental condition. Unpaired, two-tailed t-test was used. b, qPCR assessment of signal-to-noise ratio for the evaluation of the effect of low-binding tubes. Mouse liver polyA selected RNA (10 ng) was used in these experiments. Data are presented as mean ± SD values, with n = 3 independent experiments for each experimental condition. Unpaired, two-tailed t-test was used. c, qPCR assessment of signal-to-noise ratio for the optimized picoMeRIP method with different antibodies. Mouse liver polyA selected RNA (10 ng) was used in these experiments. Data are presented as mean ± SD values, with n = 6; 7; 4; 4 independent experiments for Millipore, NEB, Diagenode, Synaptic Systems antibodies, respectively. Unpaired, two-tailed t-test was used. d, Synthetic gel image from an Agilent 2100 Bioanalyzer electrophoreses run showing the size of sequencing libraries generated from picoMeRIP-seq of 10 ng, 1 ng and 100 pg mouse liver polyA selected RNA. Library preparation adds 139 bp to the size of the immunoprecipitated RNA fragments. The top line indicates the applied number of sonication cycles with the Hielscher UP100H sonicator. Each cycle is 30 seconds sonication plus 30 seconds on ice. e, Tailored SMART library preparation with a single-tube protocol to include as much as possible of the immunoprecipitated RNA. The SMART scN6 Primer (green) random primers allows the generation of cDNA from all immunoprecipitated RNA fragments. After the reverse transcription, the SMART Stranded Adapter (pink) will be linked with the cDNA product. Next, a first round of PCR amplification (PCR 1) adds full-length Illumina adapters, including barcodes. To minimize the number of samples to be processed downstream, individual samples can be pooled together for a second round of PCR amplification (PCR 2) using primers universal to all libraries. The final library is compatible with any Illumina sequencing platform. Created with BioRender.com.

Extended Data Fig. 2 Assessment of picoMeRIP-seq using mouse liver RNA.

a, Venn diagrams showing overlap of m⁶A peaks from two replicates of published work starting with 3 µg total RNA (Liu et al²⁷), and for two replicates of picoMeRIP-seq experiments starting with either 10 ng or 1 ng of polyA RNA. b, Consensus motifs identified within m⁶A peaks from picoMeRIP-seq experiments with titration of the starting amount of polyA RNA. c, Metagene profiles showing the enrichment of m⁶A peaks along protein-coding gene transcripts.

Extended Data Fig. 3 Evaluation of the reliability and robustness of picoMeRIP-seq by testing a range of different statistical significance cutoffs (that is, q value reported by MACS).

a, Number of m⁶A peaks. b, Fraction of peaks with RRACH motif. c, Fraction of peaks which are located at stop codon or 3’ UTR. d, Fraction of peaks overlapping between two biological replicates. e, Fraction of peaks overlapping between picoMeRIP-seq (10 ng, biological replicate 1) and published data²⁷ (3 µg RNA, biological replicate 1). f, Fraction of peaks with RRACH motif for the peaks that are identified in both biological replicates (shared) and the peaks that are only supported by one replicate (unique), respectively. g, Fraction of peaks that are located at stop codon or 3’ UTR for the shared peaks and unique peaks, respectively.

Extended Data Fig. 4 Characterization of specificity and background of picoMeRIP.

a, Top panel, schematic illustration of experimental setup for titration of two control RNAs, m⁶A-modified GLuc, and unmodified CLuc, for picoMeRIP and qPCR assessment. Bottom panel, quantified immunoprecipitation levels of m⁶A-modified GLuc, and unmodified CLuc, after picoMeRIP. picoMeRIP signal was normalized to 100% GLuc. Data are presented as mean ± SD values, with n = 2 independent experiments for each experimental condition. Created with BioRender.com. b, Experimentally observed m⁶A levels of m⁶A-modified control RNA as compared to the expected ones based on the titration experiment described in panel a. Data are presented as mean ± SD values, with n = 2 independent experiments for each experimental condition. R is the correlation coefficient in regression analysis. c, qPCR assessment of picoMeRIP m⁶A signal over background with two control RNAs used as spike-ins, m⁶A-modified GLuc, and unmodified CLuc, and for the previously validated m⁶A positive (Pdzd8) and negative (Rdh10) liver transcripts. Data are presented as mean ± SD values, with n = 3 independent experiments for each experimental condition. d, Western blot of METTL3 and GAPDH for WT and Mettl3 deficient (“knock-out” (KO)) mES cell lines. GAPDH was used as loading control. Two independent western blot experiments were performed with similar results. e, Schematic illustration of experimental setup to compare number of m⁶A peaks and peak signal strength of picoMeRIP-seq from WT and Mettl3 deficient (KO) mES cells, including spiked-in control RNAs. Mouse liver mRNA and mES cell mRNA were added in a 1:1 ratio. Two control RNAs were used as spike-ins, m⁶A-modified GLuc, and unmodified CLuc, and added at a 1:1 ratio. Created with BioRender.com. f, Number of picoMeRIP-seq m⁶A peaks uniquely identified in WT and Mettl3 deficient (KO) mES cells, and co-identified in mouse liver and WT mES cells. g, Box plots of m⁶A signal for comparison of picoMeRIP-seq peaks uniquely identified in WT and Mettl3 deficient (KO) mES cells. Middle vertical line (bold) depicts the median, and the 25^th percentile to 75^th percentile is shown. Whiskers extend to the most extreme data point within 1.5 interquartile range of the quartiles. The p value was calculated using a Wilcoxon rank sum test (two-sided). The corresponding m⁶A peak numbers for each of three groups are shown in panel f. h, Assessment of false discovery rate of picoMeRIP-seq by spike-in m⁶A-modified GLuc, and unmodified Cluc control RNAs.

Source data

Extended Data Fig. 5 picoMeRIP-seq from single zebrafish zygotes.

a, b, c, Scatter plots showing the correlation of picoMeRIP-seq experiments for (a) pooled zygotes; (b) pooled zygotes and single zygote; and (c) single zygote. The scatter plots are plotted using the natural log of read coverage (RPKM + 1). r, Pearson correlation coefficient. d, Heatmap comparing picoMeRIP-seq experiments from pooled zygotes and single zygote. r, Pearson correlation coefficient. e, Number of m⁶A peaks. Peak detection when combining the data from five single-zygote experiments is also shown. The numbers of uniquely mapped, deduplicated, non-rRNA-derived sequencing reads are shown inside the bars within parentheses. f, Number of m⁶A peaks called from different picoMeRIP-seq experiments from pools of zygotes and for single zygotes shown as full columns, as reference. Number of m⁶A peaks called from randomly down-sampled data from a picoMeRIP-seq experiments from a pool of ten zygotes is indicated with solid black lines. Down-sampling was performed to produce equal reads numbers as that originally yielded from each of the indicated experiments. The numbers of detected peaks from single-zygote picoMeRIP-seq is close to the expected maximum as estimated by peak calling from randomly down-sampled data from pools of ten zygotes. As a control, peak calling was also performed from randomly down-sampled data from input.

Extended Data Fig. 6 Comparison of picoMeRIP-seq from single zebrafish zygotes and from pools of zygotes.

a, Fraction of overlap of m⁶A peaks between single-zygote picoMeRIP-seq experiments and picoMeRIP-seq from pools of zygotes. The topmost row is the fraction of overlap of m⁶A transcripts with previously published work (Zhao et al²⁸, GEO accession: GSM2088167 and GSM2088177). b, Consensus motifs identified within m⁶A peaks from single-zygote picoMeRIP-seq experiments and for picoMeRIP-seq from pools of zygotes.

Extended Data Fig. 7 picoMeRIP-seq from FACS sorted mouse ES cells.

a, Genome browser snap shots of two transcripts with m⁶A enrichment (transcript ID: ENSMUST00000026865.14 for Jade1, ENSMUST00000105344.7 for Tcf3). b, Transcriptome-wide correlation analyses (sequencing read coverage) between picoMeRIP-seq experiments from 1,000, 100 and 10 mouse ES cells. c, Heatmap showing fraction of overlap of m⁶A peaks between picoMeRIP-seq experiments with 1,000, 100 and 10 mouse ES cells. Peaks identified in the samples indicated at the bottom of the plot were used as the reference when calculating fraction of overlap of peaks from samples indicated at the left side of the plot. d, Consensus motifs identified within m⁶A peaks from picoMeRIP-seq experiments with 1,000, 100 and 10 mouse ES cells. e, Metagene profiles showing the enrichment of m⁶A peaks along protein-coding gene transcripts for 1,000, 100 and 10 mouse ES cells.

Extended Data Fig. 8 Consensus motifs and peak annotation from single mouse oocytes and preimplantation embryos.

a, Consensus motifs identified within m⁶A peaks of biological replicate 2. b, Top, pie charts show genomic annotation of m⁶A peaks. Bottom, bar plots show the peak enrichment score, which was calculated as the log₂ ratio of the observed over expected peak numbers.

Extended Data Fig. 9 GO analysis and enrichment of m⁶A on retrotransposon-derived RNAs in mouse oocytes and embryos.

a, Gene Ontology (GO) analysis for m⁶A-marked transcripts (+) and for transcripts not marked by m⁶A (-). The modified Fisher exact p values were (that is EASE score) reported by DAVID (see https://david.ncifcrf.gov/helps/functional_annotation.html). b, Enrichment of m⁶A in transcripts from selected retrotransposon subfamilies shown for mouse oocytes, embryos, ES cells (ESCs) and eight mouse tissues. MeRIP-seq data for mouse tissues are from Liu et al.²⁷. c, d, Expression and m⁶A signal profiles across the internal sequences of MTA (c); and MERVL (d). The expression is RPKM calculated by deepTools (see Methods).

Supplementary information

Reporting Summary

Supplementary Table 1

Supplementary Table 1.

Source data

Source Data Extended Data Fig. 4

Unprocessed scan of western blots for Extended Data Fig. 4d.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, Y., Wang, Y., Vera-Rodriguez, M. et al. Single-cell m⁶A mapping in vivo using picoMeRIP–seq. Nat Biotechnol 42, 591–596 (2024). https://doi.org/10.1038/s41587-023-01831-7

Download citation

Received: 30 September 2021
Accepted: 17 May 2023
Published: 22 June 2023
Issue Date: April 2024
DOI: https://doi.org/10.1038/s41587-023-01831-7

Subjects

Abstract

Similar content being viewed by others

Main

Methods

Ethics statement

Antibodies and tubes

Zebrafish zygotes, mouse liver, mES cells, mouse oocytes and embryo collection

Real-time qPCR

Single-cell picoMeRIP–seq

rRNA and DNA depletion

RNA fragmentation by sonication

Antibody–bead incubation

IP and washes

Ethanol precipitation

Library preparation and sequencing

Spike-in controls

Western blotting

Sequencing data processing, m6A peak identification and motif analysis

Read density pileup visualization, Pearson correlation, m6A signal and PCA

m6A peak annotation and m6A gene definition

GO analyses

m6A enrichment on retrotransposon-derived RNAs

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links

Sequencing data processing, m⁶A peak identification and motif analysis

Read density pileup visualization, Pearson correlation, m⁶A signal and PCA

m⁶A peak annotation and m⁶A gene definition

m⁶A enrichment on retrotransposon-derived RNAs