Identification of G-quadruplex clusters by high-throughput sequencing of whole-genome amplified products with a G-quadruplex ligand

Yoshida, Wataru; Saikyo, Hiroki; Nakabayashi, Kazuhiko; Yoshioka, Hitomi; Bay, Daniyah Habiballah; Iida, Keisuke; Kawai, Tomoko; Hata, Kenichiro; Ikebukuro, Kazunori; Nagasawa, Kazuo; Karube, Isao

doi:10.1038/s41598-018-21514-7

Download PDF

Article
Open access
Published: 15 February 2018

Identification of G-quadruplex clusters by high-throughput sequencing of whole-genome amplified products with a G-quadruplex ligand

Wataru Yoshida¹,
Hiroki Saikyo¹,
Kazuhiko Nakabayashi ORCID: orcid.org/0000-0003-2927-0963²,
Hitomi Yoshioka¹,
Daniyah Habiballah Bay^1,3,
Keisuke Iida⁴,
Tomoko Kawai²,
Kenichiro Hata²,
Kazunori Ikebukuro⁵,
Kazuo Nagasawa⁵ &
…
Isao Karube¹

Scientific Reports volume 8, Article number: 3116 (2018) Cite this article

4830 Accesses
30 Citations
4 Altmetric
Metrics details

Subjects

Abstract

G-quadruplex (G4) is a DNA secondary structure that has been found to play regulatory roles in the genome. The identification of G4-forming sequences is important to study the specific structure-function relationships of such regions. In the present study, we developed a method for identification of G4 clusters on genomic DNA by high-throughput sequencing of genomic DNA amplified via whole-genome amplification (WGA) in the presence of a G4 ligand. The G4 ligand specifically bound to G4 structures on genomic DNA; thus, DNA polymerase was arrested on the G4 structures stabilised by G4 ligand. We utilised the telomestatin derivative L1H1-7OTD as a G4 ligand and demonstrated that the efficiency of amplification of the G4 cluster regions was lower than that of the non-G4-forming regions. By high-throughput sequencing of the WGA products, 9,651 G4 clusters were identified on human genomic DNA. Among these clusters, 3,766 G4 clusters contained at least one transcriptional start site, suggesting that genes are regulated by G4 clusters rather than by one G4 structure.

Direct genome-wide identification of G-quadruplex structures by whole-genome resequencing

Article Open access 14 October 2021

Identifying genome-wide off-target sites of CRISPR RNA–guided nucleases and deaminases with Digenome-seq

Article 18 January 2021

Defining genome-wide CRISPR–Cas genome-editing nuclease activity with GUIDE-seq

Article 12 November 2021

Introduction

G-quadruplex (G4) is a DNA secondary structure composed of two or more stacking G-quartets, a planar array of four guanine bases connected by a Hoogsteen hydrogen bond, and stabilised by a monovalent cation¹. In genomic DNA, G4-forming sequences were first described in immunoglobulin switch regions and telomeric DNA at the ends of chromosomes^2,3 and have since been identified in several regulatory regions, such as transcription factor binding sites and promoters^4,5,6. In promoter regions, G4-forming sequences are involved in transcriptionally activating^7,8,9 or silencing gene expression^10,11. In addition, G4s have been reported to be involved in replication¹², DNA recombination¹³ and splicing processes¹⁴.

Identification of G4-forming sequences in the genome is necessary to elucidate the biological functions of G4. In silico analysis has revealed that the putative G-quadruplex forming sequences (PQS) are enriched in promoters, CpG islands, 5′UTRs, first exons, first exon/intron junctions and nuclease-hypersensitive sites^15,16,17. Furthermore, putative duplex-derived interstrand G4-forming sequences have been identified^18,19. However, the use of computational analysis alone is not sufficient to identify G4 regions precisely. We previously identified 1998 G4-forming sequences in mouse CpG islands using fluorescent-labelled G4 ligand with a mouse CpG island microarray^20,21. Clusters of G4-forming sequences that promote transcription and replication-dependent DNA damage induced by a G4 ligand have been identified in oncogenes and tumour-suppressor genes by ChIP-Seq of the DNA damage marker γH2AX²². Recently, 716,310 G4-forming sequences stabilised by G4 ligand pyridostatin (PDS) and 525,890 G4-forming sequences stabilised by K⁺ were identified in the human genome by combining polymerase stop assay with Illumina next-generation sequencing (G4-seq)²³.

In this study, we performed a whole genome amplification (WGA)²⁴ in the presence of a G4 ligand, followed by high-throughput sequencing of WGA products to identify G4 clusters in human genomic DNA. It has been reported that DNA polymerase was arrested on G4 structures stabilised by G4 ligand²⁵. This led us to hypothesise that genomic DNA would be amplified by WGA, except for G4 clusters, in the presence of a G4 ligand. Hence, we could identify G4 clusters by analysing the WGA products using high-throughput sequencing technologies.

Results

Analysis of inhibitory activity of G4 ligand against DNA polymerase extension on G4-forming sequences

The G4 ligand 7OTD is a telomestatin derivative that binds to the top of the G-tetrad structure through π-stacking and electrostatic interactions^26,27,28. To investigate whether 7OTD inhibits DNA polymerase extension on G4-forming regions, polymerase chain reaction (PCR) was performed on the human genomic DNA in the presence of 7OTD. For amplifying G4-forming regions, PCR primers for c-MYC, c-KIT, BCL2 and VEGFA G4 regions in the human genomic DNA were designed. For amplifying non-G4-forming regions, PCR primers for MBD3L3, CD4, CNDP2, and SOD1 were designed. In the absence of 7OTD, all of these regions were accurately amplified from genomic DNA by PCR (Fig. 1). The G4-forming and non-G4-forming regions were amplified in the presence of less than 100 nM 7OTD. In the presence of 1 µM 7OTD, the non-G4-forming regions were amplified, whereas the G4-forming ones were not. These results demonstrate that 7OTD specifically inhibits DNA polymerase extension on G4-forming regions in PCR.

Measurement of removal efficiency of BODIPY-labelled 7OTD from genomic DNA

PCR is a suitable method to confirm the efficiency of WGA of target regions; however, owing to the interference of 7OTD with PCR in the G4-forming regions, 7OTD should be removed from WGA products before PCR analysis. To evaluate the removal efficiency of 7OTD from genomic DNA, 10 µM BODIPY-labelled 7OTD was incubated with human genomic DNA. Then, LiCl solution (final concentration 4 M) was added to the mixture since lithium ions destabilise G4 structures²⁹. The genomic DNA was purified by gel filtration and then the fluorescence intensity of BODIPY was measured to calculate the efficiency of removal of the residual BODIPY-labelled 7OTD from the genomic DNA. The results showed that 4.5% of the fluorescence intensity was detected in the purified samples, indicating that more than 95% of BODIPY-labelled 7OTD was successfully removed from genomic DNA by LiCl and gel filtration.

In the PCR analysis, 1 µM 7OTD inhibited PCR on G4-forming regions, but 100 nM 7OTD did not interfere with the reaction. Therefore, we assumed that 1 µM 7OTD is a suitable concentration for the WGA reaction, since the amount of 7OTD remaining after the purification would be less than 50 nM, which would not subsequently inhibit the PCR. To confirm that the residual 7OTD would not inhibit PCR on G4-forming regions, PCR was performed using genomic DNA purified from a mixture of 1 µM 7OTD and genomic DNA. The results showed that no PCR inhibition for c-MYC, c-KIT, BCL2 and VEGFA G4 regions was detected (Fig. 2).

Whole genome amplification in the presence of 7OTD

WGA based on multiple displacement amplification using Phi29 DNA polymerase was utilised. In this system, the average product length is typically greater than 10-kb. When HeLa genomic DNA was amplified by WGA in the absence of 7OTD, WGA products of around 23-kb were obtained (Fig. 3). On the other hand, when HeLa genomic DNA was amplified by the WGA in the presence of 1 µM 7OTD, WGA products were not detected in 1% agarose gel electrophoresis. The G4-seq analysis revealed that the human genome contains 716,310 G4-forming sequences stabilised by a G4 ligand²³. These results suggest that Phi29 DNA polymerase would be arrested on numerous G4-forming regions on genomic DNA and its inhibition would reduce the yield of total WGA products.

To analyse whether 7OTD specifically inhibits the extension of DNA polymerase on G4-forming regions in WGA, the WGA products were purified by LiCl and gel filtration and analysed by PCR. After gel filtration of the products amplified by WGA in the absence of 7OTD, the approximately 23-kb WGA product was not detected in 1% agarose electrophoresis since the column is not suitable for long-DNA purification. However, the G4-forming regions and non-G4-forming regions were amplified from the purified WGA products by PCR; thus, the products amplified by WGA in the presence of 7OTD were analysed by PCR (Fig. 4). Although the non-G4-forming regions were amplified by PCR, only slight PCR amplification of the G4-forming regions was detected. Quantitative analysis of the band intensity revealed that the average WGA efficiencies in the non-G4-forming and G4-forming regions were 50% and 17%, respectively. These results demonstrate that the efficiency of amplification of the G4-forming regions by WGA was lower than that of the non-G4-forming regions in the presence of 7OTD.

High-throughput sequencing of the WGA products for identification of G4 clusters in human genomic DNA

To identify the G4-forming regions on the human genome, high-throughput sequencing of the WGA products was performed on an Illumina HiSeq X10 platform using the paired-end mode (150 bp x2). The WGA products were purified before PCR analysis, as described above. In contrast, the WGA products were directly used as templates for high-throughput sequencing, without any purification because DNA polymerase-based reactions would be inhibited on the G4-forming regions in the presence of 7OTD during the library preparation and sequencing reaction. High-throughput sequencing yielded 337 million reads (35 × depth of coverage) and 311 million reads (32 × depth of coverage) for 7OTD and control libraries, respectively.

First, the mapped reads were counted per 200 bp window, with a sliding size of 200 for the entire genome. Consistent with the PCR analysis of the WGA products, a decrease of the mapped reads in the 7OTD library was detected in c-MYC, c-KIT, BCL2 and VEGFA G4 regions (Fig. 5 and Fig. S5). In contrast, specific peaks at the G4-forming sequences were not detected in the regions because the decrease of the mapped reads was detected over the whole region. The occurrence of PQS predicted by G4Hunter is 5.02 per 10 kbp in the human genome¹⁷. In contrast, 35, 15, 26 and 60 PQS were predicted in the 10 kbp region of c-MYC, c-KIT, BCL2 and VEGFA G4, respectively. Clusters of G4-forming sequences that promote transcription and replication-dependent DNA damage induced by a G4 ligand have been identified by ChIP-Seq for the DNA damage marker γH2AX²². ChIP-Seq demonstrated that the γH2AX domains were enriched on chromosomes that have high PQS frequencies. Our sequencing results also demonstrated that the mean depth of coverage of the 7OTD library was lower than that of the control library on chromosomes 16, 17, 19, 20 and 22, which have high PQS frequencies (Fig. S6). These results indicated that clusters of G4-forming sequences would be identified by counting the mapped reads over large windows in our sequencing results.

We then counted the mapped reads per 1.0, 2.5, 5.0 and 10 kbp windows with sliding sizes of 1.0, 2.5, 5.0 and 10 kbp, respectively. The correlation coefficients between PQS frequencies and ratios of the reads for the 7OTD library to the reads for the control library in 1.0, 2.5, 5.0 and 10 kbp windows were −0.52, −0.61, −0.65 and −0.66, respectively. Therefore, counted mapped reads per 10 kbp windows were utilised to identify G4 clusters (Supplementary Dataset 1). On the 25 γH2AX-enriched genes, the average PQS numbers was 24 per 10 kbp, the average G4 numbers identified by G4-seq using PDS was 9.8 per 10 kbp and the average ratio of the reads for the 7OTD library to the reads for the control library was 0.292 (Supplementary Dataset 2). In contrast, the average PQS numbers was 2.4 per 10 kbp, the average G4 numbers identified by G4-seq was 1.1 per 10 kbp and the average ratio of the reads was 1.92 on the two γH2AX-negative genes (Supplementary Dataset 3). Therefore, we defined the threshold for identification of G4 clusters as a PQS number is ≥24 per 10 kbp, the number of G4 identified by G4-seq using PDS is ≥10 and the ratio is ≤0.292. By these criteria, we identified 9,651 G4 clusters in the human genome (Supplementary Dataset 4). In the 9,651 G4 clusters, the average ratio of reads, number of PQS and number of G4 identified by G4-seq using PDS was 0.133, 39.8 and 14.9, respectively.

G4-seq using PDS identified 716,310 G4-forming sequences, whereas G4-seq using K⁺ identified 525,890 G4-forming sequences in human genomic DNA. This suggests that there are G4-forming sequences for which G4 folding is induced by a G4 ligand. G4 formations may be induced by G4 binding proteins in cells. Therefore, extraction of the ligand-inducible G4 clusters would be important. To extract G4 clusters, we used G4-seq results performed in K⁺-stabilised condition. The average number of G4 was 6.2 per 10 kbp on the 25 γH2AX-enriched genes. We defined the threshold for identification of the ligand-inducible G4 clusters from 9,651 G4 clusters as ≤6 G4 identified by G4-seq using K⁺. Using this criterion, we extracted 1,622 ligand-inducible G4 clusters (Supplementary Dataset 5).

Among identified 9,651 G4 clusters, 3,766 G4 clusters (39.0%) contained at least one transcriptional start site (Supplementary Dataset 6). In the entire genome, 25301 windows (8.3%) contain at least one transcriptional start site, indicating that G4 clusters are enriched for transcriptional start sites. It has been reported that ATRX interacts with PQS clusters to regulate gene expression³⁰. These results, therefore, suggest that genes are regulated by G4 clusters rather than by one G4 structure.

Discussion

ChIP-Seq for the DNA damage marker γH2AX is useful to identify G4 clusters that fold into G4 structures in vivo; however, the method was applied on only oncogenes and tumour suppressor genes because of the broad coverage of γH2AX signatures²². Moreover, G4 structure formation would be affected by the chromatin state in vivo. In contrast, we identified 9,651 G4 clusters in the whole human genome because our method directly detected G4 clusters that were stabilised by a G4 ligand in vitro. We also demonstrated that 3,766 G4 clusters contain at least one transcriptional start site. The phi29 DNA polymerase used in the WGA reaction is a replicative polymerase from the Bacillus subtilis phage phi29 (Φ29), which has strand displacement and processive synthesis properties³¹, meaning that it can imitate the replication process. G4-forming structures that play critical roles in replication have been identified, such as Rif1-binding sequences³². Therefore, the G4 clusters identified in this study would be involved in not only transcriptional regulation but also DNA replication.

Several structure-specific ligands that have the specificity to bind to different G4 structures have been reported; for example, 4,2-L2H2-6OTD and 5,1-L2H2-6OTD induced an antiparallel topology and a hybrid-type topology of telomeric DNA, respectively³³. In addition, InEt2 and InPr2 G4 ligands specifically stabilised the parallel topology of c-MYC, c-KIT1 and c-KIT2 G4 structures³⁴. The G4 ligand inhibited DNA polymerase extension on c-MYC G4 DNA, whereas no significant amount of stop product of telomeric G4 DNA was observed. Our method may thus contribute to the identification of G4 clusters containing specific G4 structures using topology-specific ligands. Moreover, the binding specificity of a telomestatin derivative against multimeric G4 structures can be improved by multimerization of the G4 ligand³⁵. These results suggest that gene-specific G4 cluster ligands may be developed by multimerization of suitable G4 ligands, with designing the linker lengths.

It has been reported that the Bcl-2 G4 structure and quadruplex structure of C9orf72 repeat were stabilised by DNA methylation^36,37. We reported that the initial elongation efficiency of PCR decreased with increasing DNA methylation levels in VEGFA and RET G4-forming sequences³⁸, indicating that the G4 structures are also stabilised by DNA methylation. These reports suggest that our method could be applied to detect epigenetic modification by identifying G4 clusters stabilised by DNA methylation.

Methods

Analysis of inhibitory activity of 7OTD on PCR

Human genomic DNA was purified from HeLa (RBRC-RCB0007, RIKEN) cells using DNeasy blood and tissue kit (Qiagen). As G4-forming regions, c-MYC, c-KIT, BCL2 and VEGFA G4-forming regions were used. As non-G4-forming regions, MBD3L3, CD4, CNDP2, and SOD1 regions were used. All PCR primers (Table S1) were designed by Primer 3^39,40. PCR was performed using 0, 1, 10, 100 or 1000 nM 7OTD, 500 nM each primer, 100 ng of human genomic DNA, 250 µM each dNTP and 0.5 U Ex Taq HS (Takara) with a buffer [25 mM TAPS (N-Tris(hydroxymethyl)methyl-3-aminopropanesulfonic acid), 2 mM MgCl₂, 0.1 mM DTT, 5% DMSO (pH 9.3)] in a 20-µL solution. The thermocycling conditions were as follows: 95 °C for 5 min, followed by 35 cycles of 95 °C for 30 s, 59 °C for 30 s and 72 °C for 30 s. The PCR products were analysed by 2% agarose gel electrophoresis.

Measurement of removal efficiency of BODIPY-labelled 7OTD from genomic DNA

HeLa genomic DNA (3.1 µg) and BODIPY-labelled 7OTD (10 µM) were mixed in a buffer [10 mM Tris–HCl, 100 mM KCl (pH 7.4)] in a 25-µL solution. After 10 min of incubation, 25-µL of 8 M LiCl was added and then incubated at 95 °C for 5 min. The BODIPY-labelled 7OTD was removed using Illustra MicroSpin G-25 Columns (GE), in accordance with the manufacturer’s protocol. After the purification, an approximately 50-µL sample was obtained. The fluorescence intensity of BODIPY-labelled 7OTD in this sample was measured by a microplate reader (Spark 10 M, Tecan).

Analysis of effect of residual 7OTD on PCR

HeLa genomic DNA (3.1 µg) and 7OTD (1 µM) were mixed in a buffer [10 mM Tris–HCl, 100 mM KCl (pH 7.4)] in a 25-µL solution. After 10 min of incubation, 25-µL of 8 M LiCl was added and then 7OTD was removed, as described above. PCR was performed using 3.2-µL of the purified sample, 500 nM each primer, 250 µM each dNTP and 0.5 U Ex Taq HS (Takara) with a buffer [25 mM TAPS, 2 mM MgCl₂, 0.1 mM DTT, 5% DMSO (pH 9.3)] in a 20-µL solution. The thermocycling conditions were as follows: 95 °C for 5 min, followed by 35 cycles of 95 °C for 30 s, 59 °C for 30 s and 72 °C for 30 s. The PCR products were analysed by 2% agarose gel electrophoresis. As a control, 1.6-µL of the mixture of genomic DNA and 7OTD was used as a template for PCR.

Whole genome amplification in the presence of 7OTD

In the presence or absence of 1 µM 7OTD, WGA was performed using REPLI-g Mini Kit (QIAGEN), in accordance with the manufacturer’s protocol. Briefly, 2.5-µL of HeLa genomic DNA (100 ng) was incubated with 2.5-µL of denaturation solution at room temperature for 3 min and then 5-µL of neutralisation buffer was added. Next, 40-µL of master mix containing phi29 DNA polymerase with 7OTD was added and incubated at 30 °C for 16 h. To remove 7OTD, 25 µL of 8 M LiCl was added to 25-µL of the WGA product and then incubated at 95 °C for 5 min. The 7OTD was removed by LiCl and Illustra MicroSpin G-25 Columns (GE), as described above. The purified WGA product (3.2-µL) was used as a template for PCR in a 20-μL reaction volume, as described above.

High-throughput sequencing of the WGA products

Sequencing libraries were prepared using the NEBNext Ultra II kit (NEB) and sequenced using the paired-end mode (150 bp x2) on the HiSeq X10 platform (Illumina) at Macrogen Inc. Obtained sequence reads were mapped onto the hg19 reference genome using BWA⁴¹. PCR-duplicate reads were removed using Picard⁴². In total, 311 and 337 million reads were obtained for the 7OTD and control libraries, respectively. Mapped reads per 0.2, 1.0, 2.5, 5.0 or 10 kbp windows with the sliding size of 0.2, 1.0, 2.5, 5.0 or 10 kbp were counted for the entire genome using bedtools 2.26.0, respectively⁴³. The fold-change value and Fisher’s exact test p-value were calculated for mapped read counts per window. G-quadruplex forming sequences were predicted by the G4hunter program¹⁷, with default settings. The mapped reads per window, fold-change of mapped reads, and locations of predicted G4 quadruplex sites were visualised in.igv format on the Integrative Genome Viewer⁴⁴ and on the USCS Genome Browser⁴⁵.

References

Hardin, C. C., Watson, T., Corregan, M. & Bailey, C. Cation-dependent transition between the quadruplex and Watson-Crick hairpin forms of d(CGCG3GCG). Biochemistry 31, 833–841 (1992).
Article CAS PubMed Google Scholar
Sen, D. & Gilbert, W. Formation of parallel four-stranded complexes by guanine-rich motifs in DNA and its implications for meiosis. Nature 334, 364–366 (1988).
Article ADS CAS PubMed Google Scholar
Sundquist, W. I. & Klug, A. Telomeric DNA dimerizes by formation of guanine tetrads between hairpin loops. Nature 342, 825–829 (1989).
Article ADS CAS PubMed Google Scholar
Eddy, J. & Maizels, N. Gene function correlates with potential for G4 DNA formation in the human genome. Nucleic Acids Res. 34, 3887–3896 (2006).
Article CAS PubMed PubMed Central Google Scholar
Bochman, M. L., Paeschke, K. & Zakian, V. A. DNA secondary structures: stability and function of G-quadruplex structures. Nat. Rev. Genet. 13, 770–780 (2012).
Article CAS PubMed PubMed Central Google Scholar
Rhodes, D. & Lipps, H. J. G-quadruplexes and their regulatory roles in biology. Nucleic Acids Res. 43, 8627–8637 (2015).
Article CAS PubMed PubMed Central Google Scholar
Verma, A., Yadav, V. K., Basundra, R., Kumar, A. & Chowdhury, S. Evidence of genome-wide G4 DNA-mediated gene expression in human cancer cells. Nucleic Acids Res. 37, 4194–4204 (2009).
Article CAS PubMed PubMed Central Google Scholar
Catasti, P., Chen, X., Moyzis, R. K., Bradbury, E. M. & Gupta, G. Structure-function correlations of the insulin-linked polymorphic region. J. Mol. Biol. 264, 534–545 (1996).
Article CAS PubMed Google Scholar
Timmer, C. et al. An isothermal titration and differential scanning calorimetry study of the G-quadruplex DNA-insulin interaction. J. Phys. Chem. B. 118, 1784–1790 (2014).
Article CAS PubMed Google Scholar
Brooks, T. A., Kendrick, S. & Hurley, L. H. Making sense of G-quadruplex and i-motif functions in oncogene promoters. FEBS J. 277, 3459–3469 (2010).
Article CAS PubMed PubMed Central Google Scholar
Onel, B. et al. A New G-Quadruplex with Hairpin Loop Immediately Upstream of the Human BCL2 P1 Promoter Modulates Transcription. J. Am. Chem. Soc. 138, 2563–2570 (2016).
Article CAS PubMed PubMed Central Google Scholar
Paeschke, K., Capra, J. A. & Zakian, V. A. DNA replication through G-quadruplex motifs is promoted by the Saccharomyces cerevisiae Pif1 DNA helicase. Cell 145, 678–691 (2011).
Article CAS PubMed PubMed Central Google Scholar
Mani, P., Yadav, V. K., Das, S. K. & Chowdhury, S. Genome-wide analyses of recombination prone regions predict role of DNA structural motif in recombination. PLoS One 4, e4399 (2009).
Article ADS PubMed PubMed Central Google Scholar
Ribeiro, M. M. et al. G-quadruplex formation enhances splicing efficiency of PAX9 intron 1. Hum. Genet. 134, 37–44 (2015).
Article CAS PubMed Google Scholar
Huppert, J. L. & Balasubramanian, S. Prevalence of quadruplexes in the human genome. Nucleic Acids Res. 33, 2908–2916 (2005).
Article CAS PubMed PubMed Central Google Scholar
Eddy, J. & Maizels, N. Conserved elements with potential to form polymorphic G-quadruplex structures in the first intron of human genes. Nucleic Acids Res. 36, 1321–1333 (2008).
Article CAS PubMed PubMed Central Google Scholar
Bedrat, A., Lacroix, L. & Mergny, J. L. Re-evaluation of G-quadruplex propensity with G4Hunter. Nucleic Acids Res. 44, 1746–1759 (2016).
Article PubMed PubMed Central Google Scholar
Cao, K., Ryvkin, P. & Johnson, F. B. Computational detection and analysis of sequences with duplex-derived interstrand G-quadruplex forming potential. Methods 57, 03–10 (2012).
Article CAS Google Scholar
Kudlicki, A. S. G-quadruplexes involving both strands of genomic DNA are highly abundant and colocalize with functional sites in the human genome. PLoS One 11, e0146174 (2016).
Article PubMed PubMed Central Google Scholar
Iida, K. et al. Fluorescent-ligand-mediated screening of G-quadruplex structures using a DNA microarray. Angew. Chem. Int. Ed. Engl. 52, 12052–12055 (2013).
Article CAS PubMed Google Scholar
Bay, D. H. et al. Identification of G-quadruplex structures that possess transcriptional regulating functions in the Dele and Cdc6 CpG islands. BMC Mol. Biol. 18, 17 (2017).
Article PubMed PubMed Central Google Scholar
Rodriguez, R. et al. Small-molecule-induced DNA damage identifies alternative DNA structures in human genes. Nat. Chem. Biol. 8, 301–310 (2012).
Article CAS PubMed PubMed Central Google Scholar
Chambers, V. S. et al. High-throughput sequencing of DNA G-quadruplex structures in the human genome. Nat. Biotechnol. 33, 877–881 (2015).
Article PubMed Google Scholar
Silander, K. & Saarela, J. Whole genome amplification with Phi29 DNA polymerase to enable genetic or genomic analysis of samples of low DNA yield. J. Methods Mol. Biol 439, 1–18 (2008).
Article CAS Google Scholar
Guo, K. et al. Formation of pseudosymmetrical G-quadruplex and i-motif structures in the proximal promoter region of the RET oncogene. J. Am. Chem. Soc. 129, 10220–10228 (2007).
Article CAS PubMed PubMed Central Google Scholar
Tera, M. et al. Synthesis of a potent G-quadruplex-binding macrocyclic heptaoxazole. Chembiochem 10, 431–435 (2009).
Article CAS PubMed Google Scholar
Iida, K. & Nagasawa, K. Macrocyclic polyoxazoles as G-quadruplex ligands. Chem. Rec. 13, 539–548 (2013).
Article CAS PubMed Google Scholar
Chung, W. J. et al. Solution structure of an intramolecular (3 + 1) human telomeric G-quadruplex bound to a telomestatin derivative. J. Am. Chem. Soc. 135, 13495–13501 (2013).
Article CAS PubMed Google Scholar
Woiczikowski, P. B., Kubar, T., Gutiérrez, R., Cuniberti, G. & Elstner, M. Structural stability versus conformational sampling in biomolecular systems: why is the charge transfer efficiency in G4-DNA better than in double-stranded DNA? J. Chem. Phys. 133, 035103 (2010).
Article ADS PubMed Google Scholar
Law, M. J. et al. ATR-X syndrome protein targets tandem repeats and influences allele-specific expression in a size-dependent manner. Cell 143, 367–378 (2010).
Article CAS PubMed Google Scholar
Blanco., L. et al. Highly efficient DNA synthesis by the phage phi 29 DNA polymerase. Symmetrical mode of DNA replication. J. Biol. Chem. 264, 8935–8940 (1989).
CAS PubMed Google Scholar
Kanoh, Y. et al. Rif1 binds to G quadruplexes and suppresses replication over long distances. Nat. Struct. Mol. Biol. 22, 889–897 (2015).
Article CAS PubMed Google Scholar
Sakuma, M. et al. Design and synthesis of unsymmetric macrocyclic hexaoxazole compounds with an ability to induce distinct G-quadruplex topologies in telomeric DNA. Org. Biomol. Chem. 14, 5109–5116 (2016).
Article CAS PubMed Google Scholar
Diveshkumar, K. V. et al. Specific stabilization of c-MYC and c-KIT G quadruplex DNA structures by indolylmethyleneindanone scaffolds. Biochemistry 55, 3571–3585 (2016).
Article CAS PubMed Google Scholar
Abraham Punnoose, J. et al. Adaptive and Specific Recognition of Telomeric G-Quadruplexes via Polyvalency Induced Unstacking of Binding Units. J. Am. Chem. Soc. 139, 7476–7484 (2017).
Article CAS PubMed Google Scholar
Lin, J. et al. Stabilization of G-quadruplex DNA by C-5-methyl-cytosine in bcl-2 promoter: implications for epigenetic regulation. Biochem. Biophys. Res. Commun. 433, 368–373 (2013).
Article CAS PubMed Google Scholar
Zamiri, B., Mirceta, M., Bomsztyk, K., Macgregor, R. B. Jr. & Pearson, C. E. Quadruplex formation by both G-rich and C-rich DNA strands of the C9orf72 (GGGGCC)8•(GGCCCC)8 repeat: effect of CpG methylation. Nucleic Acids Res. 43, 10055–10064 (2015).
CAS PubMed PubMed Central Google Scholar
Yoshida, W. et al. Detection of DNA methylation of G-quadruplex and i-motif-forming sequences by measuring the initial elongation efficiency of polymerase chain reaction. Anal. Chem. 88, 7101–7107 (2016).
Google Scholar
Koressaar, T. & Remm, M. Enhancements and modifications of primer design program Primer3. Bioinformatics 23, 1289–1291 (2007).
Article CAS PubMed Google Scholar
Untergasser, A. et al. Primer3 - new capabilities and interfaces. Nucleic Acids Res. 40, e115 (2012).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Broad Institute of MIT and Harvard. Picard http://broadinstitute.github.io/picard/ (2017).
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Robinson, J. T. et al. Integrative genomics viewer. Nat. Biotechnol. 29, 24–26 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kent, W. J. et al. The human genome browser at UCSC. Genome Res. 12, 996–1006 (2002).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Dr. Tomohiko Yamazaki (National Institute for Materials Science, Japan) for the kind gift of HeLa cells. We thank Dr. Amina Bedrat (University of Bordeaux, France) for kind support to run the G4Hunter program. This work was supported in part by Grants-in-Aid for Young Scientists (B) (15K18278 to W.Y.), for Scientific Research (C) (17K06933 to W.Y.), for Scientific Research (B) (23310158 and 26282214 to K. Nagasawa) and for Challenging Exploratory Research (26560444, 16K13094 to K. Nagasawa, and 17K19535 to K.H.) from JSPS. The genome sequence described/used in this research was derived from a HeLa cell line. Henrietta Lacks, and the HeLa cell line that was established from her tumor cells without her knowledge or consent in 1951, have made significant contributions to scientific progress and advances in human health. We are grateful to Henrietta Lacks, now deceased, and to her surviving family members for their contributions to biomedical research. The data generated from this research were submitted to the database of Genotypes and Phenotypes (dbGaP), as a substudy under accession number phs000640. The accession number of this study is phs001450.v1.p1.

Author information

Authors and Affiliations

School of Bioscience and Biotechnology, Tokyo University of Technology, 1404-1 Katakura-machi, Hachioji, Tokyo, 192-0982, Japan
Wataru Yoshida, Hiroki Saikyo, Hitomi Yoshioka, Daniyah Habiballah Bay & Isao Karube
Department of Maternal-Fetal Biology, National Research Institute for Child Health and Development, 2-10-1 Ookura, Setagaya, Tokyo, 157-0074, Japan
Kazuhiko Nakabayashi, Tomoko Kawai & Kenichiro Hata
Biology Department, Umm Al-Qura University, P.O. Box 715, Makkah, 21955, Saudi Arabia
Daniyah Habiballah Bay
Molecular Chirality Research Center, Synthetic Organic Chemistry, Department of Chemistry, Graduate School of Science, Chiba University, 1-33 Yayoi, Inage, Chiba, 263-8522, Japan
Keisuke Iida
Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology, 2-24-16 Naka-cho, Koganei, Tokyo, 184-8588, Japan
Kazunori Ikebukuro & Kazuo Nagasawa

Authors

Wataru Yoshida
View author publications
You can also search for this author in PubMed Google Scholar
Hiroki Saikyo
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhiko Nakabayashi
View author publications
You can also search for this author in PubMed Google Scholar
Hitomi Yoshioka
View author publications
You can also search for this author in PubMed Google Scholar
Daniyah Habiballah Bay
View author publications
You can also search for this author in PubMed Google Scholar
Keisuke Iida
View author publications
You can also search for this author in PubMed Google Scholar
Tomoko Kawai
View author publications
You can also search for this author in PubMed Google Scholar
Kenichiro Hata
View author publications
You can also search for this author in PubMed Google Scholar
Kazunori Ikebukuro
View author publications
You can also search for this author in PubMed Google Scholar
Kazuo Nagasawa
View author publications
You can also search for this author in PubMed Google Scholar
Isao Karube
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

W.Y. conceived and designed the experiments. H.S., K. Nakabayashi and H.Y. performed the experiments. W.Y., H.S., K. Nakabayashi, H.Y., D.H.B., K. Iida., T.K., K.H., K. Ikebukuro, K. Nagasawa, I.K. analysed the data. W.Y., K. Nakabayashi and D.H.B. wrote the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Wataru Yoshida.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Dataset 1

Dataset 2

Dataset 3

Dataset 4

Dataset 5

Dataset 6

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yoshida, W., Saikyo, H., Nakabayashi, K. et al. Identification of G-quadruplex clusters by high-throughput sequencing of whole-genome amplified products with a G-quadruplex ligand. Sci Rep 8, 3116 (2018). https://doi.org/10.1038/s41598-018-21514-7

Download citation

Received: 29 July 2016
Accepted: 05 February 2018
Published: 15 February 2018
DOI: https://doi.org/10.1038/s41598-018-21514-7

This article is cited by

Crosstalk between G-quadruplex and ROS
- Songjiang Wu
- Ling Jiang
- Qinghai Zeng
Cell Death & Disease (2023)
Starfish infers signatures of complex genomic rearrangements across human cancers
- Lisui Bao
- Xiaoming Zhong
- Lixing Yang
Nature Cancer (2022)
Quantitative detection of CpG methylation level on G-quadruplex and i-motif-forming DNA by recombinase polymerase amplification
- Masanori Goto
- Yuji Baba
- Wataru Yoshida
Analytical and Bioanalytical Chemistry (2022)
Ubiquitin-mediated DNA damage response is synthetic lethal with G-quadruplex stabilizer CX-5461
- Tehmina Masud
- Charles Soong
- Samuel Aparicio
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Analysis of inhibitory activity of G4 ligand against DNA polymerase extension on G4-forming sequences

Measurement of removal efficiency of BODIPY-labelled 7OTD from genomic DNA

Whole genome amplification in the presence of 7OTD

High-throughput sequencing of the WGA products for identification of G4 clusters in human genomic DNA

Discussion

Methods

Analysis of inhibitory activity of 7OTD on PCR

Measurement of removal efficiency of BODIPY-labelled 7OTD from genomic DNA

Analysis of effect of residual 7OTD on PCR

Whole genome amplification in the presence of 7OTD

High-throughput sequencing of the WGA products

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links