Abstract
DNA sequences containing consecutive guanines organized in 4-interspaced tandem repeats can form stable single-stranded secondary structures, called G-quadruplexes (G4). Herein, we report that the Polycomb group protein BMI1 is enriched at heterochromatin regions containing putative G4 DNA sequences, and that G4 structures accumulate in cells with reduced BMI1 expression and/or relaxed chromatin, including sporadic Alzheimer’s disease (AD) neurons. In AD neurons, G4 structures preferentially accumulate in lamina-associated domains, and this is rescued by re-establishing chromatin compaction. ChIP-seq analyses reveal that G4 peaks correspond to evolutionary conserved Long Interspersed Element-1 (L1) sequences predicted to be transcriptionally active. Hence, G4 structures co-localize with RNAPII, and inhibition of transcription can reverse the G4 phenotype without affecting chromatin’s state, thus uncoupling both components. Intragenic G4 structures affecting splicing events are furthermore associated with reduced neuronal gene expression in AD. Active L1 sequences are thus at the origin of most G4 structures observed in human neurons.
Similar content being viewed by others
Introduction
Nucleosomal histone proteins are regulated by post-translational modifications that can impact transcription, replication, and repair. BMI1 is a component of the Polycomb Repressive Complex 1 (PRC1). The PRC1 promotes chromatin compaction and gene silencing in part through its E3-mono-ubiquitin ligase activity mediated by RING1a/b on histone H2A at lysine 119 (H2Aub)1,2,3. The PRC1 is recruited to facultative heterochromatin to maintain repression at developmental and senescence-associated genes4,5,6,7,8. BMI1 is also enriched at constitutive heterochromatin, where it co-purifies with ATRX, HP1, DEK1, and Lamins9. BMI1 inactivation in human dermal fibroblasts (HDFs) results in loss of heterochromatin and transcriptional de-repression of repetitive DNA sequences9. More recently, reduced neuronal expression of BMI1 was found associated with sporadic late-onset Alzheimer’s disease (AD)10. Acute BMI1 inactivation in cultured human neurons can also recapitulate AD-associated hallmarks, including the accumulation of beta-amyloid and hyper-phosphorylated Tau10. Aged mice hemizygous for Bmi1 (Bmi1+/–) also develop, along with some progeroid features, AD-like behavioral, and neuropathological phenotypes11. Furthermore, loss of heterochromatin and genomic instability at repetitive DNA sequences were described as new molecular characteristics present in cortical neurons from both Bmi1+/– mice and AD cases11,12. Loss of heterochromatin and transcriptional activation of specific classes of endogenous retroelements occur in neurodegenerative tauopathies and in animal models of Tau over-expression13,14. Notably, advanced aging is the greatest risk factor to develop AD15,16, and many anomalies present in AD patient’s neurons in situ, such as relaxed heterochromatin and nuclear envelope defects, are considered as hallmarks of aging15,17,18,19. Accordingly, it was proposed that AD may represent an acquired laminopathy19,20.
Interestingly, most genetically inherited progeroid syndromes, such as Hutchinson–Gilford progeria, Werner, Bloom, and Xeroderma pigmentosum, present heterochromatin relaxation and genomic instability phenotypes21,22,23. With the exception of Hutchinson–Gilford progeria, which is linked to mutations in LMNA (encoding for the nuclear envelope protein Lamin A), the other progeroid disease genes encode DNA damage and/or repair proteins. More specifically, Werner (WRN), Xeroderma pigmentosum (XPB, XPD), and Bloom (BLM) gene products encode DNA helicases that can resolve G-quadruplex (G4) DNA’s secondary structures (also called structured DNA or G-quadruplexes) stabilized by Hoogsteen hydrogen bonds between guanines (G)24,25,26. Notably, the DNA-dependent ATPase and helicase ATRX is enriched at repetitive DNA sequences predicted to form G4 structures, and ATRX can physically bind structured DNA in vitro27. Persistent G4 structures have been proposed to represent a threat to genomic stability and gene function by interfering with fork elongation and DNA repair during replication and transcription28,29,30,31. In normal physiological conditions, however, G4 structures may be necessary for the control of gene expression, maintenance of telomeres, and establishment of replication origins32.
We report here that putative G4 DNA sequences are significantly enriched in BMI1 chromatin immunoprecipitation and sequencing (ChIP-seq) data sets, and that BMI1 deficiency in HDFs or neurons results in the induction of G4 structures. In AD neurons with relaxed heterochromatin, G4 structures accumulate at lamina-associated domains (LADs) and peri-nucleolar heterochromatin. Likewise, normal neurons exposed to drugs that open chromatin also present massive accumulation of G4 structures. ChIP-seq analyses of post-mitotic human neurons using the 1H6 antibody further show that G4 structures correspond to “active” evolutionary conserved Long Interspersed Element-1 (L1) sequences. Consistently, inhibition of RNA Polymerase II (RNAPII) can reverse the induction of G4 structures without altering the chromatin compaction state. Importantly, G4 structures present at specific loci in AD neurons are associated with reduced gene expression and perturbed alternative splicing. Chromatin-mediated transcriptional repression of L1 sequences thus prevents excessive formation of G4 structures in human neurons, which otherwise can interfere with gene expression.
Results
Using public BMI1 ChIP-seq data sets, we annotated all BMI1-enriched chromatin regions and tested their propensity to form putative G4 DNA structures using the Quadparser algorithm33. Using four different sets of Quadparser parameters, we found that BMI1 was significantly enriched at chromatin regions predicted to form G4 structures independently of the Quadparser stringency (Fig. 1a). We further investigated whether BMI1 association with G4 motifs was comparable to the one observed in proteins known to physically interact with structured DNA such as the ATRX, XPB, and XPD helicases25,27. We annotated ATRX, XPB, and XPD ChIP-seq peaks alongside BMI1 peaks for the presence of putative G4. While ∼45% of ATRX, XPB, and XPD peaks were associated with putative G4, only ∼6% of BMI1 peaks showed the same association (Fig. 1b), suggesting that BMI1 does not bind G4 structures but is rather enriched at chromatin regions with the propensity to generate G4. We also analyzed the ChIP-seq peaks obtained with the BG4 antibody, which recognizes G4 structures. As reported, we found that ~60% of BG4 peaks contained a putative G4 DNA sequence (Fig. 1b)34. We next compared the putative G4 DNA sequences contained in the BG4, XPB, and the BMI1 peaks. Surprisingly, while ~30% of the putative G4 DNA sequences linked by BG4 were enriched for XPB, only 0.05% were enriched for BMI1 (Fig. 1c). Similarly, only 0.15% of the putative G4 sequences bound by XPB were enriched by BMI1 (Fig. 1c). Since BG4 peaks were reported to be predominant at the promoter of actively transcribed genes, these results suggested that BMI1 peaks containing putative G4 DNA sequences are rarely present in actively transcribed regions.
To test the possibility that BMI1 function is required to prevent the formation of G4 structures, we used the 1H6 and BG4 antibodies, which recognize G4 structures34,35. In early passage normal HDFs, we noticed that the baseline level of 1H6 and BG4 immunoreactivity was relatively low (Fig. 1d and Supplementary Fig. 1b). However, we observed a robust nuclear and modest cytoplasmic immunoreactivity for 1H6 and BG4 in BMI1-knockdown HDFs (shBMI1) (Fig. 1d and Supplementary Fig. 1b), suggesting accumulation of both G4 DNA and G4 RNA structures36,37. To test for the specificity of the signal obtained, we fixed naive cells with paraformaldehyde (PFA) and then exposed them to HCL, an agent that denatures DNA’s secondary structure. HCL is commonly used in transmission electron microscopy and immunohistochemistry on paraffin sections. We found that HCL treatment resulted in non-specific immuno-labeling when compared to cells retaining a native chromatin state (Fig. 1d and Supplementary Fig. 1b). This revealed that G4-specific antibodies should only be used on cells with native chromatin. Pyridostatin is an agent that stabilizes spontaneously forming G4 structures, and we found that 1H6 immuno-labeling was significantly increased after exposing HDFs to pyridostatin (Fig. 1e)38. To further validate our observations, we performed colocalization studies with an antibody against the Werner (WRN) helicase. WRN is predicted to bind and unwind G4 structures39,40,41. In control HDFs, WRN and 1H6 levels were low and did not colocalize (Fig. 1f and Supplementary Fig. 2a). They were, however, significantly induced after BMI1 knockdown or after exposition to pyridostatin (Supplementary Fig. 2a). In both cases, 1H6 and WRN presented a very high coefficient of colocalization (Pearson correlation: 0.79 for shBMI1; 0.70 for shScr + pyridostatin) (Fig. 1f). Relatively strong colocalization between 1H6 and XPB or 53BP1 (but not XPD) was also observed (Fig. 1f and Supplementary Fig. 2a). Importantly, G4 structures were not detected with 1H6 after treatment of HDFs with 10 Gy of gamma rays, which induce DNA double-strand breaks (Fig. 1g)42. To test if G4 structures were associated with the formation of DNA-RNA hybrids (also referred to as R-loops), we analyzed BMI1-knockdown HDFs using the S9.6 antibody43,44. We observed that in both control and BMI1-knockdown HDFs, the S9.6 signal was largely restricted to the nucleolus (Supplementary Fig. 1c) and that, while nucleoli were larger in BMI1-knockdown cells, no significant difference in S9.6 labeling was observed outside the nucleolus (Supplementary Fig. 1c, d). Importantly, while DNAse1 treatment could prevent 1H6 labeling in HDFs (Supplementary Fig. 1e), RNAse H and RNAse A were unable to do so (Supplementary Fig. 1f), thus suggesting that nuclear structures recognized by the 1H6 antibody are DNA-based, and not RNA-DNA hybrids or secondary RNA structures. Taken together, these experiments suggest that nuclear foci visualized using the 1H6 antibody represent genuine G4 structures.
Given that BMI1 is enriched at heterochromatin and BMI1 deficiency results in loss of heterochromatin compaction9, we reasoned that BMI1 activity might counteract the induction of G4 structures through chromatin compaction. The H3K9me3 histone mark labels constitutive heterochromatin regions, which are in general transcriptionally inactive, while the H3K9ac histone mark labels “active” and open chromatin regions. Accordingly, reduced H3K9me3 labeling was observed in HDFs 16 h after transfection with a BMI1-targeting shRNA vector, suggesting loss of heterochromatin (Fig. 1h and Supplementary Fig. 3a). Notably, nearly all 1H6 foci observed following BMI1 knockdown did not colocalize with H3K9me3 (Fig. 1h, Pearson coefficient correlation of –0.135). In contrast, H3K9ac signal intensity was increased upon BMI1 knockdown (Fig. 1h and Supplementary Fig. 3a), and a significant positive correlation (Pearson coefficient correlation of 0.25) was observed between 1H6 and H3K9ac labeling (Fig. 1h)45. This suggested that the induction of G4 structures may be associated with chromatin relaxation. To test our hypothesis, we used histone deacetylase inhibitor(s) (HDACi), which lead to chromatin relaxation by preventing the deacetylation of histones45,46. We found that HDFs treated with sodium butyrate or trichostatin displayed rapid induction of G4 structures within 2 h, which was markedly preceded by robust elevation of H3K9ac levels (Fig. 1i and Supplementary Fig. 3b–d). Pearson correlation analyses at 2 h revealed a near-perfect correlation between H3K9ac and 1H6 labeling, suggesting that most G4 structures were induced following histone acetylation (Fig. 1i and Supplementary Fig. 3b–d).
To test if Bmi1 knockout was also associated with the formation of G4 structures, we analyzed Bmi1−/− mice. The Bmi1 protein is expressed in post-mitotic retinal neurons (Supplementary Fig. 4a), including cone photoreceptors. Cones from Bmi1−/− mice present reduced heterochromatin compaction at post-natal (P) day 2547,48. In wild-type (WT) mice, we found that the baseline level of 1H6 was low in all retinal neurons (Fig. 1j and Supplementary Fig. 4a). In contrast, nuclear 1H6 immunoreactivity was high in S-opsin positive cones of Bmi1−/− mice (Fig. 1j, white arrows and Supplementary Fig. 4a). Notably, 1H6 induction in Bmi1−/− cones also correlated with increased H3K9ac level (Supplementary Fig. 4b, white arrows). Interestingly, 1H6 was also induced in cones from Bmi1−/− mice at P1 (Supplementary Fig. 4c), thus before the onset of retinal degeneration48. These results suggested that BMI1-mediated chromatin compaction prevents excessive formation of G4 structures in human and mouse cells.
Re-establishing chromatin compaction reverses accumulation of G4 structures in AD neurons
Considering that BMI1 expression is reduced in AD brains and in patient-derived AD neurons produced from induced pluripotent stem cells (iPSCs)10, we tested whether the above findings were relevant to AD. We first evaluated the presence of G4 structures in BMI1-knockdown human cortical neurons, an experimental cellular model of AD10. We found that BMI1-knockdown neurons showed reduce H3K9me3 levels, which correlated with a robust accumulation of G4 structures (Fig. 2a). Using frozen brain sections from the hippocampus of three elderly controls and three AD cases, we found that G4 structures also accumulated in AD neurons in situ (Fig. 2b). Likewise, cortical neurons produced from the differentiation of iPSCs from three unrelated AD cases (AD1, AD2, and LG) presented increased H3K9ac levels that correlated with the accumulation of G4 structures, when compared to control iPSC-derived neurons from two unrelated non-demented elderly cases (Ctrl1 and Ctrl2) (Fig. 2c and Supplementary Fig. 5; see Experimental procedures for description of the iPSC lines). DNA damage accumulates in AD neurons in situ11, and 53BP1 nuclear foci label DNA damage49. We found that 1H6 foci weakly colocalized with 53BP1 nuclear foci in AD neurons at day in vitro 30 (DIV30) (Supplementary Fig. 6). By performing a time-course study, we also observed the accumulation of G4 structures in AD neurons at DIV14, thus well before the appearance of 53BP1 nuclear foci at DIV30 (Fig. 2d). This suggested, at least within the timeframe analyzed herein, that G4 structures are not significantly associated with DNA damage in AD neurons.
Next, we performed high-resolution confocal microscopy and 3D reconstruction analyses on AD neurons to reveal the subcellular localization of G4 structures. G4 structures were found to be abundant around the nucleolus and at interspaced puncta located at the nuclear periphery, suggesting accumulation at the nucleolar heterochromatin and at LADs, respectively (Fig. 2e)50,51. Using the neuronal marker βIII-tubulin, we confirmed that G4 structures observed in AD cultures were predominant in neurons (Supplementary Fig. 5a). We further investigated if the modulation of chromatin structure could improve the observed phenotype. Tau over-expression in neurons is sufficient to induce heterochromatin relaxation, and GSK3β is the main kinase that phosphorylates Tau in AD12,52,53. Likewise, p53 accumulates in AD neurons, and p53 can initiate heterochromatin relaxation by inhibiting the expression of the histone Lys9 tri-methyltransferase SUV39H110,54. We thus treated control and AD neurons for 24 h with CHIR99021 (a GSK3β inhibitor) or pifithrin-α (a p53 inhibitor) (Fig. 2f). We found that both treatments significantly reduced H3K9 hyper-acetylation and the accumulation of G4 structures in neurons from the three AD cases (Fig. 2f and Supplementary Fig. 5b–e), revealing that re-establishment of chromatin compaction is sufficient to rescue the G4 phenotype.
1H6 peaks present high similarities with canonical and non-canonical G4 sequences
In order to localize G4 structures on the human genome, we performed ChIP-seq on control and AD neurons at DIV30 using the 1H6 antibody10. We identified 1389 peaks in AD neurons and 1165 peaks in control neurons. Enrichment of 1H6 within the peaks was significantly higher in AD neurons (Fig. 3a and Supplementary Fig. 7a). Peaks in AD neurons were also significantly larger than in controls, covering ~0.6% of the human genome, compared to less than 0.2% (Fig. 3b). While most of the peaks in control neurons were represented in AD samples (Supplementary Fig. 7a), cluster 1 from the AD peaks showed no enrichment in control neurons, revealing a large subgroup of AD-specific peaks (Fig. 3a). Venn diagram distribution revealed that out of the 1389 AD peaks, 737 peaks were AD-specific, and 558 were shared with control neurons. Out of the 1165 control peaks, 504 peaks were control-specific, and 609 were shared with AD neurons (Fig. 3c). The discrepancy in the number of peaks shared by both groups was explained by the occasional presence of two control peaks within a very large and unique AD peak.
We observed by immunofluorescence that most G4 structures induced in AD neurons localized at LADs. Bioinformatic analysis of ChIP-seq data further revealed that ~40% of 1H6 peaks colocalized with constitutive LAD (cLADs) and ~25% with facultative LAD (fLADs) (Fig. 3e)55. AD peaks were also found to colocalized more with cLADs than control peaks. Hence, ~75% of AD peaks indeed colocalized with cLADs or fLADs. The observed percentage could not be explained by the coverage of these domains on the genome since cLADs and fLADs represent ~50% of the human genome (Fig. 3f).
We next compared all 1H6 peaks with the canonical G4 motif. The most stringent predicted canonical G4 sequence contains four interspaced repeats with at least three guanines/repeat, the guanine repeats being separated by a loop of 1–12 nucleotides56. This revealed that 61% of 1H6 peaks in AD neurons and 53% of 1H6 peaks in control neurons colocalized with a predicted canonical G4 sequence (Supplementary Fig. 7b). Importantly, these values presented a Z score nine times higher than a random distribution of all peaks on the human genome when performing a test with 1000 random permutations (Supplementary Fig. 7c). Annotation of all 1H6 peaks revealed that ~38% were located within gene bodies, ~20% at enhancers, and ~5% at CpG islands (Fig. 3d). We also analyzed all peaks using an unbiased motif discovery algorithm. We found that the most statistically significant motifs discovered by MEME Suite contained interspaced G repeats predicted to form non-canonical G4 structures (Fig. 3g, h and Supplementary Fig. 7e)57,58. When probed with a QGRS finder (quadruplex forming G-rich sequences)59, these motifs presented a G score of 3460. When analyzing with MEME the peaks that did not colocalize with a canonical G4 sequence (~45% of all peaks), we found that the most statistically significant motifs also contained interspaced G repeats predicted to form non-canonical G4 structures (Supplementary Fig. 7f). These results indicate that most epitopes recognized by the 1H6 antibody in the ChIP-seq experiment corresponded to structured DNA.
1H6 peaks correspond to evolutionary conserved L1 sequences
Using bioinformatic analyses, we found that the best clustering approach to segregate 1H6 peaks was obtained when testing for the enrichment of repetitive elements. Indeed, upon annotation of all peaks for the presence of repetitive elements, we found that Long Interspersed Elements (LINEs) were present in 95% of peaks from control neurons and in 98% of peaks from AD neurons (Fig. 4a). Furthermore, 75% of the peaks also contained at least one Short Interspersed Element (SINE) (Fig. 4a). Notably, the L1 family was the most enriched for LINEs, and the ALU family was the most enriched for SINEs (Fig. 4b, c). The enrichment for LINEs and SINEs in all 1H6 peaks was much higher than the theoretical percentage obtained when searching for LINEs and SINEs in all predicted G4 sequences of the human genome (Fig. 4a–c, showed with blue histograms). These differences in enrichment could also not be accounted for by a difference in genome coverage between the mean of control and AD 1H6 peaks (~0.3%) and all predicted G4 sequences (~0.3%) (Fig. 3b). Furthermore, we performed a permutation test with 1000 permutations of the peaks and compared them to the various repetitive families and found a significant difference between the random set and the 1H6 peaks giving with a P value associated with this difference (Fig. 4a–c and Supplementary Fig. 8, ***P < 0.001). Among the L1 family, the LIPA3, LIPA2, LIM5, and LIMC4 sequences were the most represented (Fig. 4d). Among the ALU family, the AluSx, AluY, AluSz, and AluSxz sequences were the most represented (Supplementary Fig. 9a). Remarkably, when searching for predicted G4 motifs in the consensus sequences for these repetitions, we found that each contained at least three canonical G4 sequences (Fig. 4e and Supplementary Fig. 9b).
Considering the above findings, we investigated the presence of LINEs or SINEs in intragenic 1H6 peaks. Notably, a unique peak was identified in both control and AD neurons between exons 13 and 14 of the Amyloid beta-precursor protein (APP) gene. In AD neurons, the peak was much larger than in control neurons, spanning two extra-canonical G4 sequences (Fig. 4f). We also noticed the presence of two regions with very high reads density within this unique peak (Fig. 4f, boxed peaks in red). Further analysis revealed that these regions contained four non-canonical G4 sequences presenting a relatively high G4 score (Fig. 4f). Strikingly, we found a near-perfect match between the presence of a large intragenic 1H6 peak and the presence of a unique and evolutionary conserved L1 sequence at the APP locus and all other tested loci (Fig. 4f and Supplementary Fig. 9c). While SINEs were frequently present within or close to 1H6 peaks, they were not as predominant as L1 sequences (Fig. 4f and Supplementary Fig. 9c). Noticeably, the presence of “bystander” SINEs apparently accounted for the enlarged 1H6 peak found in AD neurons at the APP locus (Fig. 4f). These findings suggested that evolutionary conserved L1 sequences represent the source of almost all G4 structures detected in healthy and AD neurons.
Evolutionary conserved L1 sequences are the only LINEs with intact internal promoters, allowing them to be transcribed by RNAPII. To test if the formation of G4 structures was dependent on transcription, we treated control and AD neurons with the RNAPII inhibitor 5,6-dichlorobenzimidazole 1-β-D-ribofuranoside (DRB) for 8 h prior to immunofluorescence analysis (Fig. 4g and Supplementary Fig. 10a, b). We found that inhibition of RNAPII mildly impacted G4 structures in control neurons, but dramatically reversed the accumulation of G4 structures in AD neurons from the 3 AD cases (Fig. 4g and Supplementary Fig. 10a, b). Accordingly, the 1H6 signal largely colocalized with that of RNAPII in AD neurons (Pearson correlation analysis: 0.63) (Fig. 4h). To confirm this, we treated control neurons with HDACi or HDACi + DRB. While HDACi treatment resulted in the induction of G4 structures and H3K9 acetylation, the addition of DRB prevented the induction of G4 structures independently of H3K9 acetylation (Fig. 4i and Supplementary Fig. 10c). These results thus distinguished RNAPII-mediated G4 induction from chromatin relaxation.
G4 structures can perturb splicing and gene expression in AD neurons
To evaluate the biological significance of our findings, we performed a Gene Ontology (GO) analysis of genes associated with 1H6 peaks. Among the 139 control-specific peaks associated with a gene (i.e., lost in AD), there was no pathway enrichment. Among the 259 genes with peaks common to control and AD neurons (i.e., common peaks), there was an overrepresentation of genes involved in de-ubiquitination (Fig. 5a). Among the 269 AD-specific peaks associated with a gene (i.e., gained in AD), there was a general overrepresentation of genes involved in cell–cell adhesion, axonal projection, neurogenesis, and synaptogenesis (Fig. 5a). Notably, comparative RNA-seq analysis of Ctrl1 and Ctrl2 neurons with AD1 and AD2 neurons revealed that a subset of genes with intergenic or intragenic peaks presented modified splicing events (Fig. 5b, c and Supplementary Fig. 11a, b). In the first scenario (lost in AD), a single intergenic peak between SMN1 and NAIP was lost in AD neurons. This resulted in cryptic slicing and/or transcription from SMN1 over the adjacent NAIP locus (Fig. 5b), possibly explaining the presence of aberrant and larger SMN1 protein isoforms in AD neurons (Fig. 5d and Supplementary Fig. 12). In the second and third scenarios (common peaks and gained in AD), the presence of a single intragenic peak was found to be closely associated with alternative splicing of minor isoforms and/or exon exclusion events (Fig. 5b), possibly explaining perturbations in APP and DNAH6 protein isoforms observed in AD neurons (Fig. 5d and Supplementary Fig. 12). Efficient exon splicing can be associated with increased gene expression, while exon exclusion results in the opposite trend, a process called exon-mediated activation of transcription starts61. In addition, G4 structures located in intragenic regions have been shown to represent an obstacle to transcriptional elongation, resulting in reduced gene expression62.
To test if our results matched any of these models, we superimposed all genes having a peak (lost in AD, common and gained in AD) over a Volcano plot distribution of all genes differentially expressed between control and AD neurons. While genes were generally upregulated in AD neurons (2006 upregulated and 1924 downregulated in AD1; 9584 upregulated and 5851 downregulated in AD2), no significant correlation was observed between the presence of a G4 structure and differential gene expression (Supplementary Figs. 13a and 14a). However, we observed a trend toward gene downregulation in genes having a common peak or a peak only present in AD neurons (Supplementary Figs. 13b and 14b). When we analyzed genes with an intragenic G4 structure that was also associated with an alternative splicing event, we found that these were significantly downregulated in AD neurons (Fig. 5e and Supplementary Figs. 11c, 13c, and 14c). These results suggested that intergenic G4 structures may sometime work as “gene insulator” elements in normal conditions and that excessive formation of intragenic G4 structures can perturb alternative splicing and gene expression in AD neurons.
Discussion
We report here that BMI1 inactivation in human cells or mouse photoreceptors resulted in heterochromatin relaxation and induction of G4 structures. A similar phenotype was observed in AD neurons having reduced BMI1 expression. Loss of chromatin compaction in general, as shown using HDACi, also resulted in the induction of G4 structures. Conversely, the formation of G4 structures in AD neurons was reversed by re-establishing chromatin compaction. ChIP-seq analysis using 1H6 further revealed that most significant consensus motifs contained canonical and non-canonical G4 sequences and that 1H6 peaks corresponded to evolutionary conserved L1 sequences. Hence, inhibition of RNAPII allowed us to distinguish transcription-mediated induction of G4 structures from chromatin relaxation. 1H6 enrichment was predominant in AD neurons, and most AD-specific peaks were linked to genes involved in neurogenesis and synaptogenesis. At last, some intragenic peaks enriched in AD neurons were associated with alternative splicing events, exon exclusion, and reduced gene expression.
Herein, we have preferentially used the 1H6 antibody, which was shown to recognize several, but not all, G4 structures in vitro35. We have further validated the specificity of this antibody using several approaches and by showing: (1) specific labeling after pyridostatin treatment, but not after gamma irradiation; (2) colocalization with the Werner and XPB helicases; (3) equivalent results when using the BG4 antibody; (4) specific labeling only under native chromatin conditions; and (5) detection of canonical and non-canonical G4 sequences in ChIP-seq consensus motifs. Previously, high-throughput sequencing of indirect DNA immunoprecipitation experiments with the BG4 antibody suggested that G4 DNA-forming sequences were abundant in gene bodies63,64. ChIP-seq analyses using BG4 further confirmed that, under physiological conditions, G4 structures were enriched at the promoter of some actively transcribed genes, such as C-MYC34. These observations are consistent with our findings in control neurons, where several G4 peaks were detected at intragenic and enhancer regions.
In AD neurons, heterochromatin compaction was reduced, and we found that G4 structures preferentially accumulated at LADs and peri-nucleolar heterochromatin. Likewise, heterochromatin compaction was reduced and G4 structures were induced in BMI1-knockdown neurons. Notably, it was found that most LINEs are localized at LADs65, and we indeed found that about all 1H6 peaks corresponded to L1 sequences. In this context, it is notable that BMI1 co-purifies with LMNA and LMNB2, alongside with architectural heterochromatin proteins9, suggesting that BMI1 may, directly or indirectly, be important for the stabilization of LADs. L1 sequences represent nearly 17% of the human genome66. While these sequences can be autonomously transcribed by RNAPII from a 5’ UTR promoter, it is estimated that out of the 500,000 L1 sequences present in the human genome, less than 5000 have an intact internal promoter67. Notably, it was found that the 3’ UTR of L1 sequences can form G4 structures in vitro and that this feature is conserved among mammals, suggesting a possible functional role for the propagation of retro-transposons68,69. Strikingly, we have found that only evolutionary conserved L1 sequences, i.e., predicted to be transcriptionally active, were associated with 1H6 peaks. This is consistent with our observations that the accumulation of G4 structures required functional RNAPII activity. We also showed that upon RNAPII inhibition, chromatin relaxation could be uncoupled from the accumulation of G4 structures. This supports a model where chromatin-mediated inhibition of L1 sequences transcription is the primary mechanism regulating the formation of G4 structures in post-mitotic neurons.
A recent study revealed that many genes in AD brains displayed alternative splicing events70, but the underlying mechanism remained unknown. Nevertheless, part of these splicing anomalies was mimicked in neurons over-expressing Tau70, thus possibly linking Tau-mediated chromatin relaxation to specific alternative splicing events. Here, we found that some intragenic 1H6 peaks enriched in AD neurons were associated with alternative splicing events and reduced gene expression, thus providing a plausible mechanism for a fraction of genes with perturbed splicing in AD. The overrepresentation of 1H6 peaks in genes involved in neurogenesis, axonal guidance, and synaptogenesis in AD samples is also intriguing considering the reported inability of AD neurons to guide their projections and form functional synapses71,72,73,74,75,76,77,78. It is also intriguing considering that most genes with an exon-mediated activation of transcription starts structure are also enriched for brain development, neurogenesis, and synaptogenesis61. Further work is required to better understand the complex interconnections between G4 structures, splicing, and neuronal gene expression.
In conclusion, while specialized DNA helicases have evolved to recognize and resolve G4 structures to promote transcription, replication, and repair, chromatin-mediated transcriptional repression of L1 sequences represents a critical mechanism to prevent excessive formation of G4 structures in human neurons. Considering that heterochromatin relaxation in neurons is also observed in other neurodegenerative diseases, such as amyotrophic lateral sclerosis, frontotemporal dementia, and tauopathies12,19,79,80,81,82,83, these findings have potentially broader implications. Our findings thus expose, for the first time, the nature of DNA sequences capable of forming G4 structures in human neurons, the mechanisms underlying their activation, and the biological impact of their deregulation.
Methods
Human samples and animals
Human pluripotent stem cells were used in accordance with the Canadian Institute Health Research (CIHR) guidelines and approved by the “Comité de Surveillance de la Recherche sur les Cellules Souches” of the CIHR and Maisonneuve-Rosemont Hospital Ethic Committee. Human brain tissues were kindly provided by the Banner Institute. C57Bl/6 Bmi1−/− (The Netherlands Cancer Institute, Amsterdam) and WT (Charles River, St-Constant, Canada) mice were used in accordance with the Animal Care Committee of the Maisonneuve-Rosemont Hospital Research Centre (Approval ID #2014-03, #2012-09). The mice that were used were equally distributed between male and females and they were sacrifices at P1 and P10 for the respective experiments. These mice were kept at the HMR animal facility, with ambient temperature around 22 °C, controlled humidity with a ventilation system and a 12-h dark/light cycle.
Cell cultures
Normal HDFs were purchased from the Coriell Institute. HCA2 cells were kindly provided by the laboratory of Dr Francis Rodier. HDFs were cultured with DMEM/F12 media (Invitrogen) supplemented with 10% FBS (Invitrogen) and non-essential amino acids (Invitrogen). For the HDACi experiments, cells were treated with 5 ng/ml of Trichostatin A (Sigma, T1952-200UL) or 10 mM of Sodium Butyrate (Sigma, 303410-5G), and 5 µM of pyridostatin (Sigma, SML0678-5MG). For the replication and transcription arrest we used respectively: 1 μg/ml of Aphidicolin from Nigrospora sphaerica (Sigma, A0781-1MG) and 40 μM of DRB (Sigma, D1916-10MG). A concentration of 0.2 μg/ml of aphidicolin or of 0.2 mM of Hydroxyurea (Sigma, H8627) was used to induce replication stress. iPSC and neuronal cultures were conducted according to the methods described in10.
Differentiation of pluripotent stem cells into cortical neurons
Differentiation of iPSCs into cortical neurons was performed accordingly to Flamier et al.10.
Ctrl1: iPSC-derived fibroblasts from Coriell Institute #AG04152, from a healthy 82-year-old male patient with no family history of AD.
Ctrl2: iPSC-derived fibroblasts from Coriell Institute #AG09602 from a healthy 92-year-old female patient with no family history of AD.
AD1: iPSC-derived fibroblasts from Coriell Institute #AG08243 from a 72-year-old male patient diagnosed with sporadic AD with no family history of AD.
AD2: iPSC-derived fibroblasts from Coriell Institute #AG08259 from a 90-year-old male patient diagnosed with sporadic AD at the age of 87 with no family history of AD.
LG84: iPSC-derived fibroblasts from Coriell Institute #GM24666 from an 83-year-old male patient diagnosed with sporadic AD at the age of 78 with no family history of AD with a 3/3 APOE genotype.
iPSCs were dissociated using Accutase (Innovative Cell Technology #AT-104) and platted on growth factor reduced Matrigel (Corning #356231) in PeproGrow hES cell media (PeproTech #BM-hESC) supplemented with ROCK inhibitor (Y-27632;10 µM, Cayman Chemical #10005583). Upon 70% of confluency, the medium was changed to DDM supplemented with B27 (1X final), Noggin (10 ng/ml, PeproTech #120-10C) and LDN193189 (0.5 µM; Sigma #SML0559). The medium was changed every day. After 16 days of differentiation as neural progenitors, the medium was changed to DDM/B27 and replenished every day. At day 24 as neural progenitors (corresponding to DIV0 of neural differentiation), half of the medium was changed for Neurobasal A media supplemented with B27 (1X final) and changed again every 3 days. In Fig. 2a, neurons produced from the differentiation of the HUES9 cell line were infected at DIV0 with a lentivirus expressing an shRNA targeting BMI1 or a scramble sequence, as described in Flamier et al.10.
Immunofluorescence
Eyes were extracted and fixed by immersion overnight at 4 °C in 4% PFA/3% sucrose in PBS, pH 7.4. Samples were washed three times in PBS, cryoprotected in PBS/30% sucrose, and frozen in CRYOMATRIX embedding medium (Thermo Shandon, Pittsburgh, PA) or in Tissue-Tek® optimum cutting temperature compound (Sakura Finetek, USA). Then, 5- to 12-µm thick sections were mounted on Super-Frost glass slides (Fisher Scientific) and processed for immunofluorescence. For immunofluorescence labeling, sections were incubated overnight with primary antibody solutions at 4 °C in a humidified chamber. After three washes in PBS, sections were incubated with secondary antibodies for 1 h at room temperature. Slides were mounted on coverslips in DAPI-containing mounting medium (Vector Laboratories CA, H-1200). Confocal microscopy analyses were performed using 60× objectives with an IX81 confocal microscope (Olympus, Richmond Hill, Canada), and images were obtained with Fluoview software version 3.1 (FV10-ASW V. 3.1, Olympus). The cultured cells were fixed for 15 min with 4% PFA, washed three times and then permeabilized for 10 min with 0.25 Triton (Sigma, X100-500ML), cells were then blocked in PBS/2% BSA (Sigma, A7906-100G) for an hour and incubated overnight with the primary antibody. Primary antibodies used in this study are: FITC mouse anti-TRA-1-60 (BD Pharmingen, 560380), rabbit anti-SOX2 (ab97959), goat anti-NANOG (R&D systems, af1997), goat anti-S-Opsin (1:250, Santa Cruz, sc-14363), rabbit anti-H3K9me3 (1:500, Abcam, ab8898), rabbit anti-H3K9ac (1:500, Cell Signaling, 9671S), rabbit anti-WRN (1:100, Santa Cruz, sc-5629), rabbit anti-TFIIH p80 (1:200, Santa Cruz, sc-20696) targeted against XPD, rabbit anti-TFIIH p89 (1:200, Santa Cruz, sc-293) targeted against XPB, rabbit anti-53BP1 (1:100, Novus, NB100-304), rabbit anti-H2Aub (1:200, Cell Signaling, 8240S), rabbit anti-Ki67 (1:1000, Abcam, ab15580), and mouse BG4 (1:333, Absolute antibody, Ab00174-1.1), and 1H6 antibodies recognizing G-quadruplexes. We obtained the 1H6 antibody from The European Research Institute for the Biology of Ageing. After the primary antibodies, slides were washed three times using PBS and incubated with the secondary antibodies for 1 h. The Secondary antibodies are: donkey AlexaFluor488-conjugated anti-mouse (1:1000, Life Technologies, A21202), donkey AlexaFluor488-conjugated anti-rabbit (1:1000, Life Technologies, A21206), donkey AlexaFluor647-conjugated anti-mouse (1:1000, Life Technologies, A31571), goat AlexaFluor Texas red-conjugated anti-rabbit (1:1000, Life Technologies, T2767). Slides were then washed three times with PBS and mounted with coverslips in DAPI-containing mounting medium (Vector Laboratories CA, H-1200).
Quantifications and statistical analysis and reproducibility
For the colocalization study, random lines were drawn on individual cells using FIJI. From these lines, we plotted the intensity profile of each marker accordingly. The data collected were plotted on horizontal graphs with each marker as a separate line for the visualization of the peaks (example Fig. 1h). The data were also plotted in a scatter graph, using GraphPad Prism 5, to visualize the correlation between these two markers. On these sets of pairs, a Pearson correlation was calculated to quantify the correlation. For the co-expression study intensity of the signal for different markers was measured, using a mask on DAPI to identify the nucleus, and then plotted in a scatter plot using GraphPad Prism 5, to visualize the correlation between these two markers. On these sets of pairs, a Pearson correlation was calculated to quantify the correlation. For the expression study, we quantified the mean intensity of each marker in the nucleus area using the DAPI signal to identify that area. Values were plotted with a box and whisker graph. The K-means clustering was done searching for two or three groups with 20 iterations for each run. Statistical differences were analyzed using Student’s t-test for unpaired samples with P ≤ 0.05*, ≤0.01**, ≤0.001***. All the IF experiments (Fig. 1d–j, 2a, c–f, and 4g–i and Supplementary Figs. 1–6 and 10) were repeated a minimum of three independent times with consistent results.
Western blot
Cell extracts were homogenized in the Complete Mini Protease inhibitor cocktail solution (Roche Diagnostics), followed by sonication. Protein material was quantified using the Bradford reagent. Proteins were resolved in 1× Laemelli reducing buffer by SDS-PAGE electrophoresis and transferred to a Nitrocellulose blotting membrane (Bio-Rad). Subsequently, membranes were blocked for 1 h in 5% non-fat milk-1X TBS solution and incubated overnight with primary antibodies. The antibodies used in this study are: mouse anti SMN (1:1000, Santa Cruz, SC-32313), rabbit anti-MOAB (1:1000, Novus, NBP2-13075), mouse anti-B4 (1:1000, Santa Cruz, SC-28365), rabbit anti-ANOA4 (1:1000, Invitrogen, PA5-62785), and rabbit anti-DNAO6 (1:1000, Invitrogen, PA5-57636). Membranes were then washed three times in 1X TBS; 0.05% Tween solution and incubated for 1 h with corresponding horseradish peroxidase-conjugated secondary antibodies. Membranes were developed using the Immobilon Western (Millipore).
Public ChIP-seq analysis and prediction of G-quadruplexes
ChIP-seq data sets were obtained through the GO platform using accession numbers GSE22162 (ATRX and H3K9me3), GSE44849 (XPB and XPD), GSE76688 (BG4), and GSE38273 (BMI1). BMI1 significant peaks were extracted from Meng et al.85. The Model-based Analysis for ChIP-Seq was used to extract significant peaks with a P value cutoff ≤0.05. Peak coordinates were mapped onto hg19 genome reference using SeqMonk v0.34.0 software (Babraham Bioinformatics). Putative G-quadruplexes were predicted using Quadparser algorithm V2 running under python v2.7.11 with indicated parameters for the number of guanines in each stack (G-groups), the number of base pairs between G-groups (loop size), and the number of time the loop and a stack was repeated after the initial stack (Repeats-1)33. For example, to perform a Quadparser on chromosome 1 searching for a stack of five Gs and a loop size of 1–7 with four repeats the scrip and the parameters will be: quadparser.py -f chr1.fa -r ([gG]{5,}\w{1,7}){3,}[gG]{5,}. G-quadruplexes coordinates for each set of parameters were then mapped onto hg19 genome reference using SeqMonk software. Annotation of ChIP-seq peaks with G-quadruplexes was determined by extending them 50 base pairs on each side and counting the number of overlapping predicted G-quadruplexes. SeqMINER was used for H3K9me3 ChIP-seq enrichment heatmap and K-means clustering using default parameters.
ChIP-seq experiment and data analysis
The ChIP-seq experiment was done using the Diagenode kit: “iDeal ChIP-seq kit for Histones” reference number: C01010051. We followed the manufacturer protocol with a minor change: after cross-linking, we added pyridostatin to buffer B in order to stabilize already present G4 structures. Immunoprecipitation was performed using the 1H6 antibody. The quantity and size of DNA fragments were verified on the Bioanalyzer with a “DNA 1000” chip. After precipitation, the DNA libraries were prepared using the “NEBext Fast DNA Library Prep Set for Ion Torrent” reference number E6270S. The libraries were then loaded on the “Ion Chef System” and sequenced on the Ion Torrent. Raw reads were aligned on the human genome Hg19 using the torrent platform. EaSeq software (V. 1.111)86 (http://easeq.net) was used in order to analyze the aligned data, and to call the peaks. EaSeq was also used in order to produce the heatmaps and the train plots with a window of 10 kb. Annotation and the statistical significance were done using the RegionR package (V. 1.18.1)87 using databases downloaded from UCSC table browser88. In order to identify the motifs, we used the MEME-ChIP from the MEME suite (V. 5.0.3)89,90. In order to identify the pathways of the identified genes, we used Gene Ontology (GO) (Panther 14.0)91,92,93. The ChIP-seq data of the 1H6 antibody performed in this publication have been deposited in NCBI’s Gene Expression Omnibus (Edgar et al., 2002) and are accessible through GO Series accession number GSE133113.
Transcriptome analysis
Total RNA from two independent biological samples were extracted from each of the two cell line derived from AD patients along with the two cell lines derived from healthy patients for a total of eight samples, using the standard procedure of Qiagen columns and assayed for RNA integrity. cDNA was prepared according to the manufacturer’s instructions (NEB library) and sequenced using the Illumina platform. Base-calling and feature count were done using Illumina software. For differential expression analysis, Dseq294 was used on R program95. For the gating strategy please refer to Supplementary Fig. 11a–c. The RNA-seq data were deposited and are accessible through GO series accession number GSE162873.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
Public data accession numbers used in this study are: GSE22162, GSE44849, GSE76688, GSE38273. The ChIP-seq data of the 1H6 antibody performed in this publication have been deposited in NCBI’s Gene Expression Omnibus and are accessible through GO Series accession number GSE133113. These data have been used to construct Supplementary Figs. 5–7. The RNA-seq data were deposited and are accessible through GO series accession number GSE162873. All data are also available from the authors upon reasonable request.
References
Buchwald, G. et al. Structure and E3-ligase activity of the Ring-Ring complex of Polycomb proteins Bmi1 and Ring1b. EMBO J. 25, 2465–2474 (2006).
Satijn, D. P. et al. RING1 is associated with the polycomb group protein complex and acts as a transcriptional repressor. Mol. Cell. Biol. (2015) https://doi.org/10.1128/mcb.17.7.4105.
Wang, H. et al. Role of histone H2A ubiquitination in Polycomb silencing. Nature 431, 873–878 (2004).
Kundu, S. et al. Polycomb Repressive Complex 1 generates discrete compacted domains that change during differentiation. Mol. Cell 65, 432–446.e5 (2017) .
Endoh, M. et al. PCGF6-PRC1 suppresses premature differentiation of mouse embryonic stem cells by regulating germ cell-related genes. Elife 6, e21064 (2017).
Ito, T., Teo, Y. V., Evans, S. A., Neretti, N. & Sedivy Correspondence, J. M. Regulation of cellular senescence by Polycomb chromatin modifiers through distinct DNA damage- and histone methylation-dependent pathways. Cell Rep. 22, 3480–3492 (2018).
Bracken, A. P. et al. The Polycomb group proteins bind throughout the INK4A-ARF locus and are disassociated in senescent cells. Genes Dev. 21, 525–530 (2007).
Dietrich, N. et al. Bypass of senescence by the polycomb group protein CBX8 through direct binding to the INK4A-ARF locus. EMBO J. 26, 1637–1648 (2007).
Abdouh, M., Hanna, R., El Hajjar, J., Flamier, A. & Bernier, G. The Polycomb Repressive Complex 1 protein BMI1 is required for constitutive heterochromatin formation and silencing in mammalian somatic cells. J. Biol. Chem. 291, 182–197 (2016).
Flamier, A. et al. Modeling late-onset sporadic Alzheimer’s disease through BMI1 deficiency. Cell Rep. 23, 2653–2666 (2018).
El Hajjar, J. et al. Heterochromatic genome instability and neurodegeneration sharing similarities with Alzheimer’s disease in old Bmi1+/− mice. Sci. Rep. 9, 594 (2019).
Frost, B., Hemberg, M., Lewis, J. & Feany, M. B. Tau promotes neurodegeneration through global chromatin relaxation. Nat. Neurosci. 17, 357–366 (2014).
Sun, W., Samimi, H., Gamez, M., Zare, H. & Frost, B. Pathogenic tau-induced piRNA depletion promotes neuronal death through transposable element dysregulation in neurodegenerative tauopathies. Nat. Neurosci. 21, 1038–1048 (2018).
Guo, C. et al. Tau activates transposable elements in Alzheimer’s disease. Cell Rep. 23, 2874–2880 (2018).
Xia, X., Jiang, Q., McDermott, J. & Han, J.-D. J. Aging and Alzheimer’s disease: comparison and associations from molecular to system level. Aging Cell 17, e12802 (2018).
Guerreiro, R. & Bras, J. The age factor in Alzheimer’s disease. Genome Med. 7, 106 (2015).
Boccardi, V., Pelini, L., Ercolani, S., Ruggiero, C. & Mecocci, P. From cellular senescence to Alzheimer’s disease: the role of telomere shortening. Ageing Res. Rev. (2015) https://doi.org/10.1016/j.arr.2015.04.003.
Reddy, P. H. et al. MicroRNAs, aging, cellular senescence, and Alzheimer’s disease. Prog. Mol. Biol. Transl. Sci. (2017). https://doi.org/10.1016/bs.pmbts.2016.12.009.
Frost, B., Bardai, F. H. & Feany, M. B. Lamin dysfunction mediates neurodegeneration in tauopathies. Curr. Biol. 26, 129–136 (2016).
Frost, B. Alzheimer’s disease: an acquired neurodegenerative laminopathy. Nucleus 7, 275–283 (2016).
Zhang, W. et al. A Werner syndrome stem cell model unveils heterochromatin alterations as a driver of human aging. Science 348, 1160–1163 (2015).
Scaffidi, P. & Misteli, T. Lamin A-dependent nuclear defects in human aging. Science 312, 1059–1063 (2006).
Taimen, P. et al. A progeria mutation reveals functions for lamin A in nuclear assembly, architecture, and chromosome organization. Proc. Natl Acad. Sci. USA 106, 20788–20793 (2009).
Tang, W. et al. The Werner syndrome RECQ helicase targets G4 DNA in human cells to modulate transcription. Hum. Mol. Genet. (2016) https://doi.org/10.1093/hmg/ddw079.
Gray, L. T., Vallur, A. C., Eddy, J. & Maizels, N. G quadruplexes are genomewide targets of transcriptional helicases XPB and XPD. Nat. Chem. Biol. 10, 313–318 (2014).
Sun, H., Karow, J. K., Hickson, I. D. & Maizels, N. The Bloom’s syndrome helicase unwinds G4 DNA. J. Biol. Chem. 273, 27587–27592 (1998).
Law, M. J. et al. ATR-X syndrome protein targets tandem repeats and influences allele-specific expression in a size-dependent manner. Cell 143, 367–378 (2010).
Cheung, I., Schertzer, M., Rose, A. & Lansdorp, P. M. Disruption of dog-1 in Caenorhabditis elegans triggers deletions upstream of guanine-rich DNA. Nat. Genet. (2002) https://doi.org/10.1038/ng928.
McLuckie, K. I. E. et al. G-quadruplex DNA as a molecular target for induced synthetic lethality in cancer cells. J. Am. Chem. Soc. (2013) https://doi.org/10.1021/ja404868t.
De, S. & Michor, F. DNA secondary structures and epigenetic determinants of cancer genome evolution. Nat. Struct. Mol. Biol. (2011) https://doi.org/10.1038/nsmb.2089.
Wang, Y. et al. G-quadruplex DNA drives genomic instability and represents a targetable molecular abnormality in ATRX-deficient malignant glioma. Nat. Commun. 10, 943 (2019).
Maizels, N. & Gray, L. T. The G4 genome. PLoS Genet. 9, e1003468 (2013).
Huppert, J. L. & Balasubramanian, S. Prevalence of quadruplexes in the human genome. Nucleic Acids Res. 33, 2908–2916 (2005).
Hänsel-Hertsch, R. et al. G-quadruplex structures mark human regulatory chromatin. Nat. Genet. (2016) https://doi.org/10.1038/ng.3662.
Henderson, A. et al. Detection of G-quadruplex DNA in mammalian cells. Nucleic Acids Res. 42, 860–869 (2014).
Lam, E. Y. N., Beraldi, D., Tannahill, D. & Balasubramanian, S. G-quadruplex structures are stable and detectable in human genomic DNA. Nat. Commun. 4, 1796 (2013).
Biffi, G., Di Antonio, M., Tannahill, D. & Balasubramanian, S. Visualization and selective chemical targeting of RNA G-quadruplex structures in the cytoplasm of human cells. Nat. Chem. 6, 75–80 (2014).
Rodriguez, R. et al. Small-molecule-induced DNA damage identifies alternative DNA structures in human genes. Nat. Chem. Biol. 8, 301–310 (2012).
Johnson, J. E., Cao, K., Ryvkin, P., Wang, L. S. & Johnson, F. B. Altered gene expression in the Werner and Bloom syndromes is associated with sequences having G-quadruplex forming potential. Nucleic. Acids Res. (2009) https://doi.org/10.1093/nar/gkp1103.
Fry, M. & Loeb, L. A. Human Werner syndrome DNA helicase unwinds tetrahelical structures of the fragile X syndrome repeat sequence d(CGG)(n). J. Biol. Chem. (1999) https://doi.org/10.1074/jbc.274.18.12797.
Sidorova, J. M. Roles of the Werner syndrome RecQ helicase in DNA replication. DNA Repair (2008) https://doi.org/10.1016/j.dnarep.2008.07.017.
Rogakou, E. P., Pilch, D. R., Orr, A. H., Ivanova, V. S. & Bonner, W. M. DNA double-stranded breaks induce histone H2AX phosphorylation on serine 139. J. Biol. Chem. (1998) https://doi.org/10.1074/jbc.273.10.5858.
Boguslawski, S. J. et al. Characterization of monoclonal antibody to DNA RNA and its application to immunodetection of hybrids. J. Immunol. Methods 89, 123–130 (1986).
Phillips, D. D. et al. The sub-nanomolar binding of DNA-RNA hybrids by the single-chain Fv fragment of antibody S9.6. J. Mol. Recognit. 26, 376–381 (2013).
Bhaskara, S. et al. Hdac3 is essential for the maintenance of chromatin structure and genome stability. Cancer Cell (2010) https://doi.org/10.1016/j.ccr.2010.10.022.
Blander, G. & Guarente, L. The Sir2 family of protein deacetylases. Annu. Rev. Biochem. 73, 417–435 (2004).
Chatoo, W. et al. The Polycomb Group Gene Bmi1 regulates antioxidant defenses in neurons by repressing p53 pro-oxidant activity. J. Neurosci. (2009) https://doi.org/10.1523/jneurosci.5303-08.2009.
Barabino, A. et al. Loss of Bmi1 causes anomalies in retinal development and degeneration of cone photoreceptors. Development 143, 1571–1584 (2016).
Panier, S. & Boulton, S. J. Double-strand break repair: 53BP1 comes into focus. Nat. Rev. Mol. Cell Biol. (2014) https://doi.org/10.1038/nrm3719.
Leemans, C. et al. Promoter-intrinsic and local chromatin features determine gene repression in LADs. Cell 177, 852–864.e14 (2019) .
Vazquez, B. N. et al. SIRT7 mediates L1 elements transcriptional repression and their association with the nuclear lamina. Nucleic Acids Res. (2019) https://doi.org/10.1093/nar/gkz519.
Hernández, F., García-García, E. & Avila, J. Microtubule depolymerization and tau phosphorylation. J. Alzheimer’s Dis. 37, 507–513 (2013).
Hernandez, F., Lucas, J. J. & Avila, J. GSK3 and tau: two convergence points in Alzheimer’s disease. J. Alzheimers Dis. 33, S141–S144 (2013).
Mungamuri, S. K. et al. p53-mediated heterochromatin reorganization regulates its cell fate decisions. Nat. Struct. Mol. Biol. 19, 478–484, S1 (2012) .
Meuleman, W. et al. Constitutive nuclear lamina-genome interactions are highly conserved and associated with A/T-rich sequence. Genome Res. 23, 270–280 (2013).
Guédin, A., Gros, J., Alberti, P. & Mergny, J. L. How long is too long? Effects of loop size on G-quadruplex stability. Nucleic Acids Res. (2010) https://doi.org/10.1093/nar/gkq639.
Varizhuk, A. et al. The expanding repertoire of G4 DNA structures. Biochimie (2017) https://doi.org/10.1016/j.biochi.2017.01.003.
Piazza, A. et al. Non-canonical G-quadruplexes cause the hCEB1 minisatellite instability in Saccharomyces cerevisiae. Elife (2017) https://doi.org/10.7554/elife.26884.
Kikin, O., D’Antonio, L. & Bagga, P. S. QGRS Mapper: a web-based server for predicting G-quadruplexes in nucleotide sequences. Nucleic Acids Res. 34, W676–W682 (2006).
D’Antonio, L. & Bagga, P. Computational methods for predicting intramolecular G-quadruplexes in nucleotide sequences. In Proceedings—2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004 590–591 (2004). https://doi.org/10.1109/csb.2004.1332508.
Fiszbein, A., Krick, K. S., Begg, B. E. & Burge, C. B. Exon-mediated activation of transcription starts. Cell 179, 1551–1565.e17 (2019) .
Levy, M. A., Kernohan, K. D., Jiang, Y. & Bérubé, N. G. ATRX promotes gene expression by facilitating transcriptional elongation through guanine-rich coding regions. Hum. Mol. Genet. 24, 1–12 (2014).
Murat, P. & Balasubramanian, S. Existence and consequences of G-quadruplex structures in DNA. Curr. Opin. Genet. Dev. (2014) https://doi.org/10.1016/j.gde.2013.10.012.
Chambers, V. S. et al. High-throughput sequencing of DNA G-quadruplex structures in the human genome. Nat. Biotechnol. (2015) https://doi.org/10.1038/nbt.3295.
van Steensel, B. & Belmont, A. S. Lamina-associated domains: links with chromosome architecture, heterochromatin, and gene repression. Cell 169, 780–791 (2017).
Lander, E. S. et al. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).
Brouha, B. et al. Hot L1s account for the bulk of retrotransposition in the human population. Proc. Natl Acad. Sci. USA 100, 5280–5285 (2003).
Flasch, D. A. et al. Genome-wide de novo L1 retrotransposition connects endonuclease activity with replication. Cell 177, 837–851.e28 (2019) .
Howell, R. & Usdin, K. The ability to form intrastrand tetraplexes is an evolutionarily conserved feature of the 3’ end of L1 retrotransposons. Mol. Biol. Evol. 14, 144–155 (1997).
Raj, T. et al. Integrative transcriptome analyses of the aging brain implicate altered splicing in Alzheimer’s disease susceptibility. Nat. Genet. 50, 1584–1592 (2018).
Llorens-MartÃn, M. et al. Selective alterations of neurons and circuits related to early memory loss in Alzheimer’s disease. Front. Neuroanat. (2014) https://doi.org/10.3389/fnana.2014.00038.
Arendt, T. et al. Plastic neuronal remodeling is impaired in patients with Alzheimer’s disease carrying apolipoprotein ε4 allele. J. Neurosci. (2018) https://doi.org/10.1523/jneurosci.17-02-00516.1997.
Skaper, S. D., Facci, L., Zusso, M. & Giusti, P. Synaptic plasticity, dementia and Alzheimer disease. CNS Neurol. Disord. Drug Targets (2017) https://doi.org/10.2174/1871527316666170113120853.
Uchida, Y. et al. Semaphorin3A signalling is mediated via sequential Cdk5 and GSK3β phosphorylation of CRMP2: implication of common phosphorylating mechanism underlying axon guidance and Alzheimer’s disease. Genes Cells (2005) https://doi.org/10.1111/j.1365-2443.2005.00827.x.
Good, P. F. et al. A role for semaphorin 3A signaling in the degeneration of hippocampal neurons during Alzheimer’s disease. J. Neurochem. (2004) https://doi.org/10.1111/j.1471-4159.2004.02766.x.
Van Battum, E. Y., Brignani, S. & Pasterkamp, R. J. Axon guidance proteins in neurological disorders. Lancet Neurol. (2015) https://doi.org/10.1016/S1474-4422(14)70257-1.
Lerdkrai, C. et al. Intracellular Ca 2+ stores control in vivo neuronal hyperactivity in a mouse model of Alzheimer’s disease. Proc. Natl. Acad. Sci. USA (2018) https://doi.org/10.1073/pnas.1714409115.
Toni, N. & Sultan, S. Synapse formation on adult-born hippocampal neurons. Eur. J. Neurosci. (2011) https://doi.org/10.1111/j.1460-9568.2011.07604.x.
Zhang, Y.-J. et al. Heterochromatin anomalies and double-stranded RNA accumulation underlie C9orf72 poly(PR) toxicity. Science 363, eaav2606 (2019).
Liu, E. Y. et al. Loss of nuclear TDP-43 is associated with decondensation of LINE retrotransposons. Cell Rep. 27, 1409–1421.e6 (2019) .
Belzil, V. V. et al. Reduced C9orf72 gene expression in c9FTD/ALS is caused by histone trimethylation, an epigenetic event detectable in blood. Acta Neuropathol. 126, 895–905 (2013).
Berson, A. et al. TDP-43 promotes neurodegeneration by impairing chromatin remodeling. Curr. Biol. 27, 3579–3590.e6 (2017) .
Krug, L. et al. Retrotransposon activation contributes to neurodegeneration in a Drosophila TDP-43 model of ALS. PLoS Genet. 13, e1006635 (2017).
Israel, M. A. et al. Probing sporadic and familial Alzheimer’s disease using induced pluripotent stem cells. Nature 482, 216–220 (2012).
Meng, S. et al. Identification and characterization of Bmi-1-responding element within the human p16 promoter. J. Biol. Chem. (2010) https://doi.org/10.1074/jbc.M110.133686.
Lerdrup, M., Johansen, J. V., Agrawal-Singh, S. & Hansen, K. An interactive environment for agile analysis and visualization of ChIP-sequencing data. Nat. Struct. Mol. Biol. (2016) https://doi.org/10.1038/nsmb.3180.
Gel, B. et al. RegioneR: an R/Bioconductor package for the association analysis of genomic regions based on permutation tests. Bioinformatics (2016) https://doi.org/10.1093/bioinformatics/btv562.
Karolchik, D. et al. The UCSC Genome Browser Database. Nucleic Acids Res. (2003) https://doi.org/10.1093/nar/gkg129.
Bailey, T. L. et al. MEME Suite: tools for motif discovery and searching. Nucleic Acids Res. (2009) https://doi.org/10.1093/nar/gkp335.
Machanick, P. & Bailey, T. L. MEME-ChIP: motif analysis of large DNA datasets. Bioinformatics (2011) https://doi.org/10.1093/bioinformatics/btr189.
Ashburner, M. et al. Gene Ontology: tool for the unification of biology. Nat. Genet. (2000) https://doi.org/10.1038/75556.
Carbon, S. et al. The Gene Ontology Resource: 20 years and still GOing strong. Nucleic Acids Res. 47, D330–D338 (2019).
Mi, H. et al. PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements. Nucleic Acids Res. (2017) https://doi.org/10.1093/nar/gkw1138.
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Team, R. C. The R project for statistical computing. http://www.R-Project.Org/ 1–12 https://www.r-project.org/ (2013).
Gray, R. D., Trent, J. O. & Chaires, J. B. Folding and unfolding pathways of the human telomeric G-quadruplex. J. Mol. Biol. (2014) https://doi.org/10.1016/j.jmb.2014.01.009.
Zizza, P. et al. Intragenic G-quadruplex structure formed in the human CD133 and its biological and translational relevance. Nucleic Acids Res. 44, 1579–1590 (2016).
Acknowledgements
This work was supported by grants from the National Science and Engineering Research Council of Canada (NSERC), Canadian Institutes of Health Research (CIHR), Maisonneuve-Rosemont Hospital Foundation, and Pierre Theroux Family Foundation for Alzheimer’s Disease Research. A. F., R. H., and A. B. were supported by fellowships from the Molecular Biology Program of Université de Montréal.
Author information
Authors and Affiliations
Contributions
G. B. and R. H.: conceived and designed; R. H., A. F., and A. B.: performed the experiments; G. B., R. H., A. F., and A. B.: analyzed the data; G. B., R. H., A. F., and A. B.: wrote the paper.
Corresponding author
Ethics declarations
Competing interests
A. F. and G. B. are co-founders and shareholders of StemAxonTM. The corporation was however not involved in this study. R. H. and A. B. declare no competing interests.
Additional information
Peer review information Nature Communications thanks David Brocks, Jason Huse, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hanna, R., Flamier, A., Barabino, A. et al. G-quadruplexes originating from evolutionary conserved L1 elements interfere with neuronal gene expression in Alzheimer’s disease. Nat Commun 12, 1828 (2021). https://doi.org/10.1038/s41467-021-22129-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-021-22129-9
This article is cited by
-
Werner helicase interacting protein 1 contributes to G-quadruplex processing in human cells
Scientific Reports (2024)
-
Blockade of STING activation alleviates microglial dysfunction and a broad spectrum of Alzheimer’s disease pathologies
Experimental & Molecular Medicine (2024)
-
A tensor decomposition-based integrated analysis applicable to multiple gene expression profiles without sample matching
Scientific Reports (2022)
-
Promoter G-quadruplex folding precedes transcription and is controlled by chromatin
Genome Biology (2021)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.