Introduction

Herpesviruses are ubiquitous double-stranded DNA (dsDNA) viruses. Alphaherpesvirinae, Betaherpesvirinae, and Gammaherpesvirinae are subfamilies of Herpesviridae that are delineated on the basis of cellular tropism, efficiency of viral replication and manifestation of viral disease1,2,3. Herpesviruses differ from other viruses in their ability to establish latency in the infected host4,5,6. During the course of evolution, herpesviruses have captured several host genes that facilitate their survival in the host7,8,9,10,11,12. Once acquired, the virus homologues of human genes perform a wide array of functions including evading host immune responses, regulating apoptosis in virus infected cells, and regulating the host transcription and translation machinery13,14,15,16,17,18. Four human gene homologues (i.e., DNA helicase, DNA-dependent DNA polymerase, Ribonucleotide reductase large subunit and, Uracil-DNA glycosylase) are known to be present in all human herpesviruses. A sizable proportion (~ 13%) of proteins encoded by human herpesviruses are homologues of human proteins19. About a third of all proteins encoded by Kaposi’s sarcoma associated herpesvirus (KSHV; Human Herpesvirus 8) share homology with human proteins20,21,22.

We hypothesized that while capturing host genes, human herpesviruses have also captured the regulatory regions of the host genes. In other words, we wanted to understand if virus homologues of cellular proteins encoded by human herpesviruses also mimic the regulatory regions of their human counterparts. In particular, we wanted to study the role of G-quadruplexes which are nucleic acid secondary structures in regulatory regions of virus homologues and their human counterparts.

G-quadruplexes are non-canonical nucleic acid secondary structures facilitated by hoogsteen hydrogen bonding. These structures are composed of three or more G-tetrads stacked upon each other. G-quadruplexes are enriched in promoters of human genes and they modulate the promoter activity of several proto-oncogenes including c-myc, c-kit1, c-kit2, vegf, k-ras, met, hif1α and bcl-223,24,25,26,27,28,29,30,31,32.

Genomes of human herpesviruses are enriched for G-quadruplexes33,34,35,36. Studies have revealed the antiviral effect of G-quadruplex specific ligands37,38,39,40. These ligands were also found to interact with G-quadruplexes found in herpesvirus genomes and significantly alter virus life cycle41,42,43. G-quadruplexes in herpesvirus gene promoters modulate promoter activity and regulate gene expression44,45,46,47. Viral Bcl-2 (vBcl-2) represents one of the most well studied herpesvirus homologues of human proteins. Gammaherpesviruses infecting humans i.e., Kaposi’s sarcoma associated herpesvirus (KSHV/HHV8) and Epstein Barr Virus (EBV/HHV4) are known to encode for viral Bcl-2 homologues namely, KSHV KS-Bcl-2 protein (encoded by orf16 gene) and EBV BHRF1 protein (encoded by the bhrf1 gene)16,48,49,50. These viral Bcl-2 homologues are known to share homology with each other and also with their cellular counterpart51. The vBcl-2 proteins are expressed early during lytic replication and are known to regulate apoptosis of the infected host cells to promote release of viral progeny48,51,52,53.

We performed systematic screening to identify G-quadruplexes in virus homologues and their human counterparts. We identified putative quadruplex sequence (PQS) motifs in the regulatory regions (promoters) of virus homologues and their human counterparts. These PQS formed stable G-quadruplexes in vitro. To understand the biological role of these promoter G-quadruplexes in virus homologues we chose to study vBcl-2. We performed extensive functional characterization of G-quadruplexes in the promoters of KSHV KS-Bcl-2 and EBV BHRF1. We found that the G-quadruplex in both the promoters of KS-Bcl-2 and BHRF1 enhance viral Bcl-2 expression. On the contrary, the G-quadruplexes in human bcl-2 gene promoter supress gene expression. Studies with G-quadruplex binding ligands and mutations disrupting the G-quadruplex in the KS-Bcl-2 and BHRF1 promoters confirm a positive regulatory role for these secondary structures. Further, we also demonstrate a role for these vBcl-2 promoter G-quadruplexes in facilitating virus transcription factor mediated enhancement of vBcl-2 expression. We found that human herpesvirus homologues often mimic regulatory regions of their human counterparts. Herpesviruses exploit these secondary structures in the promoters of virus homologues to their advantage through virus-encoded transcription factors.

Materials and methods

Retrieval of sequences

The upstream regulatory region (-5000 bp to + 100 bp) with respect to the transcription start site of human genes (n = 28) that are known to have homologues in human herpesviruses were retrieved from Eukaryotic Promoter Database (EPD) freely accessible at https://epd.epfl.ch//index.php54. Although only a small proportion of herpesvirus promoters have been experimentally validated, previous reports in literature suggest that herpesvirus promoters are typically found within the 1 kb upstream region of the genes44,55,56,57,58,59. Taking this into consideration, the 1 kb upstream regulatory region of viral homologues of human herpesviruses (HHV1, HHV2, HHV3, HHV4, HHV5, HHV6A, HHV7, HHV8) along with human adenoviruses (Classes A, B1, B2, C, D, E, F, G), African swine fever virus (ASFV), Alcelaphine herpesvirus 1 (AHV1), Bovine herpesvirus 4 (BoHV4), Murine gammaherpesvirus 68 (GH68), Meleagrid herpesvirus 1 (MHV1), Lymphocystis disease virus (LDV), Frog virus 3 (FV3), Singapore grouper iridovirus (SGIV), Fowlpox virus (FPV) and ORF virus (ORFV) were obtained from ViPR (http://www.viprbrc.org) database60 and NCBI GenBank (http://www.ncbi.nlm.nih.gov) for PQS mining. The 1 kb upstream regulatory regions are henceforth referred to as putative promoter regions of homologous genes. Accession number of virus strains used in the study is listed in Supplementary File 2: Table S2T1 and Table S2T2.

PQS mapping and randomization

The retrieved upstream sequences (human and viral) were analysed to identify PQS motifs with parameters (minimum G-tetrad = 3 and loop length = 1–15) using Quadparser61. PQS density was defined as the total number of non-overlapping PQS predicted per kilo base of the sequence analysed. The selected sequences were then shuffled while preserving their individual dinucleotide frequencies to determine whether the presence of PQS motifs is a random event. A python program script62 based on the freely available ‘uShuffle’ program script63 was used for dinucleotide shuffling of the 1 kb upstream regulatory sequences of human herpesvirus Bcl-2 homologues (i.e., KS-Bcl-2 and BHRF1). Average PQS densities were calculated in the randomized sequences generated and were compared to that in the native sequences.

PQS conservation analyses

Upstream regulatory regions with at least 1 PQS motif were analysed for conservation. Accession numbers of all sequences analysed are provided in Supplementary File 2; Table S2T2. A conserved PQS motif does not necessarily possess identical core guanine (G) residues. PQS motifs with adjacent Gs capable of forming a quadruplex were also deemed as conserved.

CD spectroscopy and CD melting analyses

CD spectroscopy was performed using oligonucleotides corresponding to the randomly selected PQS in the promoter regions of human herpesvirus homologues (n = 15) and their cellular counterparts (n = 9). The sequences of the selected PQS oligonucleotides are provided in Supplementary File 2; Table S2T3 and Table S2T4. All synthetic oligonucleotides were purchased from Integrated DNA technologies (IDT) for spectroscopic analyses. Oligonucleotide samples were mixed at a concentration of 10 µM in 10 mM sodium cacodylate buffer (pH -7.5) containing 100 mM KCl. The prepared samples were heated at 95 °C for 5 min and then gradually cooled to room temperature. PQS oligonucleotides from upstream regulatory regions of KSHV KS-Bcl-2 and EBV BHRF1 genes (Wt-KSHV-GQ and Wt-EBV-GQ oligonucleotides along with their corresponding mutants Mut-KSHV-GQ and Mut-EBV-GQ) were prepared in a similar manner (Sequences are provided in Supplementary File 2, Table S2T5). In addition, CD melting was performed using Wt-KSHV-GQ and Wt-EBV-GQ PQS oligonucleotides in the presence and absence of G-quadruplex ligand pyridostatin (PDS; 5 µM). Data were recorded at a ramp rate of 1 °C/minute over a range of 20–93 °C at a fixed wavelength of 262 nm. Tm was calculated using the first derivative method.

UV melting analyses

UV melting experiments were performed using Cary 100 Bio UV–Vis double-beam spectrophotometer (Agilent Technologies) equipped with a multi-cell holder attached to a Peltier controller. The Wt-KSHV-GQ and Wt-EBV-GQ oligonucleotides (4 µM) were dissolved in 10 mM sodium cacodylate buffer (pH -7.5) and 100 mM KCl. The samples were then heated to 95 °C for 5 min and gradually cooled to room temperature. Pyridostatin at a concentration of 5 µM was used for melting analysis. The melting curves were recorded at 295 nm between 20 °C and 95 °C with a ramp rate of 1 °C/min. Origin 9.0 (Origin Lab Corp.) was used to analyse and plot melting curves. Tm was calculated using the first derivative method.

Polyacrylamide gel electrophoresis

Samples were prepared using PQS oligonucleotides (Wt-KSHV-GQ and Wt-EBV-GQ and their corresponding mutants with length matched controls; Supplementary File 2; Table S2T5) at 10 µM concentration in Tris–EDTA buffer (pH -7.0) and 100 mM KCl. The prepared samples were subjected to heating at 95 °C for 5 min followed by slow cooling to room temperature before loading. Native and denaturing polyacrylamide gels were prepared in 1 × Tris–borate EDTA (TBE) buffer. The denaturing polyacrylamide gel was supplemented with 7 M urea as a denaturant. Both gels were run in 0.5 × TBE with 50 mM KCl. Full length images are available in Supplementary File 3.

NMR spectroscopy

Samples for NMR spectroscopy were prepared using PQS oligonucleotides (Wt-KSHV-GQ and Wt-EBV-GQ and their corresponding mutants; Supplementary File 2; Table S2T5) at a concentration of 300 μM in 20 mM phosphate buffer (pH -7.0) supplemented with 100 mM KCl and 10% D2O (v/v). The samples were then heated to 95 °C and allowed to cool slowly to room temperature. 1D 1H NMR spectra were recorded at a constant temperature of 20 °C on a Bruker Avance III spectrometer equipped with cryogenic 5 mm TCI triple-resonance probe, operating at a field strength of 500 MHz. Topspin 3.5 (Bruker AG) was used for data acquisition. Data processing and plotting of spectra were done using Topspin 4.6 software (Bruker AG).

Dimethyl sulphate (DMS) footprinting

Synthetic 5’ FAM labelled PQS oligonucleotides were purchased from Integrated DNA Technologies (IDT) for footprinting analyses (Supplementary File 2; Table S2T6). The oligonucleotides were prepared in Tris–EDTA (10 mM) with and without KCl, heated to 95 °C followed by gradual cooling to room temperature to allow G-quadruplex formation. These oligonucleotides were further treated with 0.5% DMS for 2 min to allow methylation of guanine residues. This reaction was stopped using 1 μg of calf thymus DNA. The reactions were run on a 20% native polyacrylamide gel. The desired samples were gel extracted and subjected to ethanol precipitation. The extracted samples were then cleaved using 1 M piperidine at 95 °C. Cleaved products were dried using a SpeedVac machine, and further re-suspended in alkaline sequencing dye. The final products were then resolved on a 20% denaturing polyacrylamide gel. Full length images are available in Supplementary File 3.

Cloning of KSHV KS-Bcl-2 and EBV BHRF1 promoter regions

The PQS containing 1 kb promoter region of KSHV KS-Bcl-2 was amplified using HHV8 DNA (JSC-1; courtesy Dr. Tathagata Choudhuri, Visva Bharati University, Bolpur, India) and the PQS containing 1 kb promoter region of EBV BHRF1 was commercially synthesized by Life Technologies Corp. The amplified products were cloned into a promoter less firefly luciferase vector (pGL3 basic vector; Promega).

Cloning of KSHV KS-Bcl-2 and EBV BHRF1 coding regions with their native promoters

For mRNA quantification studies, the KSHV KS-Bcl-2 protein coding region was cloned along with its 1 kb promoter in ∆luc pGL3 basic vector. Similarly, the EBV BHRF1 protein coding region was also subcloned from the MSCV-N BHRF1 construct {MSCV-N BHRF1 was a gift from Dr. Karl Munger (Addgene plasmid #37,937; http://n2t.net/addgene:37937;RRID:Addgene_37937)64 along with its 1 kb promoter in ∆luc pGL3 basic vector. Mutations were incorporated into PQS containing regions to disrupt G-quadruplex formation. Sequences of mutant oligonucleotides are provided in Fig. 3a. The primers used for amplification and cloning (wild type and mutants) are provided in Supplementary File 2; Table S2T7. The plasmid constructs were purified using QIAprep Spin Miniprep Kit (Qiagen) and confirmed by sequencing. The human bcl-2 gene promoter luciferase construct LB322 (Bcl-2 from ATG to -3934) was a gift from Dr. Linda Boxer (Addgene plasmid #15,381; http://n2t.net/addgene:15381;RRID:Addgene_1538)65.

Cell lines

HEK293T cells were procured from NCCS, Pune, India and were maintained in Dulbecco’s modified medium (Invitrogen) supplemented with 10% fetal bovine serum (Invitrogen) and incubated at 37 °C with 5% CO2.

Luciferase reporter assay

For promoter activity analyses, HEK293T cells were seeded in a 24-well plate at a concentration of 5 × 104 cells and co-transfected with luciferase constructs containing KS-Bcl-2 and BHRF1 promoters (wild type or mutant) along with pRL-TK vector (a renilla luciferase reporter construct with a thymidine kinase promoter) at the concentration ratio of 25:1 (i.e., 500 ng for pGL3 constructs and 20 ng for pRL-TK) using PEI (polyethylenimine). To determine the effect of ligand on promoter activity, 5 µM pyridostatin (PDS) was added to the samples 2 h after transfection62. Additionally, we co-transfected KSHV-RTA (Plasmid pcDNA-RTA;100 ng) and EBV-RTA (Plasmid pSG5-BRLF1;100 ng) with the KSHV KS-Bcl-2 and EBV BHRF1 promoter constructs, respectively, to understand the role of viral RTA in regulating the promoter activity of these virus homologues. Plasmid pcDNA-RTA was a kind gift from Dr. Erle S. Robertson (University of Pennsylvania, Philadelphia, PA, USA)66. Plasmid pSG5-BRLF1 was a gift from Dr. S. Diane Hayward (Addgene plasmid #72635; http://n2t.net/addgene:72635;RRID:Addgene_72635)67. Cell lysates were obtained at 24 h post transfection. Luciferase assay was carried out using a dual luciferase reporter assay system (Promega) according to the manufacturer’s protocol. Luminescence values for both firefly and renilla were recorded using a MicroBeta2 Microplate Scintillation Counter (Perkin Elmer). Firefly luciferase activity was normalized to Renilla luciferase activity. Three independent experiments were done in triplicates.

Quantitative real time PCR

Total RNA was extracted from transfected HEK293T cells using TRIzol reagent (ThermoFisher scientific) according to the manufacturer's protocol. The extracted RNA (500 ng) was treated with 1 μg of DNase I (New England Biolabs) to remove residual DNA. cDNA was synthesized using iScript cDNA synthesis kit (Biorad). Real time PCR was performed using the FastStart essential DNA green master mix (Roche) for quantifying viral Bcl-2 mRNA levels i.e., KSHV KS-Bcl-2 and EBV BHRF1 mRNA levels. The appropriate primers used for qPCR studies are listed in Supplementary File 2; Table S2T8. Glyceraldehyde 3-phosphate dehydrogenase (GAPDH) was used as an internal control for quantification. Fold change in expression was calculated using the 2-∆∆Ct method68.

Data analyses, graphical representation, and statistics

Microsoft Excel was used for analysis of data and plotting of bar graphs unless mentioned otherwise. Origin 9.0 was used for plotting CD curves. Figure 7 was created using Microsoft PowerPoint. Student’s t-test was used to determine statistical significance. P values less than 0.05 were considered significant.

Results

Abundance of PQS motifs in promoters of herpesvirus homologues and their human counterparts

We analysed human genes (n = 28) that are reported to share sequence homology with several herpesvirus genes19, for the presence of PQS motifs in their upstream regulatory regions (-5000 bp to + 100 bp)69 using Quadparser61. It was observed that 79% of these human genes possess at least 1 PQS motif in their upstream regulatory region (coding strand and/or template strand) (Fig. 1a). Additional details such as Ensemble ID, length, orientation of the human genes used for analyses is provided in Supplementary File 1, Table S1T1. Correspondingly, we also analysed the 1 kb upstream regulatory regions of the homologous genes encoded by human herpesviruses for PQS motifs. Although, minimal promoter regions (< 1 kb upstream region) have been described for some herpesvirus genes55,70,70,72, we performed PQS mining using a consensus sequence for the complete 1 kb putative promoter regions or the upstream regulatory regions. Over 40% of human herpesvirus homologues were found to possess at least 1 PQS motif in their upstream regulatory regions (Fig. 1a). The promoters of homologues encoded by human herpesvirus 1 (HHV1/HSV-1; Herpes simplex virus 1), human herpesvirus 2 (HHV2/HSV-2; Herpes simplex virus 2), human herpesvirus 4 (HHV4/EBV; Epstein Barr Virus) and human herpesvirus 8 (HHV8/KSHV; Kaposi’s sarcoma-associated herpesvirus) were particularly enriched for PQS motifs. In contrast, majority of promoters of homologues encoded by human herpesvirus 3 (HHV3/VZV; Varicella zoster virus) and betaherpesviruses [human herpesvirus 5 (HHV5/HCMV; Human cytomegalovirus), human herpesvirus 6A (HHV-6A) and human herpesvirus 7 (HHV-7)] did not contain PQS motifs (Fig. 1b).

Figure 1
figure 1

A significant proportion of promoters of human genes captured by herpesviruses and the corresponding virus homologues contain at least one PQS motif. (a) Pie charts show the proportion of promoters of human genes (~ 79%) captured by herpesviruses and that of corresponding herpesvirus homologues (~ 42%) with at least 1 PQS motif. (b) Bar graph shows the percentage of viral homologues from human herpesviruses with at least 1 PQS motif in their upstream regulatory regions. Promoters of virus homologues from HHV1, HHV2 and gammaherpes viruses (HHV4 and HHV8) were enriched for PQS motifs. N denotes number of virus homologues.

Notably, all alpha and gamma human herpesviruses (HHV1, HHV2, HHV3, HHV4, HHV8) encode for corresponding viral homologues of five human genes in particular (with PQS containing promoters) i.e., DNA polymerase (POLA), DNA helicase (HFM1), Uracil DNA glycosylase (UNG1), Ribonucleotide reductase large subunit (RRM1) and Ribonucleotide reductase small subunit (RRM2)19. However, it was noted that only homologues encoded by HHV1 and HHV2 for all five genes (POLA, HFM1, UNG1, RRM1 and RRM2) contained at least 1 PQS in their upstream 1 kb regulatory regions (Table 1). Overall, it was observed that the majority of PQS containing human gene promoters (16 out of 22) do have corresponding human herpesvirus homologues (n = 29) with each possessing at least 1 PQS motif in their promoter (Supplementary File 1; Table S 1T1 and Figure S1F1). The PQS motifs found in herpesvirus putative promoter regions are comprehensively described in Supplementary File 2; Table S2T9.

Table 1 PQS motifs in promoters of h uman herpesvirus homologues belonging to different functional classes.

PQS motifs form G-qua druplex structures in vitro

CD s pectroscopic ana lyses was performed to determine the topology of randomly sele cted PQS motifs (n = 24) from promoters of human genes and their herpesvirus homologues. Details of the PQS oligonucleotides are provided in Supplementary File 2; Table S2T3 and S 2T4. These randomly selected PQS motifs were selected from the group of PQS containing promoters of human genes that match with their PQS cont aining promot ers of herpesvirus homologues (Supplementary File 1; Figure S1F1). Parallel G-quadruplexes exhibit a characteristic positive peak at 260 nm and a characteristic negative peak at 240 nm while antiparallel G-quadruplexes show a characteristic positive peak at 290 nm and a characteristic negative peak at 260 nm during CD spectroscopic analyses. Hybrid or mix-type G-quadruplexes exhibit distinct positive peaks, one each at 260 nm and 290 nm, and a negative peak at 240 nm. The CD spectra of randomly selected PQS motifs found in human gene prom oters (n = 9) and in the promoters of virus homologues (n = 15) show formation of parallel or hybrid G-quadruplexes (Supplementary File 1; Figure S1F2 and S1F3).

PQS motifs in upstream regulatory re gions of viral Bcl-2 homologues are conserved

Viral Bcl-2 (vBcl-2) is among the most extensively studied homologues encoded by herpesviruses. Therefore, to investigate the functional role of G-quadruplex in promoters of virus homologues we chose to study viral bcl-2 gene promoters. The human bcl-2 gene is a well-studied example of a proto-oncogene reported to contain stable G-quadruplexes that supress Bcl-2 expression (26, 27, 32, 72, 73). The human Bcl-2 is a critical mitochondrial membrane protein that is known to regulate the intrinsic apoptotic pathway by mediating mitochondrial cytochrome c release upon apoptotic induction thereby acting as an apoptotic inhibitor73,74. Apart from herpesviruses, some adenoviruses and iridoviruses also encode for structurally and functionally similar Bcl-2 homologues75,77,78,78. Although this work is primarily focused on human herpesviruses and since we chose to study vBcl-2 in detail, we also wanted to analyse all known viral Bcl-2 homologues (BCL2DB database; http://bcl2db.ibcp.fr/).79 for PQS motifs in their upstream 1 kb regulatory regions. Of the 15 vBcl-2 homologues known till date, 9 of them contained a PQS motif in their 1 kb upstream regulatory region (Supplementary File 1; Table S1T2). For conservation analysis, we used all available full-length sequences from viruses with a PQS motif in their promoter. Our analysis suggests that the PQS motifs in 8 vBcl-2 promoters were highly conserved (Supplementary File 1; Figure S1F4). The presence of highly conserved PQS motifs in the promoters of vBcl-2 from herpesviruses and other viruses suggests a potential functional role for these DNA secondary structures across virus families.

Among human herpesviruses, gammaherpesviruses KSHV orf16 and EBV bhrf1 genes encode for anti-apoptotic viral Bcl-2 proteins (KS-Bcl-2 and BHRF1 respectively) that share sequence and functional homology with the human bcl-2 gene48,51,80,81. Previous studies reported that P1 and P2 promoters (as well as the upstream proximal promoter regions) of human bcl-2 gene are enriched for G-quadruplexes and supress transcription of the human bcl-2 gene26,27,32,82. In this study, we observed that KS-Bcl-2 and BHRF1 genes possess ≥ 1 PQS motifs in their upstream 1 kb regulatory regions. To verify whether the high PQS densities in KSHV and EBV bcl-2 gene promoters is a random event or a consequence of their nucleotide composition, we randomized the upstream regulatory regions five times without changing the dinucleotide content of these sequences as described in “Mater ials and Methods” section. We found that the PQS densities in the native KSHV KS-Bcl-2 and EBV BHRF1 promoters were several fold higher than that of the respective randomized sequences (Fig. 2a); thus, indicating that the observed higher PQS density is not a random occurrence and is independent of the nucleotide composition of the primary sequence. We also observed that the upstream 1 kb regulatory region of KS-Bcl-2 gene contains multiple PQS species. As reported in previous genome-wide studies, GQs are predominant near the TSS in promoter regions of the human genome as well as viral genomes45,69,84,85,86,87,88. A large proportion of promoter GQs proximal to the TSS have strong implications in gene regulatory functions and are proven to be biologically relevant89,90,91. Hence, the PQS motif closest to the TSS of KS-Bcl-2 gene was chosen for further biophysical and functional analyses; this PQS motif is henceforth referred to as KSHV-GQ (Fig. 2). The EBV BHRF1 promoter contained only 1 PQS motif and this motif is henceforth referred to as EBV-GQ (Fig. 2). We also noticed that the selected PQS motifs in KS-Bcl-2 and BHRF1 promoters were highly conserved in all available full-length sequences (Fig. 2b).

Figure 2
figure 2

PQS motifs in regulatory regions of viral Bcl-2 homologues encoded by KSHV and EBV. (a) Bar graph shows PQS density in the 1 kb upstream regulatory region of KSHV KS-Bcl-2 and EBV BHRF1 genes. The PQS densities in the native sequences were significantly higher than that in the randomized sequences (shuffled sequences with constant dinucleotide content; detailed in "Materials and methods" section) (p < 0.001). (b) Schematic representation of the G-quadruplex motif closest to the TSS in KS-Bcl-2 and BHRF1 promoters. We refer to this G-quadruplex motif in KS-Bcl-2 and EBV BHRF1 as KSHV-GQ and EBV-GQ. Analysis of all full-length genomes from KSHV and EBV suggest that the KSHV-GQ and the EBV GQ are hi ghly conserved. *KS-Bcl-2 gene promoter (1 kb) was found to contain 3 PQS motifs at -207 nt., -255 nt., and -288 nt. from the TSS. Only the PQS motif closest to the TSS (at − 207 nt.) is represented in the schematic.

Biophysical characterization of GQ motifs upstream of KSHV KS-Bcl-2 and EBV BHRF1 genes

CD spectroscopic analysis using wild type (Wt) GQs i.e., Wt-KSHV-GQ and Wt-EBV-GQ reveal formation of a hybrid (mix-type) and a parallel G-quadruplex structure, respectively. G to A mutations in the Wt-KSHV-GQ and Wt-EBV-GQ oligonucleotides were found to effectively disrupt formation of G-quadruplex structures (Fig. 3a). Native and denaturing polyacrylamide gel mobility assay indicates formation of compact intramolecular structures (under native conditions) as indicated by faster mobility compared to the mutants (Mut-KSHV-GQ and Mut-EBV-GQ) and length matched controls (Fig. 3b). The NMR spectral profiles of Wt-KSHV-GQ and Wt-EBV-GQ indicate that the imino protons linked to guanine residues involved in G-quadruplex formation exhibit a distinct chemical shift value of 10.5–12 ppm (Fig. 3c). The broad NMR resonance peaks are believed to be due to the presence of conformationally diverse G-quadruplexes undergoing conformational exchange at µs-ms timescale92.

Figure 3
figure 3

Biophysical analyses of KSHV-GQ and EBV-GQ reveals formation of stable intramolecular G-quadruplex in vitro. (a) CD spectra of Wt-KSHV-GQ and Wt-EBV-GQ show formation of hybrid (mixed type) and parallel G-quadruplex structures, respectively. Mutations were introduced to disrupt the G-quadruplex (Mut-KSHV-GQ and Mut-EBV-GQ). As expected, the mutant oligonucleotides did not form G-quadruplexes. (b) Native and denaturing polyacrylamide gel electrophoresis experiments reveal formation of intramolecular secondary structures attributed to the higher mobility of wild type oligonucleotides i.e., Wt-KSHV-GQ and Wt-EBV-GQ when compared with that of mutants (Mut-KSHV-GQ and Mut-EBV-GQ) and length match controls that do not form DNA secondary structures. (c) 1D 1H NMR spectra of Wt-KSHV-GQ and Wt-EBV-GQ oligonucleotide show resonance peaks of imino protons between 10.5–12 ppm, confirming the formation of G-quadruplexes. The NMR spectra suggest that Mut-KSHV-GQ and Mut-EBV-GQ do not form G-quadruplexes. (d) DMS footprinting experiments using 5’ FAM labelled oligonucleotides Wt-KSHV-GQ and Wt-EBV-GQ show formation of intramolecular G-quadruplex structures. Footprinting experiments were performed using different KCl concentrations {500 mM, 100 mM, and 0 mM (i.e., no KCl)}. AG represents seq uencing cleavage reactions specific to purine residues. The vertical bars next to the sequence of the oligonucleotide represents the G residues protected from DMS methylation due to formation of G-quadruplex structure. It is to be noted that the ladder (AG cleavage reaction lane) of the footprinting gel of Wt-KSHV-GQ has been snipped off from another part of the same gel and aligned together with the quadruplex sequence cleavage reaction lanes for comparison.

As shown in Fig. 3d, DMS footprinting sequencing analysis of the FAM labelled wild type KSHV-GQ and EBV-GQ oligonucleotides reveals that the guanine repeats involved in G-quadruplex structures were protected from DMS methylation and hence not cleaved by piperidine whereas the guanine residues present in the loops of these GQs are sensitive to DMS methylation. Based on CD spectroscopy, native and denaturing PAGE, NMR spectroscopy and DMS footprinting analyses we conclude that PQS motifs found in KSHV KS-Bcl-2 and EBV BHRF1 promoters form stable intramolecular G-quadruplex structures in vitro.

It is widely known that pyridostatin (PDS) (G-quadruplex ligand) specifically bind to and stabilize GQs via intercalation27,85,93. CD and UV melting studies with Wt-KSHV-GQ and Wt-EBV-GQ with PDS indicate PDS-mediated stabilization of KSHV-GQ and the EBV-GQ attributed to significant differences in Tm (melting temperature) in the presence and absence of PDS (Fig. 4a–d).

Figure 4
figure 4

CD and UV melting studies reveal PDS-mediated stabilization of KSHV-GQ and EBV-GQ oligonucleotides. Melting analyses (20–93 °C) was performed on Wt-KSHV-GQ and Wt-EBV-GQ oligonucleotides with and without PDS (G-qu adruplex binding ligand) (ad). As observed, PDS stabilized both KSHV-GQ and EBV-GQ. ΔTm is defined as the difference between the Tm (melting temperature) of the PQS oligonucleotide in the presence and absence of ligand.

Stabilization of G-quadruplexes in the human bcl-2 gene promoter inhibits promoter activity

Several studies have reported the presence and enrichment of G-quadruplexes within the promoter region of the human bcl-2 gene26,32,83 and their stabilization by quadruplex-binding ligands further inhibits human bcl-2 gene promoter activity27. Though this work is primarily aimed at understanding the role of G-quadruplexes in viral bcl-2 gene promoters, we analysed the activity of human bcl-2 gene promoter under similar conditions specifically used to study viral bcl-2 gene promoter activity. We carried out luciferase-based reporter assays to assess the promoter activity of human bcl-2 gene promoter using a luciferase construct where the human bcl-2 gene regulatory region (-3934 bp to ATG; includes both P1 and P2 promoters) are cloned upstream of firefly luciferase coding gene of pGL3 basic vector (details are provided in “Materials and Methods” section). In agreement with previous reports27, we observed a significant decline in the human bcl-2 gene promoter activity in the presence of PDS (Fig. 5a), reconfirming an inhibitory role for G-quadruplexes in the promoter.

Figure 5
figure 5

Contrasting roles for promoter G-quadruplexes in human Bcl-2 and viral Bcl-2 expression. Bar graphs show promoter activity as estimated by measuring firefly luciferase levels normalized with renilla luciferase levels (transfection control). The relative luciferase units (RLU) values of the mutant constructs (Mut-KSHV-GQ and Mut-EBV-GQ) were normalized to that of the respective wild-type constructs (Wt-KSHV-GQ and Wt-EBV-GQ). (a) Previous studies have established that G-quadruplexes are found in abundance in human bcl-2 gene promoter and hence regulate its expression. Our luciferase reporter assay data is in conformity with this previously known fact. We observed that the human Bcl-2 promoter activity was significantly reduced upon exposure to PDS (5 µM). (b) The Wt-KSHV-GQ construct (with an intact G-quadruplex) has significantly higher promoter activity compared to that of the Mut-KSHV-GQ construct (possesses mutations disrupting G-quadruplex in the KS-Bcl-2 promoter). The addition of PDS (5 μM) led to a significant increase in promoter activity of the Wt-KSHV-GQ promoter. (c) The Wt-EBV-GQ construct (wild-type EBV BHRF1 promoter with an intact G-quadruplex) also exhibit higher promoter activity compared to that of the Mut-EBV-GQ construct (possesses mutations disrupting G-quadruplex in the BHRF1 promoter). We also observed that addition of PDS (5 μM) increased the promoter activity of Wt-EBV-GQ construct significantly. It is also clear from (b,c) that exposure to PDS does not alter the promoter activity the mutant reporter constructs i.e., Mut-KSHV-GQ and Mut-EBV-GQ; both containing G-quadruplex disrupting mutations). Data are depicted as mean ± SD with n = 4 replicates. (d) Our findings are summarized in a pictorial representation highlighting that G-quadruplexes present upstream of viral Bcl-2 homologues (i.e., KS-Bcl-2 and BHRF1) exert a positive regulatory effect on promoter activity. This finding is in contrast to the negative regulatory role for the promoter G-quadruplexes in human Bcl-2 that has been previously reported and reconfirmed under the same conditions we tested the viral Bcl-2 promoters.

G-quadruplexes in the promoters of KS-Bcl-2 and BHRF1 enhance transcription

To assess the role of KSHV-GQ and EBV-GQ on promoter activity, KS-Bcl-2 and BHRF1 promoters were cloned into a promoter-less luciferase vector (pGL3 basic). We transfected these constructs (Wt-KSHV-GQ and Wt-EBV-GQ) into HEK293T cells and with varying concentrations of PDS (added 2 h post transfection). The promoter activity of the Wt-KSHV-GQ and Wt-EBV-GQ constructs (containing an intact GQ) progressively increased with increasing concentrations of PDS (0–5 µM; Supplementary File 1, Figure S1F5) in a dose-dependent manner.

Mutants were constructed to disrupt GQ formation as described in the "Material and methods" section. Luciferase reporter assays were performed using firefly luciferase constructs corresponding to KS-Bcl-2 and BHRF1 promoters (wild type and mutant); Renilla luciferase constructs (pRL-TK vector) were used as transfection controls. We observed a significantly reduced promoter activity for the Mut-KSHV-GQ and Mut-EBV-GQ constructs (with mutations disrupting GQ formation) compared to their respective wildtype promoters (Fig. 5b , c; p < 0.01). This finding suggests that the G-quadruplex motif in the KS-Bcl-2 and the BHRF1 promoter enhances promoter activity. Furthermore, the addition of PDS significantly enhanced the promoter activity of the wild-type viral bcl-2 gene promoters (Wt-KSHV-GQ and Wt-EBV-GQ) (p < 0.01), while the promoter activity of the mutant constructs (Mut-KSHV-GQ and Mut-EBV-GQ) remained unaffected (Fig. 5b, c). This finding clearly suggests that the enhanced promoter activity in the wild-type constructs (Wt-KSHV-GQ and Wt-EBV-GQ) on addition of the ligand is due to secondary structure (G-quadruplex) stabilization and is not linked to the differences in the primary sequence.

Further, we also cloned the coding region of KS-Bcl-2 and BHRF1 genes along with their respective promoter regions (1 kb) and transfected them into HEK293T cells. The addition of PDS resulted in a significant increase in both KS-Bcl-2 and BHRF1 transcript levels as measured by real time quantitative PCR (Supplementary File 1; Figure S1F6); p < 0.01). These results strengthen our findings in reporter assays and confirm a positive regulatory role for the G-quadruplexes in the promoters of KSHV KS-Bcl-2 and EBV BHRF1.

Together, our findings suggest that the G-quadruplexes in human bcl-2 gene promoter supress transcription, while those in viral bcl-2 gene promoters (KS-Bcl-2 and BHRF1) enhance transcription from the respective viral bcl-2 genes. Human herpesviruses not only captured human homologues but seem to have co-evolved to mimic secondary structures present in the promoters of their human counterparts. Interestingly, our results highlight how human herpesviruses exploit the G-quadruplexes in promoters of these homologues to regulate the expression of vBcl-2 in a manner opposite to that of the human bcl-2 gene (Fig. 5a–d).

G-quadruplexes in viral Bcl-2 promoters may enhance RTA-mediated induction of promoter activity

To understand the interplay between virus-encoded transcription factors and the G-quadruplexes in viral bcl-2 gene promoters, we chose to analyse the effect of RTAs (Replication and Transcription Activator). Viral RTAs are involved in enhancing early gene transcription during lytic replication94,95,96,97. KSHV RTA/ORF50 protein encoded by orf50 gene is predominantly expressed during the lytic phase of KSHV infection and is a master regulator of several viral as well as host genes66,94,98. Similarly, EBV brlf1 gene encodes for EBV RTA/BRLF1 protein which is associated with promoting lytic replication in EBV infected B cells96,99. RTA responsive elements (RREs) have also been reported in KSHV KS-Bcl-2 and BHRF1 gene promoters95,100,101,102. Hence, we studied the role of G-quadruplexes in KS-Bcl-2 and BHRF1 promoters in facilitating RTA-mediated regulation of viral Bcl-2 expression. KSHV RTA and EBV RTA expression constructs were co-transfected with wild type (with intact GQ) and mutant (with mutations disrupting the GQ) luciferase promoter constructs of KS-Bcl-2 and BHRF1, respectively. As expected, viral RTA enhanced promoter activity of KS-Bcl-2 and BHRF1 by several fold (Fig. 6a , b). RTA-mediated induction of the KS-Bcl-2 and the BHRF1 promoters occurred in both the wildtype (with an intact GQ) and mutant promoters (with mutations disrupting the GQ). Nonetheless, the extent of induction was more pronounced in wild-type promoters compared to mutant promoters (for both KSHV KS-Bcl-2 and EBV BHRF1; Fig. 6a, b) indicating a role for G-quadruplexes in contributing to RTA-mediated enhancement of viral Bcl-2 expression in human herpesviruses. This finding has implications beyond the regulation of vBcl-2 expression, as it provides the groundwork for further studies on analysing the role of viral RTA in regulating expression of other herpesvirus genes with promoter G-quadruplexes.

Figure 6
figure 6

KSHV-GQ and EBV-GQ contribute to RTA induced expression of viral Bcl-2. Promoter activity of (a) KSHV KS-Bcl-2 and (b) EBV BHRF1 promoters were greatly enhanced upon co-transfection of KSHV RTA and EBV RTA, respectively. While the promoter activity of mutant promoters (with disrupted GQs) also increased significantly in the presence of viral RTAs, the extent (or fold-change) of RTA-mediated enhancement of promoter activity in the wild-type promoters (i.e., KSHV KS-Bcl-2 and EBV BHRF1) were more pronounced. Experiments were performed in triplicates and mean values ± SD are plotted.

Discussion

We identified that a sizable proportion of human herpesvirus homologues in alpha- and gamma- herpesviruses and their human counterparts have one or more G-quadruplexes in their promoters (Fig. 1 and Table 1). We did not observe this phenomenon in most of the betaherpesvirus-encoded homologues, indicating a possible role for cellular tropisms of these viruses in the acquisition of G-quadruplexes in the promoters of these homologues. The evolutionary ability of human herpesviruses to capture host genes particularly involved in critical cellular process during reactivation is well-documented12,103,104,105. Our findings suggest that when human herpesviruses acquire these host genes, they also frequently acquire G-quadruplexes which are critical regulatory elements from host gene promoters. Recent studies have pointed to the positive correlation between PQS frequency and localization in virus genomes and their corresponding hosts34,106. These viral PQS motifs, particularly PQS motifs found in herpesvirus genomes also share similar loop composition as their host107,108. This might occur due to the long-term relationship shared between the host and members of herpesviridae family that cause persistent infections in the host. It is postulated that this phenomenon might markedly influence the presence and conservation of quadruplex forming sequences in herpesvirus genomes. Hence, the resultant evolutionary accumulation of PQS motifs in viral genomes can be attributed to the mimicking of host genome organisation. We wanted to understand the biological role of these G-quadruplexes in the promoters of virus homologues. For this purpose, we chose to study the G-quadruplexes in viral Bcl-2 homologues, as they represent one of the most extensively studied herpesvirus homologues. In addition, the human bcl-2 gene promoter is known to contain highly stable G-quadruplexes that supress transcription27,91. Majority of the viral Bcl-2 homologues (9 of 14 vBcl-2 homologues known till date) have promoters that contain a highly conserved G-quadruplex motif (Supplementary File 1; Figure S1F4), suggesting a potential biological role for these DNA secondary structures in regulating vBcl-2 across virus families.

Functional studies to understand the role of G-quadruplexes in the promoters of viral bcl-2 genes were carried out on KSHV KS-Bcl-2 and EBV BHRF1. Biophysical characterization of the G-quadruplexes in KS-Bcl-2 and BHRF1 promoters using CD spectroscopy, native PAGE, NMR spectroscopy and DMS foot printing confirmed the formation of intramolecular G-quadruplex (Fig. 3). The addition of PDS stabilized both KS-Bcl-2 GQ and the BHRF1 GQ (Fig. 4). The stabilization of G-quadruplexes in the KS-Bcl-2 and BHRF1 promoters with PDS resulted in a significant increase in promoter activity (Fig. 5b,c). Disruption of the promoter G-quadruplex in KS-Bcl-2 and BHRF1 abrogated the G-quadruplex-mediated enhancement of promoter activity, and it also rendered the promoters non-responsive to the addition of PDS (Fig. 5b,c). In addition, to reporter assays, we confirmed the promoter G-quadruplex-mediated enhancement of vBcl-2 transcription in expression constructs KS-Bcl-2 and BHRF1 (with their respective native G-quadruplex containing promoter) using qPCR. Together, these findings confirm that promoter G-quadruplex in KS-Bcl-2 and BHRF1 significantly enhance vBcl-2 expression. We also reconfirmed previous reports that the G-quadruplexes in the human bcl-2 promoters inhibit gene expression.

Altogether, we demonstrate that when human herpesviruses acquire host genes, they may also co-acquire G-quadruplexes as regulatory elements in their promoters. Nonetheless, these viruses may exploit the regulatory role of G-quadruplexes to their advantage as evidenced by diametrically opposite roles for these nucleic acid secondary structures in regulating the expression of human Bcl-2 and vBcl-2. The G-quadruplexes in the human bcl-2 promoter inhibit the expression of this anti-apoptotic protein, thus facilitating apoptosis. It is well known that vBcl-2 homologues function similar to human Bcl-2 to inhibit apoptosis53. It has been reported that the position of the G-quadruplex within the regulatory element and the loop length may impact gene expression (107, 109–111). This may explain at least in part, the contrasting roles of G-quadruplexes in human and viral Bcl-2 promoters.

The promoter G-quadruplex-mediated increase in the expression of vBcl-2 is particularly interesting, as it may inhibit premature death of virus infected host cells, thus facilitating virus replication, and virus survival. Furthermore, RTA, a virus-encoded transcription factor augments promoter G-quadruplex-mediated enhancement of vBcl-2 expression (Fig. 7).

Figure 7
figure 7

G-quadruplex-mediated regulation of viral and human Bcl-2 expression and its potential effect on host cell apoptosis. The G-quadruplexes in human Bcl-2 promoters inhibit gene expression; this may promote apoptosis of the host cell. The G-quadruplexes in viral Bcl-2 (vBcl-2) promoters serve to enhance vBcl-2 expression. Further, the G-q uadruplexes in vBcl-2 promoters contribute to RTA mediated enhancement of promoter activity. Thus, G-quadruplexes in vBcl-2 promoters enhance vBcl-2 expression which can potentially inhibit host cell apoptosis and thereby confer survival advantage to the virus.

In the present study, we identify the presence of G-quadruplexes in human herpesvirus homologues and their human counterparts. We demonstrate how these DNA secondary structures play opposing regulatory roles in modulation of gene expression of a given virus homologue and its human counterpart. This work highlights yet unknown complex roles for G-quadruplexes in the biology of human herpesviruses and advances our current understanding of herpesvirus pathogenesis, survival mechanisms and virus-host interactions.