Epstein-Barr virus from Burkitt Lymphoma biopsies from Africa and South America share novel LMP-1 promoter and gene variations

Lei, Haiyan; Li, Tianwei; Li, Bingjie; Tsai, Shien; Biggar, Robert J.; Nkrumah, Francis; Neequaye, Janet; Gutierrez, Marina; Epelman, Sidnei; Mbulaiteye, Sam M.; Bhatia, Kishor; Lo, Shyh-Ching

doi:10.1038/srep16706

Download PDF

Article
Open access
Published: 23 November 2015

Epstein-Barr virus from Burkitt Lymphoma biopsies from Africa and South America share novel LMP-1 promoter and gene variations

Haiyan Lei¹,
Tianwei Li¹,
Bingjie Li¹,
Shien Tsai¹,
Robert J. Biggar²,
Francis Nkrumah³,
Janet Neequaye⁴,
Marina Gutierrez⁵,
Sidnei Epelman⁶,
Sam M. Mbulaiteye⁷,
Kishor Bhatia⁷ &
…
Shyh-Ching Lo¹

Scientific Reports volume 5, Article number: 16706 (2015) Cite this article

2731 Accesses
27 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Epstein Barr virus (EBV) sequence variation is thought to contribute to Burkitt lymphoma (BL), but lack of data from primary BL tumors hampers efforts to test this hypothesis. We directly sequenced EBV from 12 BL biopsies from Ghana, Brazil and Argentina, aligned the obtained reads to the wild-type (WT) EBV reference sequence and compared them with 100 published EBV genomes from normal and diseased people from around the world. The 12 BL EBVs were Type 1. Eleven clustered close to each other and to EBV from Raji BL cell line, but away from 12 EBVs reported from other BL-derived cell lines and away from EBV from NPC and healthy people from Asia. We discovered 23 shared novel nucleotide-base changes in the latent membrane protein (LMP)-1 promoter and gene (associated with 9 novel amino acid changes in the LMP-1 protein) of the 11 BL EBVs. Alignment of this region for the 112 EBV genomes revealed four distinct patterns, tentatively termed patterns A to D. The distribution of BL EBVs was 48%, 8%, 24% and 20% for patterns A to D, respectively; the NPC EBV’s were Pattern B and EBV-WT was pattern D. Further work is needed to investigate the association between EBV LMP-1 patterns with BL.

Identification and characterization of a novel Epstein-Barr Virus-encoded circular RNA from LMP-2 Gene

Article Open access 13 July 2021

Ke-En Tan, Wei Lun Ng, … Yat-Yuen Lim

The influence of human genetic variation on Epstein–Barr virus sequence diversity

Article Open access 25 February 2021

Sina Rüeger, Christian Hammer, … the Swiss HIV Cohort Study

Genome sequencing analysis identifies Epstein–Barr virus subtypes associated with high risk of nasopharyngeal carcinoma

Article 17 June 2019

Miao Xu, Youyuan Yao, … Jianjun Liu

Introduction

Epstein-Barr virus (EBV), considered the first human tumor virus, was first discovered in Burkitt lymphoma (BL) tumor cells in 1964¹. It was subsequently linked to other lymphoid cancers (Hodgkin lymphoma² and nasal T cell lymphomas³) and to epithelial cancers (nasopharyngeal carcinoma (NPC)^4,5 and gastric cancer^6,7) and it was declared a class 1 carcinogen in 1997⁵. EBV was shown to mostly be asymptomatic, particularly in developing countries⁸ and to circulate as lifelong infection in up to 95% of the world’s adult population⁹, being rarely detected in cancer. In contrast to its ubiquitous nature¹⁰, the cancers linked to it often have a regional incidence distribution¹¹. For example, BL occurs mainly in African children living in equatorial regions¹², while NPC occurs most commonly in Asian and North African adults. This regional distribution, coupled with differences in ages at clinical presentation for different cancers, suggested that there might be different high-risk EBV genetic variants influencing the observed epidemiological and clinical EBV-associated tumor patterns^13,14. If so, the discovery of high-risk EBV variants might direct public health or clinical strategies to prevent EBV-associated malignancy^15,16,17.

However, no simple correlation between EBV genetic variation and EBV-associated cancers has been presented^{18,19,20,21,22,23,24,25,26,27,28}, although EBV is known to exist as two genetic types (Type 1 or 2)²⁹, which both immortalize cells and harbor significant genetic variability in EBV latent genes²⁹. Technical constraints have limited studies to examining genetic variation in short sequence stretches in single EBV genes rather than to study of entire or multiple genes or the whole EBV genome¹⁸, while lack of primary BL samples from different geographical areas has also limited ability to study tumors from different areas. Recent technological advancements have enabled whole EBV genome sequencing, first successfully done in 1984 with wild-type (WT) obtained from an immortalized cell line infected by EBV from a patient with infectious mononucleosis, B95-8 (V01555.2)³⁰ and subsequently expanded to include EBV from three BL cell lines (AG876³¹, Akata and Mutu³²), EBV from NPC tumors^33,34,35,36 and BL tumor-derived cell lines³⁶. The resulting EBV genomic library is extensive and provides the potential to discover high-risk carcinogenic variants^35,37,38, but currently does not include data from primary BL tumor samples and may be biased towards viruses that are well adapted to grow in vitro.

In the present study, we directly sequenced DNA from 12 primary BL biopsy samples from Ghana, Brazil and Argentina using Illumina-MiSeq platform³⁷ and aligned the reads obtained from each BL sample to the WT EBV (NC_007605) reference sequence. Our objective was to find novel putative BL-associated EBV genetic traits. We compared sequences from the 12 BL EBVs with the publically available, well-annotated EBV sequence data from benign and malignant tumors, mainly using data registered with National Center for Biotechnology Information (NCBI), to identify genetic commonalities and to gain insights about new ways to categorize EBV variants.

Results

EBV sequences from primary Burkitt lymphoma biopsies were Type 1

Our results increase the number of whole EBV genomes from BL from 13 to 25 in NCBI and are the first EBV sequence results from primary BL tumors. They complement the results from tumor-derived BL cell lines, which might be biased by over-selection of viruses that are better adapted to grow in vitro. Detailed sequencing results are available in supplemental results and supplementary Tables 1 and 2. The median EBV genome size found in the EBV from BL samples was 170,597 bp (range: 163,639 to 171,595 bp) and the average coverage of the genome in each sample was 30 times (range 15 to 70). Consistent with prior results, we found a high median EBV copy number per BL tumor cell (median: 50; range 28–95). Similarly, viral sequences were clonal, showing unremarkable intra-tumor heterogeneity at only one possible position in three of the 12 tumors examined (Supplementary Table 3).

The BL EBVs in our series were all Type 1. By comparison, nine of the previously published EBV genomes from tumor-derived BL cell lines (Asia (2), Kenya (4), Nigeria (1), North Africa (1) and Africa unspecified (1) were associated with Type 1 and four were associated with Type 2 EBV (from Nigeria, Kenya, Papua New Guinea and Ghana). Our combined results suggest that 84% of BL is associated with Type 1 EBV and 16% with Type 2 EBV. All 16 NPC EBVs (from China or Hong Kong) were associated with Type 1.

Phylogenetic analysis of 12 BL EBV genomes versus 100 EBV genomes shows distinct patterns

Full-length phylogenetic analysis of our 12 BL EBVs and the 100 public EBVs genomes showed that 10 of 12 BL EBVs clustered together, while two BL EBVs, both from Brazil (KP 968260-VGO and KR63344-RPF), arrayed far from the first 10 (Fig. 1a). The 10 similar BL EBVs were close to WT-EBV and to EBV sequenced from healthy individuals in the United States and Kenya (e.g., K4123Mi and NA19384)^37,38. Of the two different, Brazil BL EBVs, one (KP968260-VGO) was close to EBVs from Asia, including from 16 NPC from China and Hong Kong, from Akata BL tumor-derived cell line, obtained from a Japanese patient and from healthy people in Asia. The ethnicity of this Brazil subject (KP968260-VGO) was not recorded. The EBV from this Brazil subject and the EBV genomes reported from Asia arrayed distinctly from the 11 BL EBVs from biopsies (Fig. 1a). The second different BL EBV (KR063344-RPF) clustered away from the EBVs reported from Asia, but it was closer to three EBVs from tumor-derived BL cell lines registered in the NCBI (LN827551-Makau, LN824203-Mak1 and LN827545-Daudi (Fig. 1a), as well as one EBV from a healthy individual from Kenya (LN827562).

Phylogenetic analysis of imputed amino acid sequences reveals most variation in EBNA-1 and LMP-1 proteins

Phylogenetic analysis of amino acid sequences imputed for EBV nuclear antigen 1 (EBNA-1) (Fig. 1b) and LMP-1 (Fig. 1c) showed similar phylogenetic clustering as we found using full-length whole EBV genomes, albeit with minor variations. The 10 similar BL EBVs were also close to each other on both EBNA-1 and LMP-1 imputed protein sequences. Within this group, however, two distinct sub-clusters were also detected using EBNA-1 (Fig. 1b), that were not observed with LMP-1 amino acid sequences (Fig. 1c). EBNA-1 and LMP-1 protein sequences from the two different, Brazil BL EBVs (KP968260-VGO and KR063344-RPF) showed phylogenic separation as described in the full-length EBV genome analysis but in different ways. The KP968260-VGO EBV was closer to the Asian NPC and non-NPC EBVs in both EBNA-1 and LMP-1 comparisons, while the KR063344-RPF EBV was closer to the similar BL EBVs, while KR063345-FNR was the outlier in EBNA-1 comparisons, clustering closer to LN827545-Daudi (Fig. 1b) but not in the LMP-1 comparisons (Fig. 1c).

Thirteen BL EBVs, all sequenced from BL tumor-derived cell lines, have been previously reported (Akata, Mutu, Raji, AG876, jijoye, Wewak1, P3HR1, c16, Daudi, Cheptages, BL36, BL37 and Makau). One of these BL EBVs (Raji) aligned most closely to our 11 similar BL EBVs for both full-length EBV genomes and LMP-1 protein sequences (Fig. 1a,b). Comparisons for EBNA-1 sequence of Raji was not included (Fig. 1b) because it is not annotated in the NCBI database. Three BL EBVs (jijoye, P3HR1, BL36 and Wewak1), all Type 2 EBV, were different from our BL EBVs, in the EBNA-1 and LMP-1 phylogenetic relationships. The remaining 8 BL EBVs from BL tumor-derived cell lines aligned separately, forming potentially three sub-clusters. The Brazil outlier EBV of unknown ethnicity (KP968260-VGO) arrayed close to the Asian Akata EBV in the EBNA-1 and LMP-1 analyses as well (Fig. 1b,c).

Analysis of nucleotide sequences reveals common sequence variations in the 11 of 12 BL EBV genomes

Whole genome sequence alignments revealed extensive nucleotide variation in all of the 12 BL biopsy EBV genomes compared to the WT-EBV reference (Supplemental Figure 2a). The density of variations per genome region was higher when the genomes were compared to Type 1 EBV GD1 associated with NPC (Supplemental Figure 2b) and substantially higher when compared with Type 2 EBV AG876 (Supplemental Figure 2c). Compared to the WT-EBV, the 12 BL biopsy EBVs shared overall 67 common nucleotide variations, but this number increased to 94-shared variations, when the outlier KP968260-VGO EBV was excluded. Analysis of the 95 coding DNA sequence (CDS) in EBV genome revealed 36 shared common non-synonymous amino acid variations occurred in 15 EBV genes (Fig. 2). Some of the variations were consistent with a geographical association rather than a BL-association. For example, a variation in BALF3 was found only in the 7 South America BL-EBVs, while several variations were found in different genes only in the 5 West Africa BL-EBVs (Fig. 2).

The analysis of imputed common amino acid changes shared by BL EBVs in all CDS of the EBV genome revealed hypervariable regions mostly in EBNA-1 and LMP-1 (Fig. 2). However, since most of the shared amino acid changes in EBNA-1 in the BL EBVs were also found in EBV from non-diseased individuals, while those in LMP-1, particularly in the N-terminus, appeared to be novel and unique to BL EBVs we focused our detailed comparisons on the LMP-1 promoter and its N-terminal region of the gene.

Sequence analysis of 12 BL EBVs reveals novel changes in the LMP-1 promoter and coding region

Analysis of the 2.1 kb sequence stretch covering LMP-1 promoter and N-terminus of coding sequence revealed a total of 51 common nucleotide variations in our 12 new BL EBVs: 19 were in the promoter region and 32 in the coding region as compared with WT-EBV. Importantly, 23 common nucleotide variations (12 in the promoter region and 11 in the coding region were novel) were shared by the 11 similar BL-EBVs (Fig. 3) and not by the outlier KP968260-VGO or any of the NPC-EBVs or non-NPC EBVs from Asia (Fig. 3). We separately confirmed these novel sequences in our 11 BL biopsy EBVs using Sanger sequencing of PCR products amplified from the target region. Eleven of the 23-nucleotide changes led to amino acid changes, 10 of which coded 9 novel amino acid changes in the N-terminal region of LMP-1 and one nucleotide change located in the second intron (Fig. 3). The 9 novel N-terminus amino acid changes were not seen in the 12 BL EBV genomes from the BL tumor-derived cell lines, however, 7 out of the 9 amino acid variations imputed 1were found in the EBV sequenced from the Raji cell line³⁶ (Fig. 3 and Table 2).

Table 1 Characteristics of the BL tumor samples that were sequenced for Epstein-Barr virus.

Full size table

Table 2 EBV genomes with the same or a highly similar pattern of nucleotide variations or mutations and imputed amino acid changes in the LMP-1 promoter and the coding regions.

Full size table

The 12 novel nucleotide variations in the LMP-1 promoter were: G-426A, T-412G, C-410A, G-376A, A-354G, G-227A, A-184T, T-172C, T-50A, A-39C, G-12A and T+18G. All 12 nucleotide variations imputed in the LMP-1 promoter were found in the EBV from Raji cell line³⁶. Four of the 12 variations were located in LMP-1 regulatory elements AML1 (G-227A), LBF2 (A-184T), LBF4 (T-172C) and CREB (A-39C) (Figs 4 and 5, Table 2) hint at a possible role in altering LMP-1 promoter function. The 9 N-terminal amino acid changes were located in the cytoplasmic domain (2 amino acids), intramembrane domain (5 amino acids) and C-terminal activation regions (CTAR) 1 (2 amino acids) (Fig. 6), but their functional significance was not evaluated in our study and it is unknown.

Alignment of 112 EBV genomes in the LMP-1 promoter region reveals four novel clustering patterns

When we aligned the LMP-1 promoter and coding region for the 12 BL EBVs and other 100 published EBV genomes registered in NCBI (Supplementary Table 4), including 75 genomes recently reported by Palser et al.³⁶, four strikingly distinct patterns of nucleotide variations, designated A to D, in this region were observed (Figs 3, 4, 5). Pattern A: Characterized by the 23 novel nucleotide sequences in the promoter region and the coding region of LMP-1 that were shared by the 11 similar BL-biopsy EBVs as noted above. An identical or highly similar pattern of common 12 nucleotide variations in the promoter region and 9 amino acid changes in the N-terminus of LMP-1 gene were found in Raji-EBV and 7 EBVs from other lymphoid conditions that have been recently published³⁶ (Table 2). Four of the 7 non-BL EBVs were from patients with post-transplant lymphoproliferative disease (PTLD) in the US and Australia. The other 3 were from type 2 EBVs. This pattern was not observed in the outlier Brazil BL (KP968260-VGO) EBV, or in any EBV from NPC or in healthy individuals from Asia. Overall, Pattern A was observed in 19/112 (17%) EBV genomes reported to NCBI, but found in about half of BL cases (12 of 25, 48%, including all but one of our new cases). In contrast, it was less common in established lymphoblastoid cell line (LCL) (1 out of 4, 25%), PTLD EBVs from USA and Australia (4 of 19: 21%) and spontaneous lymphoblastoid cell line (sLCL) EBVs from Kenya (2 of 30: 6%). Notably, three Pattern A EBVs were Type 2 EBVs (two Kenyan sLCL and one LCL of unknown origin).

Pattern B: Characterized by 13 common nucleotide variations at positions G-372A, C-356A, C-329T, G-328A, C-315T, A-286G, G-284T, G-240A, A-238G, G-234T, G-233A, C-207T and C-199T. Pattern B was observed in 28/112 (25%) EBVs, but it also appears to be an Asian type EBV, as it was shared by all NPC-EBVs from China and Hong Kong Asia, including 2 of 25 (8%) BL EBVs, Akata (Japan) and KP968260-VGO (Brazil), which clustered with the NPC EBVs in phylogenetic analyses, as well as with EBV from saliva of a healthy person presumed to be Asian³⁶ and 5 sLCL from Asia. Pattern C: Characterized by novel E2D amino acid change in LMP-1 coding region plus G-44T and G+41C nucleotide changes in the LMP-1 promoter and other isolated variations. The Pattern C shared E2D amino acid change and G+41C nucleotide change with pattern A, but lacked the other characteristic Pattern A mutations/variations. In addition, the Pattern C had a unique common variation at position G-44T within the regulatory CRE element of LMP-1 promoter. Pattern C was present in 8 of 112 (7%) EBVs, including from 7 BL tumor-derived cell lines (P3HR1, jijoye, Daudi, Makau, Mak1, BL36 and Wewaki) and one sLCL from Kenya. Pattern C was not present in EBV from NPC or healthy people from Asia. Pattern D: Was similar to the reference WT EBV. It was observed in 58 of 112 (52%) in the analyzed EBV genomes. The majority of Pattern D EBVs were from sLCLs (30 of 58, 52%), but it also occurred in diverse lymphoid conditions: 13 of 58 (22%) from PTLD, 7 of 58 (12%) from Hodgkin lymphoma, 6 of 58 (10%) from BL. Pattern D included both Type 1 and 2 EBVs.

Discussion

Our study doubles the number of published EBV genomes from BL to 25 and presents the first set of results obtained by directly sequencing DNA from primary BL biopsies. The study of primary tumors fills the main gap in the picture of EBV diversity found in BL, which has hitherto relied on tumor-derived BL cell lines and carried the risk of over-selecting for viruses that are well adapted to grow in vitro. We showed that BL EBVs from Ghana and South America, with the exception of one, phylogenetically clustered together, near WT-EBV and EBV sequenced from healthy and diseased individuals in the United States and Africa, but distant from EBV from NPC and healthy people reported from Asia. We discovered 23 novel nucleotide base substitution signature in the LMP-1 promoter and coding region (associated with 9 amino acid changes in the LMP-1 protein) that was shared by 11 of 12 similar BL EBVs from Ghana and South America. Importantly, highly similar or identical changes also occurred in one EBV sequenced from a tumor derived BL cell line (Raji) from Nigeria, in four Australian/American PTLDs and three LCLs of type 2 EBVs, including two from Kenya. These results suggest that the novel signature is not unique to BL, but it is most prevalent in BL EBVs, occurring in 48% of BL EBVs compared to 7% of 87 other analyzed EBVs genomes. If validated in large, well-selected series, this signature may prove useful as an EBV genetic marker for BL.

Our detailed analysis of the LMP-1 promoter and coding sequences for 112 EBV genomes in NCBI revealed four striking patterns of nucleotide substitutions in the analyzed EBV genomes, tentatively designated Patterns A to D. These patterns were independent of variations in EBNA2 that are used to classify EBV into subtypes²⁹ and different from the similarly designated patterns proposed by Sandvej et al.²⁷, based on LMP-1 Xho I polymorphism and the 30-bp deletion and a limited but different set off LMP-1 promoter base substitutions. While Sandvej’s patterns do not appear to be particularly useful as genetic markers of EBV variants associated with specific EBV-related malignancies²⁷, our finding that of genetic patterns with a variable distribution in some EBV-associated malignancies is intriguing. Notably, the pattern in the 25 BL cases was 48%, 8%, 24% and 20% for Pattern A through D, respectively. Of the 19 EBVs with Pattern A LMP-1, 12 (63%) were from BL samples from different continents, while the other 7 Pattern A EBVs included 4 PTLDs (US/ Australia) and 3 Type 2 sLCL/LCL (2 from Kenya; one of unknown) from different geographical areas. In comparison, Pattern B was found almost only in Asia and in both healthy and disease samples. In this context, it is important to note that there were 18 EBV genomes marked as NPC EBVs (AB850643 to AB850660) in NCBI database that were excluded in our full-length EBV genome analysis because they lacked references of publication, lacked detailed annotation and description of origin showed clearly Pattern B in the LMP-1 analysis. Whether Pattern A is associated with BL and Pattern B with NPC cannot be determined from our analysis, but disease associations will become clearer when case-control studies with representative controls are done.

Our finding of a novel signature in the LMP-1 promoter and gene was unexpected. The sequence variations in LMP1 gene promoter and/or coding sequences may play a role in the immune regulation, affect LMP1 signaling through interacting proteins in BL tumors or they may act as a strain marker. LMP-1 is a viral oncogene and it is expressed in some EBV-associated malignancies, such as Hodgkin lymphoma and NPC, but not in BL³⁹. Thus, our finding might be a clue about an important role LMP-1 plays in BL carcinogenicity as well. There is some evidence that mutations in LMP-1 regulatory sites could reduce the responsiveness of the LMP-1 promoter to transcription factors, which might favor survival and promotion of carcinogenesis by mutated variants⁴⁰. For example, Jansson et al.’s report⁴¹ that a single base substitution (G-44T) within the CRE element of the LMP-1 promoter of EBV from the P3HR1 cell line, an African origin tumor-derived BL cell line, altered factor-binding properties of LMP-1 promoter sequence (LRS) and reduced activation of the LMP-1 promoter as compared the corresponding B95-8 sites provides support for this reasoning. Our finding of 2 nucleotide variations(A-39C and G-44C, the latter is also observed in essentially all NPC-EBVs), albeit different, located within and potentially disrupting the LRS CRE of the LMP-1 promoter (Fig. 3B) in the 11 similar BL EBVs is consistent with the hypothesis that substitutions in regulatory sites may be a feature of carcinogenic EBV variants. However, our analysis of flanking regions that may be controlled by the LMP-1 promoter, such as LMP-2A, is incomplete, hence alternative functional explanations are possible. The LMP-2A gene, which is expressed by episomal viral genome, such as is found in BL⁴², modulates lytic viral activation in vitro and non-expression has been correlated with reduced transforming ability of EBV⁴².

The LMP-1 pattern A has apparently existed before the evolutionary diversion of Type 1 and Type 2 EBV, based on its presence in EBV Type 1 or Type 2. Since this genetic pattern has apparently been preserved in individuals living in many different geographical areas in such a long period, it may likely be functionally important. However, its role in BL carcinogenicity could be indirect because Pattern A was not seen in Type 2- associated BL EBVs, despite its high frequency in Type I BL EBVs.

Strengths of our approach include the use of primary BL tumor samples from different geographical areas to sequence whole EBV genomes. which improves and complements previous efforts that were limited to studying variation in short stretch sequences of single EBV genes from tumor-derived BL cell lines^11,18,43. The use of primary tumor samples reduces risk of bias towards viruses adapted to grow in vitro when tumor-derived BL cell lines are used. The main limitation of the study is lack of representative control samples to more critically evaluate disease-specific associations. Instead, we used whole EBV genome sequence data from the NCBI, which includes healthy and diseased populations from all continents, although not from exactly the same areas.

To summarize, we present the first set of EBV genomes sequenced from primary BL samples from different geographical areas. We showed that BL EBVs were closer to each other and distant from NPC EBVs and we discovered novel LMP-1 promoter and gene changes that may prove useful for classifying EBVs into four different groups. Our findings justify case-control studies to validate the novel LMP-1 variants and measure disease-specific associations with BL and other EBV-associated cancers.

Note: During the review of the paper, we sequenced 2 additional BL biopsies (VA and SG) obtained from Argentina in South America, thereby increasing the number of WBV whole genomes in NCBI to 27. Both EBV genomes showed Pattern A in LMP-1 analysis with the characteristic 23 nucleotide changes in its promoter and the coding gene, thus Pattern A EBV genotype was observed in 13 out of 14 EBVs sequenced directly from BL tumors. The full-length sequences of these 2 EBV genomes have been submitted to NCBI (accession numbers: VA KT001102; SG KT001103).

Methods

Study population

The BL samples were fresh-frozen biopsies obtained mostly from the abdomen of children with BL aged less than 15 years in Ghana (N = 5)^44,45, Brazil (N = 6) and Argentina (N = 1)⁴⁶ (Table 1) enrolled in historical studies performed by investigators at the National Cancer Institute. All diagnoses of tumor biopsies were confirmed histologically.

Ethics Review

The current study was carried out in accordance with the approved guidelines. The historical studies were conducted after ethical approval from the local institutions (Korle Bu University, the Hospital AC Camargo, Sao Paulo, Brazil and CIIH Domingos Boldrini, Campinas,Brazil and Hospital Nacional de Pediatria “Juan Garrahan,” both in Buenos fires, Argentina) and subjects gave informed consent to participate. The current study received exemption from the Office of Human Subjects Research at the National Institutes of Health to use de-identified samples. Sequencing study of previously frozen DNA samples from BL biopsies was conducted under FDA Research Involving Human Subjects Committee (RIHSC) protocol #10-008B entitled "Detection of Infectious Agents in Previously Frozen Blood Samples from Patients with Various Illnesses and Healthy Blood Donors".

Sample preparation and EBV genome sequencing

DNA was extracted from tumor samples for molecular studies as previously described⁴⁶. DNA was directly sequenced by Illumina-MiSeq as previously described³⁷. Briefly, approximately 50 ng of DNA extracted from each of the BL tumor samples was subjected to DNA library construction using the Nextera DNA Sample Prep Kit (Illumina) through tagmentation and 5-cycle polymerase chain reaction amplification according to the manufacture’s protocol. The average DNA library has insert sizes ranging from 250 to 1000 base pairs (bp) with the peak around 500 bp. Sequencing was conducted using Illumina MiSeq Reagent Kit V2 (500 cycles for the 2 × 250 bp pair-end sequencing) and the raw reads were processed following the previously described workflow (Supplementary Figure 1)³⁷.

EBV genome assembly, alignment and phylogenetic analysis

The sequences from each of the 12 samples were filtered at a Q30 phred score and trimmed to remove low quality base reads (with read error probability score >0.05, <2 ambiguities in the reads or read-lengths of less than 15 bp) using CLC Genomics Workbench (Version 7.0, Qiagen). The filtered raw reads from each BL sample were aligned to the WT-EBV (NC_007605) sequence using the CLC Genomics Workbench (version 7.0, Qiagen). Default parameters of mismatch cost of 2, insertion cost of 3, deletion cost of 3, length fraction of 1 and similarity fraction of 0.9 were used to obtain high-quality sequence alignment. The basic detection function was used to call nucleotide variations (single-nucleotide variations (SNVs) or multiple-nucleotide variations (MNVs), insertions and deletions in the reads with at least 5 reads at a particular base and when the variant sequence appeared in at least 35% of the reads at that particular base. SNVs were categorized as synonymous or non-synonymous variations, depending on whether the variant coded for a different amino acid. Variation in the BL EBV genomes relative to WT EBV reference genome was quantified by dividing the number of variations in the particular genome by the total number of bases sequenced in that genome. Variations in the internal and terminal repeat regions of the EBV genome were disregarded.

Multiple sequence alignments of the 12 BL whole EBV genomes and 100 published EBV genomes (97 registered in the NCBI and 3 published from the 1000 Genomes Project)³⁸, including 13 from tumor-derived BL cell lines, was done using the Kalign program (http://www.ebi.ac.uk/Tools/msa/kalign) installed on the NIH Helix supercomputer (https://helix.nih.gov) to facilitate phylogenetic analysis. An additional 18 EBV genomes marked as NPC-EBVs (AB850643 to AB850660) present in the NCBI database were not included in the whole genome analysis because they lacked complete references of publication, details about origin, or annotation. Individual gene alignments for LMP-1, EBNA-1 and BZLF1 proteins were analyzed by using the Clustal Omega program in EBI (http://www.ebi.ac.uk/Tools/msa/clustalo). The alignments were used to generate phylogenetic trees using Molecular Evolutionary Genetic Analysis (MEGA) software, version 5.0⁴⁷ with a neighbor-joining algorithm.

Data access

The full-length sequences of 12 BL EBV assembled genomes (supplemental Table 1): CCH, MP, SCL, VGO, RPF, CV-ARG, FNR, HU11393, H018436D, H058015C, H002213 and H03753A were annotated using the information derived from the reference genome WT-EBV (NC_007605.1) sequence. Results were submitted to the GenBank database with the accession numbers KP968257, KP968258, KP968259, KP968260, KR063344, KR063343, KR063345, KP968261, KP968262, KP968263 and KP968264, KR063342, respectively.

Additional Information

How to cite this article: Lei, H. et al. Epstein-Barr virus from Burkitt Lymphoma biopsies from Africa and South America share novel LMP-1 promoter and gene variations. Sci. Rep. 5, 16706; doi: 10.1038/srep16706 (2015).

References

Epstein, M. A., Achong, B. G. & Barr, Y. M. Virus particles in cultured lymphoblasts from Burkitt's lymphoma. Lancet 1, 702–703 (1964).
Article CAS Google Scholar
Hjalgrim, H. & Engels, E. A. Infectious aetiology of Hodgkin and non-Hodgkin lymphomas: A review of the epidemiological evidence. J Intern Med 264, 537–548 (2008).
Article CAS Google Scholar
Harabuchi, Y. et al. Nasal t-cell lymphoma causally associated with Epstein-Barr virus: Clinicopathologic, phenotypic and genotypic studies. Cancer 77, 2137–2149 (1996).
Article CAS Google Scholar
de The, G., Ablashi, D. V., Liabeuf, A. & Mourali, N. Nasopharyngeal carcinoma (npc). Vi. Presence of an EBV nuclear antigen in fresh tumour biopsies. Preliminary results. Biomedicine 19, 349–352 (1973).
CAS PubMed Google Scholar
Proceedings of the IARC working group on the evaluation of carcinogenic risks to humans. Epstein-Barr virus and Kaposi's sarcoma herpesvirus/human herpesvirus 8. Lyon, france, 17-24 june 1997. IARC Monogr Eval Carcinog Risks Hum 70, 1–492 (1997).
Pittaluga, S., Loke, S. L., So, K. C., Cheung, K. N. & Ma, L. Clonal epstein-barr virus in lymphoepithelioma-like carcinoma of the stomach: Demonstration of viral genome by in situ hybridization and southern blot analysis. Mod Pathol 5, 661–664 (1992).
CAS PubMed Google Scholar
Shibata, D., Hawes, D., Stemmermann, G. N. & Weiss, L. M. Epstein-Barr virus-associated gastric adenocarcinoma among japanese americans in Hawaii. Cancer Epidemiol Biomarkers Prev 2, 213–217 (1993).
CAS PubMed Google Scholar
Biggar, R. J. et al. Primary Epstein-Barr virus infections in African infants. Ii. Clinical and serological observations during seroconversion. Int J Cancer 22, 244–250 (1978).
Article CAS Google Scholar
Mbulaiteye, S. M. et al. High levels of Epstein-Barr virus DNA in saliva and peripheral blood from ugandan mother-child pairs. J Infect Dis 193, 422–426 (2006).
Article Google Scholar
Thorley-Lawson, D. A. & Gross, A. Persistence of the Epstein-Barr virus and the origins of associated lymphomas. N Engl J Med 350, 1328–1337 (2004).
Article CAS Google Scholar
Bhatia, K. et al. Variation in the sequence of Epstein Barr virus nuclear antigen 1 in normal peripheral blood lymphocytes and in Burkitt's lymphomas. Oncogene 13, 177–181 (1996).
CAS MathSciNet PubMed Google Scholar
Ziegler, J. L. Burkitt's lymphoma. N Engl J Med 305, 735–745 (1981).
Article CAS Google Scholar
de-The, G. et al. Epidemiological evidence for causal relationship between Epstein-Barr virus and Burkitt's lymphoma from Ugandan prospective study. Nature 274, 756–761 (1978).
Article CAS ADS Google Scholar
de-The, G. The Epstein-Barr Virus (EBV): A Rosetta Stone for understanding the role of viruses in immunopathological disorders and in human carcinogenesis. Biomed Pharmacother 39, 49–51 (1985).
CAS PubMed Google Scholar
Cohen, J. I., Mocarski, E. S., Raab-Traub, N., Corey, L. & Nabel, G. J. The need and challenges for development of an Epstein-Barr virus vaccine. Vaccine 31 Suppl 2, B194–196 (2013).
Article CAS Google Scholar
What we could do with an ebv vaccine. Lancet 1, 759–761 (1981).
Cohen, J. I., Fauci, A. S., Varmus, H. & Nabel, G. J. Epstein-Barr virus: An important vaccine target for cancer prevention. Sci Transl Med 3, 107 (2011).
Article Google Scholar
Chang, C. M., Yu, K. J., Mbulaiteye, S. M., Hildesheim, A. & Bhatia, K. The extent of genetic diversity of Epstein-Barr virus and its geographic and disease patterns: A need for reappraisal. Virus Res 143, 209–221 (2009).
Article CAS Google Scholar
Goldschmidts, W. L. et al. Epstein-Barr virus genotypes in AIDS-associated lymphomas are similar to those in endemic Burkitt's lymphomas. Leukemia 6, 875–878 (1992).
CAS PubMed Google Scholar
Gutierrez, M. I. et al. Discrete alterations in the BZLF1 promoter in tumor and non-tumor-associated Epstein-Barr virus. JNCI 94, 1757–1763 (2002).
Article CAS Google Scholar
Lorenzetti, M. A. et al. Distinctive epstein-barr virus variants associated with benign and malignant pediatric pathologies: LMP1 sequence characterization and linkage with other viral gene polymorphisms. J Clin Microbiol 50, 609–618 (2012).
Article CAS Google Scholar
Yang, Y. et al. Sequence analysis of EBV immediate-early gene BZLF11 and BRLF1 in lymphomas. J Med Virol 86, 1788–1795 (2014).
Article CAS Google Scholar
Zhang, X. S. et al. V-val subtype of Epstein-Barr virus nuclear antigen 1 preferentially exists in biopsies of nasopharyngeal carcinoma. Cancer Lett 211, 11–18 (2004).
Article CAS Google Scholar
Chao, M., Wang, H. N., Lu, Y. J., Chang, Y. S. & Yu, J. S. The v-val subtype Eepstein-Barr virus nuclear antigen 1 promotes cell survival after serum withdrawal. Oncol Rep 33, 958–966 (2015).
Article CAS Google Scholar
Sawada, A. et al. Epstein-Barr virus latent gene sequences as geographical markers of viral origin: Unique EBNA3 gene signatures identify Japanese viruses as distinct members of the asian virus family. J Gen Virol 92, 1032–1043 (2011).
Article CAS Google Scholar
Sandvej, K., Andresen, B. S., Zhou, X. G., Gregersen, N. & Hamilton-Dutoit, S. Analysis of the epstein-barr virus (ebv) latent membrane protein 1 (LMP-1) gene and promoter in hodgkin's disease isolates: Selection against ebv variants with mutations in the LMP-1 promoter ATF-1/CREB-1 binding site. Mol Path 53, 280–288 (2000).
Article CAS Google Scholar
Sandvej, K. et al. Sequence analysis of the Epstein-Barr virus (EBV) latent membrane protein-1 gene and promoter region: Identification of four variants among wild-type EBV isolates. Blood 90, 323–330 (1997).
CAS PubMed Google Scholar
Martini, M. et al. Characterization of variants in the promoter of EBV gene BZLF1 in normal donors, HIV-positive patients and in aids-related lymphomas. J Infect 54, 298–306 (2007).
Article Google Scholar
Sample, J. et al. Epstein-Barr virus types 1 and 2 differ in their EBNA-3a, EBNA-3b and EBNA-3c genes. J Virol 64, 4084–4092 (1990).
CAS PubMed PubMed Central Google Scholar
Baer, R. et al. DNA sequence and expression of the B95-8 Epstein-Barr virus genome. Nature 310, 207–211 (1984).
Article CAS ADS Google Scholar
Dolan, A., Addison, C., Gatherer, D., Davison, A. J. & McGeoch, D. J. The genome of epstein-barr virus type 2 strain ag876. Virology 350, 164–170 (2006).
Article CAS Google Scholar
Lin, Z. et al. Whole-genome sequencing of the akata and mutu Epstein-Barr virus strains. J Virol 87, 1172–1182 (2013).
Article CAS Google Scholar
Kwok, H. et al. Genomic sequencing and comparative analysis of Epstein-Barr virus genome isolated from primary nasopharyngeal carcinoma biopsy. PloS One 7, e36939 (2012).
Article CAS ADS Google Scholar
Kwok, H. et al. Genomic diversity of Epstein-Barr virus genomes isolated from primary nasopharyngeal carcinoma biopsy samples. J Virol 88, 10662–10672 (2014).
Article CAS Google Scholar
Liu, P. et al. Direct sequencing and characterization of a clinical isolate of Epstein-Barr virus from nasopharyngeal carcinoma tissue by using next-generation sequencing technology. J Virol 85, 11291–11299 (2011).
Article Google Scholar
Palser, A. L. et al. Genome diversity of Epstein-Barr virus from multiple tumour types and normal infection. J Virol 89, 5222–5237 (2015).
Article CAS Google Scholar
Lei, H. et al. Identification and characterization of EBV genomes in spontaneously immortalized human peripheral blood B lymphocytes by NGS technology. BMC Genomics 14, 804 (2013).
Article CAS Google Scholar
Santpere, G. et al. Genome-wide analysis of wild-type Epstein-Barr virus genomes derived from healthy individuals of the 1000 genomes project. Genome Biol Evol 6, 846–860 (2014).
Article Google Scholar
Thorley-Lawson, D. A. & Allday, M. J. The curious case of the tumour virus: 50 years of Burkitt's lymphoma. Nat Rev Microbiol 6, 913–924 (2008).
Article CAS Google Scholar
Chen, M. L., Wu, R. C., Liu, S. T. & Chang, Y. S. Characterization of 5'-upstream sequence of the latent membrane protein 1 (LMP-1) gene of an Epstein-Barr virus identified in nasopharyngeal carcinoma tissues. Virus Res 37, 75–84 (1995).
Article CAS Google Scholar
Jansson, A., Johansson, P., Li, S. & Rymo, L. Activity of the LMP1 gene promoter in epstein-barr virus-transformed cell lines is modulated by sequence variations in the promoter-proximal cre site. J Gen Virol 88, 1887–1894 (2007).
Article CAS Google Scholar
Busson, P., Edwards, R. H., Tursz, T. & Raab-Traub, N. Sequence polymorphism in the Epstein-Barr virus latent membrane protein (LMP)-2 gene. J Gen Virol 76(Pt 1), 139–145 (1995).
Article CAS Google Scholar
Walling, D. M. et al. The molecular epidemiology and evolution of Epstein-Barr virus: Sequence variation and genetic recombination in the latent membrane protein-1 gene. J Infect Dis 179, 763–774 (1999).
Article CAS Google Scholar
Nkrumah, F. et al. Burkitt's lymphoma: Its clinical course in relation to immunologic reactivities to Epstein-Barr virus and tumor-related antigens. JNCI 57, 1051–1056 (1976).
Article CAS Google Scholar
Nkrumah, F. K. Changes in the presentation of Burkitt's lymphoma in Ghana over a 15-year period (1969-1982). IARC Scient Pub, 665–674 (1984).
Gutierrez, M. I. et al. Molecular epidemiology of Burkitt's lymphoma from south america: Differences in breakpoint location and epstein-barr virus association from tumors in other world regions. Blood 79, 3261–3266 (1992).
CAS PubMed Google Scholar
Tamura, K. et al. Mega5: Molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance and maximum parsimony methods. Mol Biol Evol 28, 2731–2739 (2011).
Article CAS Google Scholar

Download references

Acknowledgements

We would like to thank Dr. James J. Goedert and Charles Rabkin at the Infections and Immunoepidemiology Branch at the National Cancer Institute (Bethesda, Maryland) for editorial comments. We thank Drs. Hsiao-Mei Liao and Pengfei Guo at the Center for Biologics Evaluation and Research, Food and Drug Administration (Silver Spring, Maryland) for preparing the final Figures and Tables for publication. Support: The study was funded in part by the Intramural Research Program of the Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Department of Health and Human Services; Grant number: N01-CO-12400 and in part by a Food and Drug Administration in-house Modernizing Science Fund. This study utilized the high-performance computational capabilities of the Biowulf Linux cluster at the National Institutes of Health, Bethesda, MD (http://biowulf.nih.gov).

Author information

Authors and Affiliations

Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, Maryland
Haiyan Lei, Tianwei Li, Bingjie Li, Shien Tsai & Shyh-Ching Lo
formerly, NCI, Silver Spring, Maryland
Robert J. Biggar
Noguchi Memorial Institute, Accra, Ghana
Francis Nkrumah
Department of Child Health, University of Ghana, Accra, Ghana
Janet Neequaye
Laboratorio Stamboulian, Buenos Aires, Argentina
Marina Gutierrez
Department of Pediatric Oncology, St Marcelina Hospital, Sao Paolo, Brazil
Sidnei Epelman
Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, Maryland
Sam M. Mbulaiteye & Kishor Bhatia

Authors

Haiyan Lei
View author publications
You can also search for this author in PubMed Google Scholar
Tianwei Li
View author publications
You can also search for this author in PubMed Google Scholar
Bingjie Li
View author publications
You can also search for this author in PubMed Google Scholar
Shien Tsai
View author publications
You can also search for this author in PubMed Google Scholar
Robert J. Biggar
View author publications
You can also search for this author in PubMed Google Scholar
Francis Nkrumah
View author publications
You can also search for this author in PubMed Google Scholar
Janet Neequaye
View author publications
You can also search for this author in PubMed Google Scholar
Marina Gutierrez
View author publications
You can also search for this author in PubMed Google Scholar
Sidnei Epelman
View author publications
You can also search for this author in PubMed Google Scholar
Sam M. Mbulaiteye
View author publications
You can also search for this author in PubMed Google Scholar
Kishor Bhatia
View author publications
You can also search for this author in PubMed Google Scholar
Shyh-Ching Lo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.L. and B.L. conducted sequencing experiments by MiSeq and PCR/Sanger. H.L. conducted bioinformatics analyses and drafted the manuscript. S.L. and S.T. supervised laboratory sequencing experiments and bioinformatics analyses. F.N., J.N., R.J.B., M.G. and S.E. conducted field work. K.B. and S.M.M. conceived the idea; K.B., S.M.M. and S.L. designed the experiments, guided data analysis, interpreted data, edited the manuscript and shared senior authorship. All authors had access to data, commented on and contributed to the final draft of the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Lei, H., Li, T., Li, B. et al. Epstein-Barr virus from Burkitt Lymphoma biopsies from Africa and South America share novel LMP-1 promoter and gene variations. Sci Rep 5, 16706 (2015). https://doi.org/10.1038/srep16706

Download citation

Received: 01 July 2015
Accepted: 19 October 2015
Published: 23 November 2015
DOI: https://doi.org/10.1038/srep16706

This article is cited by

Genetic variability and mutation of Epstein‒Barr virus (EBV)-encoded LMP-1 and BHRF-1 genes in EBV-infected patients: identification of precise targets for development of personalized EBV vaccines
- Yue Wang
- Yuan Rong
- Zhiyan Lu
Virus Genes (2023)
NK-/T-cell lymphomas
- Hua Wang
- Bi-bo Fu
- Yang Liang
Leukemia (2021)
A reliable Epstein-Barr Virus classification based on phylogenomic and population analyses
- Louise Zanella
- Ismael Riquelme
- Priscilla Brebi
Scientific Reports (2019)
Clinical and biological insights from viral genome sequencing
- Charlotte J. Houldcroft
- Mathew A. Beale
- Judith Breuer
Nature Reviews Microbiology (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.