Main

Epigenetic aberrations are important mechanisms in human carcinogenesis.1 A number of tumor suppressor genes are silenced by promoter methylation in colorectal cancers.1, 2 A subset of colorectal cancers have been shown to exhibit widespread promoter methylation, which is referred to as the CpG island methylator phenotype (CIMP).1, 3 CIMP-high colorectal tumors have a distinct clinical, pathologic and molecular profile, such as associations with proximal tumor location, female sex, poor tumor differentiation, inactive WNT/β-catenin (CTNNB1) and high BRAF and low TP53 mutation rates,4, 5, 6 independent of microsatellite instability status.7, 8, 9, 10 Although controversial, CIMP may have prognostic implications in colorectal cancer.11, 12, 13, 14

In contrast to CIMP-high in colorectal cancer, the concept of CIMP-low (with less widespread CIMP-specific promoter methylation) is still emerging.15, 16 While CIMP-high colorectal cancer is associated with female sex and BRAF mutation, CIMP-low is associated with KRAS mutation.17 Thus, CIMP-low cannot be explained by a simple mixture of CIMP-high and CIMP-0 (CIMP-negative).17 We have also demonstrated a possible link between CIMP-low, microsatellite instability-low, MGMT methylation and KRAS mutation in colorectal cancer.18 18q loss of heterozygosity is associated positively with CIMP-0 and inversely with CIMP-low.19 These data collectively support molecular differences between CIMP-low, CIMP-high and CIMP-0 in colorectal cancer. In light of molecular differences between CIMP-low and CIMP-high, we hypothesized that density of methylation in individual CpG islands might be different between CIMP-low and CIMP-high tumors. Examining DNA methylation levels in individual CpG islands may provide additional evidence for underlying molecular differences between CIMP-low and CIMP-high tumors.

In this study, using quantitative methylation analysis (MethyLight technology20) and a large number of population-based colorectal cancer samples, we have examined the prevalences of low-level methylation in 14 CpG islands among CIMP-low tumors, in comparison to CIMP-high and CIMP-0 tumors. MethyLight assays (quantitative methylation-specific PCR) are robust and reproducible in quantifying methylation in DNA from paraffin-embedded tumor tissue.21, 22, 23, 24, 25, 26, 27 Low index of methylation in MethyLight analysis is not associated with gene silencing,24 indicating that low methylation index in MethyLight indeed reflects low-level DNA methylation.

Materials and methods

Study Group

We utilized the databases of two large prospective cohort studies; the Nurses’ Health Study (N=121 700 women followed since 1976),28 and the Health Professional Follow-up Study (N=51 500 men followed since 1986).29 Informed consent was obtained from all participants prior to inclusion in the cohorts. A subset of the cohort participants developed colorectal cancers during prospective follow-up. Thus, these colorectal cancers represented population-based, relatively unbiased samples (compared to retrospective or single-hospital-based samples). Previous studies on Nurses’ Health Study and Health Professionals Follow-up Study have described baseline characteristics of cohort participants and incident colorectal cancer cases, and confirmed that our colorectal cancer cases were well representative as a population-based sample.28, 29 We collected paraffin-embedded tissue blocks from hospitals where cohort participants had undergone resections of primary colorectal cancers. We obtained specimens (hematoxylin and eosin (H&E)-stained slides, unstained slides and/or paraffin blocks) from 648 cases (retrieval rate 76%) in the Health Professionals Follow-up Study and 662 cases (retrieval rate 58%) in the Nurses’ Health Study. We excluded cases if adequate paraffin-embedded tumor tissue with more than 40% tumor cellularity was not available, or if tumors were preoperatively treated by chemotherapy or radiation. On the basis of the availability of adequate tissue specimens, a total of 889 colorectal cancers (402 from the men's cohort and 487 from the women's cohort) were included, while 421 tumors (246 men and 175 women) were excluded because of lack of adequate materials. Baseline characteristics among cohort participants with colorectal cancer who had or did not have available tumor tissue specimens were very similar.30 Our tumor samples have been previously characterized for status of CIMP and methylation in various CpG islands.10, 17, 18 However, we have not examined CHFR (checkpoint with forkhead and ring finger domains), HIC1 (hypermethylated in cancer 1), IGFBP3 (insulin-like growth factor binding protein 3), MINT31 (methylated in tumor 31) and WRN (Werner syndrome). Moreover, in our previous studies,10, 17, 18 we utilized percentage of methylated reference (methylation index) values only as the binary discriminator of methylation status (positive vs negative). Our current novel analysis of raw methylation index (percentage of methylated reference) values has given us new evidence for molecular differences between CIMP-low and CIMP-high as shown below. Tissue collection and analyses were approved by the Dana-Farber Cancer Institute and Brigham and Women's Hospital Institutional Review Boards.

Histologic Evaluation of Tumor Cellularity

Tumor cellularity was visually estimated by a pathologist (SO) who examined H&E-stained slides under a light microscope. We excluded cases if adequate paraffin-embedded tumor tissue with 40% or more tumor cellularity was not available, or if tumors were previously treated by chemotherapy or radiation. Thus, tumor cellularity by itself should not have caused low-level of methylation in this study. We compared average tumor cellularities between CIMP-high, CIMP-low and CIMP-0 tumors, and confirmed there were no significant differences between these tumors (see the Results section).

Sequencing of KRAS and BRAF

Genomic DNA extraction and whole genome amplification were performed as described.31 PCR and Pyrosequencing targeted for KRAS codons 12 and 13, and BRAF codon 600 were performed using the PSQ96 HS System (Biotage, Uppsala, Sweden).17, 31

Bisulfite Conversion and Real-Time PCR (MethyLight) for Quantitative DNA Methylation Analysis

Sodium bisulfite treatment on tumor tissue samples and genomic DNA was performed as described previously.24 We did not use laser capture microdissection because ample amounts of DNA were critical for reliable quantification of DNA methylation.24 Utilizing MethyLight technology (real-time PCR)20 and ABI 7300 (Applied Biosystems, Foster City, CA, USA), we quantified methylation in 8 promoters (CACNA1G (calcium channel, voltage-dependent, T-type α-1G subunit), CDKN2A (cyclin-dependent kinase inhibitor 2A (p16/INK4A)), CRABP1 (cellular retinoic acid binding protein 1), IGF2 (insulin-like growth factor 2), MLH1, NEUROG1 (neurogenin 1), RUNX3 (runt-related transcription factor 3) and SOCS1 (suppressor of cytokine signaling 1)), all of which have been selected based on screening of 195 CpG islands.6, 8 We have previously shown that these eight markers are sensitive and specific for the detection of CIMP-high.10 COL2A1 (the collagen 2A1 gene) was used to normalize for the amount of input bisulfite-converted DNA.24 We also examined six other CpG islands (CHFR, HIC1, IGFBP3, MGMT, MINT31 and WRN) which were not in the eight-marker CIMP panel. PCR primers and probes for CHFR, HIC1 and MINT31 were (bisulfite-converted nucleotides are highlighted by bold face and italics): CHFR-F, 5′-CGGGAGTTTTTATGGGCGT-3′ (GenBank no. AC127070, nucleotide nos. 62442–62460); CHFR-R, 5′-AACCGTCCCCAAAACTACGAC-3′ (nucleotide nos. 62525–62545); CHFR-probe, 6FAM-5′-CCTCGAACCGCTCCATCGAAATTCA-3′-BHQ-1 (nucleotide nos. 62482–62505); HIC1-F, 5′-TTCGTTACGGTAGTCGTTGTTTTC-3′ (GenBank no. L41919, nucleotide nos. 43–66); HIC1-R, 5′-GAAAACTATCAACCCTCGATCGA-3′ (nucleotide nos. 94–116); HIC1-probe, 6FAM-5′-TCGCGCGGTCGTCGTTCG-3′-BHQ-1 (nucleotide nos. 72–89); MINT31-F, 5′-GTCGTCGGCGTTATTTTAGAAAGTT-3′ (GenBank no. AC021491, nucleotide nos. 50060–50130); MINT31-R, 5′-CACCGACGCCCAACACA-3′ (nucleotide nos. 50113–50130); MINT31-probe, 6FAM-5′-ACGCTCCGCTCCCGAATACCCA-3′-BHQ-1 (nucleotide nos. 50087–50107). Other primers and probes were described previously6 (IGFBP3 and WRN13). PCR conditions were 95°C for 10 min, followed by 45 cycles of 95°C for 15 s and 60°C for 1 min. A standard curve was made for each PCR plate by duplicated PCR amplifications for COL2A1 on bisulfite-converted human genomic DNA at four different concentrations (in a five-fold dilution series).

The percentage of methylated reference (referred to as ‘methylation index’) at a specific locus was calculated by dividing the GENE:COL2A1 ratio of template amounts in a sample by the GENE:COL2A1 ratio of template amounts in SssI-treated human genomic DNA (presumably fully methylated) and multiplying this value by 100 (or 100 × 2/3 for CRABP1, IGF2, HIC1, IGFBP3, MINT31 and WRN).20 A methylation index cutoff of 4 (methylation positive vs negative) was based on previously validated data.20, 24 There was no tumor with methylation index=4, 10 or 20, exactly, at any locus. Precision and performance characteristics of bisulfite conversion and subsequent MethyLight assays have been previously evaluated and the assays have been validated.24

CIMP-high was defined as the presence of ≥6 methylated markers among the eight CIMP panel promoters (CACNA1G, CDKN2A, CRABP1, IGF2, MLH1, NEUROG1, RUNX3 and SOCS1), CIMP-low as the presence of 1/8 to 5/8 methylated markers, and CIMP-0 as 0/8 methylated markers, according to the previously validated criteria.10

Statistical Analysis

For categorical data, we performed the χ2-test (or the Fisher's exact test when the number in any category is less than 10), using SAS program (Version 9.1, SAS Institute, Cary, NC, USA). The Wilcoxon's rank-sum test was used to compare the average frequencies of low-level methylation in all eight CpG islands between two tumor subtypes (CIMP-0 vs CIMP-low; or CIMP-low vs CIMP-high). The two sample t-test assuming unequal variances was used to compare average tumor cellularities between CIMP-high, CIMP-low and CIMP-0 tumors. All P-values were two-sided, and statistical significance was set at P≤0.05.

Results

Overall Distribution of Methylation Index Values in Each Locus

Utilizing MethyLight technology, we quantified DNA methylation in a panel of eight promoters (CACNA1G, CDKN2A, CRABP1, IGF2, MLH1, NEUROG1, RUNX3 and SOCS1).6, 10 All of the eight markers have been shown to be sensitive and specific for CIMP-high, and the use of the eight markers as a CIMP diagnostic panel has been validated.10 Figure 1 shows a distribution of methylation index values at each locus in the 889 colorectal cancers. Only rare cases showed methylation index values between 3 and 5 in any locus, indicating that methylation status (either negative, methylation index <4 or positive, methylation index >4) at each locus was unequivocally determined in a vast majority of cases. There was no tumor with methylation index=4, 10 or 20, exactly, at any locus.

Figure 1
figure 1

Distribution of methylation index (percentage of methylated reference) values in 889 colorectal cancers.

Classification of CIMP-High, CIMP-Low and CIMP-0 Tumors

The CpG island methylator phenotype-high (CIMP-high) was defined as the presence of ≥6/8 methylated (positive) loci, CIMP-low as the presence of 1/8 to 5/8 methylated loci, and CIMP-0 as the absence (0/8) of methylated loci based on the data that CIMP-high and CIMP-low are associated with BRAF mutation and KRAS mutation, respectively.10, 17 Among the 889 colorectal cancers, there were 133 (15%) CIMP-high tumors, 340 (38%) CIMP-low tumors and 416 (47%) CIMP-0 tumors.

We selected tumors with at least 40% tumor cellularity for this study, and also assessed the average tumor cellularity in each CIMP category, to exclude the possibility that differences in methylation index values might be simply caused by differences in tumor cellularity. The average tumor cellularities were 54.8% in CIMP-high, 56.2% in CIMP-low and 57.7% in CIMP-0 tumors, and there were no significant differences (CIMP-high vs CIMP-low, P=0.45; CIMP-low vs CIMP-0, P=0.27).

Low-Level Methylation at an Individual Promoter was Common in CIMP-Low Tumors

In addition to methylation index cutoff of 4 that has been repeatedly validated as a cutoff for biologically significant methylation,10, 24, 32, 33 we assigned additional arbitrary cutoffs of low-level methylation, methylation index <10 and <20. We tested these two additional cutoffs (instead of one additional cutoff), because these cutoffs were arbitrary. First, we examined the frequency of methylation index >0 and <20 (also written as methylation index 0–20) in each promoter among CIMP-high tumors, CIMP-low tumors and CIMP-0 tumors (Table 1). The overall frequency of methylation index 0–20 in each promoter was almost always higher in CIMP-low tumors than in CIMP-high and CIMP-0 tumors (with the only exception of a higher frequency of MLH1 methylation index 0–20 in CIMP-high tumors than CIMP-low tumors).

Table 1 Frequency of low methylation index at each CIMP-high-specific promoter among CIMP-high, CIMP-low and CIMP-0 colorectal cancers

Considering biological significance of the methylation index cutoff of 4 (ie methylation positive vs negative), we next analyzed the frequencies of methylation index >0 and <4 (between 0 and 4) (also written as methylation index 0–4) and methylation index 4–20, separately. To compare the frequencies of methylation index 0–4 in CACNA1G, for example, we limited the denominator to tumors with CACNA1G methylation index <4 (methylation negative) (Table 1). This was because, compared to CIMP-0 tumors, CIMP-high and CIMP-low tumors had a higher proportion of tumors positive for CACNA1G methylation (methylation index >4), which, if included in the denominator, would have by itself decreased the frequency of methylation index 0–4. Compared to CIMP-0 tumors, CIMP-low tumors consistently showed higher frequencies of methylation index 0–4 in all eight loci (P=0.004), and the differences were highly significant in CRABP1, MLH1, NEUROG1 and RUNX3 (all P<0.001).

To compare the frequencies of methylation index 4–10 (or 4–20) in CACNA1G, for example, we limited the denominator to tumors with CACNA1G methylation index >4 (methylation positive) (Table 1). This was because, compared to CIMP-high tumors, CIMP-low tumors had a higher proportion of tumors negative for CACNA1G methylation (methylation index <4), which, if included in the denominator, would have by itself decreased the frequency of methylation index 4–10 (or 4–20). Compared to CIMP-high tumors, CIMP-low tumors consistently showed higher frequencies of methylation index 4–20 in all eight loci (P=0.004) and methylation index 4–10 in all eight but one locus (MLH1) (P=0.03) (Table 1). The differences were statistically significant in many individual loci (CACNA1G, CRABP1, IGF2, NEUROG1, RUNX3 and SOCS1, all P≤0.03) regardless of the cutoff used (methylation index 4–10 or 4–20).

DNA Methylation Levels at CpG Islands not Included in the CIMP Panel

In addition to the eight promoter CpG islands in the CIMP panel, we examined DNA methylation levels in six other CpG islands (CHFR, HIC1, IGFBP3, MGMT, MINT31 and WRN), which were not in the CIMP panel (Table 2). Among 888 tumors, sensitivity and specificity of each marker for the CIMP-high diagnosis were as follows: CHFR, 90 and 65%; HIC1, 89 and 55%; IGFBP3, 57 and 78%; MGMT, 50 and 65%; MINT31, 97 and 78%; and WRN, 94 and 73%, respectively. Sensitivities or specificities of these six markers for CIMP-high were considerably lower than those of the eight CIMP panel markers.10

Table 2 Frequency of low methylation index at each non-CIMP-specific locus among CIMP-high, CIMP-low and CIMP-0 tumors

With regard to WRN methylation, low methylation index was significantly more common in CIMP-low tumors than in CIMP-high and CIMP-0 tumors (Table 2). However, for the other five markers, the relationship between low methylation index and CIMP-low was not persistent. Our results suggest that the relationship between CIMP-low tumors and low methylation index is mainly limited to CpG islands that are highly correlated with CIMP-high.

Prevalences (Frequencies) of Methylation Index 0–4, 4–10 and 10–20 Across 8 CIMP-Specific Promoters in Each Tumor

We examined an overall prevalence of low-level methylation in the eight CIMP panel markers (in addition to low-level methylation in each marker as in Table 1). Table 3 illustrates methods to assess overall prevalence (frequency) of methylation index 0–4 (highlighted by blue) in each tumor, using examples of tumors. The frequency of loci with methylation index 0–4 in each tumor was calculated as ‘(the number of loci with methylation index 0–4)/(the number of all negative loci with methylation index <4).’ We limited the denominator to negative loci with methylation index <4; otherwise, the presence of more positive loci by itself would have made methylation index 0–4 less prevalent. When there were more positive loci (methylation index>4) in a given tumor (eg case ID 6–9 in Table 3), there were fewer negative loci with methylation index <4.

Table 3 Example of methylation index values and methods to calculate the frequency of low methylation index (>0 and <20) in an individual tumor

The frequency of loci with methylation index 4–10 (highlighted by yellow in Table 3) in each tumor was calculated as (the number of loci with methylation index 4–10)/(the number of all positive loci with methylation index >4). Likewise, the frequency of loci with methylation index 10–20 (highlighted by red in Table 3) in each tumor was calculated as ‘(the number of loci with methylation index 10–20)/(the number of all positive loci with methylation index >4).’ We limited the denominator to positive loci with methylation index >4; otherwise, the presence of fewer positive loci by itself would have made methylation index 4–10 (or methylation index 10–20) less prevalent.

Low Methylation Index 4–10 or 4–20 is More Common in CIMP-Low Tumors than in CIMP-High Tumors

Table 4 shows a distribution of the frequencies of methylation index 4–10, according to the number of positive loci. For tumors with a given number of positive loci, an average of the frequencies of methylation index 4–10 was calculated as ‘∑(frequency of methylation index 4–10)/(the number of cases).’ The average reflected how often a tumor with a given number of positive loci showed methylation index 4–10 among all positive loci (methylation index>4). The average for CIMP-low tumors (19.4%) was much higher than that for CIMP-high tumors (4.2%, P=0.002) (Table 4), indicating that low-level methylation (methylation index 4–10) was far more common in CIMP-low tumors than CIMP-high tumors.

Table 4 Overall frequency of low methylation index 4–10 among all positive loci (methylation index>4) according to the number of methylation-positive loci

Similarly, Table 5 shows a distribution of the frequencies of methylation index 4–20, according to the number of positive loci. For tumors with a given number of positive loci, an average of the frequencies of methylation index 4–20 was calculated as ‘∑(frequency of methylation index 4–20)/(the number of cases).’ The average for CIMP-low tumors (36.8%) was much higher than that for CIMP-high tumors (9.4%, P<0.0001) (Table 5), indicating that methylation index 4–20 was far more common in CIMP-low tumors than in CIMP-high tumors.

Table 5 Overall frequency of low methylation index 4–20 among all positive loci (with methylation index >4) according to the number of methylation-positive loci

We also calculated a weighted average of the frequencies of low-level methylation as ‘∑{(frequency of methylation index 4–10)+(frequency of methylation index 10–20)/2}/(number of cases),’ to put twice more weights on methylation index 4–10 than methylation index 10–20 (Table 5). The weighted average for CIMP-low tumors (28.1%) was much higher than that for CIMP-high tumors (6.8%, P<0.0001), further supporting that low-level methylation was more common in CIMP-low tumors than in CIMP-high tumors.

Low Methylation Index 0–4 is More Common in CIMP-Low Tumors than in CIMP-0 Tumors

Table 6 shows a distribution of the frequencies of methylation index 0–4, according to the number of negative loci. For tumors with a given number of negative loci, an average of the frequencies of methylation index 0–4 was calculated as ‘∑(frequency of methylation index 0–4)/(the number of cases)’. The average reflected how often a tumor with a given number of negative loci showed methylation index 0–4 among all negative loci (methylation index <4). The average for CIMP-low tumors (18.8%) was significantly higher than that for CIMP-0 tumors (13.9%, P<0.0001) (Table 6), indicating that low-level CpG island methylation (methylation index 0–4) was more common in CIMP-low tumors than CIMP-0 tumors.

Table 6 Overall frequency of low methylation index 0–4 among all negative loci (methylation index <4) according to the number of methylation-positive loci

The frequencies of methylation index 0–4 in CIMP-high tumors were quite unevenly distributed, because of fewer (only 1–2) negative loci (methylation index <4) in CIMP-high tumors (Table 6). Therefore, it was difficult to draw conclusion from a comparison of the frequencies of methylation index 0–4 between CIMP-high and CIMP-low.

Discussion

The purpose of this study is to examine density of CpG island methylation in CIMP-low and CIMP-high colorectal cancers, using a large population-based sample (N=889). We analyzed 14 CpG islands including 8 promoters (CACNA1G, CDKN2A (p16), CRABP1, IGF2, MLH1, NEUROG1, RUNX3 and SOCS1), all of which have been shown to be specifically methylated in CIMP-high tumors.6, 10 The other six CpG islands (CHFR, HIC1, IGFBP3, MGMT, MINT31 and WRN) were not as sensitive or specific for CIMP-high. We showed that low-level methylation at the eight CIMP-high-specific loci were more common in CIMP-low tumors than in CIMP-high and CIMP-0 tumors, while there were no consistent differences in methylation levels at the other six non-CIMP-specific loci between CIMP-low, CIMP-high and CIMP-0. Our findings may provide evidence for molecular differences between CIMP-low, CIMP-high and CIMP-0 tumors. CIMP-low tumors tend to methylate CIMP-specific CpG islands at low levels, and some of methylated CpG islands may cross the threshold of gene silencing. We have previously shown that low-level methylation (methylation index<4) is not associated with gene silencing, whereas methylation index>4 is associated with gene silencing.18, 24 In contrast, CIMP-high tumors tend to methylate most CIMP-specific CpG islands at high levels. Our data may provide evidence for a difference in pathogenesis of DNA methylation between CIMP-low and CIMP-high tumors.

Transcriptional inactivation of tumor suppressor genes by promoter CpG island methylation is an important mechanism in human carcinogenesis.2, 34, 35, 36, 37, 38 To measure the degree of DNA methylation, we used quantitative PCR assays (MethyLight),20 which is essential to reproducibly differentiate low-level methylation from high-level methylation.21, 24 Our resource of a large number of samples of colorectal cancer (relatively unbiased samples compared to retrospective or single-hospital-based samples), derived from two large prospective cohorts, has enabled us to precisely estimate the frequency of low-level CpG island methylation in colorectal cancers at a population level. Our data also provide a rationale to use quantitative DNA methylation analysis in research and clinical settings. If we had interpreted low-level methylation (methylation index 0–4) as methylation positive, which might occur in sensitive nonquantitative methylation-specific PCR assays, we might have misdiagnose a considerable number of CIMP-low tumors as CIMP-high tumors. Thus, quantitative assessment of DNA methylation is very important.24, 25, 27, 39, 40

Our findings provide additional supporting evidence for molecular features of CIMP-low tumors different from CIMP-high and CIMP-0 tumors. CIMP-low tumors generally appear to methylate at multiple CIMP-specific CpG islands at low levels, whereas CIMP-0 tumors tend to show more loci with no evidence of methylation and CIMP-high tumors tend to show more loci with high-level methylation. We have previously shown that CIMP-low is associated with male sex and KRAS mutations, while CIMP-high is associated with female sex and BRAF mutations and CIMP-0 is associated with wild-type KRAS/BRAF genes and shows no sex predilection.17 We have recently shown a possible link between CIMP-low, microsatellite instability-low, MGMT methylation and KRAS mutation, supporting unique molecular features of CIMP-low.18 Together with our current data, CIMP-low appears to be a different phenotype from CIMP-high and CIMP-0. Further studies are necessary to identify the best set of markers for the diagnosis of CIMP-low, because the eight markers we used are sensitive and specific for CIMP-high,6, 10 but not intended to separate CIMP-low from CIMP-0.

Pathogenesis of CIMP-high and CIMP-low tumors are still elusive.16 Unlike microsatellite instability in which we have known a role of the mismatch repair system, the etiologies of CIMP as well as high and low-level methylation may not be as straightforward. Considering our current data, we can speculate that CIMP-high tumors may be caused by a fundamental defect in the methyl-group metabolism, resulting in high-level methylation. In contrast, CIMP-low tumors may have either a more subtle defect, or may be a deviation of the CIMP-0 pathway. Further studies are necessary to elucidate pathogenesis of various forms of CIMP in colorectal cancer.

In conclusion, CIMP-low tumors show not only few methylated CIMP-specific CpG islands, but also frequent low-level DNA methylation in individual CpG islands, in contrast to CIMP-high and CIMP-0 tumors. Our data may raise a possibility for a difference in pathogenesis of DNA methylation between CIMP-low and CIMP-high tumors. Further studies are necessary to elucidate the mechanisms of low-level and high-level methylation in CIMP-low and CIMP-high tumors, respectively.