Longitudinal analysis of colon crypt stem cell dynamics in sulindac treated Familial Adenomatous Polyposis patients

Article metrics


The non-steroidal anti-inflammatory drug sulindac decreases size and number of adenomas after 4-6 months of treatment for familial adenomatous polyposis (FAP) patients. However, the underlying mechanism remains unknown. As stem cells are thought to be the tumor precursor cells, visualizing their behavior is crucial for monitoring tumor progression. Increased tag diversity in inactive genes is indicative of a protracted clonal evolution and consequently, increased risk for tumor formation. Therefore, the effect of sulindac on stem cell dynamics was studied. Normal appearing single crypts were laser microdissected in placebo- and sulindac- treated FAP patient tissue after which the methylation patterns were visualized by Next Generation Sequencing. A significant difference in tag diversity over time was found in the sulindac group compared to the placebo group (*p = 0.018), indicative of a shortened clonal evolution treated sulindac. The rate of change in tag diversity over time was correlated with polyp number change over time. No significant difference over time was observed in the percent methylation when comparing placebo vs sulindac. In conclusion, daily sulindac administration in FAP patients significantly altered colorectal stem cell dynamics, which might explain the chemopreventive action of this drug indicating that tag diversity may be used as a predictive biomarker.


In the intestine, differentiated cells are replaced every 5 days while stem cells can escape this fate1. For this reason, stem cells are believed to be the only cells that survive long enough to accumulate enough mutations to initiate tumorigenesis. Among stem cells, individual stem cell lineages compete for survival and replace other stem cell lineages, eventually resulting in a completely renewed stem cell pool. This continuous process in which a crypt is reconstituted from one single stem cell is called niche succession2,3. Since these stem cells are the tumor precursor cells, studying their behavior is important. However, no specific stem cell markers are available for routine diagnostics in human intestinal tissue and the tracing methods used in mouse models cannot be applied to humans. This led various scientists to investigate stem cell dynamics by measurement of the diversity of methylation patterns in the intestinal crypt2,4,5. Since methylation patterns are inherited through cell divisions during which methylation errors occur randomly, methylation patterns in inactive and therefore unregulated genes, can diverge over time. Using this method, Graham et al. found that less unique methylation patterns in a crypt indicated that this crypt had undergone a niche succession more recently compared to a crypt with a higher number of unique methylation patterns6. Baker et al. found a decreased methylation pattern diversity in adenomatous versus normal crypts in patients with familial adenomatous polyposis, indicating an increased stem cell loss/replacement rate in adenomas5. Pre-tumor progression is the phase during which there are no visible changes in the colorectal epithelium, but the cells nevertheless have already acquired early mutations for the initiation of tumor progression. The above methylation assay also allows for studying stem cell dynamics during pre-tumor progression7.

Familial adenomatous polyposis (FAP), a disorder characterized by the development of numerous colorectal adenomatous polyps between the age of 10 and 20 years8, has been used as an important model to study the transit from pre-tumor alterations to tumor progression9. FAP is an autosomal dominant hereditary disorder caused by a germline mutation in the Adenomatous Polyposis Coli (APC) gene. Additional mutations lead to a virtually 100% lifetime risk of colorectal carcinoma (CRC) development at a mean age of 39 years, only prevented by a prophylactic colectomy10. Other treatment strategies such as administrations of non-steroidal anti-inflammatory drugs (NSAIDs), namely sulindac and celecoxib, are effective in reducing both adenoma number and size11,12. In one study, sulindac intake of 300–400 mg per day resulted in a 71% reduction in adenoma formation compared to placebo after 4–6 months, a stronger treatment response than the average adenoma decrease of 23% in patients treated with 800 mg celecoxib a day for 6 months12. The number and size of adenomas increased immediately upon withdrawal of sulindac13,14. However, presently, the underlying mechanism of sulindac chemoprevention remains unclear.

A study on methylation pattern diversity revealed that stem cell niche succession occurs every 8.2 years in colorectal crypts of patients without FAP, whereas in FAP patients, stem cell lineage persists for an average of 32 years15. Our previous work on FAP patients also showed a significant increase in methylation pattern diversity in FAP patients compared to controls16. These results indicate that stem cells play a key role in the clinical course and complications of FAP. As sulindac treatment results in a decreased polyp size and number, we questioned whether sulindac treatment could influence stem cell dynamics, and, thereby, underlie the prevention or delay of polyp growth in FAP patients. Our previous study on stem cell dynamics used Sanger sequencing16 which is time consuming and therefore not suitable in a diagnostic is setting. Meanwhile, Next Generation Sequencing (NGS) has facilitated biomedical research and clinical diagnostics by allowing germline sequencing17,18, identification of novel genetic alterations19, identification of individual targeted therapy options20,21, methylation22,23 and stem cell dynamics22,23,24.

In the current study methylation patterns in FAP patients treated with sulindac were compared to placebo treated patients to study whether chemoprevention with sulindac is associated with changes in stem cell dynamics. Furthermore, methylation patterns were visualized with NGS and Sanger sequencing that was used as a technical control to validate the NGS workflow.


Diversity among stem cell lineages in the intestinal crypt can be investigated by analyzing changes in methylation patterns in the CpG island of the CSX gene. This gene is not expressed in the colorectum and changes in the methylation status are expected not to be regulated, i.e. they occur by chance. During cell division, CpG islands can randomly accumulate methylation errors25. Therefore, with ageing, methylation pattern diversity increases and drifts within and between individual crypts, which can be used to recapitulate crypt histories4.

To study stem cell dynamics, the diversity of unique methylation patterns per crypt and the percent methylation within a crypt were used16. The diversity of methylation patterns is defined as the total number of unique methylation patterns within a single crypt. The diversity reflects the genetic clonal diversity and time since all the stem cells went through a clonal homogenization event; i.e. duration of niche succession. Finding fewer patterns indicates a lower diversity; therefore the crypt is thought to have undergone a niche succession more recently. The percent methylation is defined as the number of methylated CpG sites compared to the total number of CpG sites. It reflects the mitotic age which is the total number of divisions since the zygote stage. Since methylation events accumulate randomly in unregulated genes during cell division, a higher percent methylation is indicative of a cell that has gone through more cell divisions.

Since the amount of methylation events increases with age, we carefully age matched four FAP patients treated with sulindac and four FAP patients treated with placebo (Table 1). All these patients were genotypically affected with familial adenomatous polyposis but phenotypically unaffected so they were quite young at the beginning of the study and no polyps were observed at the start of the study14. The polyp number increased with time in the placebo group and sulindac patients develop few polyps (Table 1, data from Giardiello et al.)14.

Table 1 Patient information.

DNA from sulindac and placebo treated FAP patients at three different time points during treatment (t = 0, t = 4 m and t = 2 y) was laser microdissected from 10 normal appearing crypts for each patient. After sodium bisulfite treatment, PCR and sequencing, methylation patterns were determined. The workflow is shown in Fig. 1. The sequencing can be performed either by NGS or by Sanger sequencing, where with NGS more DNA molecules will be analyzed compared to Sanger sequencing. Furthermore, the Sanger approach contains an extra cloning step, which makes the NGS approach a higher throughput approach and therefore more convenient to study stem cell dynamics.

Figure 1

Workflow of stem cell dynamics assay. The assay is performed on archival paraffin embedded (FFPE) material from normal-looking tissues. Single crypts are laser microdissected, and DNA is isolated. Next, the extracted genomic DNA is treated with sodium bisulphite to convert unmethylated cytosines into uracil; methylated cytosines are protected from this turnover. A nested PCR is performed to target the CSX sequence. The PCR products are extended with Iontorrent PGM sequencing adapters and sequenced following the manufacturers protocol. The various methylation patterns per crypt can then be clustered by adapters. Different numbers of reads are found for each pattern including PCR errors which occurred below 1% count. Meanwhile, as a technical control, the same PCR products are cloned into bacterial vectors and enumerated after a sequence analysis of cloned inserts. 10 clones were selected for sequencing so for each crypt, the total number of counts are 10. CSX, cardiac-specific homeobox. Different colors represent different methylation patterns.

Using NGS, a great number of sequences ranging from 4 to 49360 reads (mean: 13656) were obtained per crypt (Supplementary Table S1). However, the vast majority of methylation patterns only appeared in a minority of the total number of sequencing reads. It was therefore assumed that these patterns were the result of PCR or sequencing errors, and, therefore, did not represent stem cell lineages within the crypts. A suitable threshold (cut-off point) was described for diagnostics for NGS data of FFPE tissue, where variants with an allele frequency lower than 1–5% were omitted from data analysis26. Therefore, we applied a 1% cut-off value for the final data analysis.

The effect of treatment on stem cell dynamics at the different time points was compared between the sulindac and the placebo group. Because the number of methylation patterns shows right skewness, we used mixed effect negative binomial regression that can handle count data and is robust for under- or overdispersion. In the placebo group a significant increase in the mean number of unique methylation patterns was observed over time (2.14 at t = 0, 2.40 at t = 4 m and 3.69 at t = 2 y; 4 m vs 0, p = 0.591; 2 y vs 0, *p = 0.008; 2 y vs 4 m, *p = 0.034) resulting in an increased rate in the mean number of methylation patterns per year of 1.31 (95% CI 1.09–1.57; *p = 0.004) when the trend with time was analyzed continuously. Contrarily, in the sulindac group no change was detected (4.46 at t = 0, 3.24 at t = 4 m and 3.60 at t = 2 y; 4 m vs 0, p = 0.118; 2 y vs 0, p = 0.294; 2 y vs 4 m, p = 0.583) resulting in a rate in the mean number of methylation patterns per year of 0.96. (Table 2; Fig. 2a,c). A test for interaction showed that the trend with time in the number of methylation patterns significantly differed between the placebo and sulindac groups (*p = 0.018; Table 2). The placebo group showed an increasing trend whereas the sulindac group remained constant with time, which is consistent with polyp development (Fig. 3a,c).

Table 2 Multilevel mixed-effect regression analysis of differences in the number of methylation patterns and percent methylation for sulindac and placebo treated FAP patients with time using NGS.
Figure 2

Stem cell dynamics analysis using NGS. (a,b) Methylation patterns and percent methylation of placebo group (red diamond) including time point 0, 4 months and 2 years; (c,d) Methylation patterns and percent methylation of sulindac group (black diamond) including time point 0, 4 months and 2 years; The trend in time per group (both placebo and sulindac) was indicated as p-values where other statistical values are mentioned in Table 2. (e,f) Methylation patterns and percent methylation for all four patients in placebo group; (g,h) Methylation patterns and percent methylation for all four patients in sulindac group. Each block represents one patient at indicated time point.

Figure 3

Multilevel mixed-effect regression analysis showing the interaction of treatment and time with methylation patterns and percent methylation together with polyp numbers. (a,b and c) Results are shown for methylation patterns (a), percent methylation (b) and number of polyps (c); the solid lines are the results from linear trend analyses; 95% confidence intervals are shown by error bars and the shadows surrounding the trend lines. (df) The relation between change per year in number of patterns (d) and percent methylation (e) with number of polyps, and each other (f) on an individual patient basis; lines show linear regression lines through data-points. Red: Placebo; black: Sulindac.

Percent methylation was normally distributed and for that outcome we used mixed effect linear regression models to investigate the effect of treatment and time. The percent methylation in the placebo group remained constant over time (22.3% at t = 0, 18.4% at t = 4 m and 21.3% at t = 2 y; 4 m vs 0, p = 0.366; 2 y vs 0, p = 0.820; 2 y vs 4 m, p = 0.500), with a mean change in percent methylation per year of 0.2% (95% CI −3.7 to 4.2%; p = 0.909). For the sulindac group the percent methylation decreased from 25.6% at start of the study to 19.7% after 4 months and 15.3% after 2 years 4 m vs 0, p = 0.192; 2 y vs 0, p = 0.023; 2 y vs 4 m, p = 0.306; Table 2; Fig. 2b,d) with a mean change per year of −4.2% (95% CI −2.8 to −0.1; *p = 0.043). A test for interaction showed that the trend with time in percent methylation was not significantly different between both groups (p = 0.125; Table 2). The placebo group remained constant whereas the sulindac group slightly decreased trend with time (Fig. 3b).

Although individual results per patient for the sulindac and placebo groups showed no age related differences for the individual patients (Fig. 2e–h), there is a slight baseline difference in age between the placebo and the sulindac group (11.5 vs 9.75 years) which cannot be explained by above mentioned results (Table 2 and Fig. 3a). Therefore, we analyzed the data for treatment and time in relation to methylation patterns and percent methylation which remained unaffected after adjustment for age in the mixed-effects models (Supplementary Table S2).

We also estimated per patient the rate of change in mean number of methylation patterns, the change in mean percent methylation and the change in number of polyps over time, and then related these individual estimates of stem cell dynamics with polyp development and with each other (Fig. 3d–f). These results show a positive correlation between the three phenomena, indicating that patients in whom the number of methylation patterns (Fig. 3d) or the percent methylation (Fig. 3e) increases more in time also develop more polyps, and that in patients in whom the number of methylation patterns increase more in time also show more increase in percent methylation in time (Fig. 3f), and vice versa.

As sequencing depth might influence the results, the data were re-analyzed using only samples with ≥1000x sequencing. This sensitivity analysis, where 194 of the 232 crypts remained for the analysis, also showed similar results (Supplementary Table S3). As a technical validation, the same DNA samples were also analyzed using sanger sequencing (where 10 clones were sequenced per crypt), resulting in a concordance rate of at least 96% (Supplementary information; Supplementary Table S4 and Supplementary Fig. S1).


Sulindac treatment effectively induced colorectal polyp regression in FAP patients after 3–6 months, although it did not prevent the development of new polyps in pre-symptomatic FAP patients14. This short-lived effect of sulindac on polyp regression might be due to increased prostaglandin levels14, retained nuclear accumulation of β-catenin and reduced epithelial COX-2 expression27,28 or the occurrence of mutations such as in KRAS 12,29. Here we studied whether the short beneficial effect of sulindac treatment is associated with changes in stem cell behavior. Stem cells can acquire somatic mutations, which are either silent or genetically advantageous. These mutations are passed on to daughter cells and may provide a selection force to a stem cell lineage, leading to a stronger dominance and a subsequently higher longevity of the stem cell lineage30. If a stem cell lineage persists longer, the time to visible tumor progression decreases, because neoplastic mutations can accumulate in the stem cells2,31. Also, in the case of FAP, a heterozygous mutation in APC may enhance the probability of a symmetrical stem cell division, leading to a larger stem cell pool32. As mentioned, niche succession in FAP takes considerably longer than in the normal colon and consequently the risk of an increased mutational load is higher. This has also been suggested by Kim et al., who found an increased stem cell survival in FAP patients compared to controls15. An increased number of stem cells in the colorectal crypt is associated with an expansion of the proliferative compartment towards the epithelial surface, which further enhances the risk of relevant carcinogenic alterations33.

The first study on the role of methylation patterns in SC dynamics used Sanger sequencing, the “gold standard” sequencing technique. Despite its accuracy, this method is labor intensive, relatively slow, and has a low throughput34. Furthermore, to identify methylation patterns for individual stem cells, a cloning approach is necessary, making the assay particularly laborious. Recently, NGS has emerged as a high throughput technique that provides reliable sequence data even from sub-optimal starting material like formalin fixed paraffin embedded (FFPE) tissues35. A clonal amplification of each sequencing read is performed as an intrinsic component of the NGS process such that additional cloning steps to identify individual stem cells are bypassed. We detected 96% similarity in methylation events comparing NGS to Sanger sequencing. As expected, due to the higher sensitivity of NGS, more sequences and thereby more methylation patterns were found using NGS analysis. Although errors occur in NGS, a methylation pattern frequency cutoff of 1% adjusted for this. Hence, NGS can not only substitute for Sanger sequencing in studying stem cell dynamics, but also provide additional data.

When comparing the unique number of methylation patterns in the placebo versus the sulindac-treated FAP patients a significant increase is visible after 2 years of treatment in the placebo group. In contrast, a slight decrease was found in the sulindac group, especially after 4 months of treatment. Four months coincides with the time when polyp number is still repressed and resistance only starts to develop after 6 months13. There is a difference between the sulindac and placebo group for the number of methylation patterns at baseline. This occurred by chance and is unfortunate, but the difference in trend between the two groups is striking and is consistent with diminished stem cell lineage longevity due to sulindac (Table 2). It decreases the chance of accumulated mutations and corresponds with the lower number of polyps in the treatment group14,36,37.

Also a significant difference was found in sulindac treated patients with regard to mitotic age, represented by the percent methylation25. A higher mitotic age corresponds to a higher number of cell divisions that a stem cell has undergone and is associated with an increased risk of accumulating mutations. We found a slight decrease in the percent methylation between the start of the study and after 4 months of sulindac treatment, and a significant decrease in percent methylation between the start of the study and 2 years of sulindac administration, whereas the placebo group remained at a similar level. The lower percent methylation after 4 months of sulindac treatment corresponds with the lower polyp count observed in previous studies12,13,14. The effect, however, seems temporary14, and apparently other pathways can be used to circumvent the growth inhibition27,29,38. In the sulindac group, a borderline decrease was observed per year. However, there was no significant difference after treatment over time which might be a reason why this effect is temporary.

In the current study archival FFPE tissue was used which results in crosslinking of the DNA and artefacts like deamination of cytosines39. Although the deamination might influence the methylation patterns described in this study, it will affect both study groups equally. It will result in changes from a cytosine to uracil, thereby underestimating the amount of methylation. When comparing our numbers to studies that used fresh frozen tissue this seems indeed the case6,15,40. The sulindac and placebo treated patients were age matched in our study and a standardized biopsy site was used, thereby minimizing any bias. This precluded incorporation of a control group of normal subjects in our study. Location in the colorectum and also age are determinants of methylation and therefore confounders in a study like this. We also realize that the numbers are small and that conclusions need to be drawn with caution. However, it is felt that our study suggests that stem cell behavior may indeed be influenced by sulindac.

In summary, we provide evidence that NGS can be applied accurately and efficiently in studying stem cell dynamics. Furthermore, daily sulindac administration significantly altered colorectal stem cell dynamics over time, consistent with decreased stem cell lineage longevity. This effect may contribute to the observed inhibition of polyp numbers in sulindac treated FAP patients.

Material and Methods


Biopsy materials from eight FAP patients, ranging from the age of 7 to 15 years (Table 1), were selected from the study population of Giardiello et al.14. All materials are archival tissues of anonymized patients from Johns Hopkins Polyposis Registry that were used. Written informed consent was obtained from all subjects or their parents, and consent was obtained from subjects under 18 years of age. The protocol was approved by the Johns Hopkins Joint Committee on Clinical Investigation (the institutional review board). All patients had no detectable polyps at the start of the study and received 75 mg (body weight between 20 and 44 kg) or 150 mg (body weight more than 44 kg) sulindac or placebo orally twice a day. Before sulindac or placebo administration, colorectal biopsies were taken from the normal appearing mucosa (t = 0), and also after 4 months (t = 4 m) and 2 years (t = 2 y) after the start of treatment. The biopsy sites were standardized and taken from the rectum-flat mucosa. Biopsies were formalin-fixed and embedded in paraffin. The research was carried out in accordance with the ethical guidelines of the research review committee of our institution.

DNA isolation from single crypt

Paraffin embedded tissue was cut into 10 μm P.A.L.M. slides which were especially for laser micro-dissection microscopy. The slides were deparaffinized and counterstained with haematoxylin. 10 unique, normal appearing longitudinal crypts were isolated using a P.A.L.M. (Zeiss) system, and incubated with 20 μL PicoPure extraction solution (PicoPure DNA Extraction Kit, MDS Analytical Technologies) according to the manufacturer’s instructions.

Sodium bisulphite conversion and nested PCR

DNA was treated with sodium bisulphite (EpiTect Bisulphite Kit, Qiagen) according to the manufacturer’s instructions. Sodium bisulphite converted DNA was used in a nested PCR to amplify a CpG island containing 8 CpG sites in the CSX gene. This gene is not expressed in the colon. The first PCR reaction mix had a total volume of 24 µL including 12 µL KAPA Hifi Uracil+ Ready mix (Kapa biosystems), 2 µL of each primer at 2 µM (For 5′-GGGGAGAAGGGGTTTTTAATAT-3′ and Rev 5′-AAAAACACTCCTAAAAAAACAACTAA-3′), and 8 µL of template DNA. PCR1 program consisted of an initial denaturation at 95 °C for 2 minutes, followed by 40 cycles of denaturation at 98 °C for 20 seconds, primer annealing at 61 °C for 15 seconds, extension at 72 °C for 30 seconds, and a final extension step at 72 °C for 1 minute. The second PCR reaction mix had a total volume of 20 µL comprising of 4 µL KapaHifi Buffer (5X), 0.4 µL MgCl2 (25 mM), 0.6 µL dNTPs (10 mM), 0.5 µL of each respective primer at 10 mM (For 5′-GTAAAACGACGGCCAGGGAGATTTAGGAATTTTTTTTGTTTT-3′ and Rev 5′-CAGGAAACAGCTATGACACACCAAACTACAAAATCACTCATTA-3′), 0.2 µL KAPA Hifi Hotstart (1 U/µL), and 1 µl of DNA template taken from PCR 1 (diluted 1:100). PCR2 program consisted of an initial denaturation at 98 °C for 45 seconds, followed by 34 cycles of denaturation at 98 °C for 20 seconds, primer annealing at 58 °C for 15 seconds, extension at 72 °C for 30 seconds, and a final extension step at 72 °C for 1 minute.

Next Generation Sequencing – Ion torrent

The other half of the volume of the nested PCR products was used for NGS. To sequence these amplified segments of the CSX gene specialized sequencing adapters need to be attached to the amplicon, which was accomplished by a fusion PCR. Furthermore, barcodes were attached to each individual sample so that they can be identified after sequencing (see Supplementary Table S5 for primer sequences). The fusion PCR was carried out in a total volume of 20 µL comprising of 4 µL KAPA Hifi Buffer (5X), 0.4 µL MgCl2 (25 mM), 0.6 µL dNTPs (10 mM), 5 µL of each respective primer at 10 µM, 0.2 µL KAPA Hifi Hotstart (1 U/µL), and 1 µl of DNA template taken from PCR 2 (diluted 1:100). The fusion PCR reaction program: initial denaturation at 98 °C for 45 seconds, followed by 34 cycles of denaturation at 98 °C for 20 seconds, primer annealing at 58 °C for 15 seconds, extension at 72 °C for 30 seconds, and a final extension step at 72 °C for 1 minute. Next, the DNA was purified using the Agencourt® Ampure® XP kit according to the manufacturer’s instructions. All the samples were pooled at a final concentration of 0.8 pg/µL and sequenced on the Ion Torrent PGM (Thermo Fisher) using the Ion PGM™ 200 bp kit and the Ion 318™ Chip v2.

Sanger sequencing

Half of the volume of the nested PCR products was purified with the Illustra GFX PCR DNA and Gel Band Purification Kit (GE Healthcare) according to the manufacturer’s instructions. PCR products were eluted in 30 μL Elution Buffer and incubated for 10 min at 72 °C with NEB DNA polymerase to create an A-overhang. This solution contained 0.2 mM dNTPs, 2.0 mM MgCl2, 0.75 U NEB DNA polymerase (New England Biolabs), with an end concentration of 1x NEB DNA polymerase PCR buffer (New England Biolabs). PCR products were then cloned using the pGEM-T Easy Vector System (Promega) into competent E. coli Top10 cells. Successfully cloned inserts were amplified in a colony PCR. PCR products were incubated with the ABI Big Dye Terminator Mix (Applied Biosystems) and run on an ABI 3730 genetic analyzer (Applied Biosystems) for sequencing. Sequences were analyzed with DNASTAR lasergene 12.2, and sequences with incomplete bisulphite conversions were excluded from analysis.

Data analysis

For each patient, 10 normal appearing crypts were isolated and for each crypt, 10 sequences were used for Sanger analysis, each containing methylation information of 8 CpG sites. IonTorrent reads were assigned to samples by custom scripts reading the internal barcoding tags. Since the reads were from a single amplicon, they were aligned using BWA41 using default parameters to an in-silico converted reference sequence resembling a fully methylated, bisulphite converted region. Only reads were kept if they fully covered the amplicon. CpG positions (2, 4, 6, 18, 22, 37, 40 and 51) and control positions (14, 20, 14) were genotyped within the context of a single read with a custom walker for GATK42. A custom per script was used to collect all reads with a perfect conversion at the control sites and to score all the various combinations and percent methylation at the CpG sites. Scripts are available upon request. The number of unique methylation patterns was determined for each crypt, and the percent methylation was determined by comparing the number of methylated CpG sites to the total number of CpG sites per crypt6,16.

We used multilevel mixed-effects regression models to study the relation between time and treatment with stem cell dynamics, using the individual crypt derived data-points as the unit of analysis while taking within-patient clustering into account with random intercepts. For the number of methylation patterns – i.e. count data – we used mixed-effects negative binomial regression, with which we estimated the mean number of patterns per treatment group per time-point if time was analyzed categorically, and the rate of change in mean number of patterns per year for each treatment group if time was analyzed continuously (i.e. as a trend). For the percent methylation – i.e. a normally distributed continuous outcome – we used mixed-effects linear regression, with which we estimated the mean percent methylation per treatment group per time-point if time was analyzed categorically, and the mean change in percent methylation per year for each treatment group if time was analyzed continuously (i.e. as a trend). All these models included treatment and time as main effects as well as their interaction. Model assumptions were checked and were not violated. All estimates are reported with their corresponding 95% CIs. To assess the relation between changes in percent methylation, methylation patterns, and polyp development with time per patient, we estimated per patient the mean change in percent methylation by linear regression, the rate of change in methylation patterns by negative binomial regression, and the change in number of polyps by linear regression over time (all linearly), and then related these individual estimates of methylation and polyp development dynamics with each other by scatterplots, linear regression, and estimating Pearson correlation coefficients.

R version 3.2.1 was used for statistical analysis (especially using lme4) and figures were made in GraphPad Prism 6. Statistics method was indicated in the legends and applied for analysis and statistical significance was defined as *p < 0.05. The means ± SEM were displayed in the figures where applicable.

Data Availability

The data concerning the number of polyps are derived from the previous paper by Giardiello, F. M. et al.14. The remaining data generated during this study are included in this article (and its supplementary information files). The original NGS data is included as Supplementary Table S6 and BAM files are available upon request.


  1. 1.

    Wright, N. A. & Irwin, M. The kinetics of villus cell populations in the mouse small intestine. I. Normal villi: the steady state requirement. Cell Tissue Kinet 15, 595–609 (1982).

  2. 2.

    Kim, K. M. & Shibata, D. Methylation reveals a niche: stem cell succession in human colon crypts. Oncogene 21, 5441–5449, https://doi.org/10.1038/sj.onc.1205604 (2002).

  3. 3.

    Ro, S. & Rannala, B. Methylation patterns and mathematical models reveal dynamics of stem cell turnover in the human colon. Proc Natl Acad Sci USA 98, 10519–10521, https://doi.org/10.1073/pnas.201405498 (2001).

  4. 4.

    Yatabe, Y., Tavare, S. & Shibata, D. Investigating stem cells in human colon by using methylation patterns. Proc Natl Acad Sci USA 98, 10839–10844, https://doi.org/10.1073/pnas.191225998 (2001).

  5. 5.

    Baker, A. M. et al. Quantification of crypt and stem cell evolution in the normal and neoplastic human colon. Cell Rep 8, 940–947, https://doi.org/10.1016/j.celrep.2014.07.019 (2014).

  6. 6.

    Graham, T. A. et al. Use of methylation patterns to determine expansion of stem cell clones in human colon tissue. Gastroenterology 140(1241–1250), e1241–1249, https://doi.org/10.1053/j.gastro.2010.12.036 (2011).

  7. 7.

    Calabrese, P., Tavare, S. & Shibata, D. Pretumor progression: clonal evolution of human stem cell populations. Am J Pathol 164, 1337–1346, https://doi.org/10.1016/S0002-9440(10)63220-8 (2004).

  8. 8.

    Bussey, H. Familial polyposis coli: family studies, histopathology, differential diagnosis, and results of treatment. (Johns Hopkins University Press Baltimore, 1975).

  9. 9.

    Fodde, R. & Smits, R. Disease model: familial adenomatous polyposis. Trends Mol Med 7, 369–373 (2001).

  10. 10.

    Cruz-Correa, M. & Giardiello, F. M. Diagnosis and management of hereditary colon cancer. Gastroenterol Clin North Am 31, 537–549, x (2002).

  11. 11.

    Steinbach, G. et al. The effect of celecoxib, a cyclooxygenase-2 inhibitor, in familial adenomatous polyposis. N Engl J Med 342, 1946–1952, https://doi.org/10.1056/NEJM200006293422603 (2000).

  12. 12.

    Keller, J. J. & Giardiello, F. M. Chemoprevention strategies using NSAIDs and COX-2 inhibitors. Cancer Biol Ther 2, S140–149 (2003).

  13. 13.

    Giardiello, F. M. et al. Treatment of colonic and rectal adenomas with sulindac in familial adenomatous polyposis. N Engl J Med 328, 1313–1316, https://doi.org/10.1056/NEJM199305063281805 (1993).

  14. 14.

    Giardiello, F. M. et al. Primary chemoprevention of familial adenomatous polyposis with sulindac. N Engl J Med 346, 1054–1059, https://doi.org/10.1056/NEJMoa012015 (2002).

  15. 15.

    Kim, K. M., Calabrese, P., Tavare, S. & Shibata, D. Enhanced stem cell survival in familial adenomatous polyposis. Am J Pathol 164, 1369–1377, https://doi.org/10.1016/S0002-9440(10)63223-3 (2004).

  16. 16.

    Langeveld, D. et al. Aberrant intestinal stem cell lineage dynamics in Peutz-Jeghers syndrome and familial adenomatous polyposis consistent with protracted clonal evolution in the crypt. Gut 61, 839–846, https://doi.org/10.1136/gutjnl-2011-300622 (2012).

  17. 17.

    Yamaguchi, K. et al. Detection of APC mosaicism by next-generation sequencing in an FAP patient. J Hum Genet, doi:https://doi.org/10.1038/jhg.2015.14 (2015).

  18. 18.

    Pritchard, C. C. et al. ColoSeq provides comprehensive lynch and polyposis syndrome mutational analysis using massively parallel sequencing. J Mol Diagn 14, 357–366, https://doi.org/10.1016/j.jmoldx.2012.03.002 (2012).

  19. 19.

    Meyerson, M., Gabriel, S. & Getz, G. Advances in understanding cancer genomes through second-generation sequencing. Nat Rev Genet 11, 685–696, https://doi.org/10.1038/nrg2841 (2010).

  20. 20.

    Desai, A. N. & Jere, A. Next-generation sequencing: ready for the clinics? Clin Genet 81, 503–510, https://doi.org/10.1111/j.1399-0004.2012.01865.x (2012).

  21. 21.

    Uhlig, H. H. et al. The diagnostic approach to monogenic very early onset inflammatory bowel disease. Gastroenterology 147, 990–1007 e1003, https://doi.org/10.1053/j.gastro.2014.07.023 (2014).

  22. 22.

    Sottoriva, A., Spiteri, I., Shibata, D., Curtis, C. & Tavare, S. Single-molecule genomic data delineate patient-specific tumor profiles and cancer stem cell organization. Cancer Res 73, 41–49, https://doi.org/10.1158/0008-5472.CAN-12-2273 (2013).

  23. 23.

    Lister, R. et al. Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell 133, 523–536, https://doi.org/10.1016/j.cell.2008.03.029 (2008).

  24. 24.

    Korshunova, Y. et al. Massively parallel bisulphite pyrosequencing reveals the molecular complexity of breast cancer-associated cytosine-methylation patterns obtained from tissue and serum DNA. Genome Res 18, 19–29, https://doi.org/10.1101/gr.6883307 (2008).

  25. 25.

    Chu, M. W. et al. Lack of increases in methylation at three CpG-rich genomic loci in non-mitotic adult tissues during aging. BMC Med Genet 8, 50, https://doi.org/10.1186/1471-2350-8-50 (2007).

  26. 26.

    Fox, E. J., Reid-Bayliss, K. S., Emond, M. J. & Loeb, L. A. Accuracy of Next Generation Sequencing Platforms. Next Gener Seq Appl 1, doi:https://doi.org/10.4172/jngsa.1000106 (2014).

  27. 27.

    Boon, E. M. et al. Sulindac targets nuclear beta-catenin accumulation and Wnt signalling in adenomas of patients with familial adenomatous polyposis and in human colorectal cancer cell lines. Br J Cancer 90, 224–229, https://doi.org/10.1038/sj.bjc.6601505 (2004).

  28. 28.

    Koornstra, J. J. et al. Sulindac inhibits beta-catenin expression in normal-appearing colon of hereditary nonpolyposis colorectal cancer and familial adenomatous polyposis patients. Cancer Epidemiol Biomarkers Prev 14, 1608–1612, https://doi.org/10.1158/1055-9965.EPI-05-0112 (2005).

  29. 29.

    Keller, J. J. et al. Molecular analysis of sulindac-resistant adenomas in familial adenomatous polyposis. Clin Cancer Res 7, 4000–4007 (2001).

  30. 30.

    Vermeulen, L. & Snippert, H. J. Stem cell dynamics in homeostasis and cancer of the intestine. Nat Rev Cancer 14, 468–480, https://doi.org/10.1038/nrc3744 (2014).

  31. 31.

    Shibata, D. Inferring human stem cell behaviour from epigenetic drift. J Pathol 217, 199–205, https://doi.org/10.1002/path.2461 (2009).

  32. 32.

    Sasikumar, R., Rejitha, J. R., Binumon, P. K. & Manoj, M. Role of heterozygous APC mutation in niche succession and initiation of colorectal cancer–a computational study. PLoS One 6, e22720, https://doi.org/10.1371/journal.pone.0022720 (2011).

  33. 33.

    Boman, B. M., Fields, J. Z., Cavanaugh, K. L., Guetter, A. & Runquist, O. A. How dysregulated colonic crypt dynamics cause stem cell overpopulation and initiate colon cancer. Cancer Res 68, 3304–3313, https://doi.org/10.1158/0008-5472.CAN-07-2061 (2008).

  34. 34.

    Schuster, S. C. Next-generation sequencing transforms today’s biology. Nat Methods 5, 16–18, https://doi.org/10.1038/nmeth1156 (2008).

  35. 35.

    de Leng, W. W. et al. Targeted Next Generation Sequencing as a Reliable Diagnostic Assay for the Detection of Somatic Mutations in Tumours Using Minimal DNA Amounts from Formalin Fixed Paraffin Embedded Material. PLoS One 11, e0149405, https://doi.org/10.1371/journal.pone.0149405 (2016).

  36. 36.

    Mahmoud, N. N. et al. Genotype-phenotype correlation in murine Apc mutation: differences in enterocyte migration and response to sulindac. Cancer Res 59, 353–359 (1999).

  37. 37.

    Nagase, H. et al. Correlation between the location of germ-line mutations in the APC gene and the number of colorectal polyps in familial adenomatous polyposis patients. Cancer Res 52, 4055–4057 (1992).

  38. 38.

    Rice, P. L. et al. Sulindac metabolites induce caspase- and proteasome-dependent degradation of beta-catenin protein in human colon cancer cells. Mol Cancer Ther 2, 885–892 (2003).

  39. 39.

    Srinivasan, M., Sedmak, D. & Jewell, S. Effect of fixatives and tissue processing on the content and integrity of nucleic acids. Am J Pathol 161, 1961–1971, https://doi.org/10.1016/S0002-9440(10)64472-0 (2002).

  40. 40.

    Trumpi, K. et al. Mice lacking functional CD95-ligand display reduced proliferation of the intestinal epithelium without gross homeostatic alterations. Med Mol Morphol 49, 110–118, https://doi.org/10.1007/s00795-015-0129-9 (2016).

  41. 41.

    Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760, https://doi.org/10.1093/bioinformatics/btp324 (2009).

  42. 42.

    McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20, 1297–1303, https://doi.org/10.1101/gr.107524.110 (2010).

Download references


The authors thank Eric Strengman for help with the Next Generation Sequencing. We furthermore thank Danielle Langeveld, Jinyi Su, Rieneke van Laar, Laszlo Groh and Hemse Al-Khamisi for help with obtaining the methylation patterns.

Author information

H.M., L.A.A.B., G.J.A.O. and W.W.J.d.L. drafted the concept and design of the study; H.M., F.H.M., I.J.N., L.M.H. and E.A.M. were involved in the acquisition of data; H.M., L.A.A.B., S.G.E., G.J.A.O. and W.W.J.d.L. contributed to the analysis and interpretation of data; S.G.E. did the statistical analysis of the study. H.M., L.A.A.B., G.J.A.O. and W.W.J.d.L. were involved in the drafting of the manuscript; G.J.A.O., F.M.G. and W.W.J.d.L. contributed to the critical revision of the manuscript for important intellectual content; G.J.A.O., F.M.G. and W.W.J.d.L. were involved in obtaining funding.

Correspondence to Wendy W. J. de Leng.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary information

Supplementary Table S6

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark


By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.