Whole genome DNA methylation sequencing of the chicken retina, cornea and brain

Lee, Isac; Rasoul, Bejan A.; Holub, Ashton S.; Lejeune, Alannah; Enke, Raymond A.; Timp, Winston

doi:10.1038/sdata.2017.148

Download PDF

Data Descriptor
Open access
Published: 10 October 2017

Whole genome DNA methylation sequencing of the chicken retina, cornea and brain

Isac Lee¹,
Bejan A. Rasoul²,
Ashton S. Holub²,
Alannah Lejeune¹,
Raymond A. Enke^2,3 &
…
Winston Timp ORCID: orcid.org/0000-0003-2083-6027¹

Scientific Data volume 4, Article number: 170148 (2017) Cite this article

4769 Accesses
18 Citations
8 Altmetric
Metrics details

Subjects

Abstract

Whole genome bisulfite sequencing (WGBS) analysis of DNA methylation uses massively parallel next generation sequencing technology to characterize global epigenetic patterns and fluctuations throughout a range of tissue samples. Development of the vertebrate retina is thought to involve extensive epigenetic reprogramming during embryogenesis. The chicken embryo (Gallus gallus) is a classic model system for studying developmental biology and retinogenesis, however, there are currently no publicly available data sets describing the developing chicken retinal methylome. Here we used Illumina WGBS analysis to characterize genome-wide patterns of DNA methylation in the developing chicken retina as well as cornea and brain in an effort to further our understanding of retina-specific epigenetic regulation. These data will be valuable to the vision research community for correlating global changes in DNA methylation to differential gene expression between ocular and neural tissues during critical developmental time points of retinogenesis in the chicken retina.

Design Type(s)	parallel group design • organism part comparison design
Measurement Type(s)	DNA methylation profiling assay
Technology Type(s)	DNA sequencing
Factor Type(s)	animal body part • life cycle stage
Sample Characteristic(s)	Gallus gallus • brain • cornea • retina

Machine-accessible metadata file describing the reported data (ISA-Tab format)

QTL mapping of human retina DNA methylation identifies 87 gene-epigenome interactions in age-related macular degeneration

Article Open access 04 March 2024

Extensive DNA methylome rearrangement during early lamprey embryogenesis

Article Open access 04 March 2024

Mapping the chromatin accessibility landscape of zebrafish embryogenesis at single-cell resolution by SPATAC-seq

Article 08 July 2024

Background & Summary

Advances in next generation sequencing (NGS) technology have substantially increased the number of species with completed high quality genome assemblies. These advances have also opened new doors to studying the functionality of complex genomes¹. The epigenetics community has benefited greatly from NGS experiments mapping genome-wide profiles of specific histone tail and genomic DNA modifications. Consortium projects such as the ENCODE Project and the Roadmap Epigenomics Project have deepened our understanding of numerous specialized human cell and tissue types and have paved the way for similar experiments in model systems of human development and disease^2,3.

The experiment described here is part of a larger ongoing project within the James Madison University’s Center for Genome and Metagenome Studies (CGEMS) investigating transcriptional regulation in the developing vertebrate retina. The retina, a stratified layer of sensory neurons that lines the posterior portion of the eye, contains rod and cone photoreceptors that absorb focused light photons and convert their energy into electrochemical signals transmitted to the brain and processed into what we perceive as the visual world. Within the developing retina, complex transcriptional networks regulate proper differentiation of specialized subclasses of retinal neurons⁴. Characterizing epigenetic regulation of these transcriptional networks is critical for further understanding molecular mechanisms driving retinal development as well as for crafting novel strategies to combat blinding human diseases that affect the retina.

The chicken (Gallus gallus) embryo is a reliable and practical model system for studying vertebrate retinogenesis⁵. Chicken embryo development is rapid, completing its entire program from blastula to hatchling in 21 days⁶. Recent genomic efforts to improve the quality of the chicken genome assembly combined with newly developed molecular tools for genetic manipulation of this model system have contributed to a renaissance of using the chicken embryo as a robust model to study retinal development⁵. During chick development, the immature E8 retina is composed of multipotent precursor cells, which begin to terminally differentiate into specialized retinal neurons in subsequent days of development. By E18, the retina is nearly mature with photoreceptor (PR), bipolar, amacrine, horizontal, and ganglion cell neurons as well as Muller glial cells having differentiated from these multipotent precursors (Fig. 1)⁵. Progenitor cells yet to exit the cell cycle, as well as each of these specialized retinal cell types are known to express developmental and cell type-specific genes⁷. RNA-sequencing transcriptome data have recently become available to dissect global changes in gene expression during chick retinal development^8,9. Currently, however, there are no publicly available data sets characterizing epigenetic modifications of the genome in the developing chicken retina. Epigenome studies will complement transcriptome data and contribute to a deeper understanding of vertebrate retinal development.

**Figure 1: Overview of the chicken embryo, eye, and retinal development.**

The focus of this project is to characterize global patterns of DNA methylation, an epigenetic modifier of vertebrate genomes, in the developing chicken retina as well as in non-retinal tissues using whole genome bisulfite sequencing (WGBS) NGS technology. These experiments will be critical for downstream characterization of developmental and cell-specific epigenetic regulatory mechanisms during retinal development. The developmental points chosen for analysis were E8 (Fig. 1a–c) and E18 (Fig. 1d–f), which provides epigenetic information for early and late retinal development respectively. E18 whole cornea (Fig. 1e) and brain (not shown) were also included in this analysis as non-retinal ocular and non-retinal neuronal reference tissues respectively. These analyses were conducted using Illumina WGBS in tandem with a standard bioinformatics pipeline to ensure quality of the raw and trimmed sequencing data (Fig. 2) as well as a customized bioinformatics pipeline for robust eukaryotic DNA methylome analysis (Fig. 3).

**Figure 2: Quality Assessment of FASTQ sequencing data for 125 bp paired end reads.**

**Figure 3: Experimental overview and assessment of read mapping, read length and sample variance.**

Methods

Embryos

All embryo experiments were conducted with the approval of the James Madison University Institutional Animal Care and Use Committee and in accordance with the National Institutes of Health guide for the care and use of laboratory animals. Fertilized pathogen free commercial Cobb/Hubbard F1 hybrid eggs were obtained from George’s Hatchery (Harrisonburg, VA) and incubated in a rocking chamber held at 38 °C and 50–60% humidity until specified incubation days.

Tissue processing, histology & imaging:

Chicken embryos were harvested and euthanized at specified days incubated as previously described⁹. Briefly, embryos were decapitated and eyes were enucleated. Whole embryos and whole eyes were imaged prior to eyecups preparation. Isolated corneas and whole brain extracts were saved for subsequent DNA extraction. Histological preparation of eyecups was conducted as previously described⁹. Briefly, eyecups were fixed in 4% paraformaldehyde in 1× PBS for 25 min on ice, equilibrated in 25% sucrose in 1× PBS, and transferred into a 2:1 mixture of 25% sucrose:OCT compound (Electron Microscopy Sciences) on ice for 30 min prior to flash freezing in the same solution in Tissue-Tek Cryomold (Sakura Finetek) and stored in −80 °C. 10 μm thick frozen serial sections were prepared using a CM3050 S Research Cryostat (Leica) with the object and chamber temperatures set to −22 °C and −28 °C respectively. Frozen sections were thawed, H&E stained, and imaged using an EclipseTE2000 inverted microscope (Nikon) and processed with NIS Elements software (Nikon). For retinal dissection, eyecups were incubated for 20 min in HBSS modified media without calcium or magnesium (HBSS -Ca,-Mg;HyClone) at 37 °C to dissociate the retinal pigment epithelium (RPE) layer from the outermost layer of the retina. Retinas were then isolated by tearing away the sclera and gently peeling away the RPE layer. Isolated retinas and corneas were briefly rinsed in cold HBSS -Ca, -Mg. Retinas were immediately transferred to RLT+ lysis buffer (Qiagen; AllPrep kit) containing 2-Mercaptoethanol (Sigma) and vortexed vigorously to dissociate the tissue. Corneas and brain were separately flash frozen and ground into a fine powder using a mortar and pestle prior to being transferred to RLT+/BME lysis buffer solution and vortexed. Samples were stored long term in lysis buffer at −80 °C.

Genomic DNA isolation

Genomic DNA was collected from twelve embryonic chicken tissues (Table 1). Whole retinas were harvested from E8 (Fig. 1b) and E18 (Fig. 1h) developing chicken embryos as well as whole corneas from E18 embryos (Fig. 1e) and whole brain collected from E18 embryos (not shown). Triplicate samples were obtained for each time point and genomic DNA was extracted from samples using a Qiagen AllPrep Mini Kit per the manufacturer’s instructions. Isolated DNAs were eluted in TE buffer, validated for quality and quantity using UV spectrophotometry, and stored long term at −80 °C. DNAs with an OD260/280 ratio between 1.75 and 1.85 were deemed high quality.

Table 1 BS-seq profiling to evaluate developmental and tissue-specific retinal DNA methylation.

Full size table

DNA preparation, bisulfite conversion, and sequencing

Genomic DNA samples were sheared to 200–300 bp fragments using Bioruptor Pico (Diagenode), with 9 cycles of 30 s on and 30 s off. With these samples, sequencing libraries were prepared using NEBNext Ultra II library preparation kit (New England BioLabs) with bisulfite conversion using EZ DNA Methylation-Lightning kit (Zymo) before PCR amplification of adaptor-tagged libraries. Library fragments were assessed using an Agilent Bioanalyzer to plot distribution of DNA fragment peaks. High quality libraries, having a distribution of DNA fragments centered around 300 bps were used for sequencing analysis using the Illumina HiSeq 2,500 sequencing platform yielding 32.8–60.2 million 125 bp paired end sequence reads per sample (Fig. 3b).

Quality validation and read mapping:

Between 32.8–60.2 million paired end sequence reads were obtained per sample from the Johns Hopkins School of Medicine Genetic Resource Core Facility. Quality of individual sequences within FASTQ files were evaluated using custom quality control analysis (see Code Availability), including per cycle quality analysis which plots the Phred quality score distribution on the y axis for each cycle of the sequencing by synthesis reaction plotted on the x axis (Fig. 2a, Supplementary Fig. 1) as well as per sequence quality analysis which plots mean Phred quality scores on the x axis against the overall number of reads corresponding to that Phred score on the y axis (Fig. 2b, Supplementary Fig. 2). Figure 2 demonstrates a representative sample FASTQ sequencing files from each tissue used in this analysis. Each FASTQ file had an average per base Phred score > 28 as well as the vast majority of sequencing reads with a mean Phred score > 28, both conventional thresholds denoting high quality base calls.

To correct for bias of methylation percentage on the ends of the reads, several bases were trimmed off prior to subsequent analysis using Trim Galore! (see Code Availability 1). The number of bases trimmed was determined empirically by taking a subset of the reads through the bioinformatics pipeline and observing the methylation bias. In addition, adaptor sequences and bases below the Phred score of 20 from the 3′ end were removed further increasing the average per base Phred score or reads used in downstream analysis (Supplementary Fig. 4). Trimming altogether removed 5.9 to 13% of the sequenced base pairs (Table 2). Using trimmed reads, no significant bias in methylation was observed (Fig. 2c,Supplementary Fig. 3). Figure 3a demonstrates our experimental overview including the bioinformatics pipeline employed following quality validation of sequence reads. High quality sequence reads were aligned to the UCSC Gallus gallus reference genome (galGal5) preprocessed for bisulfite sequencing using bismark software¹⁰ (See Code Availability 2), yielding a range of 65 to 80% uniquely aligned reads (Fig. 3b and Table 3).

Table 2 BS-seq read and filtering statistics.

Full size table

Table 3 BS-seq mapping statistics.

Full size table

Methylation data analysis

Aligned bisulfite sequence reads were processed using the Bioconductor package bsseq¹¹. Local-likelihood smoothing was performed on the datasets to improve the precision of methylation frequencies. Smoothing parameters were adjusted to fit the downstream analysis, i.e. differentially methylated region finding versus large block finding. Gene and CpG island annotations for Gallus gallus v5 chicken genome were obtained from the UCSC genome annotation database, and global methylation was measured for each sample group using bsseq smoothed methylation data (Supplementary Fig. 5). Global methylation showed a dip near the TSS, as is typically expected. In the gene bodies, methylation varied widely, showing little difference to the methylation across the whole genome. Finally, CpG islands were mostly unmethylated, with a median of 66% of CpG islands having an average methylation below 20%.

Code availability

The following software and versions were used for quality control and data analysis as described in the main text:

1
FastQC, version 0.11.4 was used for quality analysis of raw FASTQ sequencing data: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/
2
Trim Galore!, version 0.4.1 was used for adaptor and end-trimming of raw FASTQ sequencing data: http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/
3
Bismark, version 0.16.3 was used for bisulfite-sequencing-specific alignment of raw FASTQ sequencing data: http://www.bioinformatics.babraham.ac.uk/projects/bismark/

All code used for quality assessment and data analysis in this study is available at: https://github.com/isaclee/chicken

Data Records

Raw FASTQ files for the whole genome bisulfite-seq libraries were deposited to the NCBI Sequence Read Archive (SRA) (Data Citation 1), and have been assigned BioProject accession PRJNA389197 (Table 1; Data Citation 1).

Technical Validation

Quality control-DNA library integrity

Quality of the bisulfite library was assessed using an Agilent Bioanalyzer to plot distribution of DNA fragment peaks. High quality libraries, having a distribution of DNA fragments centered around 300 bps without adaptor/primer dimer peaks, were used for sequencing analysis.

Bisulfite sequencing data quality

Mean Phred quality scores of the sequenced reads fall in the high quality range, as shown by per base (Fig. 2a, Supplementary Figs 1 and 4) and per sequence (Fig. 2b, Supplementary Fig. 2) quality analysis. 25.0 to 45.5 million reads were mapped to the chicken reference galGal5 genome assembly (Table 3). No significant bias in DNA methylation percentage was observed with respect to the sequence position along the read (Fig. 2c, Supplementary Fig. 3).

Data reproducibility

Using smoothed methylation values at all available CpG loci, we performed principal component analysis and hierarchical clustering analysis to test the reproducibility of the methylation data (Fig. 3c,d). The resulting within-group and between-group Pearson correlations were calculated to report numerical evidence of the conclusion (Fig. 3d and Supplementary Fig. 6). As expected, each triplicate group shared similar variances in the first two principal components and grouped into distinct clusters. Likewise, highest within-group correlations were observed within the triplicates, and the highest between-group correlation was observed between the two developmental stages of retinal samples.

Usage Notes

The bioinformatics pipeline applied to our data set outlined in Fig. 3a was achieved using a collection of freely available, open access tools. However, these analyses are interchangeable with many other currently available tools. Our raw FASTQ data can be aligned to any available chicken reference genome, including the most recent 2015 galGal5 assembly, using a variety of freely available bisulfite-converted sequence aligners. In this study we used the Bismark methylation aligner¹⁰, however, we expect that similar alignment results can be achieved using the bsmooth pipeline¹¹. Alignment of the FASTQ data in the form of bam files can be viewed using popular genome browsers such as the UCSC Genome Browser¹², the Ensembl Browser¹³, or the Broad Institute’s Integrative Genome View¹⁴. Subsequent differential methylation analysis using these data can be carried out using the R/bioconductor package bsseq^11,15 or other publicly available packages such as methylkit¹⁶ may also be used for this analysis.

Our data set will be useful for a variety of studies investigating developmental and tissue-specific changes in DNA methylation in the vertebrate retina. There are, however, several considerations that must be taken into account when using these data for downstream analysis. First, DNAs were extracted from whole retina, whole brain, and whole cornea without any enrichment for cell type. Therefore, resulting DNA methylation patterns are representative of heterogeneous mixtures of different cell types within these tissues. Second, the quantity of sequenced and mapped reads per sample in this study (Fig. 3b) is sufficient for robust differentially methylated region and block analysis, but is below the suggested threshold for small or single nucleotide methylation analysis. We chose this coverage to maximize the number of biological replicates, following work by Ziller et al. who demonstrated that 5X-15X coverage was sufficient¹⁷. Finally, due to the decreased nucleotide complexity after bisulfite conversion, the alignment of the sequence reads and hence the methylation measurements may vary depending on the reference genome used for alignment¹⁸. Taking these considerations into account, these data will be a useful resource for the vision research community to thoroughly investigate critical changes in DNA methylation that take place during the complex process of vertebrate retinal development as well as differences in DNA methylation between other ocular and neuronal tissues. These data can also be used in conjunction with transcriptome data to characterize epigenetically regulated transcriptional networks critical for retinal development.

Additional Information

How to cite this article: Lee, I. et al. Whole genome DNA methylation sequencing of the chicken retina, cornea and brain. Sci. Data 4:170148 doi: 10.1038/sdata.2017.148 (2017).

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Henson, J., Tischler, G. & Ning, Z. Next-generation sequencing and large genome assemblies. Pharmacogenomics 13, 901–915 (2014).
Article Google Scholar
The ENCODE Project Consortium. A User’ s Guide to the Encyclopedia of DNA Elements ( ENCODE). PLoS Biol. 9 (2011).
Chadwick, L. H. The NIH Roadmap Epigenomics Program data resource. Epigenomics 4, 317–324 (2013).
Article Google Scholar
Lamb, T. D., Collin, S. P. & Pugh, E. N. Jr Evolution of the vertebrate eye: opsins, photoreceptors, retina and eye cup. Nat Rev Neurosci 8, 960–976 (2011).
Article Google Scholar
Vergara, M. N. & Canto-Soler, M. V. Rediscovering the chick embryo as a model to study retinal development. Neural Dev 7, 22–40 (2012).
Article Google Scholar
V. Hamburger, H. L. H. A series of normal stages in the development of the chick embryo. Dev. Dyn. 88, 49–92 (1951).
Google Scholar
Doh, S. T. et al. Analysis of retinal cell development in chick embryo by immunohistochemistry and in ovo electroporation techniques. BMC Dev. Biol. 10, 8 (2010).
Article Google Scholar
Enright, J. M., Lawrence, K. A., Hadzic, T. & Corbo, J. C. Transcriptome profiling of developing photoreceptor subtypes reveals candidate genes involved in avian photoreceptor diversification. J. Comp. Neurol. 523, 649–668 (2016).
Article Google Scholar
Langouet-Astrie, C. J., Meinsen, A. L., Grunwald, E. R., Turner, S. D. & Enke, R. A. RNA sequencing analysis of the developing chicken retina. Sci. Data 3, 160117 (2016).
Article CAS Google Scholar
Krueger, F. & Andrews, S. R. Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics 27, 1571–1572 (2011).
Article CAS Google Scholar
Hansen, K. D. et al. BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions. Genome Biol. 13, R83 (2012).
Article Google Scholar
Tyner, C. et al. The UCSC Genome Browser database: 2017 update. Nucleic Acids Res. 45, 626–634 (2017).
Google Scholar
Yates, A. et al. Ensembl 2016. Nucleic Acids Res. 44, 710–716 (2016).
Article Google Scholar
Robinson, J. T. et al. Integrative genomics viewer. Nat. Biotechnol. 29, 24–26 (2011).
Article CAS Google Scholar
Li, P. et al. An Integrated Workflow for DNA Methylation Analysis. J. Genet. Genomics 40, 249–260 (2013).
Article Google Scholar
Akalin, A. et al. methylKit: a comprehensive R package for the analysis of genome-wide DNA methylation profiles. Genome Biol. 3 (2012).
Ziller, M. J., Hansen, K. D., Meissner, A. & Aryee, M. J. Coverage recommendations for methylation analysis by whole genome bisulfite sequencing. Nat. Methods 12, 230–232 (2015).
Article CAS Google Scholar
Wulfridge, P., Langmead, B., Feinberg, A. P. & Hansen, K. D. Choice of reference genome can introduce massive bias in bisulfite sequencing data. 1–31 (2016).

Data Citations

NCBI Sequence Read Archive SRP108572 (2017)

Download references

Acknowledgements

The authors would like to thank George’s Hatchery in Harrisonburg, VA for providing fertilized eggs used in this study. We also thank RAE’s Spring 2017 Bio 480 Advanced Molecular Biology class in the JMU Department of Biology as well as the Cold Spring Harbor Laboratory DNA Learning Center for contributions to data analysis. This work was supported by Commonwealth Health Research Board grant #216-05-15 awarded to RAE and WT, a JMU 4-VA Collaborative Research Grant awarded to RAE, and Burroughs Wellcome Fund Grant #1017506 awarded to RAE.

Author information

Authors and Affiliations

Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA,
Isac Lee, Alannah Lejeune & Winston Timp
Department of Biology, James Madison University, Harrisonburg, VA 22807, USA,
Bejan A. Rasoul, Ashton S. Holub & Raymond A. Enke
Center for Genome & Metagenome Studies, James Madison University, Harrisonburg, VA 22807, USA,
Raymond A. Enke

Authors

Isac Lee
View author publications
You can also search for this author in PubMed Google Scholar
Bejan A. Rasoul
View author publications
You can also search for this author in PubMed Google Scholar
Ashton S. Holub
View author publications
You can also search for this author in PubMed Google Scholar
Alannah Lejeune
View author publications
You can also search for this author in PubMed Google Scholar
Raymond A. Enke
View author publications
You can also search for this author in PubMed Google Scholar
Winston Timp
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

I.L. conducted library preparation, bioinformatics analysis and writing of the manuscript. B.R. and A.H. assisted with bioinformatics analysis and writing of the manuscript. A.L. assisted with library preparation. W.T. and R.A.E. conceived and secured funding for the project as well as supervised all aspects of the project.

Corresponding authors

Correspondence to Raymond A. Enke or Winston Timp.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

ISA-Tab metadata

Supplementary information

Supplementary Figures (PDF 14426 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files made available in this article.

Reprints and permissions

About this article

Cite this article

Lee, I., Rasoul, B., Holub, A. et al. Whole genome DNA methylation sequencing of the chicken retina, cornea and brain. Sci Data 4, 170148 (2017). https://doi.org/10.1038/sdata.2017.148

Download citation

Received: 20 June 2017
Accepted: 22 August 2017
Published: 10 October 2017
DOI: https://doi.org/10.1038/sdata.2017.148

This article is cited by

DNA methylation in poultry: a review
- Xing Ju
- Zhijun Wang
- Qinghua Nie
Journal of Animal Science and Biotechnology (2023)
A chicken DNA methylation clock for the prediction of broiler health
- Günter Raddatz
- Ryan J. Arsenault
- Frank Lyko
Communications Biology (2021)
RNA sequencing analysis of the human retina and associated ocular tissues
- Scott T. Schumacker
- Krista R. Coppage
- Ray A. Enke
Scientific Data (2020)

Subjects

Abstract

Similar content being viewed by others

Background & Summary

Methods

Embryos

Tissue processing, histology & imaging:

Genomic DNA isolation

DNA preparation, bisulfite conversion, and sequencing

Quality validation and read mapping:

Methylation data analysis

Code availability

Data Records

Technical Validation

Quality control-DNA library integrity

Bisulfite sequencing data quality

Data reproducibility

Usage Notes

Additional Information

References

References

Data Citations

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

ISA-Tab metadata

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links