RNA sequencing analysis of the developing chicken retina

Langouet-Astrie, Christophe J.; Meinsen, Annamarie L.; Grunwald, Emily R.; Turner, Stephen D.; Enke, Raymond A.

doi:10.1038/sdata.2016.117

Download PDF

Data Descriptor
Open access
Published: 20 December 2016

RNA sequencing analysis of the developing chicken retina

Christophe J. Langouet-Astrie¹,
Annamarie L. Meinsen¹,
Emily R. Grunwald¹,
Stephen D. Turner² &
…
Raymond A. Enke^1,3

Scientific Data volume 3, Article number: 160117 (2016) Cite this article

Subjects

Abstract

RNA sequencing transcriptome analysis using massively parallel next generation sequencing technology provides the capability to understand global changes in gene expression throughout a range of tissue samples. Development of the vertebrate retina requires complex temporal orchestration of transcriptional activation and repression. The chicken embryo (Gallus gallus) is a classic model system for studying developmental biology and retinogenesis. Existing retinal transcriptome projects have been critical to the vision research community for studying aspects of murine and human retinogenesis, however, there are currently no publicly available data sets describing the developing chicken retinal transcriptome. Here we used Illumina RNA sequencing (RNA-seq) analysis to characterize the mRNA transcriptome of the developing chicken retina in an effort to identify genes critical for retinal development in this important model organism. These data will be valuable to the vision research community for characterizing global changes in gene expression between ocular tissues and critical developmental time points during retinogenesis in the chicken retina.

Design Type(s)	organism development design • transcription profiling by high throughput sequencing design
Measurement Type(s)	transcription profiling assay
Technology Type(s)	RNA sequencing
Factor Type(s)	life cycle stage
Sample Characteristic(s)	Gallus gallus • retina • cornea

Machine-accessible metadata file describing the reported data (ISA-Tab format)

Bulk RNA sequencing analysis of developing human induced pluripotent cell-derived retinal organoids

Article Open access 09 December 2022

Comparing the transcriptome of developing native and iPSC-derived mouse retinae by single cell RNA sequencing

Article Open access 21 January 2023

RNA sequencing analysis of the human retina and associated ocular tissues

Article Open access 24 June 2020

Background & Summary

Next-generation sequencing technology has allowed for extensive transcriptome analysis of a wide range of species both with and without reference genome assemblies^1,2. These RNA sequencing (RNA-seq) analyses have become the gold standard for in depth characterization of global changes in gene expression as well as the accumulation of novel transcript isoforms. Once made publicly available, genome-wide experiments also provide the research community with valuable data that can be subsequently mined to further scientific knowledge. Detailed curation of these datasets is paramount for accurate interpretation, widespread dissemination, and repurposing of data.

The experiment described here is part of a larger ongoing project within the James Madison University’s Center for Genome & Metagenome Studies (CGEMS) investigating transcriptional regulation in the developing vertebrate retina. Within the developing retina, precise transcriptional regulation is critical for proper differentiation of specialized subclasses of retinal neurons and glial support cells³. Insights into these complex regulatory networks are critical for further understanding molecular mechanisms that drive human retinal development as well as for crafting novel strategies to combat blinding human diseases that affect the retina. Because developmental processes are highly conserved among vertebrate species, the chicken (Gallus gallus) embryo is a reliable and practical model system for studying organogenesis. Developmental staging of the chicken embryo has been characterized in meticulous detail for 65 years⁴. Embryo development is rapid, completing its entire program from blastula to hatchling in 21 days. Chick embryos are particularly useful for studying ocular development compared to other model organisms as their developmental eyes are easily accessible for experimentation due to their relatively large size. Recent resequencing and improvement of the chicken genome assembly combined with newly developed molecular tools for genetic manipulation of this model system have contributed to a renaissance of using the chicken embryo as a robust model to study retinal development⁵.

During chick retinal development, embryonic day 8 (E8) and embryonic day 18 represent early and late retinal developmental time points with respect to cellular differentiation of retinal neurons. The early E8 retina is packed with multipotent precursor cells while the E18 retina is nearly mature with photoreceptor (PR), bipolar, amacrine, horizontal, and ganglion cell neurons as well as Muller glial cells having differentiated from these multipotent precursors (Fig. 1) (ref. 5). Precursor cells yet to exit the cell cycle as well as each of these specialized retinal cell types are known to express developmental and cell type-specific genes, however the full detail of these specific expression patterns is yet to be defined⁶. Only recently have transcriptome experiments began to be employed as a tool to dissect global changes in gene expression during chick retinal development. A recent study using an elegant reporter system followed by cell capture and RNA-seq analysis, characterized differential gene expression of rod and cone photoreceptor differentiation during chick retinal development⁷. Focusing solely on this one important class of retinal neuron, Enright and colleagues were able to identify hundreds of differentially expressed genes involved in diverse cellular processes contributing to the birth of chicken photoreceptors⁷. Currently, there are no publicly available data sets characterizing the whole chicken retina transcriptome during development.

**Figure 1: Overview of Gallus gallus embryo, eye, and retinal development.**

This focus of this project is to characterize the mRNA transcriptome of the developing chicken retina in an effort to identify genes critical for differentiation of the six major retinal cell types as well as their derivative cell subtypes in this important model organism. The developmental points chosen were E8 (Fig. 1a–c), E16 (Fig. 1d–f), and E18 (Fig. 1g–j), which provides transcriptional information for early, middle/late, and late retinal development respectively. E18 whole cornea (Fig. 1h) was also included in this analysis as a non-retinal reference tissue. These analyses were conducted using Illumina RNA-seq in tandem with a bioinformatics pipeline to ensure sequence quality (Fig. 2) and for robust eukaryotic transcriptome analysis (Fig. 3).

**Figure 2: Quality assessment of raw FASTQ sequence data for 150 bp paired end left (1) and right (2) reads.**

**Figure 3: Experimental overview and assessment of read mapping and sample variance.**

Methods

Embryos

All embryo experiments were conducted with the approval of the James Madison University Institutional Animal Care and Use Committee and in accordance with the National Institutes of Health guide for the care and use of laboratory animals. Fertilized pathogen free commercial Cobb/Hubbard F1 hybrid eggs were obtained from George’s Hatchery (Harrisonburg, VA) and incubated in a rocking chamber held at 38 °C and 50–60% humidity until specified incubation days.

Tissue processing, histology & imaging

Chicken embryos were harvested and euthanized at specified days incubated by decapitation. Intact eyes were enucleated and placed in cold PBS. Whole embryos were imaged using an iPhone6 and whole eyes were imaged using a Stemi SV 6 stereo microscope (Zeiss) equipped with 18.2 Color Mosaic camera (SPOT). Eyecups were prepared by piercing and cutting around the limbus to dissect away the anterior segment, lens, and vitreous. Isolated corneas were saved for subsequent RNA extraction. For histology, eyecups were fixed in 4% paraformaldahyde in 1× PBS for 25 min on ice and then equilibrated for 2–24 hours in 25% sucrose in 1× PBS. Equilibrated eyecups were transferred into a 2:1 mixture of 25% sucrose:OCT compound (Electron Microscopy Sciences) on ice for 30 min and then flash frozen in the same solution in a Tissue-Tek Cryomold (Sakura Finetek) and stored at −80 °C. 10 μm thick frozen serial sections were prepared using a CM3050 S Research Cryostat (Leica) with the object and chamber temperatures set to −22 °C and −28 °C respectively. Frozen sections were thawed, H&E stained, and imaged using an EclipseTE2000 inverted microscope (Nikon) and processed with NIS Elements software (Nikon). For retinal dissection, eyecups were incubated for 20 min in HBSS modified media without calcium or magnesium (HBSS -Ca,-Mg;HyClone) at 37 °C to dissociate the retinal pigment epithelium (RPE) layer from the outermost layer of the retina. Retinas were then isolated by tearing away the sclera and gently peeling away the RPE layer. Isolated retinas and corneas were briefly rinsed in cold HBSS -Ca, -Mg. Retinas were immediately transferred to RLT+ lysis buffer (Qiagen; AllPrep kit) containing 2-Mercaptoethanol (Sigma) and vortexed vigorously to dissociate the tissue. Corneas were flash frozen and ground into a fine powder using a mortar and pestle prior to being transferred to RLT+/BME lysis buffer solution and vortexed. Samples were stored long term in lysis buffer at −80 °C.

Total RNA isolation

Total RNA was collected from eight embryonic chicken ocular tissues (Table 1). Whole retinas were harvested from E8 (Fig. 1a–c), E16 (Fig. 1d–f), and E18 (Fig. 1g–j) developing chicken embryos as well as whole corneas collected from E18 embryos (Fig. 1h). Duplicates were obtained for each time point and total RNA was extracted from samples using a Qiagen AllPrep Mini Kit with an on column DNaseI treatment step per the manufacturer’s instructions. Isolated RNAs were eluted in nuclease free water, validated for quality and quantity using UV spectrophotometry, and stored long term at −80 °C. RNAs with a OD260/280 ratio between 1.9 and 2.1 were deemed high quality.

Table 1 RNA-seq profiling to evaluate developmental and tissue-specific retinal gene expression.

Full size table

RNA preparation and sequencing

Total RNA samples chosen for characterization of global mRNA expression were submitted to the Cold Spring Harbor Laboratory DNA Sequencing Center for Bioanalyzer quality control analysis (Agilent) and Illumina Next Generation Sequencing. All submitted samples had RNA integrity number (RIN) >8. Stranded TrueSeq libraries with poly dT enrichment were prepared from total RNA from each of the four samples in biological duplicate according to the manufacture’s protocol. The resulting average size of the cDNA libraries was approximately 300 bp. Libraries for the 8 cDNA samples were sequenced using the Illumina NextSeq 500 sequencing platform yielding 28.6–72.2 million 125 bp paired end sequence reads per sample (Table 2; Fig. 3b).

Table 2 RNA-seq read statistics.

Full size table

Quality validation and read mapping

Between 28.7 and 72.2 million paired end sequence reads were obtained per sample from the Cold Spring Harbor Laboratory DNA Sequencing Center (Table 2). Quality of individual sequences were evaluated using FastQC analysis (see Code Availability 1), including per base sequence quality analysis which plots the Phred quality score distribution on the y axis for each read generated per sample for each nucleotide base call plotted on the x axis (Fig. 2). Figure 2 demonstrates that all 16 FASTQ sequencing files have an average per base Phred score >28, a conventional threshold denoting high quality base calls. Figure 3a demonstrates our experimental overview including the bioinformatics pipeline employed following quality validation of sequence reads. High quality sequence reads were aligned to the UCSC Gallus gallus reference genome (galGal4) using STAR¹ for ultrafast transcript assembly (see Code Availability 2). The percentage of uniquely mapped reads ranged from 73 to 85% (Table 2; Fig. 3b). Further quality validation of the data set was assessed using mapped reads from each sample.

Data transformation and downstream analysis

Differential gene expression between samples was quantified at the gene level using the read summarization program featureCounts (see Code Availability 3)². From this point on, all data analysis was conducted using R programing language and related packages. The output matrix from featureCounts⁸ was input into the bioconductor package DESeq2 (see Code Availability 4)⁹. This package was used to normalize the count data with a negative binomial distribution and values were log₂ transformed. Statistical plots principal component analysis (PCA) and distance matrix analysis were generated with the same package to assess variance between sample groups and sample replicates (Fig. 3c,d). Ensembl gene IDs mapping to predicted genes were excluded and the Benjamini-Hochberg False Discovery Rate¹⁰ procedure was used to re-estimate the adjusted P-values for Ensembl gene IDs¹¹ mapping to known genes.

Code availability

The following software and versions were used for quality control and data analysis as described in the main text:

1
FastQC, version 0.11.4 was used for quality analysis of raw FASTQ sequencing data: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
2
STAR, version 2.5.1b was used for mapping of sequence reads to the chicken galGal4 genome assembly: https://github.com/alexdobin/STAR/releases
3
FeatureCounts, version 1.5.0 was used for differential gene expression analysis: http://bioinf.wehi.edu.au/featureCounts/
4
DESeq2, version 1.6.2 was used for normalization and visualization of differential gene expression analysis output: http://bioconductor.org/packages/DESeq2

All code used for quality assessment and data analysis in this study is available at: https://github.com/stephenturner/langouetastrie-scidata2016-chicken-rnaseq-retina-code.

Data Records

Raw FASTQ files for the RNA-seq libraries were deposited to the NCBI Sequence Read Archive (SRA) (Data Citation 1), and have been assigned BioProject accession PRJNA275440. Output from the transcripts were deposited to the NCBI Gene Expression Omnibus (GEO) with accession number GSE65938 providing access to all the data files (Table 1; Data Citation 2). Supplementary file GSE65938_GEOENKE-RNASEQ-normcounts.csv.gz contains the genome, counts, and normalized counts used to generate the statistical plots (Data Citation 2).

Technical Validation

Quality control-RNA integrity

Quality of total RNA fractions was assessed using an Agilent Bioanalyzer to calculate a RNA Integrity Number (RIN). The RIN algorithm determines the RNA quality of the samples with the highest quality having a score of 10. Conventional to NGS analysis, only RNA samples with a RIN >8 were used for sequencing analysis.

RNA-Seq raw data quality

FastQC per base sequence quality analysis demonstrates mean Phred quality scores are well within the acceptable range for downstream analysis (Fig. 2). Between 21.0–61.5 million reads mapped reads were mapped to the chicken reference galGal4 genome assembly (Table 2). PCA biplot and distance matrix confirm the similarity between biological replicates and variability between developmental time points respectively. In addition, the PCA plot displayed that sample type (PC1) and embryonic developmental day (PC2) account for 98.8% of the variability in gene expression (Fig. 3c,d).

Usage Notes

The bioinformatics pipeline applied to our data set outlined in Fig. 3a was achieved using a collection of freely available, open access tools. These analyses however, are interchangeable with many other currently available tools. Our raw fastq data can be aligned to any available chicken reference genome, including the most recent 2011 galGal4 assembly, using a variety of freely available aligners. In this study we used the STAR genome aligner¹², however, similar analysis may be achieved using the ‘new tuxedo’ pipeline¹³. Alignment of the fastq data in the form of bam files can be viewed using popular genome browser such as the UCSC Genome Browser¹⁴, the Ensembl Browser¹¹ or the Broad Institute’s Integrative Genome Viewer (IGV)^15,16. Here our differential gene expression analysis was carried out using DESeq2 (refs 9,17), however other publicly available packages such as egdeR¹⁸ or CuffDiff¹ may also be used for this analysis. An alternative to using genome aligners is to employ an alignment-free transcript quantification^19,20 step followed by gene-level summation²¹ then differential expression analysis such as DESeq2 (refs 9,17). An alignment-free pipeline reduces the time of analysis as well as required computing power which may be beneficial for some users^19,20.

Our data set will be useful for a variety of studies investigating developmental and tissue-specific changes in gene expression in the vertebrate retina. There are however, several considerations that must be taken into account when using these data for downstream analysis. First, RNAs were extracted from whole retina or whole cornea without any enrichment for cell type. Therefore, resulting downstream analysis will be representative of heterogeneous mixtures of differing cell types within these tissues. Second, cDNA libraries were prepared using a poly dT primer, thus the data set is representative of only polyadenylated mRNA transcripts and does not represent non-coding RNA or other non-polyadenylated cellular transcripts. Additionally, usage of poly dT priming introduces a bias towards overrepresentation of the 3’ end of transcripts, particularly in the case of large transcripts. Finally, the quantity of sequenced and mapped reads per sample in this study (Table 2; Fig. 3b) is sufficient for robust differential gene expression analysis, however, is below the conventional threshold for thorough differential isoform analysis²². Taking these considerations into account, these data will be a useful resource for the vision research community to thoroughly investigate critical changes in gene expression that take place during the complex process of vertebrate retinal development. Additionally, these data will be available to explore important tissue-specific comparisons in gene expression patterns between the retina and the cornea, two clinically relevant ocular tissues.

Additional information

How to cite this article: Langouet-Astrie, C. J. et al. RNA sequencing analysis of the developing chicken retina. Sci. Data 3:160117 doi: 10.1038/sdata.2016.117 (2016).

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Trapnell, C. et al. Differential analysis of gene regulation at transcript resolution with RNA-seq. Nat. Biotechnol. 31, 46–53 (2013).
Article CAS Google Scholar
Koboldt, D. C., Steinberg, K. M., Larson, D. E., Wilson, R. K. & Mardis, E. R. The next-generation sequencing revolution and its impact on genomics. Cell 155, 27–38 (2013).
Article CAS Google Scholar
Lamb, T. D., Collin, S. P. & Pugh, E. N. Evolution of the vertebrate eye: opsins, photoreceptors, retina and eye cup. Nat. Rev. Neurosci. 8, 960–976 (2007).
Article CAS Google Scholar
Hamburger, V. & Hamilton, H. L. A series of normal stages in the development of the chick embryo. 1951. Dev. Dyn. 195, 231–272 (1992).
Article CAS Google Scholar
Vergara, M. N. & Canto-Soler, M. V. Rediscovering the chick embryo as a model to study retinal development. Neural Dev 7, 22 (2012).
Article Google Scholar
Doh, S. T. et al. Analysis of retinal cell development in chick embryo by immunohistochemistry and in ovo electroporation techniques. BMC Dev. Biol. 10, 8 (2010).
Article Google Scholar
Enright, J. M., Lawrence, K. A., Hadzic, T. & Corbo, J. C. Transcriptome profiling of developing photoreceptor subtypes reveals candidate genes involved in avian photoreceptor diversification. J. Comp. Neurol. 523, 649–668 (2015).
Article CAS Google Scholar
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Article CAS Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Article Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society. Series B (Methodologicl) 57, 289–300 (1995).
Article MathSciNet Google Scholar
Flicek, P. et al. Ensembl 2014. Nucleic Acids Res. 42, D749–D755 (2014).
Article CAS Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Article CAS Google Scholar
Pertea, M., Kim, D., Pertea, G. M., Leek, J. T. & Salzberg, S. L. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat. Protoc. 11, 1650 (2016).
Article CAS Google Scholar
Kent, W. J. et al. The Human Genome Browser at UCSC. Genome Res. 12, 996–1006 (2002).
Article CAS Google Scholar
Thorvaldsdottir, H., Robinson, J. T. & Mesirov, J. P. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief. Bioinform. 14, 178–192 (2013).
Article CAS Google Scholar
Robinson, J. T. et al. Integrative genomics viewer. Nat. Biotechnol. 29, 24–26 (2011).
Article CAS Google Scholar
Anders, S. & Huber, W. Differential expression analysis for sequence count data. Genome Biol. 11, R106 (2010).
Article CAS Google Scholar
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010).
Article CAS Google Scholar
Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527 (2016).
Article CAS Google Scholar
Patro, R., Mount, S. M. & Kingsford, C. Sailfish enables alignment-free isoform quantification from RNA-seq reads using lightweight algorithms. Nat. Biotechnol. 32, 462–464 (2014).
Article CAS Google Scholar
Soneson, C., Love, M. I. & Robinson, M. D. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Research 4, 1521 (2015).
Article Google Scholar
Conesa, A. et al. A survey of best practices for RNA-seq data analysis. Genome Biol. 17, 13 (2016).
Article Google Scholar

Data Citations

NCBI Sequence Read Archive SRP055037 (2016)
Enke, R., Turner, S., Grunwald, E., Brown, S., & DNA Learning Center NCBI Gene Expression Omnibus GSE65938 (2016)

Download references

Acknowledgements

The authors would like to thank George’s Hatchery in Harrisonburg VA for providing fertilized eggs used in this study as well as the Cold Spring Harbor Laboratory DNA Learning Center for providing sequencing services as part of a project funded by the National Science Foundation (DUE: 1323522). We also thank Sophia L. Brown and R.A.E.’s Fall 2015 Bio 481 Genomics and Spring Bio 480 Advanced Molecular Biology classes in the JMU Department of Biology for contributions to data analysis. This work was supported by Commonwealth Health Research Board grant #216-05-15 awarded to R.A.E., and a JMU/UVA 4-VA Collaborative Research Grant awarded to R.A.E. and S.D.T.

Author information

Authors and Affiliations

Department of Biology, James Madison University, Harrisonburg, 22807, Virginia, USA
Christophe J. Langouet-Astrie, Annamarie L. Meinsen, Emily R. Grunwald & Raymond A. Enke
Department of Public Health Sciences University of Virginia, Charlottesville, 22908, Virginia, USA
Stephen D. Turner
Center for Genome & Metagenome Studies, James Madison University, Harrisonburg, 22807, Virginia, USA
Raymond A. Enke

Authors

Christophe J. Langouet-Astrie
View author publications
You can also search for this author in PubMed Google Scholar
Annamarie L. Meinsen
View author publications
You can also search for this author in PubMed Google Scholar
Emily R. Grunwald
View author publications
You can also search for this author in PubMed Google Scholar
Stephen D. Turner
View author publications
You can also search for this author in PubMed Google Scholar
Raymond A. Enke
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.R.G. assisted with nucleic acids extraction and analysis. C.L.A. and A.L.M. assisted with writing manuscript and bioinformatics analysis of dataset. S.D.T. assisted with securing funding for project, writing manuscript, and bioinformatics analysis of dataset. R.A.E. conceived and secured funding for the project as well as supervised all aspects of the project.

Corresponding author

Correspondence to Raymond A. Enke.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

ISA-Tab metadata

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0 Metadata associated with this Data Descriptor is available at http://www.nature.com/sdata/ and is released under the CC0 waiver to maximize reuse.

Reprints and permissions

About this article

Cite this article

Langouet-Astrie, C., Meinsen, A., Grunwald, E. et al. RNA sequencing analysis of the developing chicken retina. Sci Data 3, 160117 (2016). https://doi.org/10.1038/sdata.2016.117

Download citation

Received: 18 August 2016
Accepted: 08 November 2016
Published: 20 December 2016
DOI: https://doi.org/10.1038/sdata.2016.117