Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Transcriptome dataset of human corneal endothelium based on ribosomal RNA-depleted RNA-Seq data


The corneal endothelium maintains corneal transparency; consequently, damage to this endothelium by a number of pathological conditions results in severe vision loss. Publicly available expression databases of human tissues are useful for investigating the pathogenesis of diseases and for developing new therapeutic modalities; however, databases for ocular tissues, and especially the corneal endothelium, are poor. Here, we have generated a transcriptome dataset from the ribosomal RNA-depleted total RNA from the corneal endothelium of eyes from seven Caucasians without ocular diseases. The results of principal component analysis and correlation coefficients (ranged from 0.87 to 0.96) suggested high homogeneity of our RNA-Seq dataset among the samples, as well as sufficient amount and quality. The expression profile of tissue-specific marker genes indicated only limited, if any, contamination by other layers of the cornea, while the Smirnov-Grubbs test confirmed the absence of outlier samples. The dataset presented here should be useful for investigating the function/dysfunction of the cornea, as well as for extended transcriptome analyses integrated with expression data for non-coding RNAs.

Measurement(s) RNA
Technology Type(s) RNA sequencing
Factor Type(s) sex
Sample Characteristic - Organism Homo sapiens

Machine-accessible metadata file describing the reported data:

Background & Summary

The cornea is a transparent tissue located at the outermost surface of the eyeball, where it refracts light as a lens. It is comprised of five different layers: the epithelium, Bowman’s layer, stroma, Descemet’s membrane, and endothelium1. The corneal endothelium maintains corneal transparency by regulating the inflow and outflow of the aqueous humor to the stroma via a pump-and-barrier function. The cells of the corneal endothelium have limited proliferative capacity2,3; therefore, damage to the corneal endothelium by several pathological conditions, such as Fuchs endothelial corneal dystrophy (FECD), invasive cataract surgery, glaucoma surgery, and endotheliitis, can induce a loss of corneal transparency and result in severe visual disturbance4. The only therapeutic remedy has been transplantation of a donor cornea, and more than 50% of the indications for corneal transplantation involve corneal endothelial decompensation5,6.

Cell therapy for treating corneal endothelial decompensation has been investigated by many research groups7. Indeed, a clinical trial involving injection of cultured corneal endothelial cells into the anterior chamber of the eye has been initiated in Japan8. The induction of corneal endothelial cells from human pluripotent stem cells (iPSCs) has been reported, and the transplantation of those cells has been proposed as a potential decompensation therapy9. Along this line, many researchers have devoted their efforts to determining selective markers for the corneal endothelium to validate cultured cells for clinical use10,11,12,13,14,15. In addition, a candidate causative gene for FECD has been identified16,17 that affects ~4% of the population over the age of 40 in the United States18,19,20 and is responsible for most common corneal dystrophies. This identification of a causative gene, in turn, has led to the elucidation of the pathophysiology and to the proposal of multiple potential therapeutic modalities21,22,23. In line with this recent accelerating research in the field of corneal endothelial decompensation, the establishment of a rigid gene expression database is anticipated.

Gene expression data derived from multiple species and tissues are useful for revealing the molecular pathogenesis of diseases and are now increasingly available from public databases. However, gene expression data for human ocular tissues in particular, such as corneal endothelial cells, remains limited. For example, no ocular data are listed on the GTEx Portal (, while only summarized data derived from whole eye, retina, and some ocular cell lines are listed on the ENCODE project ( In terms of the corneal endothelium, two records are found in the “Publication” of “Data Type” deposited in ENCODE, and one article describes mRNA expression24 by referring to other published RNA-Seq data25. On the other hand, there are some published articles describing the results of RNA-Seq data derived from corneal endothelium (Supplementary Table 1), although all of the data were based on the sequencing libraries generated from poly-A selected mRNA and sequenced by single-end 50-bp reads25,26,27,28,29. In addition, some of the data were produced from cell lines or after performing ex vivo/primary culture, which could induce artificial gene expression. Recently, RNA-Seq analysis of the human ocular tissues, including cornea, has been reported as a result of pair-end 150-bp sequencing, although they used the whole cornea tissues without separating into the different layers30. Consequently, our RNA-Seq dataset based on the pair-end 100-bp sequencing using the ribosomal RNA-depleted total RNA should be useful for analyzing unbiased expression of not only coding genes but also non-coding RNAs of human corneal endothelial cells.

Therefore, in this data descriptor, our goal was to provide a transcriptome dataset based on ribosomal RNA-depleted RNA-Seq data derived from the corneal endothelium of seven normal Caucasian donors to serve as a gold standard for expression analyses. To this end, we paid strict attention to the selection criteria of the human donors (i.e., a narrow range of age distribution and an almost equal number of each gender), as well as to the qualities of the extracted total RNA and the generated RNA-Seq data, through several quality-control (QC) procedures. Consequently, the dataset established here should be a useful, reliable, and robust tool for understanding the cellular characteristics of the cornea endothelium, as well as a reference control for revealing the molecular mechanisms of disease pathogenesis in corneal dystrophies.


Ethics statement

The human tissue used in this study was handled in accordance with the tenets set forth in the Declaration of Helsinki. Informed written consent to utilize donor corneal tissue for eye research was obtained from the next of kin of all deceased donors. All corneal tissue was recovered under the tenets of the Uniform Anatomical Gift Act (UAGA) of the particular state in which the donor consent was obtained and the tissue was recovered.

Corneal endothelial tissues

Normal human donor corneas were obtained from CorneaGenTM (, Seattle, WA). All corneas had been stored at 4 °C in storage medium (Optisol-GS; Bausch & Lomb, Rochester, US-NY) for less than 14 days before use for experiments. Corneas derived from 7 donors (3 males and 4 females of Caucasian descent; age range: 48–69 years old) were used in the study. Descemet’s membrane, including the corneal endothelium, was stripped from the donor corneas. The corneal endothelium was then lysed in 700 μL of QIAzol lysis reagent (Qiagen, Valencia, CA), homogenized with a vortex mixer for 30 seconds, and stored at −80 °C until used for experiments.

Total RNA preparation

Total RNA was extracted from each corneal endothelium with an RNeasy Mini Kit (Qiagen). Briefly, the lysed corneal endothelium in QIAzol lysis reagent was thawed at 37 °C and incubated for 5 minutes at room temperature. Chloroform (140 μL) was added and the samples were mixed thoroughly, followed by centrifugation at 12,000 × g for 15 minutes at 4 °C. The upper layer was collected, mixed with an equal volume of 70% ethanol, and concentrated using spin columns. The final concentration of total RNA was measured with an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA) and an RNA 6000 Pico Kit (Agilent Technologies). The quality of the total RNA was assessed by calculating the RNA integrity number (RIN) with the Agilent 2100 Expert Software (Agilent Technologies) (Table 1). The total RNA samples were snap-frozen in liquid nitrogen and stored at −80 °C until use (the mean ± SD storage period was 612.6 ± 27.3 days).

Table 1 Sample information.

RNA-Seq library preparation and sequencing

The RNA-Seq libraries for next-generation sequencing (NGS) were generated with a SMARTer Stranded Total RNA-Seq Kit v2 - Pico Input Mammalian (Takara Bio Inc., Shiga, Japan), according to the manufacturer’s instruction. The stored total RNA samples were thawed and their total RNA concentrations were determined with a NanoDrop ND-1000 (Thermo Fisher Scientific, Waltham, MA) and/or NanoPhotometer NP-80 (Implen GmbH, Munich, Germany) instrument. The cDNA derived from rRNA was depleted by ZapR v2 & R-Probe v2 contained in SMARTer Kit during the procedure. The quality and quantity of each RNA-Seq library was confirmed by the following three different methods: i) use of a high sensitivity DNA kit using an Agilent 2100 Bioanalyzer, ii) quantitative PCR (QPCR) with a Stratagene Mx3005P real-time QPCR system (Agilent Technologies) and the SYBR FAST ROX Low qPCR Master Mix of the KAPA Library Quantification Kit Illumina Platform (Roche Sequencing Solutions Inc., Pleasanton, CA), and iii) measurement with a Qubit 2.0 Fluorometer (Thermo Fisher). The generated libraries were then subjected to cluster generation in a flow cell using a TruSeq PE Cluster Kit v3 (Illumina Inc., San Diego, CA) and sequenced by using a paired-end 100-bp read protocol on a HiScanSQ System (Illumina) with a TruSeq SBS Kit v3 (Illumina). The sequencing and the subsequent data processing were carried out at the NGS Core Facility of the Kyoto Prefectural University of Medicine.

RNA-Seq data analyses

As for the NGS data QC and mapping processes, base calling was performed by bcl2fastq version 2.20 (Illumina). Generated fastq files were applied to the FastQC version 0.11.9. As the fastp 0.20.1 program simultaneously performs both the QC analysis for input RNA-Seq data and QC filtering to remove the low quality/too short sequences, we used the fastq files as the QC filtered reads generated by fastp with default parameter setting. After the QC, adaptor sequences of illumina TruSeq contained in the remained reads were trimmed by Trimmomatic-0.39 program, and configured the following options referring to the web site: ILLUMINACLIP: TruSeq. 3-PE-2.fa:2:30:10:2:keepBothReads LEADING:3 TRAILING:3 MINLEN:36.

The reads that passed these filters were aligned to the human reference genome sequence (GRCh38, for details see the Usage Notes section) by using STAR version 2.7.3a program. The alignment process by STAR was performed with the options of ‘–runMode alignReads’, ‘–outSAMtype BAM SortedByCoordinate’, and ‘–quantMode TranscriptomeSAM’. The gene expression analyses were performed by ‘rsem-calculate-expression’ program from RSEM version 1.3.3 after generated indexes for reference genome and GTF files by ‘rsem-prepare-reference’.

All of the following statistical analyses were conducted by using R program version 3.6.3. Principal component analysis (PCA) and regression analysis were performed by using ‘prcomp’ and ‘cor’ function in the default packages of R, respectively. Transcripts Per Million (TPM) data from RSEM results was extracted for 7,887 genes with TPM ≥ 1.0 in all seven samples from all assessed 60,164 genes and transformed to common logarithm values for these tests. The correlation among seven samples was tested by the Spearman’s rank correlation coefficient test, and the resulting correlation matrix was drawn with the ‘ggcorrplot’ library of R. In the heatmap analysis, the values were normalized with ‘zFPKM’ and heatmap analysis was performed with ‘pheatmap’ (both the libraries were obtained from Bioconductor). Marker genes reported to show specific expression in different corneal tissues, such as the epithelium, stroma, and endothelium, were selected as follows: PAX6 and WNT7 were referred from a representative study that performed a functional analysis of the corneal epithelium31; ALDH3A1, CHST6, KERA, and PTGDS were extracted from expression markers of the corneal stroma or keratocytes commonly reported in four articles32,33,34,35; and ATP1A1, TJP1, COL8A1, and SLC4A11 were selected from highly ranked investigated genes related to the corneal endothelium in a comprehensive review article36. The expression data of each gene used in the heatmap were evaluated for the existence of outlier samples by the Smirnov-Grubbs test with the ‘outliers’ library of R.

Data Records

All raw fastq files produced by RNA-Seq were deposited in the DNA Data Bank of Japan (DDBJ) Sequence Read Archive (DRA)37. The expression data set of genes obtained by STAR and RSEM was deposited in the DDBJ Genomic Expression Archive (GEA) with Experiment Accession ID E-GEAD-399. The E-GEAD-399 files contain the information of the Ensembl gene ID and the TPM value of each gene derived from all the samples.

Technical Validation

Quality assessment of total RNA and RNA-Seq data

We obtained sufficient yield (>10 ng) and a RIN value (≥7.0) satisfying the requirements for library preparation (Table 1 and Supplementary Figure 1). The RNA-Seq yielded a number of raw reads derived from seven samples that fell within the range between 38.97 and 59.89 M reads. Regression analysis showed no significant correlation of RNA yield (Supplementary Figure 2a), RIN (Supplementary Figure 2b), or storage period (Supplementary Figure 2c) to the number of raw reads, suggesting that the quality of the produced reads was not affected by the starting RNA and/or the storage condition. All the paired-end reads showed sufficient quality scores after the QC processes (Fig. 1a and Supplementary Figure 3). The filtered reads were mapped to the reference genome within a range between 24.12 and 45.29 M read, relatively less amount of the reads than those of the typical standard read depth for RNA-Seq (~50 M reads/sample), which might be due to the condition(s) of total RNA obtained from the preserved tissues in the eye bank and/or RNA fragmentation resulted in the small library size appeared in some samples (Supplementary Table 2). Overall, the variations in the mapped reads were reduced compared to the raw reads, suggesting that the homogeneity of the RNA-Seq reads among the samples had been improved through the QC processes (Fig. 1b).

Fig. 1

Quality control (QC) results of RNA-Seq data. (a) The distribution of the Phred quality score per base sequence based on FastQC for each of the seven samples (green line) generated by multiQC. The original FastQC plot of each sample is shown as Supplementary Figure 3. The different colors of plot area indicate the ranges of Phred quality score as red (<20), orange (20–28), and green (28<). All the post-QC reads were distributed on the green area showing sufficient quality. (b) Number of fastq reads (1) without filtering (black), (2) surpassing the QC filters (dark gray), and (3) successful in mapping (light gray).

We assessed the homogeneity among the samples by analyzing the correlation of the TPM values of the genes filtered from the RNA-Seq dataset (Supplementary Figure 4 and Supplementary Table 2). The result of PCA showed that all samples were distributed within the narrow range of the first component, where the contribution (92.84%) indicated much higher than that of the second component (2.86%) (Fig. 2a). In addition, the correlation coefficients distributed from 0.87 to 0.96 (Fig. 2b). These results suggested a high correlation of the gene expression pattern among the samples.

Fig. 2

Homogeneity of RNA-Seq data among the samples. PCA (a) and the analysis of correlation (b) among the RNA-Seq data were performed by TPM values from the selected genes. (a) X- and Y-axis shows the principal component 1 (PC1) and PC2 with each contribution rate, respectively. (b) The values shown within the correlation matrix indicate the Spearman’s rank correlation coefficient.

Expression analysis of tissue-specific genes

We evaluated the expression profile of our current RNA-Seq dataset using genes reportedly expressed in each layer of the cornea (i.e., the corneal epithelium, stroma, and endothelium) (Fig. 3). PAX6 and WNT7A, which are expressed in the corneal epithelium31, showed low expression in the endothelium. Most genes with known expression in the stroma32,33,34,35 showed low expression levels in the endothelium, although the Prostaglandin D2 synthase (PTGDS) gene was highly expressed. However, this high expression of PTGDS is consistent with a report indicating that PTGDS is expressed in both the stroma and the endothelium38. By contrast, expression of genes often used as endothelial markers36 was generally high. The expression of Tight Junction Protein 1 (TJP1, also known as ZO-1) was relatively low when compared with other endothelial markers, but this lower expression level was consistent with a previous description of lower expression of TJP1 mRNA compared to TJP1 protein26. The Smirnov-Grubbs test confirmed the absence of outlier samples based on the evaluation of the TPM values from each gene used in the heatmap.

Fig. 3

Expression profile of marker genes selected from each layer of corneal tissue. The heatmap indicates the normalized TPM values for each gene and sample. The expression levels of PAX6 and WNT7A were low in the corneal endothelium. The expression levels of ALDH3A1, CHST6, and KERA were low, while the PTGDS was highly expressed in the endothelium. The expression levels were high for the genes commonly used as endothelial markers.

Taken together, the expression data of this corneal endothelium RNA-Seq dataset is considered to be reliable. It should be useful as a reference database of healthy corneal endothelium tissue for the various expression analyses, as well as for future transcriptome analyses including the expression data of non-coding RNAs.

Usage Notes

The human reference genome sequence (GRCh38) used in STAR alignment process was obtained from Ensembl ( Before aligned, the reference genome was indexed by STAR with ‘--sjdbGTFfile’ option for GTF gene annotation file provided in the same release ( Note that this GTF file contains the annotation of 60,683 genes, and 60,164 genes were applied to analyses in this study after removing the annotations of tRNA and rRNA referred to ‘RepeatMasker’ track from UCSC Genome Browser ( on Human Dec. 2013 (GRCh38/hg38).

Code availability

As described in the Methods section, all of the analyses in this study were performed with the following open-access programs:

QC checking for RNA-Seq data was performed by FastQC version 0.11.9. (

QC results were summarized by multiQC version 1.9. (

QC filtering was performed by using fastp 0.20.1 program with default setting. (

After QC, adaptor sequences were trimmed by Trimmomatic-0.39 program. ( = trimmomatic).

All reads were aligned to the human reference genome sequence by STAR version 2.7.3a program. (

Gene expression analyses were performed by RSEM version 1.3.3. (


  1. 1.

    Weiss, J. S. et al. IC3D Classification of Corneal Dystrophies—Edition 2. Cornea. 34, 117–159 (2015).

    Article  Google Scholar 

  2. 2.

    Joyce, N. C. Proliferative capacity of the corneal endothelium. Prog Retin Eye Res. 22, 359–389 (2003).

    CAS  Article  Google Scholar 

  3. 3.

    Joyce, N. C. Proliferative capacity of corneal endothelial cells. Exp Eye Res. 95, 16–23 (2012).

    CAS  Article  Google Scholar 

  4. 4.

    Tan, D. T., Dart, J. K., Holland, E. J. & Kinoshita, S. Corneal transplantation. Lancet. 379, 1749–1761 (2012).

    Article  Google Scholar 

  5. 5.

    Gain, P. et al. Global Survey of Corneal Transplantation and Eye Banking. JAMA Ophthalmol. 134, 167–173 (2016).

    Article  Google Scholar 

  6. 6.

    Eye Bank Association of America. 2014 Eye Banking Statistical Report (2015).

  7. 7.

    Okumura, N. & Koizumi, N. Regeneration of the Corneal Endothelium. Curr Eye Res. 45, 303–312 (2020).

    Article  Google Scholar 

  8. 8.

    Kinoshita, S. et al. Injection of cultured cells with a ROCK inhibitor for bullous keratopathy. N Engl J Med. 378, 995–1003 (2018).

    CAS  Article  Google Scholar 

  9. 9.

    Hatou, S. & Shimmura, S. Review: corneal endothelial cell derivation methods from ES/iPS cells. Inflamm Regen. 39, 19 (2019).

    Article  Google Scholar 

  10. 10.

    Cheong, Y. K. et al. Identification of cell surface markers glypican-4 and CD200 that differentiate human corneal endothelium from stromal fibroblasts. Invest Ophthalmol Vis Sci. 54, 4538–4547 (2013).

    CAS  Article  Google Scholar 

  11. 11.

    Chng, Z. et al. High throughput gene expression analysis identifies reliable expression markers of human corneal endothelial cells. PLoS One. 8, e67546 (2013).

    ADS  CAS  Article  Google Scholar 

  12. 12.

    Okumura, N. et al. Cell surface markers of functional phenotypic corneal endothelial cells. Invest Ophthalmol Vis Sci. 55, 7610–7618 (2014).

    CAS  Article  Google Scholar 

  13. 13.

    Ueno, M. et al. Gene Signature-Based Development of ELISA Assays for Reproducible Qualification of Cultured Human Corneal Endothelial Cells. Invest Ophthalmol Vis Sci. 57, 4295–4305 (2016).

    CAS  Article  Google Scholar 

  14. 14.

    Ueno, M. et al. Concomitant Evaluation of a Panel of Exosome Proteins and MiRs for Qualification of Cultured Human Corneal Endothelial Cells. Invest Ophthalmol Vis Sci. 57, 4393–4402 (2016).

    CAS  Article  Google Scholar 

  15. 15.

    Yamamoto, A. et al. A physical biomarker of the quality of cultured corneal endothelial cells and of the long-term prognosis of corneal restoration in patients. Nat Biomed Eng. 3, 953–960 (2019).

    CAS  Article  Google Scholar 

  16. 16.

    Baratz, K. H. et al. E2-2 protein and Fuchs’s corneal dystrophy. N Engl J Med. 363, 1016–1024 (2010).

    CAS  Article  Google Scholar 

  17. 17.

    Wieben, E. D. et al. A common trinucleotide repeat expansion within the transcription factor 4 (TCF4, E2-2) gene predicts Fuchs corneal dystrophy. PLoS One. 7, e49083 (2012).

    ADS  CAS  Article  Google Scholar 

  18. 18.

    Hamill, C. E., Schmedt, T. & Jurkunas, U. Fuchs Endothelial Cornea Dystrophy: A Review of the Genetics Behind Disease Development. Semin Ophthalmol. 28, 281–286 (2013).

    Article  Google Scholar 

  19. 19.

    Lorenzetti, D. W. C., Uotila, M. H., Parikh, N. & Kaufman, H. E. Central Cornea Guttata. Am J Ophthalmol. 64, 1155–1158 (1967).

    CAS  Article  Google Scholar 

  20. 20.

    Musch, D. C., Niziol, L. M., Stein, J. D., Kamyar, R. M. & Sugar, A. Prevalence of Corneal Dystrophies in the United States: Estimates from Claims Data. Invest Ophthalmol Vis Sci. 52, 6959–6963 (2011).

    Article  Google Scholar 

  21. 21.

    Du, J. et al. RNA toxicity and missplicing in the common eye disease fuchs endothelial corneal dystrophy. J Biol Chem. 290, 5979–5990 (2015).

    CAS  Article  Google Scholar 

  22. 22.

    Wieben, E. D. et al. Trinucleotide Repeat Expansion in the Transcription Factor 4 (TCF4) Gene Leads to Widespread mRNA Splicing Changes in Fuchs’ Endothelial Corneal Dystrophy. Invest Ophthalmol Vis Sci. 58, 343–352 (2017).

    Article  Google Scholar 

  23. 23.

    Soragni, E. et al. Repeat-Associated Non-ATG (RAN) Translation in Fuchs’ Endothelial Corneal Dystrophy. Invest Ophthalmol Vis Sci. 59, 1888–1896 (2018).

    CAS  Article  Google Scholar 

  24. 24.

    Yoshihara, M. et al. Discovery of Molecular Markers to Discriminate Corneal Endothelial Cells in the Human Body. PLoS One. 10, e0117581 (2015).

    Article  Google Scholar 

  25. 25.

    Chen, Y. et al. Identification of novel molecular markers through transcriptomic analysis in human fetal and adult corneal endothelial cells. Hum Mol Genet. 22, 1271–1279 (2013).

    CAS  Article  Google Scholar 

  26. 26.

    Frausto, R. F., Le, D. J. & Aldave, A. J. Transcriptomic Analysis of Cultured Corneal Endothelial Cells as a Validation for Their Use in Cell Replacement Therapy. Cell Transplant. 25, 1159–1176 (2016).

    Article  Google Scholar 

  27. 27.

    Chung, D. D. et al. Transcriptomic Profiling of Posterior Polymorphous Corneal Dystrophy. Invest Ophthalmol Vis Sci. 58, 3202–3214 (2017).

    CAS  Article  Google Scholar 

  28. 28.

    Chung, D. D. et al. Alterations in GRHL2-OVOL2-ZEB1 axis and aberrant activation of Wnt signaling lead to altered gene transcription in posterior polymorphous corneal dystrophy. Exp Eye Res. 188, 107696 (2019).

    CAS  Article  Google Scholar 

  29. 29.

    Frausto, R. F. et al. Phenotypic and functional characterization of corneal endothelial cells during in vitro expansion. Sci Rep. 10, 7402 (2020).

    ADS  CAS  Article  Google Scholar 

  30. 30.

    Schumacker, S. T., Coppage, K. R. & Enke, R. A. RNA sequencing analysis of the human retina and associated ocular tissues. Sci Data. 7, 199 (2020).

    CAS  Article  Google Scholar 

  31. 31.

    Ouyang, H. et al. WNT7A and PAX6 define corneal epithelium homeostasis and pathogenesis. Nature. 511, 358–361 (2014).

    ADS  CAS  Article  Google Scholar 

  32. 32.

    Du, Y. et al. Secretion and Organization of a Cornea-like Tissue In Vitro by Stem Cells from Human Corneal Stroma. Invest Ophthalmol Vis Sci. 48, 5038–5045 (2007).

    Article  Google Scholar 

  33. 33.

    Chan, A. A. et al. Differentiation of Human Embryonic Stem Cells into Cells with Corneal Keratocyte Phenotype. PLoS One. 8, e56831 (2013).

    ADS  CAS  Article  Google Scholar 

  34. 34.

    Wu, J., Du, Y., Watkins, S. C., Funderburgh, J. L. & Wagner, W. R. The engineering of organized human corneal tissue through the spatial guidance of corneal stromal stem cells. Biomaterials. 33, 1343–1352 (2012).

    CAS  Article  Google Scholar 

  35. 35.

    Che, X. et al. A Novel Tissue-Engineered Corneal Stromal Equivalent Based on Amniotic Membrane and Keratocytes. Invest Ophthalmol Vis Sci. 60, 517–527 (2019).

    CAS  Article  Google Scholar 

  36. 36.

    Van den Bogerd, B. et al. Corneal Endothelial Cells Over the Past Decade: Are We Missing the Mark(er)? Transl Vis Sci Technol. 8, 13 (2019).

    Article  Google Scholar 

  37. 37.

    DNA Data Bank of Japan (2020).

  38. 38.

    Sakai, R. et al. Construction of Human Corneal Endothelial cDNA Library and Identification of Novel Active Genes. Invest Ophthalmol Vis Sci. 43, 1749–1756 (2002).

    PubMed  Google Scholar 

Download references


This work was supported by the Program for the Strategic Research Foundation at Private Universities from MEXT (Koizumi N and Okumura N).

Author information




Y.K. and N.H. prepared samples. Y.T., Y.K., N.H. and M.N. generated RNA-Seq libraries and sequenced at NGS Core Facility of KPUM. Y.T., Y.K. and N.H. performed bioinformatics analyses. Y.T., N.O., M.N. and N.K. wrote the manuscript. N.O., K.T., M.N., and N.K. designed and secured funding for research project. All authors read and approved the final version of the manuscript.

Corresponding author

Correspondence to Masakazu Nakano.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit

The Creative Commons Public Domain Dedication waiver applies to the metadata files associated with this article.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Tokuda, Y., Okumura, N., Komori, Y. et al. Transcriptome dataset of human corneal endothelium based on ribosomal RNA-depleted RNA-Seq data. Sci Data 7, 407 (2020).

Download citation


Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing