Introduction

Developmental dyslexia (DD), also known as reading disability, is characterized by unexpected difficulties in reading and spelling without apparent intellectual and neurological impairment.1 The prevalence of reading disability among children and adolescents ranges from 5 to 17.5%.2 Notably, it was more prevalent among children with nonsyndromic cleft lip with or without cleft palate (NSCL/P), a common birth defect characterized by isolated orofacial cleft with unknown but complex etiology.3 The rate of moderate and severe degree of reading disability were reported to be 35 and 17%, respectively, in a group of NSCL/P children.4

Although DD and NSCL/P are apparently different, they might have potential common etiologic pathways. Compared with age-matched non cleft controls, children with NSCL/P scored significantly lower in a series of reading measurements.5, 6 It has been suggested that DD and NSCL/P shared similar neurological endophenotypes. Short-term verbal memory is the ability to keep verbal information in conscious awareness for brief periods of time.7 Deficiency in short-term verbal memory was known to be an important endophenotype of DD.7 Among children with NSCL/P, reading disability were also featured by short-term verbal memory defects which fitted into a classic model of DD.8, 9 Therefore, there might be some common etiologies underlying DD and NSCL/P. Neuroimaging studies indicated a neurological basis of DD could be explained by aberrant structure and function of reading and language networks throughout the left hemisphere.10 Consistent with this view, the most severely affected brain region among NSCL/P patients was reported to be the left temporal lobe in which observed structural abnormalities were directly related to cognitive dysfunctions.11

In addition to neurological findings, DD and NSCL/P might have similar etiologies at genetic level as well. Both DD and NSCL/P are complex developmental disorders with genetic and non-genetic components to their perspective etiologies. Current genetic understanding of DD is mainly based on linkage analysis and candidate gene association study. Linkage studies of DD identified nine DD susceptibility loci termed DYX1 to DYX9. Further refinement of these loci reported a number of candidate genes associated with DD. The well recognized candidate genes of DD includes DYX1C1 in DYX1, DCDC2 and KIAA0319 in DYX2 and ROBO1 in DYX5.12, 13, 14, 15 In recent years, some of these associated genes have been replicated in different populations through association studies.16 Meanwhile, genetic research of NSCL/P has recently identified multiple genes associated with disease susceptibility through genome-wide association study (GWAS). Previous GWAS in western populations identified a number of representative single nucleotide polymorphisms (SNPs) strongly identified as NSCL/P susceptibility loci.17, 18, 19, 20 Given substantial differences in genetic backgrounds, their exact contribution to risk in Chinese populations remains unknown. Recently, the first GWAS of NSCL/P in Chinese reported several risk SNPs of NSCL/P.21 In particular, rs8049367 located in 16p13.3 was found to be a risk variant exclusive to Chinese.21 We also confirmed variants within previously reported susceptibility loci including rs4791774 located in 17p13.1 and rs2235371 located in 1q32.2.21

Given the substantial genetic components underlying DD and NSCL/P, we aimed to explore observed linkage between these two disorders from genetic perspective. In the present study, we selected NSCL/P susceptibility SNPs from our recent GWAS report and evaluated their contribution to DD in a Chinese case–control study. Our study provided additional evidences to justify underlying genetic etiologies shared by DD and NSCL/P.

Materials and methods

Subjects

The criteria for participants with DD and healthy individuals were described elsewhere previously.22 Briefly, DD screening underwent the two-stage procedures. First, primary school students aged between 7 and 13 from Shandong province of China were subjected to a Chinese reading test consisting of character-, word- and sentence-level questions. The participants whose reading scores were above 87th percentile or below the 13th percentile among all students in the same grade were chosen for further evaluation. These participants were subjected to a character reading test composed of 300 Chinese characters individually for the assessment of reading ability. Then the Raven’s standard test was performed to exclude individuals with intelligence deficiency. Finally, a total of 631 children, 288 dyslexic participants and 343 controls, were selected for subsequent analysis. This study was approved by the ethical committee of Tsinghua University School of Medicine. All participants were informed with written consent.

Candidate SNPs genotyping

Three SNPs reported to be strongly associated with NSCL/P, rs2235371, rs8049367 and rs4791774, were genotyped on Sequenom MassARRAY platform (Sequenom, San Diego, CA, USA) at CapitalBio Corporation (Beijing, China). Genomic DNA samples were extracted from saliva samples using Oragene DNA self-collection kit (DNA Genotek, Ottawa, Ontario, Canada) and DNA quantity was determined by Nanodrop spectrophotometry (Nanodrop 1000 Spectrophotometer, Thermo Scientific, Wilmington, DE, USA). A PCR reaction based on a locus-specific primer extension reaction was designed using the MassARRAY Assay Design software package (v3.1, Sequenom, San Diego, CA, USA). MALDI-TOF mass spectrometer (Sequenom) and MassARRAY Type 4.0 software (Sequenom) were used for mass determination and data acquisition.

Data analysis

Statistical analysis was undertaken using PLINK software version 1.9 (http://pngu.mgh.harvard.edu/~purcell/plink/), which is an open-source whole-genome association analysis toolset and is commonly used to perform a range of basic, large-scale analyses.23 Hardy–Weinberg equilibrium tests were undertaken for each SNP, and association tests were performed using additive, dominant or recessive genetic models under a logistic regression model with covariates of age and sex. Bonferroni corrections for multiple testing were used to determine statistical significance.

Results

Basic characteristics of research participants are presented in Table 1. There were no differences of age distribution between DD cases and controls. The proportion of male in DD case group was higher than control groups. The effects of gender on DD were indicated in Supplementary Table S1. Three SNPs were genotyped in current study. In our sample, the allele T of rs2235371 was less frequent in cases than that in controls. The allele T of rs8049367 and the allele G of rs4791774 were more frequent in cases than that in controls.

Table 1 Basic characteristics of participants

SNP rs2235371 was not significantly associated with DD under allelic (P=0.1905, odd ratio (OR)=0.8569), additive (P=0.1780, OR=0.8487) and recessive (P=0.5807, OR=1.1470) models, except dominant model (P=0.0293, OR=0.6998). However, with the adjustment of age and sex, the association under dominant model became nonsignificant (P=0.1659). SNP rs8049367 was significantly associated with DD under additive (P=0.0085, OR=1.4100), dominant (P=0.0256, OR=1.4960) and recessive (P=0.0374, OR=1.7280) model, after adjustment for age and sex. SNP rs4791774 was not significantly associated with DD under all models (Table 2).

Table 2 Association analysis of NSCL/P SNPs with DD

After the Bonferonni correction for multiple comparisons, only SNP rs8049367 (OR=1.4100, 95% CI=1.0920–1.8220) on chromosome 17 was significantly associated with DD under additive models, indicating rs8049367 is a potential SNP marker for DD.

Discussion

In total, three susceptibility SNPs of NSCL/P were evaluated for their contribution to risk to DD. These SNPs were initially reported in a recent GWAS report as novel risk variants for NSCL/P specific to Chinese populations.21 In the present study, the T allele of rs8049367 showed 41% increased risk of DD, therefore rs8049367 appears to be susceptibility SNP for DD as well. However, the observed association between rs8049367 and DD should be interpreted with caution as this SNP is an intergenic variant located 50 kb upstream of CREBBP and 32 kb downstream of ADCY9 with unknown function. The CREBBP gene encodes a nuclear protein that binds to cAMP response element binding protein (CREB) and acts a transcriptional coactivator with histone acetyltransferase activity.24, 25 Mutations in CREBBP gene are known to cause Rubinstein–Taybi syndrome, which is a genetic disorder characterized by developmental delay and abnormalities as well as intellectual disabilities.26 In addition, the CREBBP gene was implicated in memory consolidation.27, 28 Knock-out of CREBBP in mice resulted in permanent impairments on both long- and short-term memory.29 Meanwhile, upregulation of CREBBP-mediated transcriptional activity improved both long- and short-term memory.30 Short-term verbal memory has a critical role in phonological processing, comprehension, and other reading-related processes. Previous studies of DD frequently observed impaired short-term verbal memory among dyslexics. In a recent study, Chinese dyslexics showed reduced short-term verbal memory during recall of Chinese characters than age-matched normal readers.31 Given reported implication of CREBBP in memory impairment as well as developmental and intellectual abnormalities, understanding functional effects of rs8049367 on CREBBP in future studies might explain its contribution to DD.

There were considerable amounts of NSCL/P patients showed reduced reading abilities.4 Previous observed reading disabilities among NSCL/P were also featured by impaired short-term verbal memory which suggested underlying etiologies shared by DD and NSCL/P.8 As rs8049367 has been associated with NSCL/P in the recent GWAS, our preliminary results indicated rs8049367 as a shared risk factor for both disorders. During palate development, expression of CREBBP was constant. The CREBBP-mediated transcription is likely regulated by changes in its phosphorylation rather than by changes in its expression.32 Meanwhile, expression of ADCY9 was upregulated in the dental pulp stem cultures of NSCL/P patients.21 Therefore, rs8049367 might contribute to DD and NSCL/P through its pleiotropic effects on CREBBP and ADCY9, respectively. Indeed, rs8049367 was in strong LD with an exon variant of a long non-coding RNA (lncRNA), namely RP11–462G12.2, which was predicted to interact with both CREBBP and ADCY9.21 Thus, future investigation on lncRNA RP11–462G12.2 might contribute to explain the assumed pleiotropic effects of rs8049367.

In conclusion, we performed association study of three NSCL/P susceptibility SNPs with DD in a Chinese population and observed significant association of rs8049367. This intergenic variant might have pleiotropic effects on its two nearby genes CERBBP and ADCY9. Further functional investigations are warranted to validate the assumed pleiotropic effects which might explain the shared etiology underlying DD and NSCL/P.