Introduction

SHOX (NM_000451.3) is located in the short arm pseudoautosomal region of the sex chromosomes (PAR1), and encodes a homeobox-containing transcription factor that plays a critical role in skeletal growth.1 SHOX haploinsufficiency leads to idiopathic short stature (ISS; OMIM #300582) without skeletal malformations and Leri–Weill dyschondrosteosis (LWD; OMIM #127300) characterized by Madelung deformity.1, 2, 3 Madelung deformity is a cluster of anatomical changes in the wrist including bowing of the radius and dorsal dislocation of the distal ulna, which can result in wrist pain, deformation and/or limited joint mobility.2, 3 Genetic defects underlying SHOX haploinsufficiency include several mutations in the coding region and various copy-number variations (CNVs) in PAR1.2, 4, 5, 6, 7, 8, 9, 10, 11 The ISS/LWD-associated CNVs are predicted to involve SHOX exons and/or cis-acting enhancers. Although the precise positions of the SHOX enhancers remain to be determined, they likely reside within the highly evolutionarily conserved non-coding DNA elements (CNEs) around the gene.10, 11, 12, 13, 14 Previous studies have identified seven CNEs (CNE-5, CNE-3, CNE-2, CNE4, CNE5, evolutionarily conserved region (ECR) 1 and evolutionarily conserved sequence (ECS) 4/CNE9) that exert in vitro or in vivo cis-regulatory activity.10, 11, 12, 13, 14 In vitro assays confirmed physical interaction between these CNEs and the SHOX promoter.10, 15

So far, mutation screening and copy-number analyses of SHOX have been performed mostly for patients of European origin.4, 5, 6, 7, 8, 9, 10, 16, 17, 18, 19, 20 Previous studies indicated that SHOX haploinsufficiency is primarily caused by submicroscopic CNVs in PAR1, and accounts for 2–17% of ISS cases and 35–100% of LWD cases.4, 5, 6, 7, 8, 9, 10, 16, 17, 18, 19, 20 However, the applicability of these findings to non-European populations remains unclear. For example, although a ~47 kb deletion in the SHOX downstream region was frequently identified in European patients,10, 19, 20 it is unknown whether this CNV is shared by patients of other ethnicities. More importantly, there is no report of sequence analysis of the CNEs in the patients with ISS or LWD. Thus, the clinical importance of point mutations in the known CNEs has yet to be studied. To address these unsolved issues, we performed systematic molecular analysis of SHOX in a large cohort of Japanese patients.

Materials and methods

Subjects

The study was approved by the Institutional Review Board Committee at the National Center for Child and Development, and performed after obtaining informed consent from the participants or their parents.

The study group consisted of 328 Japanese patients with short stature (164 males and 164 females; aged 0.5–17.9 years). The patients satisfied the following conditions: (i) referred to our pediatric endocrinology clinics between March 2013 and November 2015 for evaluation of short stature; (ii) short stature with standard deviation scores of <−2.0; (iii) no chronic diseases, such as growth hormone deficiency, congenital heart disease, achondroplasia or thyroid disease that may affect growth; and (iv) lack of cytogenetically detectable chromosomal abnormalities. The patients underwent radiological examinations of the hand. We examined the presence or absence of Madelung deformity-compatible features including narrowing of the ulnar portion of the distal radial physis, anterior bowing of the radial shaft and dorsal subluxation of the ulnar head.3 Of the 328 patients, 16 with radiologically recognizable Madelung deformity were diagnosed with LWD, whereas the remaining 312 were diagnosed with ISS. Four patients were included from our previous study.21 As controls, we used genomic DNA samples obtained from 100 healthy Japanese adults with normal height (50 males and 50 females).

Copy-number analysis

All patients were subjected to copy-number analysis. Genomic DNA was extracted from peripheral leukocytes. CNVs in PAR1 were analyzed by multiplex ligation-dependent probe amplification using a commercially available kit (SALSA P018-G1, MRC-Holland, Amsterdam, Netherlands) and further characterized by array-based comparative genomic hybridization using a custom-made microarray (8 × 60 k format, Agilent Technologies, Santa Clara, CA, USA). To exclude non-pathogenic variations, we referred to the Database of Genomic Variants (http://projects.tcag.ca/variation/).

Sequence analysis

Of the 312 patients with no pathogenic CNVs, 309 (300 ISS and 9 LWD) were subjected to sequence analysis of SHOX exons; 3 patients were excluded from this analysis because of insufficient amounts of DNA samples. We also carried out sequence analysis of the 7 CNEs for 83 patients (76 ISS and 7 LWD) whose DNA samples were sufficient for this experiment.

DNA fragments corresponding to the target regions were amplified by multiplex-PCR or by using the Haloplex system (Agilent Technologies, Santa Clara, CA, USA) and sequenced on a MiSeq next-generation sequencer (Illumina, San Diego, CA, USA). The highly polymorphic oligonucleotide repeats in the CNEs were excluded from sequence analysis. Nucleotide changes indicated by next-generation sequencing were confirmed by the Sanger method. Primer sequences are available on request. Nucleotide substitutions whose allelic frequency in the databases (dbSNP, http://www.ncbi.nlm.nih.gov/snp/; or 1000 Genome Browser, http://www.ncbi.nlm.nih.gov/variation/tools/1000genomes/) was more than 0.05 were excluded as common polymorphisms.

The functional outcomes of missense substitutions were predicted by in silico analysis using PolyPhen-2 (http://genetics.bwh.harvard.edu/pph2/) and SIFT (http://sift.jcvi.org/). To examine the population frequency of exonic substitutions, we referred to the Exome Aggregation Consortium Browser (http://exac.broadinstitute.org/) and the Human Genetic Variation Browser (http://www.genome.med.kyoto-u.ac.jp/SnpDB). To evaluate the population frequency of novel substitutions, we analyzed DNA samples of control individuals.

To assess the functional significance of nucleotide changes in the CNEs, we examined whether these substitutions were located within the putative enhancer sequences. We referred to ENCODE Broad Chromatin State Segmentation in the UCSC browser (http://genome.ucsc.edu/). The evolutionary conservation of the affected nucleotides was analyzed using the UCSC browser.

Results

Copy-number analysis

Submicroscopic CNVs in PAR1 were identified in 16 cases (cases 1–16; Figure 1 and Supplementary Figures 1 and 2). These CNVs consisted of 11 deletions and 5 duplications. The deletion in case 1 extended beyond PAR1, whereas CNVs in the remaining cases were located within PAR1. The deletion in case 9 affected a genomic interval downstream of the known CNEs. The ~47 kb deletion common in European patients,10, 19, 20 was absent in our cases. The breakpoints of the CNVs in cases 2, 3, 5, 12 and 14 were located within or close to repetitive sequences (Supplementary Figure 2).

Figure 1
figure 1

Copy-number variations (CNVs) identified in cases 1–16. The upper panel shows the names of multiplex ligation-dependent probe amplification (MLPA) probes. Probes of the SHOX exons are shaded in purple, and those of the putative enhancer regions, that is, highly evolutionarily conserved non-coding DNA elements (CNEs), evolutionally conserved region (ECR), or evolutionally conserved sequence (ECS), are shaded in blue or yellow. Probes in the pseudoautosomal region 1 are indicated with an arrow. The green and red lines depict copy-number loss and gain, respectively. The dotted lines indicate the copy-number unknown regions. p-ter: short arm telomere; cen, centromere.

Sequence analysis

Sequence analysis of SHOX identified eight intragenic substitutions in eight cases (cases 17–24; Table 1 and Figure 2). Of these, five were missense substitutions, and three were silent (synonymous) changes. These eight substitutions were absent or extremely rare in the general population. Four of the five missense substitutions were assessed as ‘probably damaging’, whereas p.Ser46Phe in case 21 was scored as ‘benign’. The substitutions in cases 17–19 were located within the homeobox.

Table 1 Nucleotide alterations identified in this study
Figure 2
figure 2

Nucleotide substitutions identified in cases 17–39. (a) Genomic structure of SHOX and its flanking region. Positions refer to the Human Genome (hg 19; NCBI Build 37). The white and purple boxes indicate the untranslated and translated exons, respectively. The striped boxes depict the homeobox. The yellow and light blue boxes denote the putative upstream and downstream enhancer regions, respectively. Probably damaging mutations and benign/silent substitutions are indicated by dark blue and green arrows, respectively. Substitutions in the putative enhancer regions are depicted by light blue arrows. (b) Electrochromatographs of the substitutions. Mutated nucleotides are indicated by arrows. Substitutions indicated by a, b, and c were identified in multiple patients. p-ter: short arm telomere; cen, centromere.

Sequence analysis of the known CNEs detected 11 substitutions in 15 cases (cases 25–39; Table 1 and Figure 2). Of these, five were rare polymorphisms, and the others were first identified herein. Most of the novel substitutions were shared by the control individuals. Substitutions in CNE-5 and CNE5 resided within putative enhancer sequences (Supplementary Figure 3). Most of the altered nucleotides were not highly conserved among species (Supplementary Figure 4).

Clinical features of patients with SHOX abnormalities

Cases 1–16 with CNVs comprised ten ISS and six LWD patients, whereas cases 17–20 with probably damaging mutations included two ISS and two LWD patients (Table 2). Thus, apparent SHOX abnormalities accounted for 12 of 312 (3.8%) ISS patients and eight of 16 (50%) LWD patients. Cases with benign or silent substitutions (cases 21–24) and CNE-linked substitutions (cases 25–39) were all ISS.

Table 2 Clinical and molecular findings of patients with SHOX abnormalities

Discussion

Systematic molecular analysis identified probably pathogenic SHOX abnormalities in 3.8 and 50% of Japanese ISS and LWD cases, respectively. These results indicate that the frequency of SHOX haploinsufficiency is comparable between Japanese and European patients. On the other hand, none of our patients carried the well-known ~47 kb downstream deletion,10, 19, 20 indicating that this CNV is a European-specific founder mutation. The lack of SHOX abnormalities in seven LWD patients provides further evidence for the allelic heterogeneity of LWD. As mutations in NPR2, a causative gene of Maroteaux-type acromesomelic dysplasia, have recently been identified in patients with LWD-compatible clinical features,22 such mutations may be present in some of our patients with normal SHOX.

Submicroscopic CNVs in PAR1 were identified in cases 1–16. Our results highlight the importance of PAR1-linked CNVs as a cause of SHOX haploinsufficiency. Furthermore, the results for case 9 support the hypothesis that a hitherto unidentified SHOX enhancer resides within the ~500 kb region ~300 kb downstream of the exons.23, 24 Likewise, the results of case 16 are consistent with the recently proposed notion that the duplications in the downstream region of the known CNEs can underlie ISS and LWD.25 The high frequency and heterogeneity of CNVs in cases 1–16 may reflect the genomic instability of PAR1; the recombination rate of PAR1 during male meiosis is ~17-times higher than the genomic average.26 Indeed, the breakpoints of cases 2, 3, 5, 12 and 14 were located close to or within repetitive sequences, which provide substrates for non-allelic homologous recombination.27 While the breakpoints of cases 1–16 were widely distributed in PAR1 and relatively frequent in the genomic regions adjacent to SHOX, this is consistent with the occurrence of male-specific crossover throughout PAR1 with hotspots in the ~0.8 Mb genomic interval around SHOX.26

Probably damaging intragenic mutations were detected in cases 17–20. These data confirm the results of the European studies that the pathogenic point mutations in SHOX are widely distributed in the coding region and relatively common in the homeobox.2, 28 On the other hand, the pathogenicity of four benign/silent substitutions in SHOX exons and ten substitutions in the CNEs remains unknown. As three of the ten substitutions in the CNEs resided within putative enhancer sequences, these substitutions may disturb SHOX expression. However, most of the affected nucleotides were not highly conserved among species, raising a question of the functional significance of the substitutions. Furthermore, most of these substitutions were shared by control individuals. Our data imply that the point mutations in the known CNEs account for only a minor fraction of the etiology of SHOX haploinsufficiency, if any. As additional SHOX enhancers are likely to reside in PAR1,23, 24 sequence analysis of these novel enhancer regions is needed.

In summary, our results indicate that the frequency and mutation spectrum of SHOX abnormalities are comparable between Asian and European patients, with the exception of a European-specific downstream deletion. Furthermore, this study highlights the clinical significance and genetic heterogeneity of PAR1-linked CNVs, and implies a limited role of CNE-linked mutations in the etiology of ISS and LWD.