Main

Height is a highly variable polygenic trait with an estimated 80–90% of the variation explained by a genetic component (1,2). The variability has a normal distribution in the population with the extremes that go far beyond the normal variation, considering “normal” a height comprised between −2 and +2 SD. Short stature is the condition characterized by a height lower than 2 SD below the mean for a given age, sex, and population group and represents the most common reason for pediatric consultation. Idiopathic short stature (ISS) describes a heterogeneous group of phenotypes consisting of unidentified causes of short stature, presenting the following common criteria: normal size for gestational age at birth, normal body proportions, no evidence of pituitary hormone deficiency, no evidence of chronic organic disease, no psychiatric disease or severe emotional disturbance and normal food intake. According to this definition, it is estimated that approximately 80% of all children referred for short stature will be labeled as ISS at the end of the diagnostic work-up (3,4).

Mutations and deletions in the short stature homeobox gene (SHOX) located within the pseudoautosomal region 1 (PAR1) and representing the best known genetic cause of ISS have been detected in the 7–15% of ISS patients (5,6). However in only a minority of the ISS patients, a genetic defect has been identified.

A contribution to the understanding of the genetic etiology of short stature come from genome-wide association studies (GWAS). In the past years, several GWAS identified hundreds of common variants contributing to height determination in the general population. The most recent and comprehensive GWAS was reported by Visscher et al. (7) that analyzed the combined results of 79 GWAS using data from 253,288 individuals. They showed that less than 10,000 single-nucleotide polymorphisms (SNPs) can account for 36% of height and identified 697 SNPs clustered in 423 loci reaching a genome wide significance. Chan et al. (8) observed that the lower extreme of the phenotype distribution was shorter than what expected based on the polygenic model. Different models might explain this observation. For example, deprivation, malnutrition, comorbidity with other disorders or other environmental conditions might disturb the polygenic inheritance at this extreme of the curve. Moreover high penetrance rare variants might account for at least a portion of the extreme phenotypes whereas another part of them might be determined by a combination of quite common variants in susceptibility genes.

Loci that have been replicated across several GWAS in different populations likely represent the major genetic determinants influencing height. Common variants within and close the 3’UTR of the HMGA2 (NM_003483) have been repeatedly associated with childhood and adult height determination in multiple populations from different geographical areas showing the strongest association (<10−10) in several studies (9,10,11,12). These polymorphisms might also play a role in the susceptibility to ISS in patients that do not carry high penetrance mutations.

A role of HMGA2 in ISS is strongly supported by the observation that rare microdeletions on 12q14 including HMGA2 in the critical region have been detected in patients with short stature as a common feature (13,14). Interestingly, a submicroscopic intragenic deletion disrupting only HMGA2 has been identified in a boy with idiopathic proportionate short stature, without any other abnormalities (15). Moreover, loss-of-function hmga2 mutations in mice resulted in the pygmy phenotype, characterized by extremely decreased fat levels and pre- and postnatal growth failure (16)..

The high-mobility group A (HMGA) family comprises several proteins: HMGA1a, HMGA1b, HMGA1c produced by the alternative splicing of the HMGA1 gene, and HMGA2 encoded by the HMGA2 gene. HMGAs, expressed mainly during embryonic development, act exclusively as architectural proteins by binding the minor groove of AT-rich DNA sequences. They have no transcriptional activity per se, but they are able to alter the chromatin structure by interacting with the transcription machinery, and thus they can negatively or positively regulate the transcriptional activity of several genes (17).

In order to assess the role of HGMA2 in the determination of ISS, we compared the frequency of three HGMA2 SNPs in a group of patients and normal stature controls of Italian origin. We also performed a mutation analysis of the HMGA2 gene to identify rare variants with high effect that might be related to monogenic forms of short stature.

Results

Association Study

The SNPs rs7968682, rs7968902, and rs1042725 were genotyped in the 155 ISS patients and 318 controls. These SNPs were selected as their contribution to height determination was replicated across different GWAS (9,10,11,12). Allele and genotype frequencies were consistent with those expected from Hardy-Weinberg equilibrium.

The frequency of the rs7968682T allele was significantly higher in ISS patients than in controls (P = 0.012, OR: 1.44; Table 1 , allelic model). A similar result was observed when adding sex as covariate to the logistic regression analysis (P = 0.017, OR = 1.41). Considering a recessive model, an OR value >1 was observed for genotypes rs7968682 TT (OR = 1.72, P = 0.00622, Table 1 ) and rs1042725 TT (OR = 1.51, P = 0.037).

Table 1 Association analysis on the three SNPs (n = 155 ISS, 318 HC)

Stratifying the analysis for gender both rs7968682 and rs1042725 were significantly associated in the female group (rs7968682 TT: OR = 2.01, P = 0.0163, T: OR = 1.71, P = 0.013; rs1042725: TT OR 1.77 P = 0.05 T: OR = 1.65; P = 0.020) whereas in the males only a nonsignificant trend was observed (rs7968682 TT: OR = 1.48 T: OR = 1.24; rs1042725 TT OR 1.31 T: OR = 1.0.7; all these values were not significant).

Due to linkage disequilibrium (D’ ranging from 0.82 to 0.98 and r2 from 0.57 to 0.78), the three SNPs generated only six different haplotypes ( Table 2 ). Only two rare combinations, namely CGG and TTT, showed a different distribution in patients and controls. The common haplotype carrying the two associated variations (namely TGT) although present with higher frequency in patients, showed only a borderline statistical significant difference (P = 0.055) between patients and controls. Thus, none of the allele combinations allowed to better define an associated haplotype and the rs7968682T remains the most associated variant.

Table 2 Haplotype association

Mutation Screening

All the exons and the splicing junctions of HMGA2 were sequenced in the 155 short stature patients in order to find causative mutations. No mutation was identified. Already known polymorphisms, all located outside the coding regions, were detected with a frequency similar to that reported in public database (data not shown). In one patient, a synonymous heterozygous exon variation, namely Ser105Ser (exon 5, rs201860923) was identified, already reported in the Exome Variant Server, with a frequency of 0.07% (Broad Institute of MIT and Harvard, Cambridge, MA). In order to see whether it fell in unknown putative splicing regulatory elements, an in silico analysis was performed by using ESEfinder 3.0 (Cold Spring Harbor Laboratory, NY) and Spliceview (CNR-ITB Milan, Italy). This variation did not alter the score of putative exonic splicing enhancer sequences compared to the threshold value and did not create any novel splice site.

The presence of deletions and duplications of the single exons and the whole HMGA2 gene region was investigated through multiplex ligation-dependent probe amplification. None of the patients carried copy number variations at this locus.

Discussion

HMGA2 polymorphisms at the 3’UTR have been found to influence variation in human height. Although the SNPs were not always the same, the effect of the associated alleles was replicated in the same direction in different populations. In Japanese, for example, Takeshita et al. (18) reported the association with rs7968902: homozygotes for the T-allele were on average shorter (about 6 ± 1.2 cm) than homozygotes for the G-allele. In Caucasians and in Chinese, the association with rs1042725 was confirmed (Yang et al. (19)) in the same direction with each copy of the rs1042725C allele estimated to be associated with an increase in height of 0.6 cm. An association study conducted in tall individuals (20) showed that the rs1042725C allele was significantly associated with tall stature: individuals carrying this variant yield a significantly increased odd of being tall (OR = 1.53, 95% CI: 1.02–2.28; P = 0.03).

Altogether these studies identified rs1042725C, rs7968902T, and rs7968682G as “height increasing” alleles and rs1042725T, rs7968902G, and rs7968682T as “height decreasing” alleles in the general population. These findings encouraged us to assess whether these SNPs influenced the susceptibility to ISS, by comparing their frequency in short stature patients and controls. The increased OR of 1.72 and 1.51 observed for ISS subjects carrying the rs7968682TT and rs1042725TT genotypes, respectively, suggested a recessive effect of these polymorphisms ( Table 1 ). Although the effect of these variants is stronger and significant in the females, there is no sufficient statistic power to assess that these SNPs confer a gender-specific risk).

It is difficult to assess which SNP is directly responsible for the association because of the strong linkage disequilibrium between the two SNPs (0.57 < r2 < 0.79).

The 3 HMGA2 SNPs are located within and close to the HMGA2 3′-UTR that contains multiple binding sites for the tumor suppressor microRNA let-7 (21,22). None of the SNP falls within a predicted miRNA binding site but it has been recently shown that the sequence context around the miRNA site is also important, as RNA secondary structure and RNA binding protein can affect miRNA function (23). The rs1042725 (localized at chr12:66,358,347), located in the proximity (13 bp apart) of a let-7 putative binding site (predicted by Targetscan 6.2, Whitehead Institute for Biomedical Research, Cambridge, MA) might directly or indirectly influence the binding of a miRNA although this hypothesis can be demonstrated only through a functional test.

Deletions in disorders characterized by growth failure and evidence in mice (16) strongly suggested an involvement of HMGA2 high-penetrance mutations in ISS. In a previous study conducted on isolated growth hormone deficiency patients, only an intron variation in one patient out of 105 was identified (24). The present work is the first reported mutation screening performed on HMGA2 in ISS and contrarily to our expectation neither mutations nor copy number variation were identified thus suggesting that HMGA2 has not a significant impact on the monogenic form of short stature.

In conclusion, our findings strengthen the role of the HMGA2 polymorphisms in the determination of height and show that specific genotypes (rs7968682TT and/or rs1042725TT) contribute to the multifactorial forms of ISS.

Methods

ISS Subjects

One hundred and fifty-five unrelated individuals (82 males and 73 females) with a median age at diagnosis of 10.9 y (5.1–17.8) were recruited by the Unit of Paediatrics of the Department of Health Sciences of Novara over a period of 6 y and were included in the genetic study. The subjects were referred to the clinical center because they had a height ≤ −2 SD score (SDS) or a height ≤1.8 SDS in combination with a height velocity over 1 y < −2 SDS using the criteria of Tanner-Whitehouse (25). The mean height was −2.43 ± 0.51 SDS ranging from −4.6 and −1.8 SDS. The growth velocity was −1.4 ± 2.1 SDS.

Skeletal maturation was estimated as bone age (radius, ulna, and short bone) with the TW2 method by a pediatric endocrinologist (26). The mean delay in bone age relative to chronological age was 2.57 ± 1.86 y.

They were all evaluated for GH serum levels after two provocative tests (with arginine or clonidine or insulin or glucagon or with GHRH+arginine). None of the patients included in this study showed GH deficiency. In all the subjects, deletions/duplications and point mutations in the SHOX gene were previously excluded as the cause of short stature during routine genetic testing.

Three hundred and eighteen normal stature healthy pediatric individuals (174 males and 144 females) matched for age and sex not tested for GH secretion levels were included in the association study as controls. A written informed consent was obtained from the patient’s and control’s parents, as they were all aged less than 18. The study was approved by the Institutional Review Board (Review Board of “Azienda Ospedaliera Universitaria Maggiore della Carità”).

Genotyping

Genomic DNA of short stature patients and controls was extracted from the blood samples by standard salting-out method. SNP genotyping was performed with Taqman technology and StepOnePlus Real-Time PCR System (Applied Biosystems, Foster City, CA) to study the three SNPs rs1042725, rs7968682, and rs7968902, in ISS patients and normal stature matched controls.

Sequence Analysis

The DNA was amplified by polymerase chain reaction (PCR) using primers designed to specifically amplify the coding region and the intron/exon boundaries of the gene (Supplementary Table S1 online), using the software Primer3 Input (Version 0.4.0, Whitehead Institute for Biomedical Research). The primer pair specificity was controlled with the Basic local Alignment Search Tool (National Centre for Biotechnology Information, Bethesda, MD). The PCR reaction was carried out with the GoTaq Flexi DNA polymerase (Promega Corporation Fitchburg, Wisconsin) in a 15-µl reaction volume, at an annealing temperature of 58 °C for exons 2 and 4 and 62 °C for exons 1,3, and 5.

The PCR products were visualized on a 2% agarose gel and purified using ExoSAP-IT (Affymetrix, Santa Clara, CA) enzymatic PCR clean up system. The purified products were then sequenced with the Big Dye Terminator kit (Applied Biosystems) and the automatic sequencer ABI PRISM 3100 Genetic Analyzer (Applied Biosystems).

Multiplex Ligation-Dependent Probe Amplification

Multiplex ligation-dependent probe amplification for all exons of HMGA2 gene was performed by means of a commercial kit according to the manufacturer’s protocol (SALSA MLPA P323 CDK4-HMGA2-MDM2, MRC Holland, Amsterdam, The Netherlands).

Statistical Analysis

The analysis set consisted of the mentioned 155 patients and 318 healthy controls all fully genotyped for the three SNPs and successfully sequenced for the 5 exons of the gene. Basic association analysis with no covariates added was performed for allelic, dominant, and recessive models with PLINK 1.9 software (Center for Human Genetic Research, Massachusetts General Hospital, and the Broad Institute of Harvard & MIT). Dominant and recessive models were calculated on the basis of genotypic counts: homozygous for the risk allele vs. the other two classes pooled for the recessive model, homozygous for the risk allele plus heterozygous vs. homozygous for the protective allele for the dominant model.

Linkage disequilibrium between the three SNPs was calculated on the whole Italian sample set (both patients and controls). Linkage disequilibrium calculation and haplotype frequencies determination was performed with the Haploview software (Center for Human Genetic Research, Massachusetts General Hospital, and the Broad Institute of Harvard & MIT). Basic association analysis at the haplotype level was performed with Haploview.

Statement of Financial Support

This work was supported by Cariplo Foundation, Milano Italy. I.F, D.B., and R.M. were PhD fellow of Dottorato di Ricerca in Medicina Molecolare supported by University of Eastern Piedmont, Novara, Italy

Disclosure

None of the authors declare any conflict of interest.