INTRODUCTION

Recent progress in elucidating the genetic contributions to disease and molecular etiology of clinical phenotypes presents opportunities to further subclassify human disease traits such as scoliosis according to their underlying genetic etiologies. Such genetic/genomic subclassification can result in a virtuous cycle from bench to bedside to better understand the biological perturbations underlying disease traits and pathophysiological bases of disease.1,2 To further increase the specificity of molecular diagnosis and the clinical application of genetic testing and clinical genomics, precise correlations between genotype and phenotype, allelic series, and the clinical consequences of combinations of biallelic variants at a locus need to be established.3,4 The recent report of a compound inheritance model in congenital scoliosis (CS) provides a genetic biomarker with which to potentially clinically subclassify scoliosis and define a specific clinical disease entity (TACS, TBX6-associated CS) that is associated with a particular combination of variant alleles—biallelic variants consisting of a loss-of-function (LoF) TBX6 lesion and a common risk hypomorphic allele in trans—as an underlying genetic etiology and further explore the hypothesized TBX6 gene dosage effect.5

Congenital scoliosis is a form of spinal curvature that can be caused by vertebral malformations potentially resulting from defects of formation, defects of segmentation, or a combination of the two.6 The prevalence of CS is approximately 0.5–1 per 1000 live births.7 As a major contributor to childhood and adolescent disability, CS affects patients’ lives and activities of daily living both physically and psychologically.8 CS can arise from developmental spine defects that result from perturbations in somitogenesis.7 Successful somitogenesis requires the spatial and temporal regulation of a complicated gene interaction network in which TBX6 plays an essential role. Our previous work elucidated that the TBX6 compound inheritance model accounts for approximately 7.9–10.6% of sporadic CS in the Chinese population.5 Similar analyses of independent Japanese and European CS cohorts, in multiple patients from different world populations and genetic backgrounds, supported the proposed TBX6 compound inheritance model.9,10

We previously showed the CS phenotype could result from compound inheritance,5 consisting of null variant alleles at the TBX6 locus, either a deletion copy-number variant (CNV) of the 16p11.2 region11,12 or LoF induced by single-nucleotide variants (SNVs) and indels in TBX6, which maps within the 16p11.2 region.13 With the delineation of the hypothesis of the compound inheritance and TBX6 gene dosage model,5 we were motivated to more precisely characterize the genotype-phenotype relationships of the specific genotypic combination of alleles at the TBX6 locus observed in CS cases with TBX6 variant alleles (i.e., the compound inheritance model) and the clinically observed endophenotypes characterizing the CS. In addition, we sought to explore the compound inheritance and gene dosage hypothesis in both humans and mice. Moreover, further investigations are required to test the potential utility of molecular diagnosis and clinical genomics in the precision medicine–directed clinical evaluation and management of CS.

MATERIALS AND METHODS

Participants

We recruited two independent cohorts and a multicenter case series from China, Japan, and the United States. The discovery set (cohort 1) consists of unrelated sporadic CS patients of Chinese Han descent in the DISCO (Deciphering disorders Involving Scoliosis and COmorbidities, http://discostudy.org/) study from Peking Union Medical College Hospital (PUMCH) between October 2010 and January 2016, in which clinical diagnoses of CS were confirmed by radiological imaging.5 The first replication cohort (cohort 2) comprised Japanese patients with CS recruited from Japan, in compliance with the selection criteria as reported previously.10 The second replication case series (cohort 3) comprised patients with 16p11.2 deletion/TBX6 LoF variants collected from the United States. Genomic DNA was extracted from the peripheral blood or saliva samples. Informed consent was obtained from each participant or the corresponding guardian. The study was approved by the institutional review boards of PUMCH, RIKEN, Baylor College of Medicine, and the other participating hospitals.

Genetic analyses

A genome-wide CNV analysis was performed in 20 sporadic CS patients in cohort 15. Quantitative polymerase chain reaction (qPCR) analysis was conducted to screen for 16p11.2 deletions in the remaining 325 patients from cohort 1 and in the 142 patients from cohort 2. An orthogonal experimental approach to measure copy-number alteration, customized comparative genomic hybridization (CGH) microarrays or digital droplet PCR (ddPCR), was used to independently confirm the deletion CNV candidates. The 16p11.2 deletion case series (cohort 3) was identified by chromosomal microarray analysis (CMA) (Table S1).12,14,15

The entire TBX6 gene and its upstream region were amplified and analyzed by Sanger sequencing in all patients from cohort 15 and 121 patients from cohort 210. TBX6 variants were also detected by exome sequencing (ES) and verified by Sanger sequencing in the remaining 21 patients from cohort 2 and 1 patient (BH8084) from cohort 3 (Table S1). Based on our previous findings,5 we analyzed the hypomorphic allele in patients with 16p11.2 deletions and TBX6 LoF variants.

Phenotype evaluation

We conducted detailed phenotypic analyses of the spine, ribs, and intraspinal anomalies in cohort 1 and cohort 2. The age of onset indicated the age at which an individual first came to medical attention, exhibiting or presenting with features such as asymmetric shoulder height or shoulder blade prominence. Morphologically, the vertebral malformations were classified as defects of formation, defects of segmentation, or mixed malformations.16 Variations in the number of vertebral bodies were also described. The incidence of rib defects, along with the rib structure and the number of alterations, were also analyzed. The rib defect subtypes were classified as simple or complex.17 Intraspinal anomalies were defined as any defect involving the spinal cord, such as a tethered cord or syringomyelia.

Generation and evaluation of the Tbx6 gene-edited mice

Zygotes from FVB mice were edited by CRISPR/Cas918 to generate a Tbx6 frameshift allele and in independent experiments a Tbx6 hypomorphic allele.19 Matings of strains with different Tbx6 genotypes were performed to derive the Tbx6 locus biallelic variant mouse model with a Tbx6 LoF variant in exon 2 in trans with the hypomorphic haplotype in the T binding site in the promoter of Tbx6 on the second allele (Tbx6wt/- X Tbx6wt/mild-hypomorphic(mh) to derive Tbx6-/mh).19 Multiple adult animals (N = 92, 35–45 days) were evaluated by micro computed tomography (micro-CT) (Bruker, Belgium) to assess the phenotypes; this included 52 with compound inheritance, Tbx6-/mh. No randomization was used and no blinding was done in the animal study.

Development of the TACS predictive model

We developed a multivariate model of the risk score (TACScore) to clinically predict TACS. The TACScore was derived from cohort 1 (training data set) by binary logistic regression,20 in which all reliable variables that were significantly associated (< 0.05) with TACS were entered into a multivariate model to discover the optimal predictors for distinguishing TACS from all the enrolled subjects with CS. A scoring system was developed for making these complex statistical models useful to clinicians and to evaluate the clinical utility of the TACScore, in which the point of each predictor was assigned according to the product of the corresponding β coefficient and value of the predictors with point totals corresponding to the risk estimate.20 The receiver operating characteristic (ROC) was used to assess the effectiveness. Cutoff points for the TACScore were determined with the Youden index [J=maxc(sensitivity+specificity–1)]21 to define the maximum potential effectiveness of the predictive models. Then the TACScore was validated independently in cohort 2 (testing data set).

Statistical analysis

The Mann–Whitney U test was used to analyze the ages, and the numbers of vertebral and rib malformations. The gender association and prevalence of different types of vertebral malformations were compared using Pearson's χ2 test or Fisher's exact test. The odds ratio (OR) with 95% confidence interval (CI) was used to assess the influences of TBX6 LoF variants on vertebral, rib, and intraspinal malformations. Statistical analyses were performed with SPSS version 15.0 (SPSS, USA). A value of P < 0.05 was considered statistically significant. The variance was similar between the groups that are statistically compared. (Methodological details are provided in the Supplementary Materials.)

RESULTS

We enrolled 345 Chinese patients with sporadic CS as cohort 1 (156 males and 189 females, 12 years [7–15] as the median age at enrollment [the interquartile range, IQR], 3 years [1–9] at onset, Figure S1), including 237 previously reported5 and 108 new CS patients (Fig. 1). For cohort 2, 142 Japanese patients with CS were recruited in Japan (79 previously reported10 and 63 new patients, Fig. 1). Ten patients with evidence for compound inheritance, 16p11.2 deletion/TBX6 LoF variants, who were systematically evaluated for vertebral phenotypes were enrolled in cohort 3 (Fig. 1 and Table 1). Collectively, 26 patients with proximal 16p11.2 deletions, 5 patients with frameshift variants (c.1250_1251insT, c.266_267insC, c.704_705insG, c.1169_1170insC, and c.1179_1180delAG), and 2 patients with stop-gain variants (c.844C>T [p.R282*], c.933C>A [p.C311*]) in the TBX6 gene were identified in cohort 1 (Table 1, Fig. 1, Figure S24, and Table S2). Similarly, ten 16p11.2/TBX6 deletions, two TBX6 frameshift variants (c.156delG and c.935_936insGA), one nonsense variant (c.699G>A [p.W233*]), one splice-site variant (c.119-1G>A), and one novel missense variant (c.333G>T [p.M111I]), which had been confirmed as a LoF variant,10 were identified in cohort 2 (Fig. 1, Table 1, Figure S4B, and Table S2). Further haplotype analysis showed that all patients with TBX6 LoF variants had the T-C-A risk haplotype (the co-occurrence of three common single-nucleotide polymorphisms [SNPs], namely, rs2289292, rs3809624, and rs38096275,22) in trans on the opposite allele in cohort 1 and cohort 2 (Table 1). Thus, the percentage of new CS patients explained by the TBX6 compound inheritance model (10/108 [9.3%] in cohort 1 and 15/142 [10.6%] in cohort 2) was consistent with our previous findings (23/237 [9.7%])5. In cohort 3, ten patients with TBX6 deleterious variants (nine 16p11.2 deletions and one frameshift variant [c.469_470insCGGC, p.R157fs], Figure S4C, Table 1) were identified. Furthermore, we identified 13 patients with 14 TBX6 missense variants and 1 patient with an in-frame insertion variant from cohort 1; these 14 subjects were excluded from the in-depth systematic phenotypic and clinical endophenotypic data analyses described below as the functional effects of these variants were uncertain.

Fig. 1
figure 1

Workflow for TBX6-associated congenital scoliosis (TACS) study participants in the multicenter cohorts and the animal model genotype-phenotype analyses. The discovery set (cohort 1) consisted of 345 unrelated Chinese patients with sporadic CS. The status of the TBX6 variants was screened, and the clinical characteristics were reviewed and compared according to the TBX6 genotypes, including vertebral, rib, and intraspinal anomalies. To recapitulate and investigate the phenotypic consequences of the compound inheritance and TBX6/Tbx6 gene dosage genetic model, a mouse strain with a specific combination of Tbx6 alleles was constructed. Tbx6 alleles were individually engineered and a mouse strain for the compound inheritance model was constructed by mating, introducing a truncated allele in trans with a mild hypomorphic (mh) allele, Tbx6-/mh. Furthermore, the TBX6-associated CS score (TACScore) was developed to guide and increase the efficiency of diagnosing TACS from clinically observed endophenotypes. CS congenital scoliosis, LoF loss-of-function, Non-TACS non-TBX6-associated CS, TACS TBX6-associated CS, VUS variants of unknown significance. aCohort 1 comprised the 237 sporadic CS patients enrolled between October 2010 and June 2014 in our previous study.5 bCohort 2 comprised 79 Japanese CS patients from 94 previously described CS patients,10 after excluding 15 patients without complete clinical data. cNo information regarding whether the risk haplotype was in trans or in cis with the TBX6 frameshift variant.

Table 1 Genotype and phenotype information of TACS patients from multiple centers worldwide

Distinct endophenotype of patients with TACS

We divided patients in cohort 1 into two groups based on the TBX6 genotype. The 33 CS patients containing TBX6 LoF variants and the in trans T-C-A risk haplotype were classified as those with compound inheritance at TBX6 and their form of CS as TACS, and the remaining 298 patients were classified as non-TACS (Fig. 1).

The TACS patients were significantly younger than the non-TACS at the age of onset (TACS, 2 years [1–3]; non-TACS, 3 years [1–9]; P = 0.02 by the Mann–Whitney U test; Table 2). A higher proportion of male than female was found in the TACS group in cohort 1 (TACS, 21/33 [63.6%]; non-TACS, 128/298 [43.0%]; P = 0.03 by the Pearson χ2 test; Table 2). Overall, the vertebral anomalies were less complex in the TACS group as defined by the number of vertebral malformations (TACS, 2 [1–2]; non-TACS, 4 [2–6]; P = 2.8 × 10‒9; Table 2 and Fig. 2). Remarkably, all spinal deformities in the TACS group originated from the defect of vertebral formation, in which 29 (87.9%) had a simple type (P = 9.2 × 10‒14, Table 2 and Fig. 2). Specifically, all TACS patients exhibited hemivertebrae or butterfly vertebrae (TACS, 33/33 [100%]; non-TACS, 171/298 [57.4%]; P = 4.9 × 10‒8; OR 1.7; 95% CI 1.6–1.9; Table 2 and Figure S6A). More segmented hemivertebrae/butterfly vertebrae (TACS, 1 [1–1.5]; non-TACS, 0 [0–1]; P = 1.2 × 10‒12; Table 2 and Fig. 2) and fewer block vertebrae (TACS, 0 [0–0]; non-TACS, 0 [2–4]; P = 8.8 × 10‒10; Table 2 and Fig. 2) were noted in the TACS group.

Table 2 Comparison of clinical characteristics between TACS and non-TACS patients in cohort 1
Fig. 2
figure 2

Comparison of the distribution regarding abnormal vertebrae in non-TACS and TACS patients and in Tbx6-/mh mouse compound inheritance and gene dosage model. The x-axis shows the frequency of each malformation in each vertebra and the y-axis shows the vertebral distribution in the spine. The vertebral malformations in the non-TACS group (N = 298) were normally distributed. There were 58 TACS patients in total in three worldwide cohorts (cohort 1 from Peking Union Medical College Hospital [PUMCH] in China, N = 33; cohort 2 from the multiple centers in Japan, N = 15; cohort 3 from multiple centers in the USA, N = 10). The Tbx6-/mh engineered mice (N = 52) exhibited a distinct phenotype, namely, a defect of formation involving the lower part of the spine. In addition, malformations in the upper and middle thoracic spine were significantly less involved (P = 7.3 × 10‒5 and 3.0 × 10‒6, respectively) in the TACS group than in the non-TACS group. C cervical vertebra, CS congenital scoliosis, L lumbar vertebra, mh mild hypomorphic, Non-TACS non-TBX6-associated CS, S sacral vertebra, T thoracic vertebra, TACS TBX6-associated CS.

Of note, the vertebral segment location and spine level distribution regarding the type of abnormal vertebrae in the TACS group were distinct from that of the non-TACS group (Fig. 2). In the TACS group, the lower part of the spine (T8–S5) was more frequently involved (TACS, 33/33 [100%]; non-TACS, 247/298 [82.9%]; P = 4.4 × 10‒3; Table 2 and Fig. 2). While the incidence of rib anomalies was comparable between the two groups, more simplified rib anomalies (e.g., a localized fusion of two ribs or an absence of one rib) versus complex abnormalities (e.g., multiple extensive rib fusions or adjacent large chest wall defects) were more frequent in the TACS patients (TACS, 25/33 [75.8%]; non-TACS, 88/298 [29.5%]; P = 3.1 × 10‒7; Table 2 and Figure S6B). Furthermore, fewer intraspinal malformations (e.g., tethered cords or syringomyelia) were observed in the TACS group (TACS, 1/33 [3.0%]; non-TACS, 131/298 [44.0%]; P = 7.0 × 10‒7; Table 2).

Tbx6 gene-edited mouse model for compound inheritance

We engineered the Tbx6 LoF variant (Tbx6) by introducing a 1-bp insertion in exon 2. The Tbx6mh allele was generated as a functional equivalent to the human mild hypomorphic allele (Figure S7)19. Gene expression in vitro was downregulated by the Tbx6mh mutant to approximately 65% of the wild-type gene, which was close to the 70% dosage level of the human TBX6 mild hypomorphic allele (Figure S7C)19. These engineered Tbx6 alleles were used to construct strains with different genotypic combinations.

As predicted, and consistent with literature observations, no homozygotes for the LoF variant were identified in liveborn animals and expected Mendelian ratios of particular genotypic combinations were distorted in liveborns,19 consistent with embryonic lethality of the Tbx6-/- null animals.23 We acquired through genetic matings and phenotypically assessed mice with five specific genotypes: Tbx6wt/wt (N = 10; wt, wild-type), Tbx6wt/- (N = 10), Tbx6wt/mh (N = 10), Tbx6mh/mh (N = 10), and Tbx6-/mh (N = 52). Consistently, only the Tbx6-/mh mice exhibited vertebral malformations (48/52 [92.3%], P = 9.3 × 10‒9 [OR, 13.0; 95% CI, 5.1–33.3], Figure S8 and Table S3). Intriguingly, as observed in human TACS patients, all Tbx6-/mh mice had vertebral malformations involving the lower part of the spine (Fig. 2 and Table S3). Defects of vertebral column formation were present in most of the Tbx6-/mh mice (36/52 [69.2%], P = 4.9 × 10‒5 [OR, 3.3; 95% CI, 2.2–4.9], Fig. 2 and S8 and Table S3). The recapitulation of the type, extent, and distribution of vertebral malformations in the engineered compound inheritance model in mice (Fig. 2) further supports the compound inheritance and gene dosage model for TACS and implicates biological perturbations in vertebral column malformations in this type of CS.

Worldwide multicenter replications

For the first replication cohort (cohort 2), we recruited 142 CS patients from Japan and identified 15 TACS patients (Fig. 1, Table 1, and Table S2). As expected, all TACS patients had one or more hemivertebrae/butterfly vertebrae (TACS in cohort 2, 15/15 [100%]; non-TACS, 99/127 [78.0%]; P = 0.04 [OR, 1.3; 95% CI, 1.2–1.4]). All TACS patients in cohort 2 exhibited malformed vertebrae at the lower part of the spine (T8–S5), which is consistent with the patterns observed in cohort 1 (Table 1, Table S2, and Fig. 2). This pattern, of vertebral malformations and distribution of defects in the vertebral column, was observed only in the mice with compound inheritance and gene dosage perturbations below that of haploinsufficiency (i.e., Tbx6wt/-), but not equivalent to the Tbx6-/- combination of alleles.

Furthermore, we collected another replication case series (cohort 3) containing ten TACS patients in the United States. Intriguingly, hemivertebrae/butterfly vertebrae involving the lower half of the spine were again observed. The replication of our observations for this distinct genotypic combination in two additional independent cohorts from distinct world populations provides substantial evidence in support of the compound inheritance and TBX6 gene dosage model and speaks to the universality of TACS and the compound inheritance model in human and medical genetics.

TACScore to clinically predict TACS

We developed a model to predict TACS from the phenotypic data, and clinically measurable endophenotypes, obtained from cohort 1 by logistic regression.20 The final multivariate risk model, the TACScore, integrated variables including (1) segmented hemivertebrae/butterfly vertebrae involving the lower half of the spine (T8–S5), (2) the number of vertebral malformations, (3) the presence of intraspinal defects, and (4) the type of rib defect (Fig. 3a and Table S4). The calculated score of each variant parameter was linearly correlated with the corresponding risk in the regression model and defined from –8 to 4 (Fig. 3b). The cutoff point was selected as ≥3 to achieve the highest Youden index (J = 84.8%), with sensitivity, specificity, and accuracy levels higher than 90% (Table S5). The area under the curve (AUC) of the ROC curve was 0.9 (P = 1.6 × 10‒15; 95% CI, 0.9–1.0; Fig. 3c). Importantly, we validated the TACScore in cohort 2 with an AUC of 0.8 (P = 1.5 × 10‒4; 95% CI, 0.7–0.9; Fig. 3c and Table S6). Therefore, we introduced a clinical assessment pipeline for CS to efficiently identify high-risk patients for TACS and potential compound inheritance at the TBX6 locus (Fig. 3d).

Fig. 3
figure 3

Development and validation of the risk prediction model and diagnostic pipeline of TACS. a TACScore: risk predictive model for TACS. The final multivariate risk model was developed through a binary logistic regression analysis of detailed phenotypic data from Cohort 1. b The predictive efficacy of the TACScore. The x-axis shows the spectrum of TACScore and y-axis shows the predicted TACS frequency and the percentage of TACS patients in all CS patients with each calculated score in cohort 1 and cohort 2. The TACScore presented excellent predictive efficacy by comparing the predicted TACS risk with the real TACS frequency. The cutoff point was selected as 3 to achieve the highest accuracy. c Receiver operating characteristic (ROC) curve for the TACScore in cohort 1 and cohort 2. Areas under the curve (AUCs) were 0.9 (P = 1.6 × 10‒15; 95% CI, 0.9–1.0) for the discovery cohort (cohort 1) and 0.8 (P = 1.5 × 10−4; 95% CI, 0.7–0.9) for the validation cohort (cohort 2). d A proposed guideline for predicting and evaluating TACS. The risk of TACS evaluated by TACScore is suggested to perform prior to genetic testing. After the detection of TACS, a systemic evaluation with early interventions and genetic consultation were recommended. CMA chromosomal microarray analysis, CS congenital scoliosis, ES exome sequencing, GS genome sequencing, TACS TBX6-associated CS. aTBX6 compound variant contains a 16p11.2 deletion/TBX6 loss-of-function variant in the compound heterozygous configuration with the risk haplotype providing a hypomorphic variant. bVUS, variants with unknown significance. cEvidence from large-scale case–control studies, pedigree analysis, and functional studies are needed.

DISCUSSION

In this study, we expanded our knowledge of the contribution of the TBX6 compound inheritance to CS by providing international patient cohort studies and animal model–based evidence that (1) 9.6% (33/345) and 10.6% (15/142) of the patients in cohort 1 and cohort 2, respectively, could be parsimoniously explained by the TBX6 compound inheritance gene dosage model (Table 1 and Fig. 1); (2) all patients in the three cohorts had the risk haplotype in trans with a TBX6 LoF variant, with the exception of one patient (BCM01) for whom haplotype information was not available and one patient (BH8084) for whom allelic phasing of the identified variants was not available (Table 1); and (3) only the Tbx6-/mh genotype presented with vertebral malformations in the gene-edited mice (Fig. 2 and S8 and Table S3). Moreover, we elucidated a novel genetically and phenotypically defined disease entity, which carries a TBX6 LoF variant in trans with the hypomorphic allele and manifests a distinguishable constellation of clinical features and CS endophenotypes. Furthermore, we developed the TACScore system and proposed a clinical practice algorithm (Fig. 3) for evaluating CS and predicting TACS. Finally, we show qualitatively the same kind (hemivertebrae/butterfly vertebrae) and quantitatively the same distribution (lower spine) of vertebral malformations with the compound inheritance combination of variant alleles and TBX6/Tbx6 gene dosage model in human and mice.

TBX6 is a member of the T-box family.24 Variants in Tbx6 lead to phenotypes of vertebral and rib defects in mice.23,25 Tbx6 is important for cell fate in the paraxial mesoderm structure26 and formation of the segmental boundary,27 which is the precursor of the vertebral column.28 Therefore, the genetic and developmental malfunction in Tbx6 perturbation of the gene dosage model may parsimoniously explain the hemivertebrae/butterfly vertebrae presenting in all TACS patients (Table 1 and Fig. 2). Interestingly, Tbx6-/- produces an embryonic lethal phenotype with expanded tailbud and lack of posterior somites,23 suggesting that Tbx6 functions to maintain the specification of the posterior paraxial mesoderm.7 In zebrafish, tbx6 is required in the posterior of the embryo during the posterior part of primary neurulation and all of the secondary neurulation.29,30 Therefore, this developmental process optimally explains the observation that all TACS patients and Tbx6-/mh gene-edited mice show vertebral malformations anatomically localized at the lower part of the spine.

TACS does not follow “conventional” rare variant Mendelian inheritance expectations likely reflecting the embryonic lethality of TBX6/Tbx6 homozygous null alleles. Instead, TACS is caused by biallelic variants at a locus, consistent with autosomal recessive trait manifestation, but the presence of a single rare LoF variant in trans with one common hypomorphic allele; a gene dosage that is less than haploinsufficiency but not zero as with homozygous null alleles. This genetic model (compound inheritance and TACS) may provide an explanation for the scenarios in which monoallelic variants and phenotypes are perceived to be following a dominant inheritance pattern but for a disease trait with incomplete penetrance or the disease pattern observed is pseudodominance as found when the carrier state occurs at a high frequency in the population. Because the T-C-A haplotype is common worldwide (44% among Asians and 33% among Europeans, but <1% among Africans31), the haplotype is likely to be parsed and filtered out by the current genomic analytical pipelines. Therefore, for some single-locus genetic models, the functional effects of individual variant alleles, as well as the combination of alleles, should be considered when interpreting the genetic data and specific phenotype observed for the “unsolved” cases. Moreover, the compound inheritance model may be particularly relevant to recurrent rearrangement CNV32 loci with relatively high mutational frequency rates33 wherein deletion CNV can result in haploinsufficiency for many gene loci (e.g., the 0.6-Mb 16p11.2 deletion CNV contains 27 annotated genes).5,15 It is important to know not only the functional effects of individual variant alleles, but also the functional consequences of combinations of variant alleles at a locus.

A significantly small proportion of the TACS patients in cohort 1 were female (Table 2, 12/33 [36.4%] in the TACS patients and 170/298 [57.0%] in the non-TACS patients in cohort 1), which might be due to the interference of the 16p11.2 deletion and TBX6 variants in the formation of the female reproductive system.34 Notably, pleiotropic effects of 16p11.2 deletion CNV include other traits related to obesity,35 cognitive phenotypes including intellectual disability and psychiatric disorders,15,36 birth defects such as CAKUT (congenital anomalies of the kidney and urinary tract),37 and may even be involved in other undiscovered phenotypes or diseases. Thus, TACS patients are recommended to be evaluated systematically with long-term follow-up (Fig. 3d).

More importantly, from an orthopedic surgeon’s clinical perspective, hemivertebrae/butterfly vertebrae are the most remarkable features of TACS (Fig. 2). Surgical intervention and treatment at an early age results in a satisfactory prognosis for the patient and ameliorates scoliosis.38 For example, patient XH101 was diagnosed as having CS with an L2 hemivertebra at 3 years of age, and molecular testing indicated that she had a frameshift variant in TBX6 with the in trans T-C-A risk haplotype. The TACS diagnosis was made by integrating the molecular testing and clinical phenotypes. We performed L2 hemivertebra resection with short segment internal fixation, and removed the instrument at 3 years postoperation when she had a balanced spine. The last follow-up, which was at 9 years postoperation, showed an almost completely normal growth of the spine and normal curvature of the spinal column (Figure S9).

To realize clinical applications from bench to bedside, we developed a TACScore prediction algorithm as an objective adjuvant clinical measure to guide clinical management for patients with CS. The TACScore was validated in two independent cohorts and further integrated into the proposed guidelines for evaluating the risk of TACS (Fig. 3). Considering the possibility of phenotypic expansion,4 the prediction algorithm should include multiple variables associated with TACS comprehensively (i.e., endophenotypes) instead of relying on one particular phenotype (e.g., scoliosis). This clinical phenotypic assessment provides efficient information to physicians and families regarding whether the patient is suitable for 16p11.2/TBX6 variant detection or a genome-wide test. Clinical ES39 or genome-wide analysis are reserved for cases with low-risk TACScores or high-risk TACScores that are negative for 16p11.2/TBX6 pathogenic variants. Furthermore, the cutoff point of the TACScore could be adjusted according to medical practice experience and continued future observations as well as the local medical resources available to achieve the maximum health economic benefits.

Conclusions

We have defined a new subtype of CS, i.e., TACS, containing a TBX6 LoF variant compound with a hypomorphic allele. The TACS is a clinical entity defined by consistent clinically measurable endophenotypes: i.e., younger age at onset, hemivertebrae/butterfly vertebrae involving the lower part of the spine, simple rib anomaly, and fewer vertebrae and intraspinal defects. The TACScore can guide clinical management and genetic and clinical genomic testing. Human and mouse studies further confirm the compound inheritance and gene dosage model, and provide insights into potential biological consequences for spine development of perturbations in TBX6/Tbx6 gene dosage and expression. Such genetic models may be important to other birth defects.