Introduction

In 1866, John Langdon Haydon Down first described the characteristics presented by individuals with DS1. This syndrome has a prevalence that ranges between 1 per 800–1200 live births2. It is caused by partial or complete triplication of chromosome 21 and is the most common genetic developmental disorder3. Among possible risk factors for these cytogenetic disorders is the mother’s age, whereby incidence reaches one in every 12 births in women aged over 49 years4. But there is some controversy regarding paternal age as a risk factor, so that some researchers find no relation5,6, while others suggest that there is a moderate increase in risk when fathers are aged over 40 years7,8.

Individuals with DS may present multiple health problems, including varying degrees of intellectual capacity, as well as delayed speech and learning development. In early adulthood, DS individuals are at considerable risk of developing dementia and Alzheimer disease3. In addition, ophthalmological problems, hypothyroidism, epilepsy, obstructive sleep apnea and transient myeloproliferative disorders are more common in this group2.

Regarding dental status, DS patients often present delays and disorders in the eruption sequence of both deciduous9,10 and permanent dentition11. Dental agenesis is common and is recognized as one of the phenotypic characteristics of DS. The prevalence of agenesis (excluding third molars) is higher than among healthy individuals, present in approximately 54.6–58.5% of subjects with DS12. Microdontia13 and taurodontism14 are also fairly common. Regarding malocclusions, a higher prevalence of Angle Class III malocclusion, posterior cross bite, and anterior open bite has also been observed among DS patients15.

The craniofacial characteristics of individuals with DS have been described in various studies, but to date no systematic review of published research has been conducted. So the objective of the present work was to perform meta-analysis to evaluate the craniofacial cephalometric characteristics of individuals with DS in comparison with healthy populations, allowing us to synthesize available scientific information in the field for the first time, increasing the validity of the conclusion of quality primary studies. The review’s null hypothesis was that there are no differences in cephalometric characteristics derived from the analysis of lateral teleradiographs of the head between individuals with DS and the general population.

Materials and methods

Protocol and registration

This systematic review followed Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines16 and was registered in the PROSPERO database (International Prospective Register of Systematic Reviews) (Reg. No. CRD42018117175) at the University of York (U.K.).

Selection criteria

The population, exposition, comparison, outcome (PECO)17 acronym was applied to answer the following question: “Are the craniofacial characteristics of individuals with DS different from those of the general population?”. PECO was applied as follows: P, individuals with DS; E, clinical or genetic diagnosis of DS; C, individuals without DS; O, craniofacial characteristics of DS evaluated by means of lateral cephalometric measurements derived from the analysis of lateral teleradiographs of the head.

Inclusion criteria were: case–control, cross-sectional and cohort studies describing cephalometric measurements of subjects with DS, compared with healthy control subjects. Animal studies, clinical case reports, pilot studies, bibliographic reviews, systematic reviews, and chapters of books were excluded.

Search strategy

A thorough search was conducted without imposing any limitations on publication date or language in the following databases: Pubmed, Embase, Lilacs, Scopus, Medline, and Web of Science. The search was completed on 27th May 2019. The following search terms were used: “Trisomy 21”; “Down syndrome”; “Mongolism”; and “Trisomy G” combined (using Boolean operator “AND”) with “craniofacial characteristics”; “craniofacial features”; “craniofacial development”; “craniofacial growth”; “craniofacial morphology”; “craniofacial cephalometric”; and “dento-skeletal”.

The electronic search was complemented by a manual search in specialist journals: The Angle Orthodontics, American Journal of Orthodontics and Dentofacial Orthopedics, and the European Journal of Orthodontics.

Article selection

The selection of articles was made in three stages as shown in Fig. 1. Two researchers (A.V and A.L) carried out the selection process independently; any disagreement over the results was resolved by consensus. When agreement could not be reached, a third assessor was consulted (C.S.).

Figure 1
figure 1

Flow diagram of the search strategy.

A manual search was also conducted among the bibliographic references of the articles identified in the initial search.

The degree of agreement between reviewers was assessed by means of Cohen’s kappa coefficient (κ).

Methodological quality assessment

The methodological quality of the articles selected was assessed using the Newcastle–Ottawa Quality Assessment scale (NOS)18 by two examiners working together (A.V and A.L) and a third examiner (C.S) working independently. The degree of agreement between the examiners was analyzed with Cohen’s kappa coefficient (κ).

The NOS gives a maximum score of nine; articles scoring seven or above are considered of good quality. Assessment is divided into three areas: selection, comparability and exposure. The selection criterion consisted of four items: adequate definition of cases, representativity of the cases, selection of controls, and definition of controls. Comparability consists of the comparability of cases and controls derived from study design, considering factors such as age or ethnicity. The last part is exposure, which is divided into three items: evaluation of exposure, equality of methods for cases and controls, and dropout rate.

There is a maximum score of one for each item, with the exception of comparability, which can award a score of two.

Data extraction

The following data were extracted from each of the articles by two researchers (A.V. and C.S): authors, year of publication and for both cases and controls sample size, ethnicity, gender, and cephalometric measurements were registered. Given the wide variety of cephalometric measurements taken in the articles, for meta-analysis it was decided to include only those measurements that were repeated in at least three articles, these being: Anterior cranial base legth (SN, mm), posterior cranial base length (SBa, mm), total cranial base length (BaN, mm), posterior cranial base inclination (SNBa, °), sagittal position of subspinale relative to cranial base (SNA, °), effective length of the maxilla (CoA, mm), sagittal position of supramentale relative to cranial base (SNB, °), relative position of maxilla and mandible to each other (ANB, °), anterior facial height (NMe, mm), posterior facial height (SGo, mm) (Fig. 2; Table 1). The mean value and standard deviation were recorded for each measurement.

Figure 2
figure 2

Cephalometric landmarks (S Sella, N Nasion, Ba Basion, A A point, Co Condilion, B B point, Me Menton, Go Gonion), Cranial base measurements (SN length, SBa length, BaN length, SNBa angle), Maxilar measurements (SNA angle, CoA length), Mandibular measurements (SNB angle), Maxilo-Mandibular relation (ANB angle), Facial heights (NMe length, SGo length).

Table 1 Angular and linear cephalometric measurements and their definitions.

The reliability of the codification process between examiners was analyzed by means of Cohen’s kappa coefficient for qualitative variables and the intra-class correlation coefficient for quantitative variables.

Effect size index

For each outcome the effect size index was the mean difference (MD) between the DS group and the control group. A negative value for the MD indicated that a mean in the DS group was lower than that of the control group, and vice versa.

Statistical analysis

Separate meta-analyses were carried out for each outcome. As different articles varied in the cephalometric measurements taken, the results could only be subjected to meta-analysis when a mean datum was reported in at least three studies. As some evidence of heterogeneity among the individual studies was expected, a random-effects model was assumed. This model implies to weight each effect size by its inverse variance, this being the inverse of the sum of two variances: within-study variance and between-studies variance. Between-studies variance was estimated using the DerSimonian and Laird estimator. For each outcome, a pooled effect size (in terms of mean difference) was calculated, as well as the 95% confidence interval19. Heterogeneity of the effect sizes was checked with the Q statistic and I2 index. A statistically significant result for the Q statistic (p < 0.05) indicated the presence of heterogeneity. In addition, heterogeneity I2 indices around 25%, 50%, and 75% were considered to reflect low, moderate, and large heterogeneity, respectively20. Finally, to assess the potential influence of the two age ranges found in the studies (6–11 years old and 5–22 years old), subgroup analyses were applied with the QB statistic. A statistically significant result for the QB statistic (p < 0.05) provided evidence of significant differences between the average effect sizes of the age ranges. In addition, a double forest plot was constructed for each outcome in order to illustrate the potential differences among the effect sizes as a function of the age range. All statistical analyses were carried out using Comprehensive Meta-analysis 3.3 software21.

Results

Study selection

The search identified 766 articles, 147 in PubMed, 229 in Embase, 2 in Lilacs, 16 in Scopus, 58 in Medline, and 314 in Web of Science. After excluding duplicates, 504 articles remained. A further 208 were rejected on the basis of the title leaving 296 articles, of which 79 were selected after reading the abstracts. Then the full texts were read and a further 69 excluded for failure to meet the review criteria, leaving a total of 9 works (Fig. 1). The Cohen kappa coefficient of inter-examiner reliability was κ = 0.890.

The manual search in journals did not identify any additional works.

Methodological quality assessment

The nine articles selected were assessed with the NOS18, obtaining a Cohen kappa coefficient of κ = 0.832 between evaluators.

Of the nine articles, seven underwent quality assessment; four obtained maximum scores of nine22,23,24,25, two a score of eight26,27, and one of seven28. The two articles considered of low quality9,29 obtained scores of six (Table 2) and they were excluded from the meta-analysis.

Table 2 Methodological quality for all 9 case control studies identified by the search strategy, assessed using the Newcastle–Ottawa-Scale.

Reliability of data extraction process

The degree of inter-examiner agreement was high. For all quantitative variables, except for the item “number of controls,” an intra-class correlation coefficient of one was obtained. Initially the intra-class correlation coefficient for “number of controls” was 0.77 and after finding the mistake in the number of controls between the examiners we corrected it and obtained a κ = 1. Qualitative variables all obtained values of κ = 1.

Study characteristics

Table 3 summarizes details of the seven case–control studies included in meta-analysis. Five articles were published in English22,23,24,25,27, one in French26 and one in Spanish28. The earliest publication date was 198526, while the most recent was published in 201425. Studies were conducted in Canada23, France26, Colombia28, Spain22,24, Brazil27 and Saudi Arabia25. The smallest sample size was 14 individuals28, while the maximum was 6025. The ages of the subjects investigated ranged from 527 to 22 years25. In five articles the individuals were Caucasian22,23,24,26,27, in one Columbian28, and in the other Arab25. All the works coincided in age range and ethnicity between case and control groups. Regarding gender, five works included individuals of both sexes22,23,24,25,26; one included only males27, while another one did not report the sex of the subjects28.

Table 3 Study characteristics of the 7 case–control studies included in the sistematic review and meta-analyses.

Cephalometric characteristic of the cranial base were assessed by means of the measurements SN22,23,25,27,28; SBa22,23,25,27; NBa22,23,27; and SNBa22,23,25,26,27. Measurements relating to the maxilla were SNA23,24,25,27,28 and CoA24,25,27; and to the mandible SNB23,25,27. Anteroposterior relation between maxilla and mandible was determined by means of the ANB angle23,25,27,28; anterior face height by NMe23,27,28; and posterior face height by means of SGo23,27,28.

Meta-analyses

As shown in Table 4, a total of five studies were found to report outcomes. With the exception of SNBa and SNB, lower mean outcomes were found among DS subjects than controls. Moreover, the DS group exhibited significantly lower mean values than the control group for outcomes SN, SBa, NBa, CoA, ANB, NMe, and SGo. Two outcomes (SNBa and SNB) obtained higher mean values for DS subjects than controls but only the difference in SNBa reached statistical significance.

Table 4 Results of the pooled mean difference and heterogeneity statistics for each outcome.

Due to the low number of studies identified, the interpretation of Q statistics must be treated with caution, due to problems of statistical power. The I2 index offers a more adequate means of assessing heterogeneity among effect sizes. As shown in Table 3, moderate-to-high heterogeneity was found for all outcomes, with the exception of NBa and ANB.

The studies were grouped in two categories depending on participants’ age ranges (6–11 and 5–22 years old), the former category being more homogeneous than the latter. To assess the potential influence of age ranges on effect sizes, subgroup analyses were carried out for each outcome. Forest plots (Figs. 3, 4, 5, 6, 7, 8, 9, 10, 11, 12) show the results. In addition, the QB statistic for testing statistical significance in differences between the two average effect sizes is shown in Table 5. Statistically significant differences were found between the average effect sizes of the two age range categories for outcomes SNBa (p = 0.039); SNB (p = 0.004) and NMe (p = 0.035). Regarding the outcome SNBa, both age ranges exhibited a larger mean effect size for DS cases than for controls, but the 5–22 year age range exhibited a larger difference (MD+ = 8.854) than the 6–11 year age range (MD+ = 6.500). For the outcome SNB, the significant difference found between average effect sizes of the two age ranges was due to a larger mean effect in the DS group than the controls for the category 5–22 years (MD+ = 2.191), whereas the inverse was found for the category 6–11 years (MD+ = −2.600). Lastly, for the outcome NMe, lower mean values were found in the DS group than the controls for both age ranges, but the average mean difference was more pronounced in the category 5–22 years (MD+ =  −15.510) than 6–11 years (MD+ =  −10.342).

Figure 3
figure 3

Forest plot for the outcome ‘SN (mm)’ as a function of the age group (1 = 6–11 years old), 2 = 5–22 years old).

Figure 4
figure 4

Forest plot for the outcome ‘SBa (mm)’ as a function of the age group (1 = 6–11 years old), 2 = 5–22 years old).

Figure 5
figure 5

Forest plot for the outcome ‘BaN (mm)’ as a function of the age group (1 = 6–11 years old), 2 = 5–22 years old).

Figure 6
figure 6

Forest plot for the outcome ‘SNBa (°)’ as a function of the age group (1 = 6–11 years old), 2 = 5–22 years old).

Figure 7
figure 7

Forest plot for the outcome ‘SNA (°)’ as a function of the age group (1 = 6–11 years old), 2 = 5–22 years old).

Figure 8
figure 8

Forest plot for the outcome ‘CoA (mm)’ as a function of the age group (1 = 6–11 years old), 2 = 5–22 years old).

Figure 9
figure 9

Forest plot for the outcome ‘SNB (°)’ as a function of the age group (1 = 6–11 years old), 2 = 5–22 years old).

Figure 10
figure 10

Forest plot for the outcome ‘ANB (°)’ as a function of the age group (1 = 6–11 years old), 2 = 5–22 years old).

Figure 11
figure 11

Forest plot for the outcome ‘NMe (mm)’ as a function of the age group (1 = 6–11 years old), 2 = 5–22 years old).

Figure 12
figure 12

Forest plot for the outcome ‘SGo (mm)’ as a function of the age group (1 = 6–11 years old), 2 = 5–22 years old).

Table 5 Results of the subgroup analyses for the two categories of age ranges.

Discussion

The aim of this meta-analysis was to determine whether the craniofacial cephalometric characteristics of individuals with DS differ from those of the general population.

It was decided not to limit the literature search by language or date of publication in order not to miss any article that might provide information relevant to the work’s hypothesis; as a result, articles in three languages were included.

When evaluating the comparability of the cases and controls by means of the NOS, the age and ethnicity of subjects were considered, as cephalometric measurements may be influenced by these variables. Although in the seven works included for meta-analysis, age ranges and ethnicity were the same in case groups and control groups, there was some variability between the articles, especially regarding age, so in order to maintain homogeneity, the works were grouped according to age ranges into two groups: 6–11 years27,28 and 5–22 years22,23,24,25,26.

The seven studies included in meta-analysis used different cephalometric measurements, and so it was decided to analyze only those measurements that were repeated in at least three articles. Nevertheless, the maximum number of works to coincide in any single measurement was five. For this reason, the results of the present meta-analysis should be interpreted with caution.

Cranial base length was evaluated by means of the measurements SN, BaS and BaN. It was found that anterior cranial base length (SN) was significantly shorter in DS subjects than among control subjects. Some authors have suggested that the lesser development of the anterior cranial base is related with smaller brain size27. But according to Enlow the growth of the cranial base is considered rather autonomous, as compared to the cranial vault, because of that, we think that the smaller brain size of DS individuals should not be considered the main cause or the only cause of the short anterior cranial base of these patients. Rather both facts could be the effect of a different primary cause30. One study evaluated cranial base growth in subjects aged 8–18 years, observing that it grew in the same proportion among DS individuals as control subjects, showing that the deficit in size was produced before the age of 8 years22. Some authors have suggested that the deficit is of prenatal origin9, while others state that is develops during the first 4 years of life31.

As for posterior cranial base (BaS), it was also found to be significantly reduced in size in DS groups. A histological study based on autopsies of individuals at different ages determined that growth in this region comes to an end at the age of 18 years in healthy individuals32. Alió et al. observed that, in individuals with DS, the rate of growth decreases gradually up to the age of 15 years, while in control subjects it continues to grow, so that spheno-occipital synchondrosis growth stops earlier in individuals with DS22.

Obviously, the reduced anterior and posterior cranial bases in cases of DS mean that the total cranial base length (BaN) is also significantly smaller than in the general population.

At the same time, the cranial base angle (SNBa) was significantly larger in DS than control subjects. It has been suggested that delay in intra-sphenoidal synchondrosis fusion in postnatal life is essential to the cranial base’s flattening mechanism. Radiographic data obtained from individuals with DS show a delay in fusion, so that synchondrosis remains without fusing between the ages of 1–7 years, while in the general population it is fully obliterated by the end of the first year of life33. The present findings show that the difference in measurement between DS cases and control subjects was more acute in the 5–22 year age range. But the difference in effect size between the 6–11 year and the 5–22 year age group was 2.35°, so even though this difference was statistically significant, it was not clinically significant, as the standard deviation for this angle is around 4°34. In addition, according to Bjork, the cranial base angle is 130.8° ± 4.2° at age 12 and 131.6° ± 4.5° at age 20 years, showing an insignificant change from the former age to the latter34. Almeida et al., in their systematic review showed that SNBa remains constant from 5 to 15 years of age35. Alió et al. observed that in both individuals with DS and the general population it does not vary between the ages of 8–18 years22.

The anteroposterior position of the maxilla was evaluated by means of the SNA angle, which, although smaller in cases of DS, did not show significant differences in comparison with control subjects. The SNA angle relates the maxilla to the cranial base, which, as stated above, is significantly shorter in individuals with DS. Anteroposterior and vertical cephalometric measurements for the maxilla and mandible based on an anomalous cranial base can lead to erroneous cephalometric interpretation. For this reason, Jesuino and Valladares suggest that in these cases cephalometric measurements should take the Frankfort plane as reference27. The reduced anterior cranial base in DS can make the SNA similar to that of control subjects due to a geometric effect, even though the maxilla is smaller23, because the N point will be located at a more posterior location24.

Maxillary length (CoA) was significantly smaller in DS than control groups, corroborating the existence of maxillary hypoplasia in the sagittal plane. Alió et al. showed that the maxilla grows in the same proportion between the ages of 8 and 18 years in DS cases as in healthy subjects24. Klingel et al. observed that at the age of 6–9 months the maxilla is already smaller in all three dimensions in DS infants36.

As for the mandible, no statistically significant differences were observed between DS and control subjects in SNB angle, although values were higher in DS subjects. This measurement is also related to the cranial base; in this case shortening produces larger SNB angles23. At the same time, the present work found that the cranial base angle was significantly larger in DS groups, which would lead to less symphysis projection, while in control subjects the angle was more acute, contributing to a more anterior mandibular position35. Again, it is clear that to assess cases of DS, it is necessary to use a reference plane other than the cranial base to determine the anteroposterior position of the mandible correctly, and to reach clear conclusions regarding the size of the mandible. Meanwhile, in the 6–11 year age range, SNB angle values were lower in DS subjects than controls, while in the 5–22 year range it was larger in DS groups. This could be a result of the fact that the mandible has not undergone full growth in the younger age group. Mandibular growth ends around the age of 17 in females and 19 in males37.

Relating maxillary and mandibular anteroposterior position by means of the ANB angle, it was found that this was significantly smaller among DS subjects, indicating a greater tendency among these individuals to present skeletal Class III malocclusions.

Both anterior (NMe) and posterior face height (SGo) were significantly smaller among individuals with DS. In these cases, maxillary hypoplasia in the vertical plane conditions developmental deficiency in the facial middle third24. For NMe, the difference between DS groups and control subjects was more pronounced in the 5–22 than the 6–11 age range, probably due to the fact that the latter group had still not undergone complete growth.

The tendency toward brachyfacial pattern in individuals with DS was notable, in spite of the muscular hypotonia that such cases usually present, the tendency to keep the mouth half open in repose, and frequent infection of the upper airway. This is perhaps due to mandibular autorotation in maximum intercuspation resulting from maxillary vertical hypoplasia. The meta-analysis by Oliveira et al. found a higher prevalence of anterior open bite among individuals with DS, although they reported a low level of evidence for an association between DS and anterior open bite38.

Although the present work suffered several limitations, it was possible to reject the hypothesis that there would be no differences in craniofacial characteristics between individuals with DS and a healthy population, as clear differences were found. Nevertheless, in order to reach clear conclusions it would be necessary to unify the cephalometric measurements taken in different studies, and to use different reference planes for SN in order to determine vertical and anteroposterior positions of the maxilla and mandible. The results must be interpreted with great caution due to the scarce number of studies that reported each outcome measure. In addition, the small number of studies included in meta-analysis did not make it possible to examine the potential effects of publication bias, as at least 10 studies are needed to apply the typical methods for assessing publication bias.

Conclusions

In spite of the limitations of the present meta-analysis, it may be concluded that individuals with DS present a cranial base of reduced length, which is more flattened than in healthy subjects. Moreover, the maxilla is of reduced size in the sagittal plane and there is a tendency toward skeletal Class III malocclusion. DS is also characterized by reduced facial height in both posterior and anterior regions.