Body mass index is an overlooked confounding factor in existing clustering studies of 3D facial scans of children with autism spectrum disorder

Schwarz, Martin; Geryk, Jan; Havlovicová, Markéta; Mihulová, Michaela; Turnovec, Marek; Ryba, Lukáš; Martinková, Júlia; Macek, Milan; Palmer, Richard; Kočandrlová, Karolína; Velemínská, Jana; Moslerová, Veronika

doi:10.1038/s41598-024-60376-0

Download PDF

Article
Open access
Published: 30 April 2024

Body mass index is an overlooked confounding factor in existing clustering studies of 3D facial scans of children with autism spectrum disorder

Scientific Reports volume 14, Article number: 9873 (2024) Cite this article

306 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Cluster analyzes of facial models of autistic patients aim to clarify whether it is possible to diagnose autism on the basis of facial features and further to stratify the autism spectrum disorder. We performed a cluster analysis of sets of 3D scans of ASD patients (116) and controls (157) using Euclidean and geodesic distances in order to recapitulate the published results on the Czech population. In the presented work, we show that the major factor determining the clustering structure and consequently also the correlation of resulting clusters with autism severity degree is body mass index corrected for age (BMIFA). After removing the BMIFA effect from the data in two independent ways, both the cluster structure and autism severity correlations disappeared. Despite the fact that the influence of body mass index (BMI) on facial dimensions was studied many times, this is the first time to our knowledge when BMI was incorporated into the faces clustering study and it thereby casts doubt on previous results. We also performed correlation analysis which showed that the only correction used in the existing clustering studies—dividing the facial distance by the average value within the face—is not eliminating correlation between facial distances and BMIFA within the facial cohort.

Investigation of age-related facial variation among Angelman syndrome patients

Article Open access 21 October 2021

Large-scale open-source three-dimensional growth curves for clinical facial assessment and objective description of facial dysmorphism

Article Open access 09 June 2021

The age distribution of facial metrics in two large Korean populations

Article Open access 10 October 2019

Introduction

Autism spectrum disorders (ASD) represent a common group of neurodevelopmental disorders. Their incidence is estimated at 1:59 individuals, and the internationally declared prevalence in medical literature is 1–2%. Males are four times more likely to be affected¹. Complex interactions of genetic, environmental, and epigenetic factors, including the influence of genetic imprinting, are responsible for the development of ASD².

ASD is either a non-syndromic (idiopathic ASD) or a concurrent symptom of a specific genetic syndrome (secondary ASD)³. Inheritance of ASD ranges between monogenic and multifactorial and is influenced by other factors that are thus far unknown or unquantifiable⁴. The clinical presentation of ASD is highly variable and includes three core symptoms: impairment of verbal and nonverbal communication, impaired social interactions, and stereotypical behaviors⁵.

The number of confirmed or suspected ASD-related genes is estimated in the hundreds to thousands range⁶. In some patients, an unambiguous genetic etiology can be demonstrated by detecting a single genetic variant. The genetic architecture of ASD, however, is more complicated and comprises single nucleotide variants (SNV) but also copy number variants (CNV), together with their incomplete penetrance, and variable expression⁷. Some studies suggest a “multi-hit” model for ASD, i.e., two or more “weaker” variants with less impact acting in synergy^8,9.

Severe forms of ASD could be diagnosed as early as 18 months of age, while its milder forms are usually recognized in the preschool period¹⁰. Identification of ASD subgroups using endophenotype is rather difficult considering the substantial intrafamilial and interfamilial variability of ASD behavioural phenotypes². Previous attempts at patient stratification (henceforward “clustering”) were based mostly on cognitive/psychomotor functions^{5,11,12,13,14,15}. Some authors tried to cluster patients using electroencephalography (EEG) or brain magnetic resonance imaging (MRI)^16,17. The two-dimensional analysis of patients´ photographs provided evidence that there is no characteristic “facies” which would allow reliable clinical/visual diagnosis of ASD. Nonetheless, several facial features that are more commonly found in patients with ASD have been described. These include a broader upper face, shorter mid-face, wider eyes, larger mouth, and philtrum¹⁸.

The use of three-dimensional (3D) geometric facial morphometry as an additional tool for the stratification of ASD patients was first reported by Aldridge et al.¹⁹. The authors analyzed 3D facial scans of 65 ASD boys (8–12 years of age). A total of 136 facial distances between defined anthropometric landmarks of each scan were analyzed. Most formed a cluster that overlapped with controls. However, there were two smaller clusters outside the main group representing both “extremes of the ASD clinical spectrum”- one comprised patients with a significantly worse clinical presentation of ASD, while the other had patients with mostly milder symptoms of Asperger’s syndrome.

Subsequently, Obafemi-Ajayi et al.²⁰ ran a more detailed analysis on 62 ASD boys of the same age range whilst 83% of patients were from the same cohort used by Aldridge et al. The team used geodesic distances between defined facial landmarks. Only ASD patients entered clustering and subsequent statistical analyses. Authors showed that three clusters resulting from cluster analysis differ significantly in several cognitive measures related to ASD severity. One cluster exhibited both the higher separation from others in terms of facial distances and also worse clinical presentations of ASD. Furthermore, two other clusters had a significantly higher representation of Asperger’s syndrome patients and milder ASD severity. The controls were used only in PCA analysis where they overlapped with two clusters with milder ASD severity on the PCA plot but not with the third cluster with worst ASD cases.

Here we describe the relationship between the severity of ASD and 3D facial measurements in Czech pediatric patients of both sexes. We elucidate the central role of obesity on the cluster structure and consequently on its association with autism severity. This factor was not taken into account in the above mentioned studies.

Methods

Patient cohort

All patients were clinically diagnosed with ASD according to consensus criteria²¹. The lower age limit was eight years and the upper age limit was < 12 years, with cases ≥ 12 years being excluded from this study. This age limit was chosen so that patient skulls, which are the basis for facial dimensions, have finished the majority of their growth but have not yet started to change with the onset of puberty.

The following exclusion criteria were applied (number of excluded scans in parenthesis):

(1)
Patients with a clear laboratory genetic diagnosis of another underlying genetic disorder, i.e. syndromic ASD patients (31 scans)
(2)
Patients with other clearly defined environmental causes, e.g., fetal alcohol syndrome, fetal valproate syndrome (1 scan)
(3)
Patients of other than European—Caucasian origin (1 scan)
(4)
Patients born before the 35th week of gestation (8 scans)
(5)
Patients with poor quality 3D scans (14 scans)

Altogether, 92 male and 24 female non-syndromic ASD patients with high-quality 3D scans were analyzed. Out of these, seven male and three female patients had their mouths open during the 3D scan. Another three of the included male scans had severe obesity (body mass index for age (BMIFA) > + 2SD). Scans of 106 healthy males and 51 healthy females of the same age range were used as controls. Controls were scanned using the same hardware (see further) and were provided by the Laboratory for 3D Imaging and Analytical Methods, Department of Anthropology and Human Genetics, Faculty of Science, Charles University.

ASD severity score was assessed clinically in all patients based on the extent of intellectual disability, functioning, and other behavioral symptoms²¹. The severity was assessed as mild, moderate, or severe (Table 1).

Table 1 Assessment of patients ASD symptoms´ severity in percentages.

Full size table

3D facial gestalt scanning

Non-invasive optical 3D scanning was carried out using the multi-camera non-contact 3dMDface System scanner (3dMD Ltd., London, United Kingdom; www.3Dmd.com). Each patient was scanned from a frontal view, with the head in a natural position and was instructed to have a neutral facial expression. Age-appropriate cartoons were played on a TV in front of the children to attract their attention and “stabilize” their frontal view towards the 3D camera. Scans were edited using the Cliniface (www.cliniface.org²² or Geomagic Wrap 2017 analytical software (www.3dsystems.com²³. Landmarks were placed automatically by Cliniface, representing defined consensus anthropological points of the face²⁴. Nineteen landmarks were used for the analysis²⁰. Positions of the nineteen landmarks were verified manually in each studied case (Fig. 1).

Data analysis

The .csv files with coordinates of the 19 landmarks were imported to R²⁵, and Euclidean distances for each pair of landmarks in each proband were calculated. The corresponding geodesic distances were calculated using the Fast-Marching Toolbox in the Matlab environment^26,27. One hundred seventy-one Euclidean distances per proband and another set of 171 geodesic distances per proband were obtained.

The following procedure is identical for Euclidean and geodesic distances and was performed in R.

Normalization was performed for each proband separately—all distances associated with individual proband were divided by their mean.

BMI was normalized concerning age by computing z-scores using the following equation: \(BMIFA=\frac{BMI-{\mu }_{age}}{{sd}_{age}},\) where BMIFA is the age-normalized BMI used for all computations in this work, BMI in the equation corresponds to the BMI value of individual probands, μ_age denotes the average BMI value within the age group to which the proband belonged, and sd_age is the standard deviation within the same age group. (By the age group we mean tabularized age intervals in which we assume a constant BMI).

Correlation analysis

The Spearman correlation coefficient was calculated between the BMIFA and the distance associated with the above-mentioned landmark pairs for each proband group and each landmark pair²⁸. The Spearman coefficient was used because the data are not normally distributed. Separated distributions of correlation coefficients for each proband group were obtained this way. Replacing BMIFA with age, analogical distributions were also computed for age; this allowed us to study how landmark distances are affected by age and BMIFA and how this relationship was affected by distance normalization.

Hierarchical clustering procedure

A hierarchical clustering procedure was used to elucidate the cluster structure for the proband groups. Every proband is represented by a vector of distances between all 19 landmarks used in this study and thus has a length of 179. Manhattan distances between all probands/vectors were then computed as an input to hierarchical clustering analysis. Final clustering was performed using the R function—hclust²⁵, with the agglomerative method parameter set to the average.

The Calinski-Harabasz index was computed for every partition generated by the clustering procedure to select the technically best partition to divide into clusters²⁹. The partition with the largest Calinski-Harabasz index was used for subsequent analyses.

We performed multiple instances of cluster analysis with different cohorts. We defined three clustering scenarios (s1, s2, s3) which we applied separately to the male and female patient groups and the linear and geodesic distances. We, therefore, ran the cluster analysis 12 times in total.

s1::: In the first clustering scenario, all ASD probands were used together with all controls. Furthermore all combinatorically possible distances within the set of landmarks on each face²⁰ were used in the clustering analysis.
s2::: The second scenario was defined as s1, in which we filtered out some probands and distances. Specifically, we removed subjects with open mouths and probands identified as outliers using Rosner’s outlier test³⁰. We then removed all inter-landmark distances from the analysis exhibiting a significant correlation with BMIFA after Bonferroni correction³¹ to reduce the effect of confounding factors.
s3::: The third clustering scenario differs from s2 only in the distances used for analysis. As in s2, subjects with open mouths and outliers were removed; however, in s3, we only used distances our anthropologist chose that were not expected to correlate with BMIFA (listed in Supplemental Table 1). Distances were chosen to be minimally influenced by BMIFA.

Statistical testing

Kruskal Wallis test was used to determine if significant differences between mean BMIFA and age existed between clusters^32,. If these tests were significant, the Mann–Whitney test was used to find all significant pairwise differences^33,34. Resulting pairwise p values were corrected by Bonferroni correction in cases when the clustering analysis produced > 2 non-trivial clusters. A hypergeometric test was used to test the significance of the enrichment of individual phenotypic categories within clusters and also for enrichment of autism severity classes within the open-mouth proband group³⁵. P values < 0.05 are considered statistically significant and all statistical testing was performed in R.

Ethical approval and consent to participate

Fiduciaries of underaged patients underwent comprehensive genetic counseling followed by signing of informed consent, including approval of 3D facial scanning. The study was approved by the Internal Ethics Board of the Motol University Hospital and was carried out per the provisions of art. 28–29 of Act 373/2011 Coll, and in line with the World Medical Association the “Declaration of Helsinki” principles.

Results

Linear distances

Two preliminary correlation analyses took place in both sexes. First, we correlated BMIFA with all individual linear distances before and after their normalization. The distribution of Spearman’s coefficient was shifted to the right before normalization and yielded a majority of positive correlation coefficients (Fig. 2a–d). Interestingly, a high proportion of negative correlations appear after normalization (Fig. 2a–d), and a group of highly positive correlation coefficients remains on the right side of the distribution in ASD groups (Fig. 2a,b). These results demonstrated that normalization divided by the mean does not sufficiently eliminate correlations between BMIFA and landmark distances.

We obtained very similar results for age (Fig. 3). Both findings indicate the possible influence of BMIFA and age on the clustering results.

Clustering analysis with all combinatorically possible distances between landmarks

Applying clustering scenario s1 on the male cohort with linear distances resulted in four clusters (Fig. 4).

The biggest cluster, Cluster 2, was significantly overrepresented with controls (74%, pval < 10⁻¹⁰). Cluster 1 (65%, pval < 10⁻⁶) and Cluster 4 (100%, pval = 0.012) were significantly overrepresented with ASD patients. The smallest cluster, Cluster 3, was composed exclusively of ASD patients with open mouths (100%, pval < 10⁻⁶).

Clusters exhibit significant differences in BMIFA (pval < 10⁻⁷), age (pval = 0.0002), and ASD severity (pval = 0.0021) according to the Kruskal Wallis test.

Cluster 1 showed significantly higher BMIFA (pval < 0.004) and age (pval < 0.004) than Cluster 2. Cluster 4 showed significantly higher BMIFA (pval < 0.004) and ASD severity (pval = 0.034) than Cluster 2.

In addition, the smallest cluster, i.e., 3, showed significantly higher ASD severity than cluster 1 and 2 (pval_3-1 = 0.012, pval_3-2 = 0.00228) but not BMIFA. All relevant cluster properties are summarized in Table 2.

Table 2 Male group. Characteristics of clusters.

Full size table

Repeating the previous analysis (clustering scenario—s1) in the female group of patients resulted in three clusters, including one cluster (Cluster 1) with a single case (Fig. 5). Similarly to the previous case, the biggest cluster, Cluster 2, is significantly overrepresented with controls (84%, pval < 10⁻⁵), and Cluster 3 was significantly overrepresented with ASD patients (57%, pval = 0.001).

Cluster 3 showed significantly higher BMIFA (pval < 10⁻⁴) and age (pval = 0.023) than Cluster 2. There was no significant difference in ASD severity between Cluster 2 and 3. All relevant cluster properties are summarized in Table 3.

Table 3 Female group. Characteristics of clusters.

Full size table

Clustering analysis with defined subsets of all distances

Applying clustering scenario s2 to the male cohort resulted in a partition with two clusters (of sizes 180 and 6). The two clusters do not show over-representation by any phenotypic category or significant differences in the examined phenotypic variables. Clustering of all females with the same clustering scenario resulted in 4 clusters, with one of the clusters having a size of one. Only one cluster was significantly overrepresented with ASD cases. In other words, the separation of controls from ASD cases was rather weak compared to scenario s1. The same cluster overrepresented with ASD cases had a significantly higher BMIFA than the remaining non-trivial clusters. No other significant differences between clusters were observed in scenario s2.

Applying clustering scenario s3 to the male cohort resulted in trivial partitions, where each proband corresponds to one cluster. Clustering of the female cohort with the same clustering scenario resulted in 3 clusters, with one cluster having a size of one. The two biggest clusters did not show significant over-representation by any of the examined phenotypic categories, and no significant difference in BMIFA or age existed between them. Interestingly, Cluster 2 showed significantly higher ASD severity than Cluster 1 (pval = 0.0226).

Geodesic distances

Correlation analysis using geodesic distances resulted in qualitatively the same results as with linear distances (Figs. 6 and 7).

Clustering analysis with all combinatorically possible distances between landmarks

The resulting partition of clustering scenario s1 applied to the male cohort was composed of four clusters, with two clusters having a size of one (Fig. 8). The remaining two clusters were approximately the same size. Cluster 3 was significantly overrepresented with controls (70%, pval < 10⁻⁸), and Cluster 4 was significantly overrepresented with ASD patients (66%, pval < 10⁻⁶). Cluster 4 showed a significantly higher BMIFA (pval < 10⁻⁷), age (pval < 0.0008), and ASD severity (pval = 0.01437) than Cluster 3.

Repeating the previous scenario s1 for a female group of patients resulted in two clusters (Fig. 9).

Cluster 1 was significantly overrepresented with controls (84%, pval = 0.0001), and Cluster 2 was significantly overrepresented with ASD patients (50%). Cluster 2 showed significantly higher BMIFA (pval < 10⁻⁴) and age (pval = 0.017) than Cluster 1; however, this was not true for ASD severity.

Clustering analysis with subsets of all distances

Applying clustering scenario s2 to the male cohort resulted in three clusters (of sizes 160, 9, and 2 probands). Clusters do not show significant over-representation by any of the examined phenotypic categories nor significant differences between examined phenotypic variables. Repeating the same clustering scenario s2 with the female group resulted in three clusters (of sizes 52, 8, and 12). In contrast to the previous case, we observed significant differences in BMIFA between the resulting clusters. Cluster 3 showed a significantly higher BMIFA than Cluster 1 and 2 (pval = 0.012 and 0.003, respectively). No significant differences existed between clusters relative to the other examined phenotypic variables.

The clustering scenario s3 in both the male and female cohort resulted in trivial partitions, where every proband corresponded to a single cluster.

Relationship with ASD severity

The above clustering results, specifically the association of clusters with both significantly different mean BMIFA and ASD severity, suggest that there could be a relationship between BMIFA and ASD severity. We found that ASD boys with higher values of ASD severity also had significantly higher BMIFA values implying a positive relationship between BMIFA and ASD severity (Fig. 10). In ASD girls, the relationship was not significant.

Correlating ASD severity with open or closed mouths demonstrated that the number of severe autism cases within the group of probands with open mouth is significantly higher than is expected by chance, pval = 0.005 (Table 1).

Discussion

Our results within scenario s1 agreed with previous observations²⁰ in that the clusters were associated with significantly different clinical courses of ASD in terms of its severity. However, beyond the results of previous studies^19,20, we demonstrated that the observed clustering and consequently its association with ASD severity measure is determined by BMIFA, using both linear and geodesic distances.

This statement was confirmed by the scenario s2 analysis in which, after removing all outliers and all distances significantly correlating with BMIFA, the resulting clusters did not show an over-representation by any phenotypic category except single cluster in the case of female group and euclidean distances. We observed some residual differences between clusters in BMIFA, especially in case of geodesic distances, but this can be explained by imperfect elimination of BMIFA correlated distances when using Bonferroni correction. However, remarkable disappearance of ASD severity correlations and phenotypic category enrichment is obvious in this case. Using the inverted scenario (s3), where the distances were chosen to be as little influenced by BMIFA as possible, we got trivial partitions in all sub-scenarios except female cohort in case of euclidean distances This further suggests that apart from BMIFA, there were no other reasons for distance vectors/probands to cluster.

In the female group, the results were in partial agreement with the male group, i.e., we also observed significant separation between controls and ASD cases and differences in BMIFA and age, however, not in ASD severity. This may be explained by the smaller number of female scans available; thus, the statistical evidence for females was weaker. Therefore, a study using a larger number of female scans is needed to confirm our results.

The fact that removing BMIFA-related distances leads to the absence of clustering or produces clusters that are not correlated with clinical variables suggests that BMIFA strongly influences clustering results. Furthermore we showed that the commonly used normalization, i.e., division of facial distances by the mean value, does not eliminate correlations between individual facial distances and BMIFA. We also reproduced known results³⁶, demonstrating a positive relationship between BMIFA and ASD severity (Fig. 10) in ASD boys. This fact and the BMIFA effect on clustering explain the correlation between clusters and clinical variables related to ASD severity observed in our work and previous works^19,20.

In their retrospective study, Curtin et al.³⁷ found that children with ASD were 40% more likely to be obese than children in the general population. Several factors may be associated with a higher risk of developing obesity in children with ASD, such as psychopharmacological treatments, sleep disorders, problems with engaging in sustained physical activity, and atypical eating habits³⁸. Autistic children are notoriously picky eaters, often consuming only specific foods or foods with a certain consistency and presentation. It is not unusual for a child to “fear” trying new foods and eat relatively few food items³⁹. Our work raises the question of whether all existing studies dealing with autistic faces clustering based on facial distances cannot be simplified to the equation “high BMIFA equals worse phenotype” because no BMIFA-based exclusion/correction processes are mentioned in the original publications^19,20.

It is clear from our work that BMIFA should be considered in 3D facial soft tissue assessment since it can introduce a significant bias. In a study by Tan et al.⁴⁰, BMIFA is considered, but the methodology is not entirely clear. Influence of BMI is examined by calculating facial areas between selected landmarks in ASD children. No significant differences were found. In 2020 Tan et al. examined non-autistic siblings of ASD children in a similar manner¹⁸, no significant differences were found. BMI was discussed marginally as a confounding factor in similar studies; it was not further considered.

We observed another confounding factor that can bias facial clustering studies, and to our knowledge, has never been mentioned before. We suspect that the authors of previous studies did not exclude 3D models of faces with open mouths. This is partially backed up by Fig. 5 in reference²⁰, which includes a child with an open mouth. However, we were not able to establish contact with the authors for further clarification of this issue. Opening the mouth significantly increases the vertical distances of the face, and these patients could form their “own” cluster. On the other hand, an inability to persuade the child to close their mouth could mean the child does not have the necessary mental capacities or social skills to cooperate and thus might have a more severe behavioural phenotype. This could imply that the analysis correctly identified a group of patients with severe ASD phenotypes. However, this approach bears oversimplification by “automatically” associating an open mouth with a “worse” phenotype. This is not an observation that needs a 3D scanner-based validation. Therefore, in our opinion, these patients should be a priori excluded to get less biased results.

We assessed the severity of ASD clinical presentations from mild to severe (see Table 1) and observed that open-mouth patients have a higher percentage of moderate-to-severe phenotypes than closed-mouth patients, although it must be noted that the number of open-mouth patients available to our study was limited.

There are further potential limitations of our study. Although each patient was scanned from the frontal view with their head in a natural position and a neutral facial expression, there was one factor we could not eliminate, i.e., jaw clenching. A clenched jaw produces different vertical distances than a relaxed jaw, even though the mouth is closed. In our study, the children were shown cartoons to help them remain relaxed and not overly focused on the scanning process. This factor was not addressed in previous studies. Furthermore, the quality of 3D scans can vary according to the device and the degree of patient cooperation within the scanning process.

Another step in further objectifying the process and nullifying the influence of BMIFA is to remove the influence of soft tissues by analyzing landmarks on the cranial skeleton of probands. A brain MRI is a standard part of a clinical ASD workup. This approach was not used and is yet to be evaluated within the context of 3D facial scanning.

In summary, 3D facial scanning remains a promising non-invasive examination modality for objective assessment of patients´ facial phenotypes, whether for documenting purposes or automated human phenotype ontology (HPO) terms extraction. But it is very important that BMI is addressed in 3D scan facial studies to differentiate between traits based on skull features from those based on soft tissue changes. There are still many unknowns in ASD patient phenotypes, be it genetic, facial or behavioral. New methods could be valuable for ASD diagnostics, benefit patients and their families and aid the stratification of patients in terms of their severity and application of customized treatments.

Data availability

The datasets of patients used and/or analyzed during this study are available from the corresponding author upon reasonable request.

References

Handbook of Autism and Pervasive Developmental Disorders. 2 (Wiley, 1997).
Hrdlička, M. & Komárek, V. Dětský Autismus: Přehled Současných Poznatků. 2 (Portál, 2014).
Vorstman, J. A. S. et al. Autism genetics: Opportunities and challenges for clinical translation. Nat. Rev. Genet. 18, 362–376 (2017).
Article CAS PubMed Google Scholar
Lai, M.-C., Lombardo, M. V. & Baron-Cohen, S. Autism. Lancet. 383, 896–910 (2014).
Article PubMed Google Scholar
Wing, L. & Gould, J. Severe impairments of social interaction and associated abnormalities in children: Epidemiology and classification. J. Autism Dev. Disord. 9, 11–29 (1979).
Article CAS PubMed Google Scholar
Hua, R., Wei, M. & Zhang, C. The complex genetics in autism spectrum disorders. Sci. China Life Sci. 58, 933–945 (2015).
Article CAS PubMed Google Scholar
Guo, H. et al. Inherited and multiple de novo mutations in autism/developmental delay risk genes suggest a multifactorial model. Mol. Autism https://doi.org/10.1186/s13229-018-0247-z (2018).
Article PubMed PubMed Central Google Scholar
Bensaid, M. et al. Multi-hit autism genomic architecture evidenced from consanguineous families with involvement of FEZF2 and mutations in high-risk genes. Neuroscience https://doi.org/10.1101/759480 (2018).
Article Google Scholar
Centanni, T. M., Green, J. R., Iuzzini-Seigel, J., Bartlett, C. W. & Hogan, T. P. Evidence for the multiple hits genetic theory for inherited language impairment: A case study. Front. Genet. 6, 272 (2015).
Article PubMed PubMed Central Google Scholar
Miller, L. E., Dai, Y. G., Fein, D. A. & Robins, D. L. Characteristics of toddlers with early versus later diagnosis of autism spectrum disorder. Autism. 25, 416–428 (2021).
Article PubMed Google Scholar
Borden, M. C. & Ollendick, T. H. An examination of the validity of social subtypes in autism. J. Autism Dev. Disord. 24, 23–37 (1994).
Article CAS PubMed Google Scholar
Castelloe, P. & Dawson, G. Subclassification of children with autism and pervasive developmental disorder: A questionnaire based on Wing’s subgrouping scheme. J. Autism Dev. Disord. 23, 229–241 (1993).
Article CAS PubMed Google Scholar
Eaves, L. C., Ho, H. H. & Eaves, D. M. Subtypes of autism by cluster analysis. J. Autism Dev. Disord. 24, 3–22 (1994).
Article CAS PubMed Google Scholar
O’Brien, S. K. The validity and reliability of the Wing Subgroups Questionnaire. J. Autism Dev. Disord. 26, 321–335 (1996).
Article PubMed Google Scholar
Volkmar, F. R., Cohen, D. J., Bregman, J. D., Hooks, M. Y. & Stevenson, J. M. An examination of social typologies in autism. J. Am. Acad. Child Adolesc. Psychiatry 28, 82–86 (1989).
Article CAS PubMed Google Scholar
Duffy, F. H. & Als, H. Autism, spectrum or clusters? An EEG coherence study. BMC Neurol. 19, 27 (2019).
Article PubMed PubMed Central Google Scholar
Hrdlicka, M. et al. Subtypes of autism by cluster analysis based on structural MRI data. EuropChild Adolescent Psych. 14, 138–144 (2005).
Article Google Scholar
Tan, D. W. et al. A broad autism phenotype expressed in facial morphology. Transl. Psychiatry 10, 1–9 (2020).
Article Google Scholar
Aldridge, K. et al. Facial phenotypes in subgroups of prepubertal boys with autism spectrum disorders are correlated with clinical phenotypes. Mol. Autism 2, 15 (2011).
Article PubMed PubMed Central Google Scholar
Obafemi-Ajayi, T. et al. Facial structure analysis separates autism spectrum disorders into meaningful clinical subgroups. J. Autism Dev. Disord. 45, 1302–1317 (2015).
Article PubMed Google Scholar
American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders 5th edn. (American Psychiatric Association, UK, 2013).
Book Google Scholar
Palmer, R. L., Helmholz, P. & Baynam, G. Cliniface: Phenotypic visualisation and analysis using non-rigid registration of 3D facial images. Int. Arch. Photogram. Remote Sens. Spat. Inf. Sci. XLIII-B2-2020, 301–308. https://doi.org/10.5194/isprs-archives-XLIII-B2-2020-301-2020 (2020).
Article Google Scholar
3D Systems. Geomagic Wrap 2017. (2017). Available from: https://www.3dsystems.com/
Farkas, L. G. Anthropometry of the Head and Face (Raven Press, 1994).
Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing. (R Foundation for Statistical Computing, Vienna, 2021). Available from: https://www.R-project.org/
The MathWorks, Inc. Matlab version R2022b. (Natick, 2022). Available from: www.mathworks.com
Peyre, G. Toolbox Fast Marching. MATLAB Central File Exchange; (2023). Available from: https://www.mathworks.com/matlabcentral/fileexchange/6110-toolbox-fast-marching
Spearman, C. The proof and measurement of association between two things. Am. J. Psychol. 100, 441–471 (1987).
Article CAS PubMed Google Scholar
Caliński, T. & Harabasz, J. A dendrite method for cluster analysis. Commun. Stat. 3, 1–27 (1974).
MathSciNet Google Scholar
Rosner, B. Percentage points for a generalized ESD many-outlier procedure. Technometrics 25, 165–172 (1983).
Article Google Scholar
Bonferroni, C. E. Teoria statistica delle classi e calcolo delle probabilita. Pubblicazioni del R Istituto Superiore di Scienze Economiche e Commericiali di Firenze 8, 3–62 (1936).
Google Scholar
Kruskal, W. H. & Wallis, W. A. Use of ranks in one-criterion variance analysis. J. Am. Stat. Assoc. 47, 583–621 (1952).
Article Google Scholar
Student. The probable error of a mean. Biometrika. 1–25 (1908).
Wilcoxon, F. Individual comparisons by ranking methods. Biometr. Bull. 1, 80–83 (1945).
Article Google Scholar
Armitage, P., Berry, G. & Matthews, J. N. S. Statistical Methods in Medical Research. 4th edn 760–783 (2008).
Levy, S. E. et al. Relationship of weight outcomes, co-occurring conditions, and severity of autism spectrum disorder in the study to explore early development. J. Pediatr. 205, 202–209 (2019).
Article PubMed Google Scholar
Curtin, C., Anderson, S. E., Must, A. & Bandini, L. The prevalence of obesity in children with autism: A secondary data analysis using nationally representative data from the National Survey of Children’s Health. BMC Pediatr. 10, 11 (2010).
Article PubMed PubMed Central Google Scholar
Curtin, C., Jojic, M. & Bandini, L. G. Obesity in children with autism spectrum disorders. Harvard Rev. Psychiatry 22, 93 (2014).
Article Google Scholar
Baraskewich, J., von Ranson, K. M., McCrimmon, A. & McMorris, C. A. Feeding and eating problems in children and adolescents with autism: A scoping review. Autism. 25, 1505–1519 (2021).
Article PubMed PubMed Central Google Scholar
Tan, D. W. et al. Hypermasculinised facial morphology in boys and girls with Autism Spectrum Disorder and its association with symptomatology. Sci. Rep. 7, 9348 (2017).
Article ADS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was done under the auspices of ERN ITHACA. We would also like to extend our gratitude to our patients and their families for their cooperation.

Funding

This work was supported by a Charles University Grant (GAUK), Number 134121, MZd Czech Republic 00064203, National Center for Medical Genomics Grant LM2018132 and IGA Grant 22-07-00165.

Author information

Authors and Affiliations

Department of Biology and Medical Genetics, 2nd Faculty of Medicine, Charles University in Prague and Motol University Hospital, Prague, Czech Republic
Martin Schwarz, Jan Geryk, Markéta Havlovicová, Michaela Mihulová, Marek Turnovec, Lukáš Ryba, Júlia Martinková, Milan Macek Jr., Karolína Kočandrlová & Veronika Moslerová
PRENET - Laboratoře Lékařské Genetiky s.r.o., Pardubice, Czech Republic
Martin Schwarz
Faculty of Science and Engineering, Curtin University, Perth, Australia
Richard Palmer
Department of Anthropology and Human Genetics, Faculty of Science, Charles University, Prague, Czech Republic
Karolína Kočandrlová & Jana Velemínská

Authors

Martin Schwarz
View author publications
You can also search for this author in PubMed Google Scholar
Jan Geryk
View author publications
You can also search for this author in PubMed Google Scholar
Markéta Havlovicová
View author publications
You can also search for this author in PubMed Google Scholar
Michaela Mihulová
View author publications
You can also search for this author in PubMed Google Scholar
Marek Turnovec
View author publications
You can also search for this author in PubMed Google Scholar
Lukáš Ryba
View author publications
You can also search for this author in PubMed Google Scholar
Júlia Martinková
View author publications
You can also search for this author in PubMed Google Scholar
Milan Macek Jr.
View author publications
You can also search for this author in PubMed Google Scholar
Richard Palmer
View author publications
You can also search for this author in PubMed Google Scholar
Karolína Kočandrlová
View author publications
You can also search for this author in PubMed Google Scholar
Jana Velemínská
View author publications
You can also search for this author in PubMed Google Scholar
Veronika Moslerová
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The primary author, M.S., conceived, designed, drafted the manuscript, and led genetic consultations with the patients. M.S., V.M., K.K., M.T., M.M., L.R. and J.M. performed 3D scanning of patients and helped with their editing. MS and VM edited 3D scans the most substantially. J.G. lead statistical analysis and its interpretation. RP modified the Cliniface software for the needs of this research. M.H., K.K., J.V., M.M. Jr. and V.M. substantively revised the manuscript. All authors have agreed to be personally accountable for their contributions. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Martin Schwarz.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Table 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Schwarz, M., Geryk, J., Havlovicová, M. et al. Body mass index is an overlooked confounding factor in existing clustering studies of 3D facial scans of children with autism spectrum disorder. Sci Rep 14, 9873 (2024). https://doi.org/10.1038/s41598-024-60376-0

Download citation

Received: 16 November 2023
Accepted: 22 April 2024
Published: 30 April 2024
DOI: https://doi.org/10.1038/s41598-024-60376-0

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Investigation of age-related facial variation among Angelman syndrome patients

Large-scale open-source three-dimensional growth curves for clinical facial assessment and objective description of facial dysmorphism

The age distribution of facial metrics in two large Korean populations

Introduction

Methods

Patient cohort

3D facial gestalt scanning

Data analysis

Correlation analysis

Hierarchical clustering procedure

Statistical testing

Ethical approval and consent to participate

Results

Linear distances

Clustering analysis with all combinatorically possible distances between landmarks

Clustering analysis with defined subsets of all distances

Geodesic distances

Clustering analysis with all combinatorically possible distances between landmarks

Clustering analysis with subsets of all distances

Relationship with ASD severity

Discussion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Table 1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Comments

Search

Quick links