Population-specific facial traits and diagnosis accuracy of genetic and rare diseases in an admixed Colombian population

Echeverry-Quiceno, Luis M.; Candelo, Estephania; Gómez, Eidith; Solís, Paula; Ramírez, Diana; Ortiz, Diana; González, Alejandro; Sevillano, Xavier; Cuéllar, Juan Carlos; Pachajoa, Harry; Martínez-Abadías, Neus

doi:10.1038/s41598-023-33374-x

Download PDF

Article
Open access
Published: 27 April 2023

Population-specific facial traits and diagnosis accuracy of genetic and rare diseases in an admixed Colombian population

Luis M. Echeverry-Quiceno¹^na1,
Estephania Candelo^2,3^na1,
Eidith Gómez²,
Paula Solís²,
Diana Ramírez²,
Diana Ortiz²,
Alejandro González⁴,
Xavier Sevillano⁴,
Juan Carlos Cuéllar⁵,
Harry Pachajoa^2,3 &
…
Neus Martínez-Abadías¹

Scientific Reports volume 13, Article number: 6869 (2023) Cite this article

3275 Accesses
133 Altmetric
Metrics details

Subjects

Abstract

Up to 40% of rare disorders (RD) present facial dysmorphologies, and visual assessment is commonly used for clinical diagnosis. Quantitative approaches are more objective, but mostly rely on European descent populations, disregarding diverse population ancestry. Here, we assessed the facial phenotypes of Down (DS), Morquio (MS), Noonan (NS) and Neurofibromatosis type 1 (NF1) syndromes in a Latino-American population, recording the coordinates of 18 landmarks in 2D images from 79 controls and 51 patients. We quantified facial differences using Euclidean Distance Matrix Analysis, and assessed the diagnostic accuracy of Face2Gene, an automatic deep-learning algorithm. Individuals diagnosed with DS and MS presented severe phenotypes, with 58.2% and 65.4% of significantly different facial traits. The phenotype was milder in NS (47.7%) and non-significant in NF1 (11.4%). Each syndrome presented a characteristic dysmorphology pattern, supporting the diagnostic potential of facial biomarkers. However, population-specific traits were detected in the Colombian population. Diagnostic accuracy was 100% in DS, moderate in NS (66.7%) but lower in comparison to a European population (100%), and below 10% in MS and NF1. Moreover, admixed individuals showed lower facial gestalt similarities. Our results underscore that incorporating populations with Amerindian, African and European ancestry is crucial to improve diagnostic methods of rare disorders.

Using deep-neural-network-driven facial recognition to identify distinct Kabuki syndrome 1 and 2 gestalt

Article 22 November 2021

Flavien Rouxel, Kevin Yauy, … David Genevieve

Automated syndrome diagnosis by three-dimensional facial imaging

Article Open access 01 June 2020

Benedikt Hallgrímsson, J. David Aponte, … Ophir D. Klein

Evaluation of Face2Gene using facial images of patients with congenital dysmorphic syndromes recruited in Japan

Article 29 May 2019

Hiroyuki Mishima, Hisato Suzuki, … Kenjiro Kosaki

Introduction

According to the Online Mendelian Inheritance in Man (OMIM) databank, there are more than 10,000 genetic and rare diseases (RD) affecting 7% of the world's population^1,2. This corresponds to approximately 500 million people. Although as a whole genetic and RD are a significant cause of morbidity and mortality in the pediatric population³, by separate each disorder affects a very reduced number of people. Depending on the country, the prevalence to consider a disease as rare ranges from 1 affected individual in 50,000 people to 1 in 200,000. This low prevalence has limited the research on rare disorders.

Currently, there is limited knowledge on the etiology of these disorders. A reduced percentage of diseases (20%) presents a known molecular basis associated to a detailed phenotype description, and treatment is only available for 0.04% of RD³. As orphan diseases, many RD are chronic and incurable, representing severe and debilitating conditions⁴. The diagnosis and management of genetic RD is currently a clinical challenge⁵. Precise and early diagnosis is crucial for individuals and their families to get effective care and to reduce disease progression. However, due to the limited knowledge and complexity of these pathologies, diagnosis may take several years⁶. People often suffer during a long diagnostic odyssey, with delays in their correct treatment and management⁷. For most rare diseases, there are no reliable biomarkers for early diagnosis⁸.

Among the wide constellation of clinical symptoms associated to genetic and rare disorders, craniofacial dysmorphologies emerge as potential biomarkers^9,10. These phenotypes are highly prevalent^2,6 and are commonly used for diagnosis, management and treatment monitoring of genetic and RD⁶. Up to 40% of these disorders present characteristic craniofacial phenotypes, including Down, Morquio, Noonan, Apert, Rett, Fragile X, Williams-Beuren and Treacher-Collins and velocardiofacial syndromes, as well as other conditions such as microcephaly, holoprosencephaly, palate/lip cleft, and other 2,000 rare genetic disorders^10,11.

The genetic and environmental factors causing these disorders alter the complex process that orchestrates facial morphogenesis during pre- and postnatal development, inducing facial dysmorphologies. Facial development is highly regulated by multiple signaling pathways^12,13,14, including Fibroblast Growth Factor (FGF), Hedgehog (HH), Wingless (WNT) and Transforming Growth Factor Beta (TGF-β) and Bone Morphogenetic Proteins (BPMs). Disruptions in the regulation of any of these signaling pathways can lead to facial dysmorphogenesis¹⁵.

The facial patterns associated with each disorder are unique, but vary within and among diagnostics, ranging from subtle facial anomalies to severe malformations¹⁶. In the clinical practice, craniofacial dysmorphology is commonly assessed through qualitative visual assessment and basic anthropometric measurements. However, this approach may not capture with optimal precision the anatomical complexity of the facial dysmorphologies associated with these disorders. Qualitative descriptions of facial phenotypes are sometimes based on general terms such as coarse face, large and bulging head; saddle-like, flat bridged nose with broad, fleshy tip; or malformed teeth^17,18,19. Accurate identification of dysmorphic features for diagnosis thus depends on the clinician’s expertise, and only highly trained dysmorphologists are able to recognize the facial “gestalt” characteristic of the rarest disorders¹⁹.

Recent research seeks to incorporate into the clinical diagnosis of RD the use of objective and quantitative tools to assess facial phenotypes^{20,21,22,23,24,25}. Automated systems have been developed to improve and accelerate the diagnostic process^9,10,26. Within the clinical practice, Face2Gene is the most commonly used system (FDNA Inc., https://www.face2gene.com/), a community-driven phenotyping platform trained over 17,000 people representing more than 200 syndromes⁹. Face recognition is performed on 2D images that can be collected with any type of digital camera or phone, without previous training. Syndrome classification is achieved using DeepGestalt, a cascade Deep Convolutional Neural Network (DCNN)-based method that achieved an average 91% top-10 accuracy in identifying the correct syndrome⁹.

Other diagnostic approaches based on 3D photogrammetry have been developed more recently^10,20,21. The advantage of 3D facial models is that they are more efficient than 2D images in capturing the complexity of facial phenotypes, but their widespread use is limited because the photographic equipment required for generating 3D models is not commonly available in the clinical practice. Hallgrímsson et al. (2020)¹⁰ analyzed 3D facial models from 7,057 subjects including subjects with 396 different syndromes, relatives and unrelated unaffected subjects (https://www.facebase.org/). Deep phenotyping based on quantitative 3D facial imaging and machine learning presented a balanced accuracy of 73% for syndrome diagnosis²⁰.

Automated methods have thus demonstrated high potential to facilitate the diagnosis of facial dysmorphic syndromes^6,9,10,26. These tools present high accuracy diagnosis in European and North American populations, that are the populations in which the machine learning algorithms have been trained and validated. However, these tools have not been thoroughly tested in populations with different ancestries, and it is not well understood the how facial phenotypes associated with genetic and RD might be influenced by the complex patterns of population ancestry characterizing human populations.

Population ancestry in facial dysmorphologies: a long-disregarded factor

Facial shape shows wide variation across world-wide human populations²⁷. Facial differences between populations are detected in the shape of the forehead, brow ridges, eyes, nose, cheeks, mouth and jaw²⁸. These facial phenotypes result from divergent evolutionary and adaptive histories of human populations occurred during the evolution of Homo sapiens over the last 200,000 years. Nowadays, continuous migration and admixture keep shaping the facial phenotypes of human populations. Depending on dominance and epistatic interactions between alleles fixed or predominant in each parental group³⁰, admixed populations can display a variety of craniofacial morphologies, ranging from resemblance to one of the parental groups to a combination of both parental phenotypes and the evolution of novel phenotypes²⁹. Therefore, the evolutionary and population dynamics of human populations result in genetic and phenotypic patterns that surrogate population ancestry^30,31,32, and can modulate the facial phenotypes associated to disease.

Few studies to date have analyzed the craniofacial phenotypes associated with genetic and RD in populations of non-European descent^33,34,35,36, leaving African, Asian and Latin-American populations often disregarded and underrepresented. Unfortunately, there are no reliable representations of facial phenotypes in genetic and rare diseases in populations of non-European descent. However, it is crucial to account for the influence of population ancestry on facial variation to develop quantitative approaches that efficiently diagnose these disorders in populations from all over the world.

To cover this gap, here we assessed the facial dysmorphologies associated to prevalent genetic and RD in a Latin-American population from the Southwest of Colombia. Latin-Americans are fascinating cases of hybrid/admixed populations that evolved over relatively short periods of time^30,37. Peopling of the Americas likely started 12–18,000 years ago^38,39 by migration waves coming from North and South East Asia³⁰, following coastal and continental routes⁴¹. Amerindian populations established all over the continent and adapted to a variety of environments over thousands of years. During the last 600 years, admixture with European and African populations further shaped the genetic ancestry of Latin-American populations^42,43. In particular, the population from the region of Cali is the result of diverse migratory processes⁴⁴. Admixture with the indigenous Amerindian population began in the sixteenth century with the arrival of Spanish colonizers. In the eighteenth century, large colonial settlements of slaves brought from Africa were established in Cali for the exploitation of sugar cane that significantly changed the population structure of Valle del Cauca. Nowadays, the population of Cali is characterized by indigenous and mestizo communities, with Amerindian and African ancestry components predominating over the European ancestry contribution⁴⁴.

In this study, we compared the facial phenotypes associated to four genetic and RD, including Down syndrome (DS), Mucopolysaccharidosis type IVA metabolic disorder known as Morquio syndrome (MS), and two types of RASopathies, Noonan syndrome (NS) and Neurofibromatosis type 1 (NF1). The facial phenotype of these syndromes has not been previously characterized in Latin-American populations, and differences between populations with different ancestry backgrounds have not been assessed^34,35,36. Here, we quantitatively assessed the facial phenotypes associated to these syndromes, and compared our results in a Colombian admixed population with those reported in European descent populations. We also assessed the diagnostic accuracy of automatic methods currently used in the clinical practice, and detected evidence suggesting that further research is needed to optimize these methods in admixed populations of non-European descent.

Materials and methods

Participant recruitment for photographic sessions

The Colombian sample comprised 130 individuals from Valle del Cauca, a Southwest region in Colombia (Table 1). The cohort included 79 age matched controls and 51 individuals diagnosed with Down, Morquio, Noonan and Neurofibromatosis type 1 syndromes that were recruited from the clinical genetics consultation at Hospital-Fundación Valle del Lili in Cali (Colombia), a tertiary health reference center for these genetic and rare disorders. In most cases, clinical diagnoses were confirmed by molecular genetic testing.

Table 1 Sample composition by diagnosis. The table provides the number of male (M) and female (F) participants, as well as the total sample size for each syndrome. The age range within each diagnostic group is also provided, where \({\overline{\text{x}}}\) represents the average age.

Full size table

Down syndrome (DS, OMIM 190685), caused by trisomy of chromosome 21, was selected because it is one of the most common genetic disorders, and previous studies have shown that the clinical manifestations associated with DS vary across ethnicities³⁵. Within RD, we included Morquio syndrome type A (MS, OMIM 253000) because Colombia presents one of the highest prevalence of MS in the world, probably as a result of founder effects⁴⁵. Morquio syndrome is a subtype of Mucopolysaccharidosis disorders caused by more than 180 autosomal recessive mutations in the GALNS gene⁴⁶ that alter the metabolism of the extracellular matrix glycosaminoglycans⁴⁷. Individuals with MS show coarse facies with an excessively rapid growth of the head⁴⁸.

Finally, we also included in the analyses two RASopathies, Noonan syndrome (NS, OMIM 163950) and Neurofibromatosis type 1 (NF1, OMIM 162200), which are prevalent in Valle del Cauca and present altered craniofacial development by genetic mutations that cause Ras/MAPK pathway dysregulation⁴⁹.

To assess the facial phenotypes associated with these disorders, individuals diagnosed with DS, MS, NS and NF1 and age matched controls were recruited for photographic sessions at educational and research centers in Cali (Colombia) in 2021. The photographic material was taken under the protocol approved by ethics committee “Human Research Ethics Committee of the Icesi University” with Approval Act No. 309. To photograph the participants and to record relevant clinical information, we obtained informed consent from the participants or from their parents or legal guardians in the case of minor children, in accordance with national guidelines and regulations.

Facial image acquisition and anatomical landmark collection

Facial shape was captured from 2D images taken using a professional digital camera (SONY Alpha 58 + 18–55) that was attached to a tripod and placed at one-meter distance in front of the participants. To capture a natural facial gesture, the images were acquired in an upright position with facial neutral expression. Participants were asked to sit still, looking towards the front, with open eyes and closed mouth. Although this was challenging in children with Down syndrome, who usually show hyperactivity and tongue protrusion due to hypotonia, several photographs were taken until a neutral facial expression was achieved.

To measure facial shape of each individual and to detect the traits associated with each disorder, we recorded the 2D coordinates of a set of 18 anatomical facial landmarks (Fig. 1 and Supplementary Table 1). Landmarks were acquired using an automatic facial landmark detection procedure adapted from the open-source software library Dlib⁵⁰. The automatic landmarking process is explained in detail in Supplementary Information. In brief, from the set of 68 landmarks registered by Dlib, 15 landmarks directly matched our configuration of 18 facial landmarks (Fig. 1, Fig. S1, Table S1). Three additional landmarks were approximated through direct computations between the landmarks coordinates automatically returned by Dlib: the glabella was computed as the midpoint point between the innermost points located in the eyebrows, and the palpebrale inferius landmarks of the right and left eyes were computed as the midpoint between the two central lower eyelid landmarks.

The validity of the data was assessed by comparing the coordinates of landmarks automatically detected by Dlib with the coordinates of landmarks manually collected by an expert facial morphologist. Manual and automatic measurement differences were assessed for each individual landmark using the root mean square error (RMSE) (Fig. S2). This method was first validated with the 2D facial images of 20 control subjects, and the average RMSE was 1.75 mm. To validate the automatic landmarking method with images of syndromic patients, we manually landmarked 20 patients, including 5 individuals diagnosed with each syndrome represented in our sample. The RMSE for syndromic patients was slightly higher (RMSE = 1.96 mm), but below 2 mm (Fig. S2). Considering that this error threshold is widely accepted in studies of biological anthropology for craniometric measurements⁵¹, the precision of the automatic detection method of anatomical points was validated on both control and syndromic samples.

Quantification of facial phenotypes

We used Euclidean distance matrix analysis (EDMA) to describe the facial phenotype associated to each syndrome. EDMA is a robust morphometric method for assessing local differences between samples⁵² by detecting linear distances that significantly differ between pairwise sample contrasts and comparing patterns of significant differences across samples.

To account for size differences between subjects, the 2D coordinates of the facial landmarks of each subject were scaled by their centroid size, estimated as the square root of the sum of squared distances of all the landmarks from their centroid⁵³. After scaling, as EDMA represents shape as a matrix of linear distances between all possible pairs of landmarks, a total of 153 unique facial measurements were calculated for each individual. Linear distances were compared for each group of DS, MS, NS and NF1 syndromes with control individuals by performing a two-tailed two-sample shape contrasts on all unique inter-landmark linear distances from each sample. Relative differences between patients and controls were computed as (mean distance in controls—mean distance in patients) / mean distance in controls.

Statistical significance was assessed using a non-parametric bootstrap test with 10,000 resamples. EDMA statistically evaluated the number of significant local linear distances in each two-sample comparison based on confidence interval testing. We used the default α level in EDMA (α = 0.10), and a 90% confidence interval was calculated for each linear distance. The shape differences were sorted in increasing order, and the first 5% and the last 5% differences were discarded. The resulting minimum and maximum differences were used to set up the lower and upper confidence limits for each linear distance. Interlandmark distances were considered non-significantly different between controls and patients when the resulting interval contained the value zero. Otherwise, the equality null hypothesis was not accepted, and we assumed that a significant shape difference existed at the α level⁵⁴. To pinpoint specific local shape differences and to reveal the unique morphological pattern of variation associated with each disorder, the ten longest and shortest significant relative differences were plotted on facial figures.

Facial dysmorphology score

To confirm that results were not random due to the small sample sizes available in rare diseases, we combined the results from EDMA with an iterative bootstrapping method that further assessed whether the facial dysmorphologies associated to each syndrome were statistically significant⁵⁵. First, we estimated from the EDMA results a facial dysmorphology score (FDS) as the percentage of significantly different distances between patient and control groups. Then, we ran simulations with random samples of controls and patients generated by iterative bootstrapping to assess the statistical significance of the patterns revealed by EDMA. For each disorder, we first created subsamples of N randomly chosen controls (where N is the total number of patients available in the sample). Then, using a subsampling approach, we automatically generated random pseudo-subsamples containing a known number of patients (namely M). This procedure was repeated with increasing numbers of patients and resulted in a series of staggered pseudo sub-samples that contained from M = 0 to M = N patients. A total of 150 simulations were run in each round, and in each of these simulations, we computed an EDMA analysis and an FDS score.

The results from each round of random groups were separately represented in histograms. The first round of simulations contained no patients (M = 0) and only included control individuals, representing facial differences that can be found randomly in the general population. To assess whether the FDS value obtained using the complete patient dataset was significantly different or similar to the FDS resulting from a random sample, we compared the distribution of FDS random values with the FDS observed in the whole sample. The P-value assessing the statistical significance of the comparison was computed as the ratio between the number of simulations containing no patients that provided a higher FDS than the observed FDS divided by the total number of simulations. P-values below 0.05 indicated that the FDS obtained using the real dataset was higher that the FDS obtained randomly in a sample of control subjects.

Face2Gene diagnostic assessment

To assess the accuracy of automated diagnostic methods in the Colombian sample, we compared the clinical diagnosis based on clinical and genetic testing with the diagnosis estimated from the frontal facial 2D images of the patients using the Face2Gene technology (FDNA Inc., Boston, MA, USA; https://www.face2gene.com). Following Gurovich⁹, we assessed the top-one and top-five accuracies for each disorder, estimated as the percentage of cases where the Face2Gene model predicted the correct syndrome as the first result or within the five first results from the sorted list of probable diagnoses. We also calculated these accuracies expanding the diagnostic range to the disorder family.

Moreover, we evaluated the similarity between the Colombian patients and the facial gestalt models used by Face2Gene for syndrome classification. For each individual, we selected the first diagnostic prediction that matched their clinical and genetic diagnosis and recorded the gestalt similarity. We classified the level of similarity between the individual and the corresponding gestalt model into seven categories, including “very low”, “low”, “low-medium”, “medium”, “medium–high”, “high”, and “very high” gestalt similarity, using the “gestalt level” barplot provided by Face2Gene.

Finally, to further test the influence of population ancestry on the diagnostic accuracy of Face2Gene, and to directly compare the results with individuals from European descent populations, we performed an extensive search of public image databases to obtain 2D photos of European subjects diagnosed with DS, NS, MS and NF1 syndromes. We collected the images of 45 subjects with DS⁵⁶; and 24 diagnosed with NS⁵⁷. Unfortunately, no 2D images of European individuals diagnosed with MS and NF1 were found publicly available. Using these images, we tested the accuracy of Face2Gene in DS and NS employing the same method previously described for the Colombian population. However, we could not use these publicly available images to perform EDMA and FDS analyses on the European samples, because the pictures were not taken under controlled conditions⁵⁶, and diverse facial expression and head position would lead to bias in results of quantitative shape comparisons.

Results

EDMA analyses showed that each syndrome presented a characteristic facial phenotype.

In individuals with Down syndrome, all facial structures including the eyes, nose and mouth presented significant differences as compared to controls. Overall, DS was associated with wider but shorter facial traits (Fig. 2A).

Results showed a 6.5% increase of relative distance between the midpoint between the eyebrows (glabella) and the most inferior medial point of the lower right eyelid (palpelabre inferius), and a 7.5% increase between the right palpelabre inferius and the outer commissure of the right eyes (exocanthion), indicating hypertelorism. Additionally, in this Colombian sample, people with DS exhibited longer measurements in the buccal portion, with a 6–8% increase of mouth width as measured from the crista philtri to the chelions (Fig. 2A). However, the midfacial and nasal regions were reduced (Fig. 2A). People with DS presented a 6–8% reduction in measurements of midfacial height, with the largest difference detected as a 9.7% reduction of the distance between the tip and the root of the nose (Fig. 2A). The facial dysmorphology score (FDS) indicated that up to 58.2% of facial traits were significantly different in people with DS (Fig. 2B).

The facial pattern associated with Morquio syndrome was also characterized by wider and shorter midfacial traits, as observed in Down syndrome. However, facial dysmorphologies were more abundant and severe in MS than in DS, with 65.4% of facial traits significantly different in diagnosed individuals and higher percentages of relative change (Fig. 3 A, B). The most affected regions were the midface and the nose, whereas the mouth was the least affected. Individuals with MS presented hypertelorism, with 14% increase in the distance between the midpoint between the eyebrows (glabella) and the inner commissures of the left and right eyes (endocanthions). Individuals with MS also showed larger distances in the base of the nose, with a 14–19% increase in the distance from the tip of the nose to the insertion of the right and left alar bases (subalare) as compared to controls. Mouth width was also increased in MS; whereas midfacial heights measuring the distance between the eyes and the nose were significantly reduced from 10 to 16% in individuals with MS (Fig. 3A).

In Noonan syndrome, facial dysmorphologies were abundant and concentrated in the orbital and nasal regions. EDMA detected significantly increased distances in the upper face, but decreased distances in the midface (Fig. 4A).

Patients presented a lower position of the eyes, with 9 to 13% increased distances between the glabella or sellion and the landmarks located in the eyes. The mouth also showed a more inferior position, with 8–10% increased relative distance between the tip of the nose and the superior lip, but the shape of the mouth did not show large differences between patients and controls. The reduction of midfacial heights in individuals with NS ranged from 5 to 11%, with a similar magnitude as in DS (Fig. 4A). FDS indicated that 47.7% of facial traits were significantly different in NS (Fig. 4B).

Neurofibromatosis type 1 was associated with minor facial dysmorphologies, which were less abundant and less severe than in the previous syndromes (Fig. 5A). Individuals with NS only presented 11.4% of significantly different facial traits as compared to controls, and the percentages of relative change were low, mostly ranging from 1 to 5% (Fig. 5A,B). The largest difference was a 10% increase in facial distance between the glabella and the labiale superius (Fig. 5A). Along with larger distances in the midline of the face, EDMA detected reduced distances on the right and left sides of the face, with shorter distances from the right and left chelion to the eye landmarks, the endocanthion and the palpebrale inferius. Hypertelorism was not present in individuals with NF1 (Fig. 5A). In NF1, the FDS score was not significant (Fig. 5B), indicating that the facial dysmorphology pattern associated with NF1 is so subtle that overall is not larger than facial differences that could be randomly detected using a sample of control subjects.

For the other syndromes, the simulation tests confirmed that the facial dysmorphologies associated with Down, Morquio and Noonan syndromes were significant and different from random comparisons in control subjects. Few simulations resulted in a higher FDS than the FDS obtained with the complete real sample (Figs. 2B, 3B, 4B, first row and blue line). Moreover, in DS, MS and NS, facial dysmorphology scores increased as larger numbers of diagnosed individuals were included in the simulations (Figs. 2B, 3B, 4B, middle rows), confirming the severity of the facial dysmorphologies associated to these syndromes. Finally, the simulations comparing all recruited diagnosed individuals (last row) with random subsamples of control subjects (first row) indicated that FDS scores can range widely from 10 to 80%, underscoring the biasing effects of small sample sizes.

Face2Gene accuracy in Colombian and European populations

After quantifying the facial dysmorphologies associated to DS, MS, NS and NF1 in the Colombian sample, we tested the accuracy of the diagnosis provided by the automatic diagnostic algorithms of Face2Gene. We assessed the correspondence between the estimated Face2Gene diagnosis based on facial frontal 2D images with the diagnosis based on clinical and genetic testing.

Face2Gene estimated Down syndrome diagnosis with top-1 accuracy of 100%, as DS diagnosis was listed as the first diagnosis in all individuals, with an average gestalt similarity of 6.2 (Table 2, Fig. 6). When comparing the gestalt similarities in Colombian and European populations, a Wilcoxon test did not find a significant difference between the average gestalt similarity (P = 0.4). However, a Levene test detected a significant difference in the variance of gestalt similarity scores (P = 0.01). Whereas in the Colombian population the gestalt similarity in DS ranged from very high to very low; in the European population the range of variation was limited from very high to medium (Fig. 7).

Table 2 Accuracy of Face2Gene diagnosis based on 2D facial images in Down, Morquio, Noonan and Neurofibromatosis type 1 syndromes in a Colombian population. Percentage of cases matching the genetic diagnosis are provided for each syndrome, as well as gestalt similarity values.

Full size table

In Morquio syndrome, the top-1 accuracy of Face2Gene was 0%, as the specific diagnostic of mucopolysaccharidosis type IVA (MPSIVA) was never listed as a first prediction (Table 2). Although Face2Gene could not identify the specific type of MS, the automatic diagnostic algorithms associated the facial dysmorphologies with a diagnosis related with mucopolysaccharidosis disorders in 36.4% of cases, with a medium–high average gestalt similarity of 5.6 (Table 2). When the first 5 diagnostic predictions were considered, the top-5 accuracy raised to 45.4% for exact MPSIVA diagnosis and to 100% for mucopolysaccharidosis disorders, but with a low-medium gestalt similarity (Table 2, Fig. 6). In our sample, we detected four genetic variants (p.Gly301Cys, p.Arg386Cys, p.Arg94Cys, p.Gly333Asp, and p.Ser80Leu) that are missense mutations commonly found in the Colombian population⁴⁵ (Table S2). Due to the small sample size and genetic heterogeneity of the patients, it was not possible to test whether different genetic variants were associated to different facial phenotypes. Comparative European samples were not available.

The top-1 accuracy of Face2Gene for Noonan syndrome was 66.7%, with a medium–high average gestalt similarity of 5.2 when considering subjects in which the diagnosis was successful (Table 2). Top-5 accuracy increased to 77.8% for exact NS diagnosis, and to 88.9% when considering Noonan Syndrome-Like Disorder diagnoses, with a medium gestalt similarity of 4.4 and wide variation among individuals (Table 2, Fig. 6). Although differences did not reach statistical significance probably due to small sample sizes (P = 0.09), the comparison between populations showed that in Europe, both the diagnostic accuracy and the gestalt similarity were higher than in Colombia. Using 2D images of patients from European origin, the Face2Gene top-1 accuracy for NS was 100% and the average gestalt similarity was 5.5 (Fig. 7).

Finally, in Neurofibromatosis type 1, Face2Gene presented a top-1 accuracy of 8.3% associated with a very low gestalt similarity of 1 (Table 2). When diagnoses within the RASopathies disorder family were considered, 5 out of 12 individuals were diagnosed as Noonan syndrome and the top-1 accuracy raised to 50% (Table 2). The top-5 diagnostic accuracy was 66.6% and was associated with low gestalt similarity values of between 1 and 2 in 87.5% of individuals (Table 2, Fig. 6). Comparative European samples were not available for NF1.

Discussion

Our analyses provided an accurate quantitative comparison of facial dysmorphologies in Down, Morquio Noonan and Neurofibromatosis type 1 syndromes in a Latin-American population from Colombia. An objective and highly detailed description of the facial phenotype is a major improvement over qualitative descriptions of the complex facial dysmorphologies associated with these genetic disorders. We quantified local facial trait differences presented in people diagnosed with these disorders as compared with age matched controls of the same population, localizing the largest statistically significant facial dysmorphologies.

Our results indicated differential facial patterns associated with each disorder, with major significant dysmorphologies in DS, MS and NS, and minor facial dysmorphologies associated with NF1. Different types of genetic alterations, which ranged from aneuploidy and overall genetic imbalance in DS; to point genetic mutations affecting different processes or signaling pathways, such as the metabolism of mucopolysaccharides in MS, and the RAS/MAPK pathway in NS and NF1, significantly affected the facial phenotypes. These genetic alterations deviate the signaling pathways regulating normal facial development^16,58, and alter normal morphogenesis and growth during pre- and postnatal development¹⁵ of individuals with genetic and rare disorders.

Population-specific facial traits in Colombian individuals with genetic and rare disorders

Overall, the facial patterns observed in the Colombian Latin-American population coincide with the descriptions reported in the literature for each syndrome ^48,59,60,61. However, there are specific local traits that differ, suggesting that facial traits associated to genetic and rare diseases might be modulated by population ancestry, as a result of different evolutionary and adaptive histories of human populations^33,34,35.

Down syndrome

Down syndrome presents a worldwide prevalence of 14 per 10,000 live births, with life expectancy increasing from 25 to 60 years in developing countries^62,63,64,65. In most Latino-American regions, the real incidence of patients with DS remains unknown, and is usually underreported. A cross-sectional study in Brazil reported a DS birth rate of 4 cases per 10,000 live births⁶⁶; whereas in Colombia several studies have reported a prevalence rate between 1 per 1,000 to 5 per 10,000 live births^67,68. DS is an aneuploidy caused by trisomy of chromosome 21, and is the leading genetic cause of intellectual disability⁶³. Moreover, DS is associated with craniofacial dysmorphologies that impair vital functions such as breathing, eating, and speaking. In the literature, the DS craniofacial phenotype is mostly based on the analysis of European descent populations, and the characteristic traits include brachycephalic heads with maxillary hypoplasia leading to facial flatness; depressed nasal bridge and reduced airway passages⁵⁹; dysplastic ears with lobe absence; eyes with oblique palpebral fissures, epicanthal folds, strabismus and nystagmus^16,69; and oral alterations including open mouth, cleft lip, lingual furrows and protrusion, macroglossia, micrognathia, and narrow palate^70,71.

In the Colombian population, we found facial dysmorphologies that are consistent with the craniofacial patterns reported in the literature. For instance, our analyses detected differences in linear facial measurements that correspond to typical DS traits such as hypertelorism, maxillary hypoplasia, and shorter and wider faces associated to a brachycephalic head^16,72. Results also suggested other characteristic traits of DS, such as midfacial retrusion, and depressed nasal bridge⁵⁹. Open mouth and macroglossia^70,71 were also observed during the photographic sessions in the participants of our study.

However, in contrast to European and North American populations⁵⁵, in the Colombian population we detected that the mouth was wider in individuals diagnosed with DS as compared to euploid controls. This difference could be caused by unnatural facial gestures of the participants when asked to close the mouth during the photo shoot, or by facial differences associated to ancestry. In fact, Kruszka et al.^33,34,35 analyzed individuals diagnosed with DS in diverse populations, and showed craniofacial differences between individuals from different populations (Africans, Asians, and Latin Americans), demonstrating that ancestry is a relevant factor when assessing craniofacial variation associated to rare disorders.

Morquio syndrome

In Morquio syndrome, the worldwide prevalence ranges from 1 case per 75,000 to 1 per 200,000 live births; whereas in Colombia the prevalence rises up to 0.68 per 100,000 live births⁴⁵. As a mucopolysaccharidosis syndrome, the typical alterations of MS involve the supporting tissue and the osteoarticular system⁷³. Individuals with MS display abnormalities such as skeletal dysplasia, short stature and trunk, kyphoscoliosis, pectus carinatum, genu valgum, and joint hyperlaxity⁷⁴. Oral diseases often include periodontal disease, malocclusions, caries, and premature tooth loss⁴⁶. Individuals with MS show coarse facies, with an excessively rapid growth of the head⁴⁸. Craniofacial features include a prominent forehead, hypertelorism, prognathism, wide mouth and nose, depressed nasal bridge, plump cheeks, and lips with an oversized tongue⁴⁸. In the Colombian population, the facial dysmorphologies observed were consistent with traits reported in the literature, which included hypertelorism, prognathism, wide nose, and wide mouth^46,48.

In the Colombian sample, Morquio syndrome was associated with the most severe facial dysmorphologies. Considering that keratan and chondroitin sulfate alterations associated with MS cause irreparable damage to leukocytes and fibroblasts, and accumulate over life inducing extreme deformations of the osteoarticular system, facial dysmorphologies associated with MS are expected to increase with age, becoming more severe in adult individuals⁴⁶. Further research is required to test this hypothesis and to assess whether pharmacological treatments can slow down the progression of the disease and reduce the facial dysmorphologies associated with MS. This is especially relevant in Colombia, which is a country with one of the highest prevalence of MS in the world⁴⁵.

Moreover, dysmorphologies associated with MS vary among individuals. Typically, MS patients present severe phenotypes, although less severe forms have been described as mild or attenuated phenotypes⁷³. There is no consistent evidence regarding the genotype–phenotype correlation in MS, and whether different GALNS mutations are associated with the degree of severity in facial dysmorphology. In our Colombian sample, we detected four genetic variants (p.Gly301Cys, p.Arg386Cys, p.Arg94Cys, p.Gly333Asp, and p.Ser80Leu). Two of these genetic variants, p.Gly301Cys and p.Arg386Cys, that are the most frequently reported mutations in cases of Morquio syndrome; specifically in Colombia, but also in other American (Brazil, Chile, Argentina, Canada), and European countries (Spain, Portugal, Italy, Poland)^45,75,76,77. The high prevalence of the p.Gly301Cys mutation in the Colombian population could result from founder and migration effects⁴⁵. The p.Arg386Cys variant has been further detected in China and Turkey^75,76,77; whereas the p.Arg94Cys allele has been previously reported in Middle East, Brazil, and Italy^76,77. Other genetic variants, such as p.Ile113Phe, which are more frequently reported in British and Irish populations^45,75,76,77, were not detected in our Colombian sample. Further tests with larger samples associated to each genotype are needed to test whether the population-specific genetic variants can be associated to different facial phenotypes in Morquio syndrome.

RASopaties: Noonan and NF1 syndromes

Regarding Noonan syndrome, the worldwide prevalence of NS is 1 per 1,000 to 1 per 2,500 live births⁴⁹. NS is the most common type of RASopathy, and is a rare genetically heterogeneous autosomal dominant disorder caused by mutations in either the PTPN11, SOS 1, KRAS, BRAF or RAF1 genes. Individuals with NS display facial features such as hypertelorism, epicanthic folds, strabismus, downward slanting palpebral fissures, ptosis, high arched palate, deeply grooved philtrum with high peaks of upper lip vermillion border, midfacial hypoplasia and micrognathia, broad flat nose, low-set posteriorly rotated ears, curly/sparse/coarse hair, and short webbed neck⁶⁰. In the Colombian population, we detected hypertelorism, downward slanting palpebral fissures, and midfacial hypoplasia in cases of NS, as reported in populations of European descent⁶⁰. In addition, our results quantified relative changes in the position of the mouth in Colombian individuals diagnosed with NS not reported before⁷⁸.

In Neurofibromatosis type 1, the worldwide incidence is 1 per 2,500 to 1 per 3,000 individuals⁴⁹. NF1 is an autosomal dominantly inherited neurocutaneous disorder caused by a mutation in the neurofibromin gene. The clinical manifestations of NF1 are variable, and the timing of the onset has a major influence⁴⁹. Regarding craniofacial traits, individuals with NF1 present macrocephaly, facial asymmetry caused by dysplasia of the sphenoid wings⁶¹, as well as bone deformities caused by plexiform neurofibromas, enlarged mandibular canal, retrognathic mandible and maxilla, and short cranial base⁷⁹. The facial pattern associated with NF1 in individuals from Colombia was also compatible with typical traits of NF1, such as midface hypoplasia⁴⁹. However, our results did not detect facial asymmetry or hypertelorism as prominent facial differences between diagnosed individuals and controls in the Colombian population⁴⁹.

Overall, our results support previous evidence demonstrating that rare disorders present distinctive facial traits that are population specific, with clinical features that are significantly different in Africans, Asians, and Latin Americans^34,35,36. However, comparative facial quantitative analyses including subjects from different world regions are not usually available for most genetic and rare disorders, and reference data for diagnosis is mainly based on phenotypes defined on populations of European descent. In fact, almost no images of individuals of Latin American origin are included in reference medical texts¹⁶. Our results underscore the need to extend the analyses to populations from all over the world to achieve a complete and more accurate phenotypic representation of genetic and RD to optimize the diagnostic potential of facial biomarkers in the clinical practice.

Variable accuracy diagnosis in a Colombian population with diverse ancestry

Deep learning algorithms such as Face2Gene have shown potential as a reliable and precise tool for genetic diagnosis by image recognition^9,26,80,81. In the Colombian sample analyzed here, Face2Gene diagnosed Down syndrome with 100% accuracy, with the same accuracy as in the European sample. This result suggests that in a relatively common genetic disorder such as DS, in which the machine learning algorithm is likely trained in a large sample of individuals with a distinctive and well-represented facial phenotype, Face2Gene shows high diagnostic accuracy, independently from the genetic ancestry.

However, we found that this result cannot be extrapolated to other rare disorders. For instance, we detected a lower accuracy in the diagnosis of Noonan syndrome in the Colombian sample as compared with the European sample. Although Face2Gene correctly identified the disorder in most Colombian subjects, especially when considering the top5-accuracy within Noonan syndrome-like disorders (88.9%), the percentage of top1-accuracy was reduced from 100% to 66.7% in the Colombian sample. We hypothesize that when machine learning algorithms are trained in a relatively small sample of individuals with homogeneous European ancestry, the accuracy of diagnosing rare disorders might be more sensitive to population ancestry. Individuals from diverse populations may show lower gestalt similarity scores when assessed with predictive models that are trained on a population with different genetic and facial variation, and this may lead to reduced diagnostic accuracy.

Unfortunately, no data was publicly available on European samples to compare the diagnostic accuracy of Face2Gene in Morquio and Neurofibromatosis type 1 syndromes. Our results showed that the top1-accuracy for exact diagnosis of Mucopolysaccharidosis type IVA was 0% in the Colombian sample, despite Morquio syndrome was associated with the most severe facial dysmorphologies. Only a low percentage of cases (36.4%) were identified as a mucopolysaccharidosis-like syndrome in the first prediction. In the case of NF1, the top1-accuracy was also very low (8.3%), although the facial dysmorphologies in this disorder were less abundant and severe, and this result could just reflect the difficulty to diagnose NF1 from facial traits.

Finally, in the Colombian sample we detected a wide range of variation in gestalt similarity scores for most disorders, even for Down syndrome. In European subjects, the gestalt similarity for DS was high or very high in 95.5% of cases, and only 5% of subjects showed a medium gestalt score, even when the images included in Ferry et al. (2020)⁵⁶ were ordinary photos with uncontrolled lighting, pose, and image quality. In Colombia, 79% of individuals diagnosed with DS were associated with very high gestalt similarity values, but in 21% of subjects the gestalt similarity was lower, and ranged from medium–high to very low values. Specifically, individuals with the lowest scores exhibited traits that suggested an admixed ancestry, a hypothesis that needs further assessment.

The potential of facial biomarkers to diagnose genetic and rare disorders

Qualitative visual assessment of facial dysmorphologies is frequently employed for diagnosis, clinical management and treatment monitoring of RD¹⁶. Experts in dysmorphologies can identify the facial “gestalt” distinctive of many dysmorphic syndromes¹⁶. However, this facial assessment relies on the expertise of the clinician, and is very challenging because there is no clear one-to-one correspondence between disorders and facial dysmorphologies. Different genetic mutations can cause the same syndrome or similar phenotypes, whereas the same mutation can induce different phenotypes^12,82. In addition, within the same rare disease there may be several subtypes, and symptoms may vary even within individuals of the same genetic disorder and the same family³. This complex biology generates confusion at the time of diagnosis and warrants the development of efficient, objective and reliable diagnostic methods.

Computer-assisted phenotyping can overcome these pitfalls and provide widely accessible technologies for quick syndrome screening⁶. In this automated approach, methods can be based on 2D or 3D images^9,10,26. The advantage of 2D methods is that data collection is easy and can be readily translated into the clinical practice, as physicians can take facial images even with simple digital cameras or smartphones. The collection of 3D models is more sophisticated and requires specialized equipment but provides more accurate phenotype descriptions by incorporating the depth dimension.

To further improve the methods of craniofacial assessment to diagnose individuals with genetic syndromes and RD that exhibit facial dysmorphologies, it is crucial to assess the large morphological variation displayed by human populations in facial phenotypes. Factors such as age, sex and ancestry should be accounted for in diagnostic methods. Clinical manifestations in some genetic disorders usually begin at an early age, with two thirds of patients expressing symptoms before the second year of birth³; although in other disorders facial dysmorphologies develop later, during postnatal development. Male and female faces present sexual dimorphism at adulthood⁸³, and diseases can differently affect the facial phenotype depending on sex differences⁸⁴.

The role of population ancestry in the facial phenotype associated with genetic and rare disorders also needs to be further investigated in future analyses, assessing the reliability and validity of automatic diagnostic tools in admixed populations with diverse contributions of Amerindian, African and European ancestry components. This is critical in rare disorders with heterogenous clinical presentation and phenotype, where clinical diagnosis is a challenging process^5,6 that may take several years, leading to the so-called diagnostic odyssey⁷.

Accurate and early diagnosis of genetic and rare disorders are crucial for adequate health care and clinical management. Without a diagnosis, individuals and their families must proceed without basic information regarding their health and future developmental outcomes⁶. Even though gene-based technologies have greatly improved diagnostic procedures²⁵, the mutations causing many rare diseases are still not known and access to genetic testing is limited³. Genetic consultations may become a long process, and broad molecular testing such as exome and genome sequencing represent a high expense that is not affordable for all families and health care systems, especially in low-medium income countries⁷. In this context, faster, non-invasive and low-cost diagnostic methods based on facial phenotypes emerge as complementary tools for providing earlier first reliable diagnoses^9,10,25,26.

Therefore, in future research the recruitment of participants must be expanded to include as many individuals with RD as possible, together with large comparative samples of age-matched controls, from both sexes, and from diverse world regions that faithfully represent the complex craniofacial variation and evolutionary histories of human populations. For instance, the population in Southwestern Colombia is characterized by high levels of admixture from people with Native American, African, and European ancestry^44,85. Including the morphological variation of faces from such different ancestry backgrounds is key to pinpoint the facial dysmorphologies associated with diseases in worldwide diverse populations⁸⁶. Our simulation analyses further highlight the importance of maximizing the recruitment of diagnosed and control individuals, as results considerably change depending on the cohort and sample sizes.

Conclusions

Facial phenotypes associated with genetic and rare disorders can be influenced by population ancestry^34,35,36. Our ancestry comparisons highlight that diverse genetic background variation can modulate the phenotypic response to disease, affecting the accuracy of current tools of clinical diagnosis. In the future, deep learning algorithms including a high variety of populations with different ancestry backgrounds will optimize the precision and accuracy of diagnosis in an unbiased approach. Such predictive models will support clinicians in decision-making across the world.

Data availability

Raw phenotype data from the Colombian population cannot be made available due to restrictions imposed by the ethics approval. Images from publicly available sources can be accessed from the original publications^56,57. Anonymized landmark data and Matlab code for computing Facial Dysmorphology Score (FDS) is available at https://github.com/xaviersevillano/EDMA_FDS_analysis_2D.

References

NguengangWakap, S. et al. Estimating cumulative point prevalence of rare diseases: analysis of the Orphanet database. Eur. J. Hum. Genet. 28, 165–173. https://doi.org/10.1038/s41431-019-0508-0 (2020).
Article Google Scholar
Viteri, J. et al. Enfermedades huérfanas. Arch. Ven. Farm. Terap. 39, 627–636. https://doi.org/10.5281/ZENODO.4263347 (2020).
Article Google Scholar
Suárez-Obando, F. La atención clínica de las enfermedades raras: Un reto para la educación médica. Med. BA 40, 228–241 (2018).
Google Scholar
Cortés, F. Las enfermedades raras. Rev. Méd. Clín. Cond. 26, 425–431. https://doi.org/10.1016/j.rmclc.2015.06.020 (2015).
Article Google Scholar
Schieppati, A., Henter, J.-I., Daina, E. & Aperia, A. Why rare diseases are an important medical and social issue. Lancet 371, 2039–2041. https://doi.org/10.1016/S0140-6736(08)60872-7 (2008).
Article PubMed Google Scholar
Bannister, J. J. et al. Fully automatic landmarking of syndromic 3D facial surface scans using 2D images. Sensors 20, 3171. https://doi.org/10.3390/s20113171 (2020).
Article ADS PubMed PubMed Central Google Scholar
González-Lamuño, D. & García-Fuentes, M. Enfermedades de base genética. An. Sist. San. Nav. 31, 105–126 (2008).
Google Scholar
Gülbakan, B. et al. Discovery of biomarkers in rare diseases: innovative approaches by predictive and personalized medicine. EPMA J. 7, 1–6. https://doi.org/10.1186/s13167-016-0074-2 (2016).
Article Google Scholar
Gurovich, Y. et al. Identifying facial phenotypes of genetic disorders using deep learning. Nat. Med. 25, 60–64. https://doi.org/10.1038/s41591-018-0279-0 (2019).
Article CAS PubMed Google Scholar
Hallgrímsson, B. et al. Automated syndrome diagnosis by three-dimensional facial imaging. Gen. Med. 22, 1682–1693. https://doi.org/10.1038/s41436-020-0845-y (2020).
Article Google Scholar
Farrera, A. et al. Ontogeny of the facial phenotypic variability in Mexican patients with 22q11.2 deletion syndrome. Hea. Fac. Med. 15, 29. https://doi.org/10.1186/s13005-019-0213-9 (2019).
Article Google Scholar
Martínez-Abadías, N. et al. FGF/FGFR signaling coordinates skull development by modulating magnitude of morphological integration: Evidence from Apert syndrome mouse models. PLoS ONE 6, e26425. https://doi.org/10.1371/journal.pone.0026425 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Richtsmeier, J. T. & Flaherty, K. Hand in glove: Brain and skull in development and dysmorphogenesis. Act. Neu. 125, 469–489. https://doi.org/10.1007/s00401-013-1104-y (2013).
Article CAS Google Scholar
Hallgrímsson, B. et al. Morphometrics, 3D imaging, and craniofacial development. Curr. Top. Dev. Bio. 115, 561–597. https://doi.org/10.1016/bs.ctdb.2015.09.003 (2015).
Article Google Scholar
Kouskoura, T. et al. The genetic basis of craniofacial and dental abnormalities. Riv. Men. Svi. Odon. Sto. 121, 636–646 (2011).
Google Scholar
Jones, K.L., Jones, M.C., & Campo, M. Smith’s recognizable patterns of human malformation (ed. Elsevier Health Sciences) (Amsterdam, 2021).
Aase, J.M. The physical examination in dysmorphology in Diagnostic dysmorphology (ed. Plenum Medical Book Company) 33–42 (New York and London, 1990).
Johannes, M., Clara, V., Hubert, C. & Raoul, H. Phenotypic abnormalities: Terminology and classification. Am. J. Med. Gen. 123A, 211–230. https://doi.org/10.1002/ajmg.a.20249 (2003).
Article Google Scholar
Reardon, W. & Donnai, D. Dysmorphology demystified. Arch. Dis. Child. Fet. Neo. 92, F225–F229. https://doi.org/10.1136/adc.2006.110619 (2007).
Article Google Scholar
Hammond, P. et al. 3D analysis of facial morphology. Am. J. Med. Gen. 126A, 339–348. https://doi.org/10.1002/ajmg.a.20665 (2004).
Article Google Scholar
Hammond, P. The use of 3D face shape modelling in dysmorphology. Arch. Dis. Child. 92, 1120–1126. https://doi.org/10.1136/adc.2006.103507 (2007).
Article PubMed PubMed Central Google Scholar
Hammond, P. & Suttie, M. Large-scale objective phenotyping of 3D facial morphology. Hum. Mut. 33, 817–825. https://doi.org/10.1002/humu.22054 (2012).
Article PubMed Google Scholar
Hurst, A. C. E. Facial recognition software in clinical dysmorphology. Curr. Op. Ped. 30, 701–706. https://doi.org/10.1097/MOP.0000000000000677 (2018).
Article Google Scholar
Köhler, S. et al. Expansion of the Human Phenotype Ontology (HPO) knowledge base and resources. Nuc. Ac. Res. 47, D1018–D1027. https://doi.org/10.1093/nar/gky1105 (2019).
Article CAS Google Scholar
Agbolade, O., Nazri, A., Yaakob, R., Ghani, A. A. & Cheah, Y. K. Down syndrome face recognition: A review. Symmetry. 12, 1182. https://doi.org/10.3390/sym12071182 (2020).
Article ADS Google Scholar
Hsieh, T. C. et al. GestaltMatcher facilitates rare disease matching using facial phenotype descriptors. Nat. Gen. 54, 349–357. https://doi.org/10.1038/s41588-021-01010-x (2022).
Article CAS Google Scholar
Xiong, Z. et al. Novel genetic loci affecting facial shape variation in humans. Elife 8, e49898. https://doi.org/10.7554/eLife.49898 (2019).
Article PubMed PubMed Central Google Scholar
Qiao, L. et al. Genome-wide variants of Eurasian facial shape differentiation and a prospective model of DNA based face prediction. J. Gen. Gen. 45, 419–432. https://doi.org/10.1016/j.jgg.2018.07.009 (2018).
Article Google Scholar
Martínez-Abadías, N. et al. Phenotypic evolution of human craniofacial morphology after admixture: a geometric morphometrics approach. Am. J. Phys. Anth. 129, 387–398. https://doi.org/10.1002/ajpa.20291 (2006).
Article Google Scholar
Quinto-Sánchez, M. et al. Facial asymmetry and genetic ancestry in Latin American admixed populations. Am. J. Phys. Anth. 157, 58–70. https://doi.org/10.1002/ajpa.22688 (2015).
Article Google Scholar
Ruiz-Linares, A. et al. Admixture in Latin America: geographic structure, phenotypic diversity and self-perception of ancestry based on 7,342 individuals. PLoS Gen, 10, e1004572. https://doi.org/10.1371/journal.pgen.1004572 (2014).
Sheehan, M. J. & Nachman, M. W. Morphological and population genomic evidence that human faces have evolved to signal individual identity. Nat. Commun. 5, 4800. https://doi.org/10.1038/ncomms5800 (2014).
Article ADS CAS PubMed Google Scholar
Kruszka, P. et al. 22q11.2 deletion syndrome in diverse populations. Am. J. Med. Gen. A 173, 879–888. https://doi.org/10.1002/ajmg.a.38199 (2017).
Article CAS Google Scholar
Kruszka, P. et al. Noonan syndrome in diverse populations. Am. J. Med. Gen Part A. 173, 2323–2334. https://doi.org/10.1002/ajmg.a.38362 (2017).
Article CAS Google Scholar
Kruszka, P. et al. Down syndrome in diverse populations. Am. J. Med. Gen. Part A. 173, 42–53. https://doi.org/10.1002/ajmg.a.38043 (2017).
Article CAS Google Scholar
Dowsett, L. et al. Cornelia de Lange syndrome in diverse populations. Am. J. Med. Gen. A 179, 150–158. https://doi.org/10.1002/ajmg.a.61033 (2019).
Article CAS Google Scholar
Mendoza-Revilla, J. et al. Disentangling signatures of selection before and after European colonization in Latin Americans. Mol. Biol. Ev. 39, msac076. https://doi.org/10.1093/molbev/msac076 (2022).
Article CAS Google Scholar
Ardelean, C. F. et al. Evidence of human occupation in Mexico around the Last Glacial Maximum. Nature 584, 87–92. https://doi.org/10.1038/s41586-020-2509-0 (2020).
Article ADS CAS PubMed Google Scholar
Becerra-Valdivia, L. & Higham, T. The timing and effect of the earliest human arrivals in North America. Nature 584, 93–97. https://doi.org/10.1038/s41586-020-2491-6 (2020).
Article ADS CAS PubMed Google Scholar
Castro E Silva, M. A., Ferraz, T., Bortolini, M. C., Comas, D. & Hünemeier, T. Deep genetic affinity between coastal Pacific and Amazonian natives evidenced by Australasian ancestry. Proc. Nat. Ac. Sci. USA 118, 1. https://doi.org/10.1073/pnas.2025739118 (2021).
Article CAS Google Scholar
González-José, R. et al. Craniometric evidence for Palaeoamerican survival in Baja California. Nature 425, 62–65. https://doi.org/10.1038/nature01816 (2003).
Article ADS CAS PubMed Google Scholar
Salzano, F. M. & Bortolini, M. C. The Evolution and Genetics of Latin American Populations 512 (Cambridge University Press, Cambridge, 2002).
Google Scholar
Salzano, F. M. & Sans, M. Interethnic admixture and the evolution of Latin American populations. Gen. Mol. Biol. 37, 151–170. https://doi.org/10.1590/s1415-47572014000200003 (2014).
Article Google Scholar
Urrea-Giraldo, F. & Álvarez, A. F. C. Cali an enlarged region city: an approximation from the ethnic-racial dimension and population flows. Rev. Soc. Ec. UV. 33, 145–174. https://doi.org/10.25100/sye.v0i33.5628 (2017).
Article Google Scholar
Pachajoa, H. et al. Molecular characterization of mucopolysaccharidosis type IVA patients in the Andean region of Colombia. Am. J. Med. Gen. Part C. 187, 388–395. https://doi.org/10.1002/ajmg.c.31936 (2021).
Article CAS Google Scholar
Herrera, L. M. C., Martínez, A. V., López, N. M., Téllez, J. M. & Contreras, X. D. M. Síndrome de Morquio, enfermedad de interés para la odontopediatría. Presentación de un caso. Rev. Ped. Elec. 14, 2–11 (2017).
Google Scholar
Sawamoto, K. et al. Mucopolysaccharidosis IVA: Diagnosis, treatment, and management. Int. J. Mol. Sci. 21, 1517. https://doi.org/10.3390/ijms21041517 (2020).
Article CAS PubMed PubMed Central Google Scholar
Suárez-Guerrero, J. L., Suárez, A. K. B., Santos, M. C. V. & Contreras-García, G. A. Caracterización clínica, estudios genéticos, y manejo de la Mucopolisacaridosis tipo IV A. Med. UIS. 26, 43–50 (2013).
Google Scholar
Hernández-Martín, A. & Torrelo, A. Rasopathies: Developmental disorders that predispose to cancer and skin Manifestations. Act. Dermo-Sifiliográficas. 102, 402–416. https://doi.org/10.1016/j.adengl.2011.02.002 (2011).
Article Google Scholar
King, D. E. Dlib-ml: A machine learning toolkit. J. Mach. Learn. Res. 10, 1755–1758 (2009).
Google Scholar
Stull, K. E., Tise, M. L., Ali, Z. & Fowler, D. R. Accuracy and reliability of measurements obtained from computed tomography 3D volume rendered images. Foren. Sci. Int. 238, 133–140. https://doi.org/10.1016/j.forsciint.2014.03.005 (2014).
Article Google Scholar
Lele, S. R. & Richtsmeier, J. T. Euclidean Distance Matrix Analysis: A coordinate-free approach for comparing biological shapes using landmark data. Am. J. Phys. Anth. 86, 415–427 (1991).
Article CAS Google Scholar
Rohlf, F. J. & Slice, D. Extensions of the Procrustes method for the optimal superimposition of landmarks. Syst. Biol. 39, 40–59 (1990).
Google Scholar
Lele, S. R. & Cole, T. A new test for shape differences when variance-covariance matrices are unequal. J. Hum. Evo. 31, 193–212 (1996).
Article Google Scholar
Starbuck, J. M. et al. Green tea extracts containing epigallocatechin-3-gallate modulate facial development in Down syndrome. Sci. Rep. 11, 4715. https://doi.org/10.1038/s41598-021-83757-1 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Ferry, Q. et al. Diagnostically relevant facial gestalt information from ordinary photos. Elife 3, e02020. https://doi.org/10.7554/eLife.02020 (2014).
Article PubMed PubMed Central Google Scholar
Allanson, J. E. et al. The face of Noonan syndrome: Does phenotype predict genotype. Am. J. Med. Gen. 152A, 1960–1966. https://doi.org/10.1002/ajmg.a.33518 (2010).
Article Google Scholar
Terrazas, K., Dixon, J., Trainor, P. A. & Dixon, M. J. Rare syndromes of the head and face: mandibulofacial and acrofacial dysostoses. Wiley Interd. Rev. Dev. Biol. 6, 263. https://doi.org/10.1002/wdev.263 (2017).
Article CAS Google Scholar
Starbuck, J. M., Cole, T. M., Reeves, R. H. & Richtsmeier, J. T. Trisomy 21 and facial developmental instability. Am. J. Phys. Anth. 151, 49–57. https://doi.org/10.1002/AJPA.22255 (2013).
Article Google Scholar
Athota, J. P. et al. Molecular and clinical studies in 107 Noonan syndrome affected individuals with PTPN11 mutations. BMC. Med. Gen. 21, 50. https://doi.org/10.1186/s12881-020-0986-5 (2020).
Article CAS Google Scholar
Khosrotehrani, K., Bastuji-Garin, S., Zeller, J., Revuz, J. & Wolkenstein, P. Clinical risk factors for mortality in patients with Neurofibromatosis 1: A cohort study of 378 patients. Arch. Derm. 139, 187–191. https://doi.org/10.1001/archderm.139.2.187 (2003).
Article PubMed Google Scholar
Glasson, E. J. et al. The changing survival profile of people with Down’s syndrome: Implications for genetic counselling. Clin. Gen. 62, 390–393. https://doi.org/10.1034/j.1399-0004.2002.620506.x (2002).
Article CAS Google Scholar
Roper, R. & Reeves, R. Understanding the basis for Down syndrome phenotypes. PLoS Gen. 2, e50. https://doi.org/10.1371/journal.pgen.0020050 (2006).
Article CAS Google Scholar
Patterson, D. Molecular genetic analysis of Down syndrome. Hum. Gen. 126, 195–214. https://doi.org/10.1007/s00439-009-0696-8 (2009).
Article CAS Google Scholar
Aivazidis, S. et al. The burden of trisomy 21 disrupts the proteostasis network in Down syndrome. PLoS ONE 12, e0176307. https://doi.org/10.1371/journal.pone.0176307 (2017).
Article CAS PubMed PubMed Central Google Scholar
Laignier, M. R., Lopes-Júnior, L. C., Santana, R. E., Leite, F. M. C. & Brancato, C. L. Down syndrome in Brazil: Occurrence and associated factors. Int. J. Env. Res. Pub. He. 18, 11954. https://doi.org/10.3390/ijerph182211954 (2021).
Article Google Scholar
Hernández Ramírez, I. & Manrique Hernández, R. D. Prevalencia de síndrome de Down en CEHANI-ESE, San Juan de Pasto Colombia 1998–2003. Nova 4, 50–56. https://doi.org/10.22490/24629448.347 (2006).
Article Google Scholar
Valencia Arana, C. A. et al. Prevalencia al nacimiento de síndrome de Down en la ciudad de Manizales (Caldas-Colombia) durante el periodo 2004–2005. Biosalud. 69. https://link.gale.com/apps/doc/A258132055/IFME?u=anon~ab6dcaef&sid=googleScholar&xid=7f6e25b7 (2008).
Korayem, M. & Bakhadher, W. Craniofacial manifestations of Down syndrome: A review of literature. Ac. J. Sci. Res. 3, 176–181. https://doi.org/10.15413/ajsr.2019.0502 (2019).
Article Google Scholar
Hennequin, M., Faulks, D., Veyrune, J.-L. & Bourdiol, P. Significance of oral health in persons with Down syndrome: A literature review. Dev. Med. Child. Neu. 41, 275–283. https://doi.org/10.1111/j.1469-8749.1999.tb00599 (1999).
Article CAS Google Scholar
Oliveira, A. C. B., Paiva, S. M., Campos, M. R. & Czeresnia, D. Factors associated with malocclusions in children and adolescents with Down syndrome. Am. J. Orth Dent. Orth. 133, 489-e1 (2008).
Google Scholar
Vicente, A. et al. Craniofacial morphology in down syndrome: A systematic review and meta-analysis. Sci Rep 10, 19895. https://doi.org/10.1038/s41598-020-76984-5 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Suárez-Guerrero, J. L., Gómez Higuera, P. J. I., Arias Flórez, J. S. & Contreras-García, G. A. Mucopolisacaridosis: Características clínicas, diagnóstico y de manejo. Rev. Chil. Ped. 87, 295–304. https://doi.org/10.1016/j.rchipe.2015.10.004 (2016).
Article Google Scholar
Ortiz-Quiroga, D., Ariza-Araújo, Y. & Pachajoa, H. Calidad de vida familiar en pacientes con síndrome de Morquio tipo IV-A. Una mirada desde el contexto colombiano (Suramérica). Rehabilitación. 52, 230–237. https://doi.org/10.1016/j.rh.2018.07.002/ (2018).
Article Google Scholar
Tomatsu, S. et al. Mutation and polymorphism spectrum of the GALNS gene in mucopolysaccharidosis IVA (Morquio A). Hum. Mut. 26, 500–512. https://doi.org/10.1002/humu.20257 (2005).
Article CAS PubMed Google Scholar
Morrone, A. et al. Molecular testing of 163 patients with Morquio A (Mucopolysaccharidosis IVA) identifies 39 novel GALNS mutations. Mol. Gen. Metab. 112, 160–170. https://doi.org/10.1016/j.ymgme.2014.03.004 (2014).
Article CAS Google Scholar
Zanetti, A. et al. Molecular basis of mucopolysaccharidosis IVA (Morquio A syndrome): A review and classification of GALNS gene variants and reporting of 68 novel variants. Hum. Mut. 42, 1384–1398. https://doi.org/10.1002/humu.24270 (2021).
Article CAS PubMed Google Scholar
Lores, J., Prada, C. E., Ramírez-Montaño, D., Nastasi-Catanese, J. A. & Pachajoa, H. Clinical and molecular analysis of 26 individuals with Noonan syndrome in a reference institution in Colombia. Am. J. Med. Gen. Part C. 184, 1042–1051. https://doi.org/10.1002/ajmg.c.31869 (2020).
Article Google Scholar
Visnapuu, V., Peltonen, S., Alivuotila, L., Happonen, R.-P. & Peltonen, J. Craniofacial and oral alterations in patients with Neurofibromatosis 1. Orph. J. Rar. Dis. 13, 131. https://doi.org/10.1186/s13023-018-0881-8 (2018).
Article Google Scholar
Park, S., Kim, J., Song, T.-Y. & Jang, D.-H. Case Report: The success of face analysis technology in extremely rare genetic diseases in Korea: Tatton–Brown–Rahman syndrome and Say-Barber –Biesecker–Young–Simpson variant of ohdo syndrome. Front. Gen. 13, 903199. https://doi.org/10.3389/fgene.2022.903199 (2022).
Article Google Scholar
Pascolini, G., Calvani, M. & Grammatico, P. First Italian experience using the automated craniofacial gestalt analysis on a cohort of pediatric patients with multiple anomaly syndromes. It. J. Ped. 48, 91. https://doi.org/10.1186/s13052-022-01283-w (2022).
Article CAS Google Scholar
Aldridge, K. et al. Brain phenotypes in two FGFR2 mouse models for Apert syndrome. Dev. Dyn. 239, 987–997. https://doi.org/10.1002/dvdy.22218 (2010).
Article CAS PubMed PubMed Central Google Scholar
Enlow, D.H., & Hans, M.G. Essentials of facial growth (ed. Saunders) (Saunders, 1996).
Martínez-Abadías, N. et al. Facial Biomarkers Detect Gender-Specific Traits for Bipolar Disorder. FASEB. J. 35. https://doi.org/10.1096/fasebj.2021.35.S1.03695 (2021).
Adhikari, K., Chacón-Duque, J. C., Mendoza-Revilla, J., Fuentes-Guajardo, M. & Ruiz-Linares, A. The Genetic Diversity of the Americas. Ann. Rev. Gen. Hum. Gen. 18, 277–296. https://doi.org/10.1146/annurev-genom-083115-022331 (2017).
Article CAS Google Scholar
Conley, A. B. et al. A comparative analysis of genetic ancestry and admixture in the Colombian Populations of Chocó and Medellín. G3 (Bethesda, Md) 7, 3435–3447. https://doi.org/10.1534/g3.117.1118 (2017).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We are grateful for the voluntary collaboration of all participants, including children and their families. We are thankful to Colegio Ecológico Scout and Universidad Icesi for granting us permission to organize the photographic sessions in Colombia; and Dr. Nelläker for help with accessing the European database. We thank Max Rubert for technical photographic assistance. We also thank the reviewers and editor for their insightful comments, which have greatly improved the quality of our manuscript. We acknowledge support from Proyecto COL0012168-1097 Interfacultades-ICESI, Grup de Recerca Consolidat (2021 SGR 00706), and Biological Anthropological Master UB-UAB.

Author information

These authors contributed equally: Luis M. Echeverry-Quiceno and Estephania Candelo.

Authors and Affiliations

Departament de Biologia Evolutiva, Ecologia i Ciències Ambientals (BEECA), Facultat de Biologia, Universitat de Barcelona (UB), Av. Diagonal, 643. Planta 2, 08028, Barcelona, Spain
Luis M. Echeverry-Quiceno & Neus Martínez-Abadías
Centro de Investigaciones en Anomalías Congénitas y Enfermedades Raras (CIACER), Universidad ICESI, Cali, Colombia
Estephania Candelo, Eidith Gómez, Paula Solís, Diana Ramírez, Diana Ortiz & Harry Pachajoa
Servicio de Genética Clínica, Fundación Valle del Lili, Cali, Colombia
Estephania Candelo & Harry Pachajoa
HER - Human-Environment Research Group, La Salle - Universitat Ramon Llull, Barcelona, Spain
Alejandro González & Xavier Sevillano
Universidad ICESI, Cali, Colombia
Juan Carlos Cuéllar

Authors

Luis M. Echeverry-Quiceno
View author publications
You can also search for this author in PubMed Google Scholar
Estephania Candelo
View author publications
You can also search for this author in PubMed Google Scholar
Eidith Gómez
View author publications
You can also search for this author in PubMed Google Scholar
Paula Solís
View author publications
You can also search for this author in PubMed Google Scholar
Diana Ramírez
View author publications
You can also search for this author in PubMed Google Scholar
Diana Ortiz
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro González
View author publications
You can also search for this author in PubMed Google Scholar
Xavier Sevillano
View author publications
You can also search for this author in PubMed Google Scholar
Juan Carlos Cuéllar
View author publications
You can also search for this author in PubMed Google Scholar
Harry Pachajoa
View author publications
You can also search for this author in PubMed Google Scholar
Neus Martínez-Abadías
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.M.E., E.C., H.P. and N.M.A. designed the study and wrote the manuscript; L.M.E., E.C., E.G., P.S., D.R., D.O, J.C.C., H.P. and N.M.A. organized and performed data collection; L.M.E., E.C., A.G, X.S. and N.M.A. performed data analysis and prepared the figures. All authors reviewed the manuscript.

Corresponding author

Correspondence to Neus Martínez-Abadías.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Echeverry-Quiceno, L.M., Candelo, E., Gómez, E. et al. Population-specific facial traits and diagnosis accuracy of genetic and rare diseases in an admixed Colombian population. Sci Rep 13, 6869 (2023). https://doi.org/10.1038/s41598-023-33374-x

Download citation

Received: 10 December 2022
Accepted: 12 April 2023
Published: 27 April 2023
DOI: https://doi.org/10.1038/s41598-023-33374-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.