Next generation phenotyping for diagnosis and phenotype–genotype correlations in Kabuki syndrome

Hennocq, Quentin; Willems, Marjolaine; Amiel, Jeanne; Arpin, Stéphanie; Attie-Bitach, Tania; Bongibault, Thomas; Bouygues, Thomas; Cormier-Daire, Valérie; Corre, Pierre; Dieterich, Klaus; Douillet, Maxime; Feydy, Jean; Galliani, Eva; Giuliano, Fabienne; Lyonnet, Stanislas; Picard, Arnaud; Porntaveetus, Thantrira; Rio, Marlène; Rouxel, Flavien; Shotelersuk, Vorasuk; Toutain, Annick; Yauy, Kevin; Geneviève, David; Khonsari, Roman H.; Garcelon, Nicolas

doi:10.1038/s41598-024-52691-3

Download PDF

Article
Open access
Published: 28 January 2024

Next generation phenotyping for diagnosis and phenotype–genotype correlations in Kabuki syndrome

Quentin Hennocq ORCID: orcid.org/0000-0001-7882-8287^1,2,3,4,5,6,
Marjolaine Willems⁷,
Jeanne Amiel^1,4,8,
Stéphanie Arpin⁹,
Tania Attie-Bitach^1,4,8,
Thomas Bongibault^1,5,
Thomas Bouygues^1,5,
Valérie Cormier-Daire^1,4,8,
Pierre Corre^10,11,
Klaus Dieterich¹²,
Maxime Douillet¹,
Jean Feydy¹³,
Eva Galliani^2,3,4,
Fabienne Giuliano¹⁴,
Stanislas Lyonnet^1,4,8,
Arnaud Picard^2,3,4,
Thantrira Porntaveetus¹⁵,
Marlène Rio^1,4,8,
Flavien Rouxel⁷,
Vorasuk Shotelersuk¹⁶,
Annick Toutain⁹,
Kevin Yauy⁷,
David Geneviève⁷^na1,
Roman H. Khonsari^1,2,3,4,5^na1 &
…
Nicolas Garcelon¹

Scientific Reports volume 14, Article number: 2330 (2024) Cite this article

1945 Accesses
Metrics details

Subjects

Abstract

The field of dysmorphology has been changed by the use Artificial Intelligence (AI) and the development of Next Generation Phenotyping (NGP). The aim of this study was to propose a new NGP model for predicting KS (Kabuki Syndrome) on 2D facial photographs and distinguish KS1 (KS type 1, KMT2D-related) from KS2 (KS type 2, KDM6A-related). We included retrospectively and prospectively, from 1998 to 2023, all frontal and lateral pictures of patients with a molecular confirmation of KS. After automatic preprocessing, we extracted geometric and textural features. After incorporation of age, gender, and ethnicity, we used XGboost (eXtreme Gradient Boosting), a supervised machine learning classifier. The model was tested on an independent validation set. Finally, we compared the performances of our model with DeepGestalt (Face2Gene). The study included 1448 frontal and lateral facial photographs from 6 centers, corresponding to 634 patients (527 controls, 107 KS); 82 (78%) of KS patients had a variation in the KMT2D gene (KS1) and 23 (22%) in the KDM6A gene (KS2). We were able to distinguish KS from controls in the independent validation group with an accuracy of 95.8% (78.9–99.9%, p < 0.001) and distinguish KS1 from KS2 with an empirical Area Under the Curve (AUC) of 0.805 (0.729–0.880, p < 0.001). We report an automatic detection model for KS with high performances (AUC 0.993 and accuracy 95.8%). We were able to distinguish patients with KS1 from KS2, with an AUC of 0.805. These results outperform the current commercial AI-based solutions and expert clinicians.

Genome-wide association studies

Article 26 August 2021

Demographic bias in misdiagnosis by computational pathology models

Article 19 April 2024

Genome-wide association analyses identify 95 risk loci and provide insights into the neurobiology of post-traumatic stress disorder

Article 18 April 2024

Introduction

Kabuki syndrome (KS) is a rare genetic disorder, with an estimated prevalence of 1:86,000 to 1:32,000^1,2,3. The typical KS face includes long palpebral fissures associated with eversion of the lateral third of the lower eyelid; long and heavy lashes giving the impression of made-up eyes; broad, arched and interrupted eyebrows; broad, depressed nasal tip; and prominent, cupped ears^1,2,4. Extra-facial features include mild to moderate intellectual disability, visceral malformations, skeletal dysplasia and immunological manifestations⁵. KS has been described in all ethnic groups^6,7.

More than 80% of KS patients have a pathogenic variant in the coding regions of KMT2D (KS type 1, KS1, OMIM147920), and around 10% of patients have a pathogenic variant in the KDM6A gene (KS type 2, KS2, OMIM300128)^8,9,10,11,12.

Improving syndrome screening in clinical genetics is a crucial challenge in reducing diagnostic wandering. In France, the 7000 rare diseases identified to date represent 4.5% of the population, half of which affect children under the age of 5 with 10% of deaths between 0 and 5. Around 50% of patients are not diagnosed, and for the remaining 50%, diagnostic wandering reaches an average of 5 years¹³. Diagnostic wandering is defined by the failure to define the precise cause of a disease after having performed all available investigations. Applications of Artificial Intelligence (AI) are increasing in healthcare^14,15,16,17. The field of dysmorphology has been changed by these new methods, under the name of Next Generation Phenotyping (NGP)¹⁸. Publications comparing human performances to NGP are flourishing^19,20,21,22, and some suggest that digital tools do it better than human experts in terms of diagnosis: Dudding-Byth et al.²³ showed a better performance of NGP compared to clinicians in a group of ten genetic syndromes, not including KS; Rouxel et al.⁵ compared the performance of the DeepGestalt technology¹⁸ using the Face2Gene online tool (FDNA Inc. Boston, MA, USA) to the performances of clinicians trained in the recognition of KS1 and KS2.

The aim of this study was to develop a NGP model for the diagnosis of KS and for distinguishing KS1 from KS2. We trained and validated the model on a large national and international multi-center cohort of patients of all ages and ethnicities. The specificity of this approach was the integration of lateral pictures, including the outline of the cranial vault and the position of the ears, as well as frontal pictures and the morphology of the external ear.

Materials and methods

The study was approved by the Comité Éthique et Scientifique pour les Recherches, les Études et les Évaluations dans le domaine de la Santé (CESREES), №4570023bis, the Commission Nationale Informatique et Libertés (CNIL), №MLD/MFI/AR221900, the Institutional Review Board, Faculty of Medicine, Chulalongkorn University (IRB 264/62), and in accordance with the 1964 Helsinki declaration and its later amendments. Informed and written consents were obtained from the legal representatives of each child or from the patients themselves if they were of age.

Photographic dataset

We included most pictures from the photographic database of the Maxillofacial surgery and Plastic surgery department of Hôpital Necker—Enfants Malades (Assistance Publique—Hôpitaux de Paris), Paris, France. This database contains 594,000 photographs from 22,000 patients, and all pictures since 1995 were taken by a professional medical photographer using a Nikon D7000 device in standardized positions.

We included retrospectively and prospectively, from 1995 to 2023, all frontal and lateral pictures of patients diagnosed with KS. The photographs were not calibrated. All patients had genetic confirmation of KS (KMT2D or KDM6A). We excluded all photographs taken after any surgerical procedure that could have modified the craniofacial morphology. Multiple photographs per patient corresponded to different ages of follow-up. Duplicates were excluded.

Controls were selected among patients admitted for lacerations, trauma, infection and various skin lesions, without any record of chronic conditions. More precisely, follow-up for any type of chronic disease was considered as an exclusion criterion. The reports were retrieved using the local data warehouse Dr Warehouse²⁴. For each patient, the best lateral view was included.

Data from five other medical genetics departments were also included according to the same criteria: (1) Montpellier University Hospital (n = 32), (2) Grenoble University Hospital (n = 1), (3) Tours University Hospital (n = 1), (4) King Chulalongkorn Memorial Hospital Bangkok, Thailand (n = 8), and (5) Lausanne University Hospital, Lausanne, Switzerland (n = 1).

Validation set

For designs №1 and №2, we randomly selected a group of individuals corresponding to 10% of the number of patients with KS, and the equivalent number of control patients. These patients were removed from the training set. The two sets were therefore independent.

Landmarking

We used three different templates based on 105 landmarks for the frontal views, 73 for the lateral views and 41 for the external ear pictures. We developed an automatic annotation model for each template following a pipeline including: (1) detection of the Region Of Interest (ROI) and (2) automatic placement of the landmarks.

For ROI detection, a Faster Region-based Convolutional Neural Network (RCNN) model was trained after data augmentation (images and their + 10° and 10° rotations), with a learning rate of 0.001, a batch size of 4, a gamma of 0.05 and 2000 iterations, optimized and split into two stages: ROI detection and determination of profile laterality.

(1) ROI detection—Faster RNN trained on 15,633 images, after data augmentation (images and their + 10° and − 10° rotations): 6186 frontal images (2062 × 3) and 9447 right and left profile images (3159 × 3). The batch size was 2, learning rate was 0.0025, and the maximum number of iterations was 2800.

(2) Determination of profile laterality—Pre-trained ResNet50 network²⁵ using the Pytorch library²⁶. The training images included 1570 left profiles and 1579 right profiles. The batch size was 16, an Adam optimizer²⁷ was used with a learning rate of 0.001, a step of 7, and a gamma of 0.1, trained over 25 epochs.

For the automatic placement of landmarks, we used a patch-based Active Appearance Model (AAM) using the menpo library on Python 3.7²⁸. We have previously reported the relevance of this approach²⁹. We used two-scale landmarking: the model for frontal pictures was trained on 904 manually annotated photographs, with a first stage of dimensioning (diagonal = 150), a patch shape of [(15, 15), (23, 23)] and 50 iterations and a second stage without resizing, with a patch shape of [(20, 20), (30, 30)] and 10 new iterations. The model for profile pictures was trained on 1,439 manually annotated photographs, with a first stage of dimensioning (diagonal = 150), a patch shape of [(15, 15), (23, 23)] and 25 iterations and a second stage without resizing, with a patch shape of [(15, 15), (23, 23)] and 5 new iterations. The model for ears was trained on 1221 manually annotated photographs, with a first stage of dimensioning (diagonal = 100), a patch shape of [(15, 15), (23, 23)] and 50 iterations and a second stage without resizing, with a patch shape of [(20, 20), (30, 30)] and 20 new iterations. All three models used the Lucas Kanade optimizer³⁰.

Each automatically annotated photograph was checked by two authors blinded for the diagnosis, QH and MD, and landmarks were manually re-positioned when necessary, using landmarker.io³¹. The Intraclass Correlation Coefficient (ICC) was computed between the raters. ICC values greater than 0.9 corresponded to excellent reliability of the manual annotation³².

Geometric morphometrics

We performed Generalized Procrustes Analysis (GPA)³³ on all landmark clouds using the geomorph package on R³⁴. Since the data were uncalibrated photographs, ROI sizes were not available: shape parameters only were assessed and not centroid sizes. Procrustes coordinates were processed using Principal Component Analysis (PCA) for dimension reduction. We retained the principal components explaining 99% of the total variance in cumulative sum. The last 1% was considered as negligible information.

Texture extraction

We partitioned the frontal and profile pictures into key areas and applied textural feature extraction methods to each zone, allowing to check the results and determine which zone had contributed most to the diagnosis.

We defined 14 key areas that could potentially contribute to diagnosis: 11 on frontal views (right/left eyes, right/left eyebrows, glabella, forehead, nasal tip, philtrum, right/left cheeks, and chin) and 3 on lateral views (pre-auricular region, eye, and zygoma relief). Each zone was extracted automatically using the previously placed landmarks.

We used the Contrast Limited Adaptative Histogram Equalization (CLAHE) algorithm for histogram equalization, as previously reported before the use of feature extractors^35,36. CLAHE enhanced contrast by evenly dispersing gray values³⁷ and by reducing the influences of illumination during picture capture and of skin color. Kiflie et al. recommended CLAHE as a first choice equalization method³⁸.

Gray-Level Co-occurrence Matrix (GLCM) methods, as proposed by Haralick³⁹, are based on the estimation of the second-order joint conditional probability density functions, which characterize the spatial relationships between pixels. GLCM is commonly used in texture analysis^40,41, for instance in radiomics on CT-scan or MRI images^42,43,44 or for skin texture assessment⁴⁵. In GLCM, the co-occurrence matrix contains information on entropy, homogeneity, contrast, energy and correlation between pixels. GLCM includes 28 features, taking into account the average and range for each item of information and for each zone, representing 28 × 14 = 394 textural features for each patient.

Stratification using metadata

The textural features and the geometric principal components were combined for further analysis. To consider associated metadata (age and gender) and the fact that we included more than one photograph per patient (that is the non-independence of the data), a mixed model was designed for each feature. The variables to be explained were the features (geometric and textural), with age, gender and ethnicity considered as explanatory variables. A random effect on age and individuals was introduced. The equation of the mixed model was:

$${\varvec{Features}}_{{{\varvec{i}},{\varvec{j}}}} \sim \alpha + age. \beta_{1} + gender.\beta_{2} + ethnicity.\beta_{3} + age.\beta_{1,i} + \varepsilon_{i,j}$$

where $age.\beta_{1,i}$ corresponded to a random slope for age per individual, and $\varepsilon_{i,j}$ was a random error term. We did not use an interaction term between age and gender and age and ethnicity as it did not increase the likelihood of the model. Age, gender and ethnicity are significant factors in dysmorphology^46,47.

The residuals of each feature were computed to consider potential biases linked to the metadata:

$${\varvec{\varepsilon}}_{{{\varvec{i}},{\varvec{j}}}} = {\varvec{Features}}_{{{\varvec{i}},\user2{ j}}} - \alpha + age. \beta_{1} + gender.\beta_{2} + ethnicity.\beta_{3} + age.\beta_{1,i}$$

Classification model

The inputs to the model were the residuals from the linear models described above, for each geometric or textural feature. We used eXtreme Gradient Boosting (XGBoost), a supervised machine learning classifier, for all the analyses⁴⁸. We chose a tree-based booster, and the loss function to be minimized was a logistic regression for binary classification. We set several hyperparameters to improve the performance and effect of the machine learning model: learning rate = 0.3, gamma = 0, maximum tree depth = 6. The model with the lowest error rate was chosen for analysis. We separated the dataset into a training set and a testing set, and a five-fold cross-validation was used to define the ideal number of iterations to avoid overfitting.

The chosen model with the ideal number of iterations was then used on the independent validation set to test performances, by plotting accuracy and AUC. The Receiver Operating Characteristics (ROC) curves were plotted in R using the plotROC package⁴⁹. We used the DeepGestalt tool proposed by Face2Gene CLINIC on our validation set, to be able to compare its performance (accuracies).

Uniform Manifold Approximation and Projection (UMAP) representations

The residuals $\varepsilon_{i,j}$ were represented using UMAP for visual clustering, a nonlinear dimension reduction technique⁵⁰. We retained the residuals associated with features with a classification gain (in their cumulative sum) > 0.75 in the importance matrix associated with the XGboost model. A k (local neighborhood size) value of 15 was used. A cosine metric was introduced to compute distances in high dimensional spaces: the effective minimal distance between embedded points was $10^{ - 6}$. The three conditions of UMAP, namely uniform distribution, local constancy of the Riemannian metric and local connectivity were verified. UMAP analyses were performed using the package umap on R⁵¹ (Fig. 1).

Classification designs

1.
Design №1, syndrome diagnosis support: KS was tested against controls in a binary classification.
2.
Design №2, genotype–phenotype correlations: KS1 and KS2 were tested in binary classifications.
3.
Design №3, genotype–phenotype correlations: KS1 Protein-Altering Variants (PAVs) and Protein-Truncating Variants (PTVs) were tested in binary classifications.

Ethics approval

This study was performed in line with the principles of the Declaration of Helsinki. Approval was granted by the CESREES (17/06/2021, 4570023).

Consent to participate

Written informed consent was obtained from the parents.

Consent to publish

The authors affirm that human research participants provided informed consent for publication of the images in Figs. 1, 4 and 7.

Results

Population description

Ranging between 1998 and 2023, we included 1448 frontal and lateral facial photographs, corresponding to 634 patients. The mean age was 7.2 ± 4.2 years and ranged from 0 to 40.2 years; 52% were girls. Ethnicity was 92% Caucasian, 6% African or Caribbean, and 3% Asian.

The control group comprised 1084 photographs, corresponding to 527 patients with a mean age of 7.0 ± 4.6 years. Fifty-four percent were girls and ethnicities were 93% Caucasian, 5% African/Caribbean, and 2% Asian.

The KS group comprised 364 photographs, corresponding to 107 patients with a mean age of 7.8 ± 6.7 years. Forty-two percent were girls and ethnicities were 85% Caucasian, 7% African/Caribbean, and 8% Asian. Seventy-eight percent of patients were KS1 (Table 1).

Table 1 Clinical description of the cohort.

Full size table

Two patients had a genetically confirmed diagnosis of KS, but we had no information on the causal gene. We thus collected information on genetic variation for 105 KS individuals with 82 (78%) and 23 (22%) with variations in KMT2D (KS1) and KDM6A (KS2) respectively.

In the KS1 group, 74% of variants were PTVs, with 49% nonsense variants leading to a premature stop codon (24% non-sense, 24% frameshift) and 26% splice donor site variants. Eighteen percent were PAVs, with 17% missense variants and 1% in-frame indel.

In the KS2 group, 78% of variants were PTVs, with 43% nonsense variants leading to a premature stop codon (30% non-sense, 13% frameshift), 30% splice donor site variants and 4% a large deletion. Nine percent were missense PAVs (Table 2).

Table 2 Molecular description of the cohort.

Full size table

Design №1 : KS vs controls

1.
Phenotype

We confirmed the usual characteristics described in KS: high and arched eyebrows, long palpebral fissures, and large and prominent ears (Fig. 2).

2.
Classification

We were able to distinguish KS vs controls in the independent validation group with an accuracy of 95.8% (78.9–99.9%, p < 0.001). AUCs were comparable in the training set (0.994) and in the validation set (0.993) (Fig. 3, Table 3).

Table 3 Classification performances for design №1 (KS vs controls) in the validation group.

Full size table

Ten out of eleven patients were correctly predicted as KS with our model, and this performance was the same using Face2Gene CLINIC (Supp. Table 1). In addition, we were able to predict all control patients (Fig. 4, Table 4).

Table 4 Confusion matrix for design №1 (KS versus controls) in the validation group.

Full size table

Design №2 : KS1 vs KS2

1.
Phenotype

KS2 individuals had a rounder face (HP:0000311), a shorter nose (HP:0003196), a thicker upper lip (HP:0000215), anteverted nostrils (HP:0000463), and a shorter midface (HP:0011800). There was no obvious difference in the eyebrows and eyes. The external ears were more elongated vertically in KS2 (HP:0400004), with a hypoplastic lobe (HP:0000385), and with a counter-clockwise rotation. The conch seemed more vertical in KS1 (Fig. 5).

2.
Classification

The model was able to distinguish KS1 from KS2 with an empirical AUC of 0.805 (0.729–0.880, p < 0.001) (Figs. 6, 7). This trend was found in the validation group, with an accuracy of 70% without reaching the significance threshold (Tables 5 and 6).

Table 5 Classification performances for design №2 (KS1 versus KS2) in the validation group.

Full size table

Table 6 Confusion matrix for design №2 (KS1 versus KS2) in the validation group.

Full size table

Design №3: PTV vs PAV in KS1

The model was unable to detect a difference in facial phenotype between KS1 patients with a PTV compared to KS1 patients with a PAV (0.555 [0.419–0.690], p = 0.786) (Fig. 8).

Discussion

The model we report distinguished KS from controls in the independent validation group with an accuracy of 95.8% (78.9–99.9%, p < 0.001). Only 1 patient out of 24 was classified as ‘control’ while she had KS (accuracy 96%). In the KS group, 10 out of 11 patients were correctly classified (accuracy 91%). Using the Face2Gene CLINIC tool on KS patients (because DeepGestalt technology is not capable of recognizing non-syndromic patients) 1 patient out of 11 could not be analyzed and could not be classified as KS (accuracy 91%). Performances were therefore comparable. Interestingly, the patient not recognized by our model and by Face2Gene CLINIC was of African ethnicity, highlighting the lack of training data for non-Caucasian patients. The distribution of ethnic groups varies greatly from one center to another, which is why we believe it is important to encourage international collaborations in the field of Next Generation Phenotyping.

The model we report was also capable to distinguish KS1 from KS2 with an empirical AUC of 0.805 (0.729–0.880, p < 0.001). Rouxel et al.⁵ showed that the Face2Gene RESEARCH tool distinguished KS1 from KS2 in a cohort of 66 patients with an AUC of 0.722 (p = 0.022). The same team showed a classification accuracy of 61% (20/33) by clinical genetics experts between KS1 and KS2. The performance of our model was at least comparable to Face2Gene RESEARCH and seemed to outperform that of clinical experts.

Rouxel et al.⁵ explained that KS1 patients had a longer face and nose, a thin upper lip vermilion and a longer midface in comparison to KS2 patients, who have a rounder face, a thicker vermilion and anteverted nostrils. Our study reports new phenotypic features not seen on frontal images alone for KS2, such as a particular morphology of the external ear, longer along the vertical axis and with counter-clockwise rotation.

Phenotype-genotype correlations have been reported in KS for extra-facial anomalies. Cardiovascular abnormalities, namely ventricular septal defects, coarctation of the aorta, atrial septal defects, bicuspid aortic valve, patent ductus arteriosus, and hypoplastic left heart syndrome^{52,53,53,54,55} are more prevalent in KS2 compared to KS1^1,56. Persistent hypoglycemia due to pituitary hormone deficiency, adrenal insufficiency, growth hormone deficiency and dysregulated insulin secretion by the pancreatic β-cells^57,58 are also more frequent in KS2¹⁰, possibly because the inhibition of KDM6A increases the release of insulin from pancreatic islet cells, as suggested by mouse models^1,59. Urinary tract anomalies, such as horseshoe kidneys and renal hypoplasia, seem to be more frequent in KS1, and genital defects such as cryptorchidism and hypospadias could be more frequent in KS2^56,60,61.

Rouxel et al.⁵ underline the lack of Asian patients in their evaluation, and proposed that larger series were needed to better define phenotypical differences between KS1 and KS2, and the general dependance of the phenotype with ethnicity^6,12. The collaboration with an Asian clinical genetics center (Bangkok) is thus a strong point of this study.

The use of textural feature extraction allowed our model account for typical KS characteristics not recognized by geometric analysis (Procrustes) alone. The lateral sparsening of the eyebrows and heavy lashes giving the impression of make-up eyes were thus included into in the classification.

Barry et al.¹ reported a large meta-analysis including 152 articles and 1369 individuals with KS and assessed the prevalence of the different types of pathogenic variation per gene. The majority of KMT2D variants were truncating (non-sense 34%, frameshift 34%), then missense (23%) and finally splice site variants (9%). The majority of KDM6A variants were truncating (frameshift 36% > non-sense 27%), followed by splice site (20%), and missense (18%). We found similar results, with a higher prevalence of truncating non-sense variants for both genes. There was a higher prevalence of splice donor site variants, with 26% for KMT2D and 30% for KDM6A. Some authors report a more severe clinical outcomes in patients with non-sense variants than in patients with a frameshift variant¹. Faundes et al.⁵⁶ found more severe neurodevelopmental anomalies in patients with protein-truncating mutations in the KS2 group. Shah et al.⁶² reported ophthalmological anomalies such as strabismus, blue sclerae, microphthalmia and refractive anomalies that were more severe in patients with a non-sense variant, and less frequent in patients with a frameshift variant. Our model did not find any significant difference in facial phenotype between PTV and PAV.

Conclusion

Here we report an automatic detection model for KS including the face, profiles and ears, with performances (AUC 0.993 and accuracy 95.8%) comparable to those of Face2Gene, on an independent validation set. These performances were achieved using an international cohort of 107 patients with a confirmed molecular diagnosis of KS. Using the same model, we were able to separate patients with KS1 (KMT2D) from KS2 (KDM6A), with an AUC of 0.805. These results seem to at least outperform Face2Gene and support the possibility of using a phenotype-first strategy to diagnose KS and detect its two causal genes.

Data availability

The code is available to readers on the website https://framagit.org/imagine-plateforme-bdd/mfdm/. The datasets (photographs) supporting the current study have not been deposited in a public repository because of their identifiable nature.

References

Barry, K. K. et al. From genotype to phenotype—a review of Kabuki syndrome. Genes (Basel) 13, 1761 (2022).
Article ADS PubMed CAS Google Scholar
Niikawa, N. et al. Kabuki make-up (Niikawa-Kuroki) syndrome: A study of 62 patients. Am. J. Med. Genet. 31, 565–589 (1988).
Article PubMed CAS Google Scholar
White, S. M. et al. Growth, behavior, and clinical findings in 27 patients with Kabuki (Niikawa-Kuroki) syndrome. Am. J. Med. Genet. A 127A, 118–127 (2004).
Article PubMed CAS Google Scholar
Kuroki, Y., Suzuki, Y., Chyo, H., Hata, A. & Matsui, I. A new malformation syndrome of long palpebral fissures, large ears, depressed nasal tip, and skeletal anomalies associated with postnatal dwarfism and mental retardation. J. Pediatr. 99, 570–573 (1981).
Article PubMed CAS Google Scholar
Rouxel, F. et al. Using deep-neural-network-driven facial recognition to identify distinct Kabuki syndrome 1 and 2 gestalt. Eur. J. Hum. Genet. 30, 682–686 (2022).
Article PubMed CAS Google Scholar
Adam, M. P. & Hudgins, L. Kabuki syndrome: A review. Clin. Genet. 67, 209–219 (2005).
Article PubMed CAS Google Scholar
Bögershausen, N. et al. Mutation update for Kabuki syndrome genes KMT2D and KDM6A and further delineation of X-Linked Kabuki syndrome subtype 2. Hum. Mutat. 37, 847–864 (2016).
Article PubMed Google Scholar
Lederer, D. et al. Deletion of KDM6A, a Histone Demethylase Interacting with MLL2, in three patients with Kabuki Syndrome. Am. J. Hum. Genet. 90, 119–124 (2012).
Article PubMed PubMed Central CAS Google Scholar
Paděrová, J. et al. Molecular genetic analysis in 14 Czech Kabuki syndrome patients is confirming the utility of phenotypic scoring. Clin. Genet. 90, 230–237 (2016).
Article PubMed Google Scholar
Banka, S. et al. Novel KDM6A (UTX) mutations and a clinical and molecular review of the X-linked Kabuki syndrome (KS2). Clin. Genet. 87, 252–258 (2015).
Article PubMed CAS Google Scholar
Bögershausen, N. & Wollnik, B. Unmasking Kabuki syndrome. Clin. Genet. 83, 201–211 (2013).
Article PubMed Google Scholar
Ng, S. B. et al. Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome. Nat. Genet. 42, 790–793 (2010).
Article PubMed PubMed Central CAS Google Scholar
DGOS. Les maladies rares. Ministère de la Santé et de la Prévention https://sante.gouv.fr/soins-et-maladies/prises-en-charge-specialisees/maladies-rares/article/les-maladies-rares (2023).
Rajkomar, A., Dean, J. & Kohane, I. Machine learning in medicine. N. Engl. J. Med. 380, 1347–1358 (2019).
Article PubMed Google Scholar
Choy, G. et al. Current applications and future impact of machine learning in radiology. Radiology 288, 318–328 (2018).
Article PubMed Google Scholar
Novoa, R. A., Gevaert, O. & Ko, J. M. Marking the path toward artificial intelligence-based image classification in dermatology. JAMA Dermatol. 155, 1105–1106 (2019).
Article PubMed Google Scholar
Loftus, T. J. et al. Artificial Intelligence and surgical decision-making. JAMA Surg. 155, 148–158 (2020).
Article PubMed PubMed Central Google Scholar
Gurovich, Y. et al. Identifying facial phenotypes of genetic disorders using deep learning. Nat. Med. 25, 60–64 (2019).
Article PubMed CAS Google Scholar
Zhang, Q. et al. Molecular and phenotypic expansion of alström syndrome in Chinese patients. Front. Genet. 13, 808919 (2022).
Article PubMed PubMed Central CAS Google Scholar
Javitt, M. J., Vanner, E. A., Grajewski, A. L. & Chang, T. C. Evaluation of a computer-based facial dysmorphology analysis algorithm (Face2Gene) using standardized textbook photos. Eye 36, 859–861 (2022).
Article PubMed Google Scholar
Latorre-Pellicer, A. et al. Evaluating Face2Gene as a tool to identify cornelia de lange syndrome by facial phenotypes. Int. J. Mol. Sci. 21, E1042 (2020).
Article Google Scholar
Mishima, H. et al. Evaluation of Face2Gene using facial images of patients with congenital dysmorphic syndromes recruited in Japan. J. Hum. Genet. 64, 789–794 (2019).
Article PubMed Google Scholar
Dudding-Byth, T. et al. Computer face-matching technology using two-dimensional photographs accurately matches the facial gestalt of unrelated individuals with the same syndromic form of intellectual disability. BMC Biotechnol. 17, 90 (2017).
Article PubMed PubMed Central Google Scholar
Garcelon, N. et al. A clinician friendly data warehouse oriented toward narrative reports: Dr Warehouse. J. Biomed. Inform. 80, 52–63 (2018).
Article PubMed Google Scholar
Koonce, B. ResNet 50 63–72 (Springer, 2021). https://doi.org/10.1007/978-1-4842-6168-2_6.
Paszke, A. et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library. https://doi.org/10.48550/arXiv.1912.01703 (2019).
Kingma, D. P. & Ba, J. Adam: A Method for Stochastic Optimization. https://doi.org/10.48550/arXiv.1412.6980 (2017).
Alabort-i-Medina, J., Antonakos, E., Booth, J., Snape, P. & Zafeiriou, S. Menpo: A comprehensive platform for parametric image alignment and visual deformable models. In Proceedings of the 22nd ACM international conference on Multimedia 679–682 (ACM, 2014). https://doi.org/10.1145/2647868.2654890.
Hennocq, Q. et al. An automatic facial landmarking for children with rare diseases. Am. J. Med. Genet. Part A 2022, 145 (2022).
Google Scholar
Lucas, B. & Kanade, T. An Iterative Image Registration Technique with an Application to Stereo Vision (IJCAI) Vol. 81 (Springer, 1981).
Google Scholar
landmarker.io. The Menpo Project. https://www.menpo.org/landmarkerio/ (2022).
Bartko, J. J. The intraclass correlation coefficient as a measure of reliability. Psychol. Rep. 19, 3–11 (1966).
Article PubMed CAS Google Scholar
Rohlf, F. J. & Slice, D. Extensions of the procrustes method for the optimal superimposition of landmarks. Syst. Zool. 39, 40–59 (1990).
Article Google Scholar
Baken, E. K., Collyer, M. L., Kaliontzopoulou, A. & Adams, D. C. geomorph v4.0 and gmShiny: Enhanced analytics and a new graphical interface for a comprehensive morphometric experience. Methods Ecol. Evol. 12, 2355–2363 (2021).
Article Google Scholar
Avcı, H. & Karakaya, J. A novel medical image enhancement algorithm for breast cancer detection on mammography images using machine learning. Diagn. (Basel) 13, 348 (2023).
Google Scholar
Anifah, L., Purnama, I. K. E., Hariadi, M. & Purnomo, M. H. Osteoarthritis classification using self organizing map based on Gabor Kernel and contrast-limited adaptive histogram equalization. Open Biomed. Eng. J. 7, 18 (2013).
Article PubMed PubMed Central Google Scholar
Huang, C., Li, X. & Wen, Y. AN OTSU image segmentation based on fruitfly optimization algorithm. Alexandr. Eng. J. 60, 183–188 (2021).
Article Google Scholar
Kiflie, A., Tesema Tufa, G. & Salau, A. O. Sputum smears quality inspection using an ensemble feature extraction approach. Front. Public Health 10, 1032467 (2023).
Article PubMed PubMed Central Google Scholar
Haralick, R. M., Shanmugam, K. & Dinstein, I. Textural features for image classification. IEEE Trans. Syst. Man Cybern. 3, 610–621 (1973).
Article Google Scholar
Mohanaiah, P., Sathyanarayana, P. & GuruKumar, L. Image Texture Feature Extraction Using GLCM Approach Vol. 3 (Springer, 2013).
Google Scholar
Löfstedt, T., Brynolfsson, P., Asklund, T., Nyholm, T. & Garpebring, A. Gray-level invariant Haralick texture features. PLoS One 14, e0212110 (2019).
Article PubMed PubMed Central Google Scholar
Mundt, P. et al. Periaortic adipose radiomics texture features associated with increased coronary calcium score-first results on a photon-counting-CT. BMC Med. Imaging 23, 97 (2023).
Article PubMed PubMed Central Google Scholar
Adelsmayr, G. et al. Three dimensional computed tomography texture analysis of pulmonary lesions: Does radiomics allow differentiation between carcinoma, neuroendocrine tumor and organizing pneumonia?. Eur. J. Radiol. 165, 110931 (2023).
Article PubMed Google Scholar
Peng, B. et al. Preoperative computed tomography-based tumoral radiomic features prediction for overall survival in resectable non-small cell lung cancer. Front. Oncol. 13, 1131816 (2023).
Article PubMed PubMed Central Google Scholar
Ou, X., Pan, W. & Xiao, P. In vivo skin capacitive imaging analysis by using grey level co-occurrence matrix (GLCM). Int. J. Pharm. 460, 28–32 (2014).
Article PubMed CAS Google Scholar
Muenke, M., Adeyemo, A. & Kruszka, P. An electronic atlas of human malformation syndromes in diverse populations. Genet. Med. 18, 1085–1087 (2016).
Article PubMed Google Scholar
Burchard, E. G. et al. The importance of race and ethnic background in biomedical research and clinical practice. N. Engl. J. Med. 348, 1170–1175 (2003).
Article PubMed Google Scholar
Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785–794 (Association for Computing Machinery, 2016). https://doi.org/10.1145/2939672.2939785.
Sachs, M. C. plotROC: A tool for plotting ROC curves. J. Stat. Softw. 79, 2 (2017).
Article PubMed PubMed Central Google Scholar
McInnes, L., Healy, J. & Melville, J. UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. https://doi.org/10.48550/arXiv.1802.03426 (2020).
R Core Team. European Environment Agency. https://www.eea.europa.eu/data-and-maps/indicators/oxygen-consuming-substances-in-rivers/r-development-core-team-2006 (2020).
Hughes, H. E. & Davies, S. J. Coarctation of the aorta in Kabuki syndrome. Arch. Dis. Child. 70, 512–514 (1994).
Article PubMed PubMed Central CAS Google Scholar
Digilio, M. C. et al. Congenital heart defects in molecularly proven Kabuki syndrome patients. Am. J. Med. Genet. A 173, 2912–2922 (2017).
Article PubMed CAS Google Scholar
Cheon, C.-K. & Ko, J. M. Kabuki syndrome: Clinical and molecular characteristics. Korean J. Pediatr. 58, 317–324 (2015).
Article PubMed PubMed Central Google Scholar
Yoon, J. K. et al. The strong association of left-side heart anomalies with Kabuki syndrome. Korean J. Pediatr. 58, 256–262 (2015).
Article PubMed PubMed Central Google Scholar
Faundes, V. et al. Clinical delineation, sex differences, and genotype-phenotype correlation in pathogenic KDM6A variants causing X-linked Kabuki syndrome type 2. Genet. Med. 23, 1202–1210 (2021).
Article PubMed PubMed Central CAS Google Scholar
Yap, K. L. et al. Congenital hyperinsulinism as the presenting feature of Kabuki syndrome: Clinical and molecular characterization of 9 affected individuals. Genet. Med. 21, 233–242 (2019).
Article PubMed CAS Google Scholar
Gole, H., Chuk, R. & Coman, D. Persistent hyperinsulinism in Kabuki syndrome 2: Case report and literature review. Clin. Pract. 6, 848 (2016).
Article PubMed PubMed Central Google Scholar
Gibson, C. E. et al. Congenital hyperinsulinism in infants with turner syndrome: Possible association with monosomy X and KDM6A haploinsufficiency. Horm. Res. Paediatr. 89, 413–422 (2018).
Article PubMed CAS Google Scholar
Courcet, J.-B. et al. Clinical and molecular spectrum of renal malformations in Kabuki syndrome. J. Pediatr. 163, 742–746 (2013).
Article PubMed CAS Google Scholar
Cetinkaya, E., Misirlioğlu, E. D., Vidinlisan, S., Baydar, Z. & Ozhan, Z. R. Hypospadias in a patient with Kabuki make-up (Niikawa-Kuroki) syndrome. J. Pediatr. Endocrinol. Metab. 14, 803–805 (2001).
Article PubMed CAS Google Scholar
Shah, S. S. et al. Insights into the genotype-phenotype relationship of ocular manifestations in Kabuki syndrome. Am. J. Med. Genet. A 191, 1325–1338 (2023).
Article PubMed CAS Google Scholar

Download references

Funding

This work was supported by the ‘Agence Nationale de la Recherche’, ’Investissements d’Avenir’ program (ANR-10-IAHU-01), France 2030 grant “Face4Kids” (ANR-21-PMRB-0004), Health Systems Research Institute (66-101, 66-122), Thailand Science Research and Innovation Fund Chulalongkorn University, and National Research Council of Thailand (N42A650229).

Author information

These authors contributed equally: David Geneviève and Roman H. Khonsari.

Authors and Affiliations

Imagine Institute, INSERM UMR1163, 75015, Paris, France
Quentin Hennocq, Jeanne Amiel, Tania Attie-Bitach, Thomas Bongibault, Thomas Bouygues, Valérie Cormier-Daire, Maxime Douillet, Stanislas Lyonnet, Marlène Rio, Roman H. Khonsari & Nicolas Garcelon
Service de chirurgie maxillo-faciale et chirurgie plastique, Hôpital Necker-Enfants Malades, Assistance Publique-Hôpitaux de Paris, Paris, France
Quentin Hennocq, Eva Galliani, Arnaud Picard & Roman H. Khonsari
Centre de Référence des Malformations Rares de la Face et de la Cavité Buccale MAFACE, Filière Maladies Rares TeteCou, Paris, France
Quentin Hennocq, Eva Galliani, Arnaud Picard & Roman H. Khonsari
Faculté de Médecine, Université de Paris Cité, 75015, Paris, France
Quentin Hennocq, Jeanne Amiel, Tania Attie-Bitach, Valérie Cormier-Daire, Eva Galliani, Stanislas Lyonnet, Arnaud Picard, Marlène Rio & Roman H. Khonsari
Laboratoire ‘Forme et Croissance du Crâne’, Faculté de Médecine, Hôpital Necker-Enfants Malades, Assistance Publique-Hôpitaux de Paris, Université Paris Cité, Paris, France
Quentin Hennocq, Thomas Bongibault, Thomas Bouygues & Roman H. Khonsari
Hôpital Necker-Enfants Malades, 149 rue de Sèvres, 75015, Paris, France
Quentin Hennocq
Département de Génétique Médicale, Maladies Rares et Médecine Personnalisée, Génétique clinique, CHU Montpellier, Centre de référence anomalies du développement SOOR, INSERM U1183, Montpellier University, Montpellier, France
Marjolaine Willems, Flavien Rouxel, Kevin Yauy & David Geneviève
Service de médecine génomique des maladies rares, Hôpital Necker-Enfants Malades, Assistance Publique-Hôpitaux de Paris, Paris, France
Jeanne Amiel, Tania Attie-Bitach, Valérie Cormier-Daire, Stanislas Lyonnet & Marlène Rio
Service de Génétique, CHU Tours, UMR 1253, iBrain, Université de Tours, Inserm, Tours, France
Stéphanie Arpin & Annick Toutain
Nantes Université, CHU Nantes, Service de chirurgie maxillo-faciale et stomatologie, 44000, Nantes, France
Pierre Corre
Nantes Université, Oniris, UnivAngers, CHU Nantes, INSERM, Regenerative Medicine and Skeleton, RMeS, UMR 1229, 44000, Nantes, France
Pierre Corre
Univ. Grenoble Alpes, Inserm, U1209, IAB, CHU Grenoble Alpes, 38000, Grenoble, France
Klaus Dieterich
HeKA team, INRIA, 75012, Paris, France
Jean Feydy
MEDISYN Genetics, Lausanne, Switzerland
Fabienne Giuliano
Center of Excellence in Genomics and Precision Dentistry, Department of Physiology, Faculty of Dentistry, Chulalongkorn University, Bangkok, Thailand
Thantrira Porntaveetus
Center of Excellence for Medical Genomics, Department of Pediatrics, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
Vorasuk Shotelersuk

Authors

Quentin Hennocq
View author publications
You can also search for this author in PubMed Google Scholar
Marjolaine Willems
View author publications
You can also search for this author in PubMed Google Scholar
Jeanne Amiel
View author publications
You can also search for this author in PubMed Google Scholar
Stéphanie Arpin
View author publications
You can also search for this author in PubMed Google Scholar
Tania Attie-Bitach
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Bongibault
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Bouygues
View author publications
You can also search for this author in PubMed Google Scholar
Valérie Cormier-Daire
View author publications
You can also search for this author in PubMed Google Scholar
Pierre Corre
View author publications
You can also search for this author in PubMed Google Scholar
Klaus Dieterich
View author publications
You can also search for this author in PubMed Google Scholar
Maxime Douillet
View author publications
You can also search for this author in PubMed Google Scholar
Jean Feydy
View author publications
You can also search for this author in PubMed Google Scholar
Eva Galliani
View author publications
You can also search for this author in PubMed Google Scholar
Fabienne Giuliano
View author publications
You can also search for this author in PubMed Google Scholar
Stanislas Lyonnet
View author publications
You can also search for this author in PubMed Google Scholar
Arnaud Picard
View author publications
You can also search for this author in PubMed Google Scholar
Thantrira Porntaveetus
View author publications
You can also search for this author in PubMed Google Scholar
Marlène Rio
View author publications
You can also search for this author in PubMed Google Scholar
Flavien Rouxel
View author publications
You can also search for this author in PubMed Google Scholar
Vorasuk Shotelersuk
View author publications
You can also search for this author in PubMed Google Scholar
Annick Toutain
View author publications
You can also search for this author in PubMed Google Scholar
Kevin Yauy
View author publications
You can also search for this author in PubMed Google Scholar
David Geneviève
View author publications
You can also search for this author in PubMed Google Scholar
Roman H. Khonsari
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Garcelon
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors read and approved the final manuscript. Conceptualization: Q.H., RH.K., N.G., D.G. Data curation: Q.H., M.D., T.B., T.B., Formal analysis: Q.H., J.F., T.B., T.B. Funding acquisition: N.G., RH.K., S.L., V.CD., Investigation: Q.H., RH.K., N.G., D.G. Methodology: Q.H., N.G., D.G., RH.K., J.F., K.Y., F.R. Project administration: S.L., N.G., M.R., J.A. Resources: A.T., D.G., M.W., K.D., S.A., T.P., F.G., V.S., P.C. Software: Q.H., T.B., T.B. Supervision: Q.H., RH.K., N.G., D.G. Validation: Q.H., RH.K., N.G., D.G. Visualization : Q.H., RH.K., N.G., D.G. Writing-original draft: Q.H. Writing-review & editing: Q.H., RH.K., N.G., D.G., A.T., D.G., M.W., K.D., S.A., T.P., F.G., V.S., P.C.

Corresponding author

Correspondence to Quentin Hennocq.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Table 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hennocq, Q., Willems, M., Amiel, J. et al. Next generation phenotyping for diagnosis and phenotype–genotype correlations in Kabuki syndrome. Sci Rep 14, 2330 (2024). https://doi.org/10.1038/s41598-024-52691-3

Download citation

Received: 20 December 2023
Accepted: 22 January 2024
Published: 28 January 2024
DOI: https://doi.org/10.1038/s41598-024-52691-3

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Genome-wide association studies

Demographic bias in misdiagnosis by computational pathology models

Genome-wide association analyses identify 95 risk loci and provide insights into the neurobiology of post-traumatic stress disorder

Introduction

Materials and methods

Photographic dataset

Validation set

Landmarking

Geometric morphometrics

Texture extraction

Stratification using metadata

Classification model

Uniform Manifold Approximation and Projection (UMAP) representations

Classification designs

Ethics approval

Consent to participate

Consent to publish

Results

Population description

Design №1 : KS vs controls

Design №2 : KS1 vs KS2

Design №3: PTV vs PAV in KS1

Discussion

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Table 1.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links