Diagnostic performance of convolutional neural networks for dental sexual dimorphism

Franco, Ademir; Porto, Lucas; Heng, Dennis; Murray, Jared; Lygate, Anna; Franco, Raquel; Bueno, Juliano; Sobania, Marilia; Costa, Márcio M.; Paranhos, Luiz R.; Manica, Scheila; Abade, André

doi:10.1038/s41598-022-21294-1

Download PDF

Article
Open access
Published: 14 October 2022

Diagnostic performance of convolutional neural networks for dental sexual dimorphism

Ademir Franco^1,2,6,
Lucas Porto³,
Dennis Heng¹,
Jared Murray¹,
Anna Lygate¹,
Raquel Franco⁴,
Juliano Bueno⁵,
Marilia Sobania⁶,
Márcio M. Costa⁷,
Luiz R. Paranhos⁴,
Scheila Manica¹ &
…
André Abade⁸

Scientific Reports volume 12, Article number: 17279 (2022) Cite this article

1699 Accesses
6 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Convolutional neural networks (CNN) led to important solutions in the field of Computer Vision. More recently, forensic sciences benefited from the resources of artificial intelligence, especially in procedures that normally require operator-dependent steps. Forensic tools for sexual dimorphism based on morphological dental traits are available but have limited performance. This study aimed to test the application of a machine learning setup to distinguish females and males using dentomaxillofacial features from a radiographic dataset. The sample consisted of panoramic radiographs (n = 4003) of individuals in the age interval of 6 and 22.9 years. Image annotation was performed with V7 software (V7labs, London, UK). From Scratch (FS) and Transfer Learning (TL) CNN architectures were compared, and diagnostic accuracy tests were used. TL (82%) performed better than FS (71%). The correct classifications of females and males aged ≥ 15 years were 87% and 84%, respectively. For females and males < 15 years, the correct classifications were 80% and 83%, respectively. The Area Under the Curve (AUC) from Receiver-operating Characteristic (ROC) curves showed high classification accuracy between 0.87 and 0.91. The radio-diagnostic use of CNN for sexual dimorphism showed positive outcomes and promising forensic applications to the field of dental human identification.

Sex differences orchestrated by androgens at single-cell resolution

Article 10 April 2024

Demographic bias in misdiagnosis by computational pathology models

Article 19 April 2024

Towards a general-purpose foundation model for computational pathology

Article 19 March 2024

Introduction

Several techniques used in forensic sciences rely on subjective operator-dependent procedures¹. The decision-making process behind these procedures requires experience and may lead to error rates with a significant impact in practice². Important contributions of forensic dentistry to forensic sciences emerged from radio-diagnostic procedures, such as dental charting for human identification^3,4,5, and dental staging for age estimation^6,7,8,9,10. Computer-based tools were developed to create a man–machine interface and reduce bias from the operator’s side. Software like KMD PlassData DVI™ (KMD s/a, Ballerup, Denmark) added quality control procedures to the reconciliation process, made disaster victim identification less time-consuming, and guaranteed more straightforward human identifications¹¹. In dental age estimation, promising automated techniques abbreviated the number of manual interactions needed to allocate developmental stages to teeth examined on radiographs¹². While dental charting has a fundamental role in comparative human identification, dental age estimation contributes indirectly as a reconstructive factor.

Among the reconstructive factors, sex plays a fundamental part in narrowing lists of missing persons¹³. When biological/physical sex-related parameters are available they may lead to binary segregation of the victims (into males and females) and limit the number of required antemortem (AM) and postmortem (PM) comparisons¹⁴. A recent systematic literature review with over a hundred eligible studies highlighted the importance of dentomaxillofacial features in the process of sexual dimorphism¹⁵. According to the authors, the existing techniques for sexual dimorphism based on teeth can be biochemical (e.g. from the analysis of dental tissues), metric (namely measuring teeth), and non-metric (e.g. relying on dental morphology)¹⁵. Biochemical techniques seem to be more accurate¹⁵ and represent the current state-of-the-art when it comes to dental analyses. However, the application of these techniques in practice is restricted because they require advanced facilities and tools that are not usually available in most medicolegal institutes, especially in developing countries.

The most common techniques debated in the current scientific literature fall within the group of metric analyses, in which linear measurements (mesiodistal width and intercanine distance) and volumetric assessments can be performed ex-vivo or through 2D (radiographic/photographic) 3D (tomographic scan) imaging¹⁶. In this context, examiner reproducibility is a drawback since millimetric measurements and volumetric analyses require extensive calibration and training. In order to reduce operator-dependent interactions, artificial intelligence could figure as an option to enhance diagnostic performances of sex estimation techniques. Machine learning algorithms are known to learn underlying relationships in data and support the decision-making process (or even make decisions without requiring explicit instructions)¹⁷. In 1989, the concept of a Convolutional Neural Network (CNN) was introduced and demonstrated enormous potential for tasks related to computer vision. CNNs are among the best learning algorithms for understanding images and have demonstrated exemplary performance in tasks related to image segmentation, classification, detection, and retrieval¹⁸. One of the most outstanding features of CNNs is their ability to explore spatial or temporal correlation in the data. The CNN topology is divided into several learning stages that consist of a combination of convolutional layers, non-linear processing units, and subsampling layers¹⁹. Since the late ’90 s, several improvements in the learning architecture of CNNs were made to enable the assessment of large, heterogeneous, complex, and multiclass datasets¹⁹. The proposed innovations included the modification of image processing units, optimization for the assessment of parameters and hyperparameters, new “design” patterns, and layer connectivity ^18,20,21.

In this scenario, artificial intelligence could find productive grounds for the use of radiographic datasets and could be challenged for sexual dimorphism. However, given the existing scientific literature and the morphological parameters currently known to be dimorphic (e.g. the maxillary sinuses²²), testing the performance of machine learning algorithms to estimate the sex of adults would be merely confirmatory. In order to propose a real challenge to artificial intelligence, sexual dimorphism could be performed with a sample of children and juveniles—a population in which anthropological indicators of sex are not well-pronounced or at least not fully expressed.

In country-specific jurisdictions, the admissibility of evidence in Court depends on several technical aspects, including the knowledge about the error of the method (factor including in Daubert’s rule, for instance). With that in mind, testing forensic solutions developed with artificial intelligence, and investigating the accuracy of the method (and inherent error) are initial steps prior to implementing computer-aided tools in practice. This diagnostic study aimed to use a radiographic dataset in a machine learning setup to promote an automated process of sexual dimorphism based on dentomaxillofacial features of children and juveniles.

Materials and methods

Ethical aspects and study design

This was a diagnostic study with retrospective sample collection. The methodological architecture was based on a medical imaging dataset to feed machine learning within the context of artificial intelligence. Informed consent was waived because the study was observation and required retrospective sampling from a pre-existing image database, but ethical approval was obtained from the Ethics Committee in Human Research of Faculdade Sao Leopoldo Mandic. The Declaration of Helsinki (DoH), 2013, was followed to assure ethical standards in this medical research. The sample was collected from a pre-existing institutional image database. Hence, no patient was prospectively exposed to ionizing radiation merely for research purposes. All the images that populated the database were obtained for diagnostic, therapeutic, or follow-up reasons.

Sample and participants

The sample consisted of panoramic radiographs (n = 4003; 1809 males and 2194 females) collected according to the following eligibility criteria: Inclusion criteria—radiographs of male and female Brazilian individuals with age between 6 and 22.9 years. Exclusion criteria—panoramic radiographs missing patient’s information about sex, date of birth, and date of image acquisition; visible bone lesions and anatomic deformity; the presence of implants and extensive restorative materials; severely displaced and/or supernumerary teeth. The radiographs were obtained from a private oral imaging company in the Central-Western region of Brazil. The images were imported to an Elitebook 15.6" FHD Laptop with i5 (Hewlett-Packard, Palo Alto, CA, USA) for analysis.

The annotations were accomplished by three trained observers, with experience in forensic odontology, supervised by a forensic odontologist with 11 years of practice in the field. A bounding-box tool was used to annotate the region of interest in Darwin V7 (V7 Labs, London, UK) software package²³. Vertically (y-axis), the box was positioned covering the apical region of the most superior teeth whilst the lower limit covered the apical region of the most inferior teeth. Laterally (x-axis), the box ended right after the third molars, bilaterally. The final selection of the region of interest was represented by a rectangular box covering all the teeth visible in the panoramic radiograph. The images were anonymized for annotation, hiding age and sex information. The software registered the annotations that were later tested for association with sex.

Pre-processing and training approach

The full dataset of panoramic radiographs was initially divided into the age groups “under 15 years" (n = 2,254) and “equal or older 15 years" (n = 1,749). This division was justified to challenge the network regarding the sexual dimorphism. In children, sexual dimorphism is more difficult because the expression of external sexual features is not pronounced. Hence, the age of 15 years represents a transitional point to a fully developed permanent dentition (except for the third molars)⁸. Normally, all the permanent teeth will have fully developed crowns around this age⁸. The roots, if not developed, will present a late stage of formation¹⁶. In each age group (< 15 years vs. ≥ 15 years) a single problem was established: sexual dimorphism, and a binary outcome was expected regarding sex (male vs. female), and age (< 15 years vs. ≥ 15 years). Hence, four classes were considered in this study: under 15 males vs. under 15 females; and over 15 males vs. over 15 females (Fig. 1).

Next, the images were pre-processed preserving high-level of detail and signal-to-noise ratio while avoiding photometric nonlinearity and geometric distortion. Initially, in this study, we used eight CNNs architectures namely DenseNet121, InceptionV3, Xception, InceptionResNetV2, ResNet50, ResNet101, MobileNetV2, and VGG16. DenseNet121 was selected in this study because this is one of the most successful models of recent times, and is available from open sources (e.g. Pytorch, TensorFlow and Keras API). Additionally, it must be noted that DenseNet121 outperformed the other architectures during a pilot study that we performed with 100 epochs (Table 1). Table 2 shows the characteristics of the architecture models used in this study.

Table 1 Summarized results of the metrics of the seven models evaluated in a pilot test to support the decision-making process for the selection of a network.

Full size table

Table 2 Specifics of the CNN architectures applied and tested in this study.

Full size table

In this study, we evaluated the DenseNet121 architecture using two training approaches: From Scratch (FS) and Transfer Learning (TL). With FS the network weights are not inherited from a previous model but are randomly initialized. It requires 1) a larger training set, 2) the risk of overfitting`1`²⁸ is higher since the network has no experience from previous training sessions, and 3) the network needs to rely on the input data to define all inherent weights. However, this approach allows the creation of a network topology that can work towards a specific problem/question. TL is a method that reuses models applied to specific tasks as a starting point for new domains of interest. Consequently, the network borrows data (with original labels) or extracts knowledge from related fields to obtain the highest possible performance in the area of interest^24,25. As per standard practices, TL can be applied using a base neural network as a fixed feature extractor. This way the images of the target dataset are fed to the deep neural network. Later, the features that are generated as input to the final layer classifier are extracted²⁶. Through these features, a new classifier is built, and the model is created. Specifically for the base network (last layer), a fine-tuning strategy is added, and the weights of previous layers are also modified. We used pre-trained weights based on the ImageNet model²⁷ and implemented transfer learning to best fit our dataset.

To avoid overfitting and improve the generalizability of the evaluated models (due to the quantitative restriction of images in the data set) we used a computational framework (Keras²⁹) for pre-processing layers to create a pipeline augmentation layers of image data—which can be used as an independent pre-processing code in non-Keras³⁰ workflows. These layers apply random augmentation transformations to a batch of images and are only active during training³⁰. Table 3 presents each layer with its respective implemented parameters.

Table 3 Image data augmentation layers and parameters.

Full size table

A stochastic optimization algorithm (SGD) was used to optimize the training process. We initially set a base learning rate of 1 × 10⁻³. The base learning rate was decreased to 6 × 10⁻⁶ with increased iterations. In the validation process, we used the k-fold cross-validation method^31,32. The dataset was divided into 5 (k) mutually exclusive subsets of the same size (five sets of 20% of the sample). This strategy creates a subset (20%) to be used for the tests and the remaining k − 1 (80%) is used to estimate the parameters (training). The five sets were dynamic over five repetitions for each of the architectures (TL and FS). It means that all the training samples had a different (randomly selected) dataset built from the original sample. Hence, images used during the training process were not used in the subsequent validation stage within the same k-fold training-test. After this process quantification of the model accuracy is feasible.

Diagnostic metrics

To evaluate the (radio-diagnostic) classification performance of the proposed architecture, the loss, overall accuracy, F1-scores, precision, recall, and specificity were selected as the accuracy performance metrics (Table 4). In the training stage, the internal weights of the model are updated during several iterations. We supervised each iteration in the training period, registering the weights with the best predictive power of the model determined by the overall accuracy metric.

Table 4 Diagnostic metrics used to evaluate the performance of the investigated CNN architectures.

Full size table

Additionally, this study quantified the performance of the CNN into a confusion matrix³³ for FS and TL. The matrix contains information about true (real) and predicted classifications accomplished the CNN. This approach helps on finding and reducing bias and variance issues and enables adjustments capable of producing more accurate results. Another approach used in this study was the Receiver Operating Characteristic (ROC) curve³⁴, which is a diagnostic tool to enable the analysis of classification performances represented by sensitivity, specificity, and area under the curve (AUC). Visual outcomes were illustrated with gradient-weighted class activation mapping (Grad-CAM) to indicate the region on the panoramic radiograph that was more activated during the machine-guided decision to classify females and males. The study was performed with a Linux machine, with Ubuntu 20.04, an Intel® Core(TM) i7-6800 K processor, 2 Nvidia® GTX Titan Xp 12 GB GPUs, and 64 GB of DDR4 RAM. All models were developed using TensorFlow API version 2.5³⁵ and Keras version 2.5 ²⁹. Python 3.8.10 was used for algorithm implementation and data wrangling³⁶.

Results

The performance of DenseNet121 architecture tested with FS and TL approaches showed that the former had an overall accuracy rate of 0.71 with a specificity rate of 0.87. With TL, the overall accuracy increased to 0.82 with a specificity rate of 0.92—between K-folds 1–5 TL accuracy floated between 0.81 to 0.83. All the other metrics quantified in this study confirmed the superior performance of TL over FS (Table 5).

Table 5 Quantified performances of DenseNet121 with FS and TL architectures.

Full size table

A deeper look at FS and TL considering the metrics of loss and accuracy per epoch was presented in Figs. 2 and 3, respectively. In both architectures, loss (which is the combination of errors after iterations) decreases progressively with the epochs, while accuracy increases, both during training and validation setups. TL, however, shows a more evident reduction of loss over time—within a shallow curve that ends close to zero by the end of the 100 epochs. This phenomenon is not observed in FS. Additionally, the accuracy of TL is represented by a more curvilinear improvement that starts over 0.5 increasing to nearly 1. In FS, the accuracy curve starts over 0.6 (initially better) and stabilizes when it reaches 0.9. These outcomes show that TL had better improvement over sequential iterations.

Figure 4 shows the confusion matrix for the performance of DenseNet121 to classify males and females in the age groups below and above (or equal) 15 years. In the older group, FS approach reached 0.83 and 0.72 for the correct classification of females and males, respectively. In the younger group, the classification rates decreased to 0.79 and 0.53, respectively. With TL, the correct classification of females and males in the older group reached 0.87 and 0.84, respectively, while in the younger group the classification rates decreased to 0.80 and 0.83, respectively. The optimal performance of TL over FS within DenseNet121 is visualized in Fig. 5.

ROC curves for FS showed AUC of 0.87 and 0.82 for the classification of females and males above (or equal) the age of 15 years, and 0.79 and 0.74 for females and males below the age of 15 years. The AUC obtained with TL reached 0.91 and 0.90 for females and males in the younger age group, and 0.87 for both sexes in the younger age group.

Finally, Fig. 6 shows the gradient-weighted class activation mapping (Grad-CAM) in which stronger signals (reddish) were observed around the crowns of anterior and posterior teeth. Weak signals (blueish) were observed in root and bone regions.

Discussion

Sexual dimorphism is a crucial step in the anthropological process of building the biological profile of the deceased³⁷. In general, sex-related differences between males and females are expressed as changes in the shape and size of anatomic structures³⁸. Puberty is a biological landmark that triggers more evident differences between males and females³⁹. Over time, these differences will manifest especially in the pelvic bones and the skull⁴⁰. Teeth, however, are known for their resistance to environmental effects (extrinsic factors) and systemic health conditions (intrinsic factors); and are available for forensic examination in most cases. Moreover, the radiographic visualization of dental anatomy is optimal given the highly mineralized tissues of crown and root(s). This study proposed the use of artificial intelligence for the radio-diagnostic task of sexual dimorphism from human teeth.

A preliminary challenge proposed to test the artificial intelligence in this study was the inclusion of anatomically immature individuals in the sample. This is to say that the human skeleton is not fully influenced by the hormonal changes early in life and that the maxillofacial bones are still similar between males and females in childhood. More specifically, the age limits of the addressed population were 6 and 22.9 years—an interval that covers children, adolescents, and young adults. Deciduous and some permanent teeth, on the other hand, will express full development in childhood. The permanent mandibular first molar, for instance, shows apex closure around the age of 7.5 years. Aris et al.³⁹, explain that teeth that fully develop long before puberty may have observable dimorphic features that can be explored even before the expression of skeletal dimorphism. Hence, the rationale at this point was to test the performance of the artificial intelligence within a scenario in which the mandible, maxillae, and other skulls bones would not play a major role in sexual dimorphism, giving the chance to teeth to express their dimorphic potential.

The radiographic aspect of the present study differs from the (physical) anthropological assessment of Aris et al.³⁹, because our study has the preliminary and fundamental scope of screening teeth (or tooth regions) that can play a more important part to distinguish males and females. In a future step, teeth and tooth regions detected as dimorphic in the present study could be tested and validated by means of physical examination (i.e. studies ex vivo). Among the main advantages of the radiographic approach is the visualization of dental anatomy, including the internal aspect of the crown and roots (namely the pulp chamber and root canals, respectively), and the possibility of retrospective dataset sampling from existing databases—which is hampered in observational anthropological/archaeological studies.

DenseNet121 architecture running with TL training approach in 100 epochs led to the best performance for sexual dimorphism. Particularly, the training accuracy maintained high (above 80%) between epochs 19–100, while the validation accuracy was between 70–83% after epoch 31. Consequently, the average accuracy of TL was 82%, with average specificity of 92% in the total sample. Authors claim⁴¹ that when the entire skeleton is available for anthropological assessment, the accuracy of sexual dimorphism can reach 100%. This phenomenon is justified by the contribution of pelvic bones and skull to the analyses. Studies solely based on teeth present much lower estimates. Paknahad et al.⁴², for instance, performed a study with bitewing radiographs and reported an accuracy of 68% for sexual dimorphism based on odontometric assessments of the deciduous second molars (mandibular and maxillary). In our study, the higher accuracy rates are possibly justified by the integral assessment of dental anatomy (all the visible bidimensional dental features of the teeth were considered) in the process of sexual dimorphism—instead of specific linear measurements. In the study of Paknahad et al.⁴², only the width of the enamel, dentin, and pulp space were considered. Moreover, our study assessed radiographs of 4003 individuals, while the previous authors⁴² sampled only 124 individuals. In practice, a preliminary overall accuracy of 82% (specificity of 92%) corroborates DenseNet121 with TL approach as a proper tool for radiographic sexual dimorphism.

The purpose of the present study, however, was to challenge to artificial intelligence even more. To that end, the sample was divided into males and females below and above the age of 15 years. ROC curves obtained during the analyses per age category showed AUC between 0.90–0.91 for males and females over the age of 15, respectively, while in the younger group the AUC was 0.87 for both the males and females. These outcomes confirm that, in fact, sexual dimorphism is more challenging among children (in this case, between 6 and 14.9 years). In both groups, however, the AUC was considered excellent for diagnostic accuracy tests⁴³. Consequently, the features assessed from panoramic radiographs in the present study had enough discriminant power to distinguish males and females with accurate performance.

The Grad-CAM images obtained in our study showed a similar region of activation in both age groups. In general, the activation region was more centralized and horizontal – surrounding the crowns of anterior and posterior teeth. These outcomes are corroborated by studies that show the dimorphic value of canines^44,45 and incisors⁴¹ between males and females.

This is a preliminary study to understand the discriminant power of dental morphology to distinguish males and females using panoramic radiographs. At this point, these outcomes should not be translated to practice since they currently serve to screen regions of teeth that may weigh more for sexual dimorphism. A few cases in the scientific literature reported the use of postmortem panoramic radiographs for human identification^46,47. In these cases, the current findings could have a more tangible application. For anthropological practices in single cases and mass disasters, more comprehensive knowledge of radiographic sexual dimorphism is needed, especially when it comes to the effects of age on dental morphological features.

Conclusion

The dentomaxillofacial features assessed on panoramic radiographs in the present study showed discriminant power to distinguish males and females with excellent accuracy. Higher accuracy rates were observed among adolescents and young adults (older group) compared to children (younger group). DenseNet121 architecture with TL approach led to the best outcomes compared to FS. The regions with stronger activation signals for machine-guided sexual dimorphism were around the crowns of anterior and posterior teeth.

Data availability

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

References

Kafadar, K. The need for objective measures in forensic evidence. Significance 16, 16–20. https://doi.org/10.1111/j.1740-9713.2019.01249.x (2019).
Article Google Scholar
Pretty, I. & Sweet, D. The scientific basis for human bitemark analyses—A critical review. Sci. Justice 41, 8592. https://doi.org/10.1016/S1355-0306(01)71859-X (2001).
Article Google Scholar
Franco, A. et al. Feasibility and validation of virtual autopsy for dental identification using the Interpol dental codes. J. Forensic Leg. Med. 20, 248–254. https://doi.org/10.1016/j.jflm.2012.09.021 (2013).
Article PubMed Google Scholar
Franco, A., Orestes, S. G. F., Coimbra, E. F., Thevissen, P. & Fernandes, A. Comparing dental identifier charting in cone beam computed tomography scans and panoramic radiographs using Interpol coding for human identification. Forensic Sci. Int. 302, 109860. https://doi.org/10.1016/j.forsciint.2019.06.018 (2019).
Article PubMed Google Scholar
Angelakopoulos, N., Franco, A., Willems, G., Fieuws, S. & Thevissen, P. Clinically detectable dental identifiers observed in intra-oral photographs and extra-oral radiographs, validated for human identification purposes. J. Forensic Sci. 62, 900–906. https://doi.org/10.1111/1556-4029.13310 (2017).
Article PubMed Google Scholar
Thevissen, P., Fieuws, S. & Willems, G. Third molar development: measurements versus scores as age predictor. Arch. Oral Biol. 56, 1035–1040. https://doi.org/10.1016/j.archoralbio.2011.04.008 (2011).
Article CAS PubMed Google Scholar
Franco, A., Vetter, F., Coimbra, E. F., Fernandes, Â. & Thevissen, P. Comparing third molar root development staging in panoramic radiography, extracted teeth and cone beam computed tomography. Int. J. Legal Med. 134, 347–353. https://doi.org/10.1007/s00414-019-02206-x (2020).
Article PubMed Google Scholar
Franco, A., Thevissen, P., Fieuws, S., Souza, P. H. C. & Willems, G. Applicability of Willems model for dental age estimations in Brazilian children. Forensic Sci. Int. 231, 401-e1. https://doi.org/10.1016/j.forsciint.2013.05.030 (2013).
Article PubMed Google Scholar
Sartori, V. et al. Testing international techniques for the radiographic assessment of third molar maturation. J. Clin. Exp. Dent. 13, e1182. https://doi.org/10.4317/jced.58916 (2021).
Article PubMed PubMed Central Google Scholar
Gelbrich, B., Carl, C. & Gelbrich, G. Comparison of three methods to estimate dental age in children. Clin. Oral Invest. 24, 2469–2475. https://doi.org/10.1007/s00784-019-03109-2 (2020).
Article Google Scholar
KMD PlassData DVI. Computerized identification of disaster victims and missing persons. http://kmd.net/solutions-and-services/solutions/kmd-plassdata-dvi
Banar, N. et al. Towards fully automated third molar development staging in panoramic radiographs. Int. J. Leg. Med. 134, 1831–1841. https://doi.org/10.1007/s00414-020-02283-3 (2020).
Article Google Scholar
Krishan, K. et al. A review of sex estimation techniques during examination of skeletal remains in forensic anthropology casework. Forensic Sci. Int. 261, 165-e1. https://doi.org/10.1016/j.forsciint.2016.02.007 (2016).
Article PubMed Google Scholar
De Boer, H. H., Blau, S., Delabarde, T. & Hackman, L. The role of forensic anthropology in disaster victim identification (DVI): Recent developments and future prospects. Forensic. Sci. Res. 4, 303–315. https://doi.org/10.1080/20961790.2018.1480460 (2019).
Article PubMed Google Scholar
Capitaneanu, C., Willems, G. & Thevissen, P. A systematic review of odontological sex estimation methods. J. Forensic Odonto-Stomatol. 35, 1 (2017).
CAS Google Scholar
Rocha, M. F. N., Pinto, P. H. V., Franco, A. & da Silva, R. H. A. Applicability of the mandibular canine index for sex estimation: A systematic review. Egypt. J. Forensic Sci. 12, 1–18. https://doi.org/10.1186/s41935-022-00270-w (2022).
Article Google Scholar
Suthaharan, S. Machine Learning Models and Algorithms for Big Data Classification: Thinking with Examples for Effective Learning (Springer Publ. Inc., Boston, 2015).
MATH Google Scholar
Gu, J. et al. Recent advances in convolutional neural networks. Pattern Recognit. 77, 354–377. https://doi.org/10.1016/j.patcog.2017.10.013 (2015).
Article ADS Google Scholar
Khan, A., Sohail, A., Zahoora, U. & Qureshi, A. S. A survey of the recent architectures of deep convolutional neural networks. Artif. Intell. Rev. 53, 5455–5516. https://doi.org/10.1007/s10462-020-09825-6 (2020).
Article Google Scholar
Sun, Y., Xue, B., Zhang, M. & Yen, G. G. Evolving deep convolutional neural networks for image classification. IEEE Transact. Evol. Comput. 24, 394–407. https://doi.org/10.1109/TEVC.2019.2916183 (2020).
Article Google Scholar
Alzubaidi, L. et al. Review of deep learning: concepts, cnn architectures, challenges, applications, future directions. J. Big Data 8, 53. https://doi.org/10.1186/s40537-021-00444-8 (2021).
Article PubMed PubMed Central Google Scholar
Farias Gomes, A. et al. Development and validation of a formula based on maxillary sinus measurements as a tool for sex estimation: A cone beam computed tomography study. Int. J. Leg. Med. 133, 1241–1249. https://doi.org/10.1007/s00414-018-1869-6 (2019).
Article Google Scholar
V7 Labs. Darwin Auto-Annotate (2022). https://www.v7labs.com/
Pan, S. J. & Yang, Q. A survey on transfer learning. IEEE Transact. Knowl. Data Eng. 22, 1345–1359. https://doi.org/10.1109/TKDE.2009.191 (2010).
Article Google Scholar
Shao, L., Zhu, F. & Li, X. Transfer learning for visual categorization: A survey. IEEE Transact. Neural Netw. Learn. Syst. 26, 1019–1034. https://doi.org/10.1109/TNNLS.2014.2330900 (2015).
Article MathSciNet Google Scholar
Khandelwal, I. & Raman, S. Analysis of transfer and residual learning for detecting plant diseases using images of leaves. In Computational Intelligence: Theories, Applications and Future Directions—Volume II: ICCI-2017 (eds Verma, N. K. & Ghosh, A. K.) 295–306 (Springer Publ. Inc., Singapore, 2019).
Google Scholar
Deng, J. et al. Imagenet: A large-scale hierarchical image database. IEEE Conf. Comp. Vis. Pattern Recog. 2009, 248–255. https://doi.org/10.1109/CVPR.2009.5206848 (2009).
Article Google Scholar
Hawkins, D. M. The problem of overfitting. J. Chem. Inf. Comput. Sci. 44, 1–12. https://doi.org/10.1021/ci0342472 (2004).
Article CAS PubMed Google Scholar
Chollet, F. et al. (2015) Keras. GitHub Rep. 1, 1, https://github.com/fchollet/keras.
Chollet, F. et al. (2021) Keras api references—preprocessing layers. GitHub Rep. 1, 1, https://keras.io/api/layers/preprocessinglayers/.
Hastie, T., Tibshirani, R. & Friedman, J. The Elements of Statistical Learning: Data Mining, Inference and Prediction (Springer, New York, 2009).
Book Google Scholar
Kohavi, R. et al. A Study of Cross-validation and Bootstrap for Accuracy Estimation and Model Selection 1137–1145 (California, Stanford, 1995).
Google Scholar
Visa, S., Ramsay, B., Ralescu, A. & Knaap, E. (2011) Confusion matrix-based feature selection. Midwest. Artif. Intell. Cogn. Sci. Conf. 1, 120–127. http://ceur-ws.org/Vol-710/paper37.pdf.
Powers, D. M. W. Evaluation: From precision, recall and f-measure to roc., informedness, markedness & correlation. J. Mach. Learn. Technol. 2, 37–63. https://doi.org/10.48550/arXiv.2010.16061 (2011).
Article Google Scholar
Abadi, M. et al. (2015) TensorFlow: Large-scale machine learning on heterogeneous systems. Software available from https://tensorflow.org
Van Rossum, G. & Drake, F. L. Python 3 Reference Manual (Create Space, California, 2009).
Google Scholar
Liu, Y. et al. Study of sexual dimorphism in metatarsal bones: Geometric and inertial analysis of the three-dimensional reconstructed models. Front. endocrinology 12, 734362. https://doi.org/10.3389/fendo.2021.734362 (2021).
Article Google Scholar
Horbaly, H. E., Kenyhercz, M. W., Hubbe, M. & Steadman, D. W. The influence of body size on the expression of sexually dimorphic morphological traits. J. Forensic Sci. 64, 52–57. https://doi.org/10.1111/1556-4029.13850 (2019).
Article PubMed Google Scholar
Aris, C., Nystrom, P. & Craig-Atkins, E. A new multivariate method for determining sex of immature human remains using the maxillary first molar. Am. J. Phys. Anthropol. 167, 672–683. https://doi.org/10.1002/ajpa.23695 (2018).
Article PubMed Google Scholar
Christensen, A. M., Passalacqua, N. V. & Bartelink, E. J. Forensic Anthropology: Current Methods and Practice (Academic Press, Cambridge, 2019).
Google Scholar
Satish, B., Moolrajani, C., Basnaker, M. & Kumar, P. Dental sex dimorphism: Using odontometrics and digital jaw radiography. J. Forensic Dental Sci. 9, 43. https://doi.org/10.4103/jfo.jfds_78_15 (2017).
Article CAS Google Scholar
Paknahad, M., Vossoughi, M. & Zeydabadi, F. A. A radio-odontometric analysis of sexual dimorphism in deciduous dentition. J. Forensic Leg. Med. 44, 54–57. https://doi.org/10.1016/j.jflm.2016.08.017 (2016).
Article PubMed Google Scholar
Mandrekar, J. N. Receiver operating characteristic curve in diagnostic test assessment. J. Thorac. Oncol. 5, 1315–1316. https://doi.org/10.1097/jto.0b013e3181ec173d (2010).
Article PubMed Google Scholar
Banerjee, A., Kamath, V. V., Satelur, K., Rajkumar, K. & Sundaram, L. Sexual dimorphism in tooth morphometrics: An evaluation of the parameters. J. Forensic Dental Sci. 8, 22–27. https://doi.org/10.4103/0975-1475.176946 (2016).
Article Google Scholar
Selim, H. F. et al. Sex determination through dental measurements in cone beam computed tomography. Rev. Bras. Odontol. Legal. 7, 50–58. https://doi.org/10.21117/rbol-v7n12020-299 (2020).
Article Google Scholar
Silva, R. F. et al. Panoramic radiograph as a clue for human identification: A forensic case report. Int. J. Forensic Odontol. 2, 85. https://doi.org/10.4103/ijfo.ijfo_4_17 (2017).
Article Google Scholar
Conceição, L. D., Ouriques, C. S., Busnello, A. F. & Lund, R. G. Importance of dental records and panoramic radiograph in human identification: A case report. Rev. Bras. Odontol. Legal. 5, 68–75. https://doi.org/10.21117/rbol.v5i1.152 (2018).
Article Google Scholar

Download references

Acknowledgements

This study was funded in part by the Coordination for the Improvement of Higher Education Personnel (CAPES—finance code 001), the National Council for Scientific and Technological Development (CNPq), and the ASFO-2022 Research Grant by the American Society of Forensic Odontology. The authors express their gratitude.

Author information

Authors and Affiliations

Centre of Forensic and Legal Medicine and Dentistry, University of Dundee, Dundee, UK
Ademir Franco, Dennis Heng, Jared Murray, Anna Lygate & Scheila Manica
Department of Therapeutic Stomatology, Institute of Dentistry, Sechenov University, Moscow, Russia
Ademir Franco
Computer Vision Solutions, Rumina S.A, Belo Horizonte, Minas Gerais, Brazil
Lucas Porto
Department of Preventive and Social Dentistry, Federal University of Uberlandia, Av. Pará 1720, Bloco 2G, Sala 1, Campus Umuarama, Uberlândia, Minas Gerais, Brazil
Raquel Franco & Luiz R. Paranhos
Division of Oral Radiology, Faculdade Sao Leopoldo Mandic, Campinas, Brazil
Juliano Bueno
Division of Forensic Dentistry, Faculdade Sao Leopoldo Mandic, Campinas, Brazil
Ademir Franco & Marilia Sobania
Department of Removable Prosthodontics, Federal University of Uberlandia, Uberlândia, Brazil
Márcio M. Costa
Computer Science, Federal Institute of Science and Technology, Barra do Garças, Brazil
André Abade

Authors

Ademir Franco
View author publications
You can also search for this author in PubMed Google Scholar
Lucas Porto
View author publications
You can also search for this author in PubMed Google Scholar
Dennis Heng
View author publications
You can also search for this author in PubMed Google Scholar
Jared Murray
View author publications
You can also search for this author in PubMed Google Scholar
Anna Lygate
View author publications
You can also search for this author in PubMed Google Scholar
Raquel Franco
View author publications
You can also search for this author in PubMed Google Scholar
Juliano Bueno
View author publications
You can also search for this author in PubMed Google Scholar
Marilia Sobania
View author publications
You can also search for this author in PubMed Google Scholar
Márcio M. Costa
View author publications
You can also search for this author in PubMed Google Scholar
Luiz R. Paranhos
View author publications
You can also search for this author in PubMed Google Scholar
Scheila Manica
View author publications
You can also search for this author in PubMed Google Scholar
André Abade
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.F.: conception of the work, interpretation of data, drafted the work, approved the published version, and agrees to be accountable for all aspects of the work; L.P.: design of the work, analysis of data, adjustments of algorithm/software, drafted the work, approved the published version, and agrees to be accountable for all aspects of the work; D.H.: design of the work, data acquisition, drafted the work, approved the published version, and agrees to be accountable for all aspects of the work; J.M.: design of the work, data acquisition, drafted the work, approved the published version, and agrees to be accountable for all aspects of the work; A.L.: design of the work, data acquisition, drafted the work, approved the published version, and agrees to be accountable for all aspects of the work; R.F.: design of the work, data acquisition, drafted the work, approved the published version, and agrees to be accountable for all aspects of the work; J.B.: conception of the work, data acquisition, approved the published version, and agrees to be accountable for all aspects of the work; M.S.: design of the work, data acquisition, drafted the work, approved the published version, and agrees to be accountable for all aspects of the work; M.M.C.: conception of the work, interpretation of data, drafted the work, approved the published version, and agrees to be accountable for all aspects of the work; L.R.P.: conception of the work, interpretation of data, drafted the work, approved the published version, and agrees to be accountable for all aspects of the work; S.M.: conception of the work, interpretation of data, drafted the work, approved the published version, and agrees to be accountable for all aspects of the work; A.A.: design of the work, analysis of data, adjustments of algorithm/software, drafted the work, approved the published version, and agrees to be accountable for all aspects of the work.

Corresponding author

Correspondence to Luiz R. Paranhos.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Franco, A., Porto, L., Heng, D. et al. Diagnostic performance of convolutional neural networks for dental sexual dimorphism. Sci Rep 12, 17279 (2022). https://doi.org/10.1038/s41598-022-21294-1

Download citation

Received: 03 July 2022
Accepted: 26 September 2022
Published: 14 October 2022
DOI: https://doi.org/10.1038/s41598-022-21294-1

This article is cited by

Binary decisions of artificial intelligence to classify third molar development around the legal age thresholds of 14, 16 and 18 years
- Ademir Franco
- Jared Murray
- Luiz Renato Paranhos
Scientific Reports (2024)
Accuracy of an AI-based automated plate reading mobile application for the identification of clinical mastitis-causing pathogens in chromogenic culture media
- Breno Luis Nery Garcia
- Cristian Marlon de Magalhães Rodrig Martins
- Marcos Veiga dos Santos
Scientific Reports (2024)
A population-based study to assess two convolutional neural networks for dental age estimation
- Jian Wang
- Jiawei Dou
- Jiang Tao
BMC Oral Health (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and methods

Ethical aspects and study design

Sample and participants

Pre-processing and training approach

Diagnostic metrics

Results

Discussion

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links