Introduction

Age estimation is a key step in human identification, legal practice and clinical research. Some areas of interest are in the aid of identification of missing people, crimes against minors, adoption procedures, invalid identification documents and mass migration1,2,3,4. For such reasons, a reliable standardized age estimation tool to differentiate minors from adults is of great interest1,5.

Based on research on the distinguishable features of skull bones, symphysis pubis, long bones, hand bones and permanent dentition2,6,7,8,9,10, the Study Group on Forensic Age Diagnostics of the German Society of Legal Medicine (AGFAD) presented the criteria for age estimation in lawsuits. Dental examination is highly reliable because dental development is less influenced by both internal and external factors1,4. With the exception of the third molars, permanent dentition completed its development between 12 and 14 years of age. Third molars are estimated to develop between the ages of 15.7 and 23.3 years, and because this timeframe intercrosses the legal age of 18, they can serve as a discriminant tool to distinguish adults from minors1,2,3,5.

In Portugal, the age of criminal and legal responsibility is 14 years old. If a crime is committed by an individual between 12 and 16 years old, he will be tried in a juvenile court. In the event of a conviction, the individual may be condemned to serve time in a closed educational center. On the other hand, if the person who commits the crime has between 16 and 18 years of age, may be considered adult and will be judged under general criminal laws. For all purposes, the legal age of adulthood in Portugal is 18 years old11,12.

In 2008, Cameriere et al.13 proposed the Third Molar Maturation Index (I3M) as a predictive tool to discriminate adult age. The I3M relies on the relationship between chronological age and the measures of the open apices of the lower left third molar. The bidimensional widths of the apical and tooth lengths are measured and used to calculate the I3M. Then, a cutoff value (0.08) is set to differentiate minors from adults13.

Over the last years, I3M was successfully validated worldwide, in Europe2,13,14,15,16,17,18,19,20,21,22,23, Africa1,24,25,26,27, Asia5,28,29,30,31, America3,32,33,34,35,36 and Oceania37. Overall, I3M had considerable reliability among all countries, however, to date, this method is not validated for a Portuguese sample. Therefore, investigating if the I3M is suitable for the Portuguese population would be of great interest.

Given the evidence on high population movements in the Portuguese population, contributing to a higher level of diversity than some neighboring populations38,39, we aimed to test the validity of the I3M in a Portuguese representative sample, by using panoramic radiographs (PRs).. Secondly, we looked for the influence of sex on its validity. Our null hypothesis was that the I3M has no validity to discriminate adult age in this Portuguese sample.

Materials and methods

Source of data and sample size

This retrospective observational study has received approval from the Egas Moniz Ethics Committee (ID 887). Written informed consent was obtained for each participant, during the first appointment at the Egas Moniz Dental Clinic (EMDC), (Almada, Portugal). Regarding participants under 18, a parent and/or legal guardian and this study gave and signed the informed consent. This research was conducted in accordance with the Declaration of Helsinki, as revised in 2013.

The present study follows the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) reporting guidelines40 for validation of prediction models. This study was conducted on a triple-blind basis with respect to: (1) diagnosis and clinical outcome; (2) data collection; and (3) analysis.

Participants

A consecutive sample of 1267 digital PRs, taken between September 2017 and January 2020, were considered for this study.

The inclusion criteria were: the presence of the lower third molar; known chronological age between 12 and 24 years old5,13,36; and absence of evident bone pathologies or systemic diseases that may affect tooth development. Exclusion criteria included spatial orientation of the third molar that prevents correct measuring, endodontic treatment and/or coronary restoration on the third molar, extensive cavities or abnormal dental anatomy, congenital anomalies and poor-quality or distorted X-rays41.

Outcome, predictors and measurement reproducibility

All digital PRs were converted to JPEG documents for the examination with ImageJ image processing software (Graphics Suite X7, Ottawa, Canada) by one trained and calibrated observer (JAN). The I3M index of each evaluated third molar was performed according to the Cameriere et al. method13. If the root development of the third molar is complete, then I3M = 0.0. If not, I3M was calculated as the distance between the inner sides of the open apex (A and B) divided by the tooth length (C) (Fig. 1). In case of I3M < 0.08, the individual is classified as 18 years old or older and if I3M ≥ 0.08, the individual as considered a minor.

Figure 1
figure 1

Measurement of the third molar index.

The Kappa correlation index was used to test the agreement of classification of individuals younger than 18 years, aged 18 years or older. The Intraclass Correlation Coefficient (ICC) was used to analyze the measurements of open apices. Previously, twelve PRs were randomly chosen from the total sample, measured and remeasured one week later by the same researcher (JAN). Kappa index indicated a perfect agreement for the inter-examiner analysis (κ = 1.00). Intraclass correlation coefficient (ICC) showed absolute agreement (ICC = 0.96), according to Landis and Koch42.

Statistical analysis

Data analysis was performed using SPSS Statistics v. 25.0 for Windows (IBM; Armonk, New York, USA). The estimated I3M index result was compared against the actual chronological age to determine the method’s performance in global and according to sex (females and males). For this purpose, contingency tables were used to calculate true positive (TP), true negative (TN), false positive (FP) and false negative (FN) values (Table 1). Then, several performance indicators were determined and detailed in Table 143. Performance measurement was assessed through binary and multiclass area under the curve (AUC), through receiver operating characteristics (ROC) analysis. The correspondent 95% confidence intervals (95% CI) were also determined. Bayes post-test probability (p) of being 18 years or older is computed to discriminate between those who are or are not aged 18 years or more. According to Bayes’ theorem, p may be written as:

$$p = \frac{Se \,x \,po}{{Se\, x \,po + \left( {1 - Sp} \right)\left( {1 - po} \right)}}$$

In the post-test probability p, p0 defines the probability that a participant is 18 years or older given that he or she is aged between 12 and 24 years, in the target population. In this study, we calculated the probability p0 as the proportion of participants between 18 and 24 years of age who live in Portugal and those who are aged between 12 and 24 years. This value (p0) was considered to be 0.55 (global, males and females) according to the Portuguese National Statistics Institute - Instituto Nacional de Estatística (INE) (https://www.ine.pt/xportal).

Table 1 Diagnostic performance indicators used in the comparative analysis.

Results

Participants

A total of 778 digital PRs (336 males and 442 females) met the inclusion criteria, with 489 being excluded due to absence of third molars. The distribution of age and sex is depicted in Table 2. The mean ages of the males and females, aged between 12 and 24 years, were 18.1 ± 3.0 years old and 17.7 ± 3.0, respectively, without statistically significant difference (p = 0.078).

Table 2 Participants age and sex distribution (n = 778).

Model performance

The estimated age of majority was correlated with the chronological age (p < 0.001). The I3M values decreased as age increased across all age groups in both sexes, showing that the lower third molar mineralization occurred earlier in females than in males (Fig. 2).

Figure 2
figure 2

Boxplot of relationship between chronological age and I3M of open apices of lower left third molar, according to females and males.

Then, pooled data of both sexes as well as separately was analysed (Table 3). The overall results were 92.8% (95% CI 91–94.6%), 90.7% (95% CI 88.7–92.8%) and 94.9% (95% CI 93.3–96.4%) for accuracy, sensitivity and specificity, respectively. Concerning the LR+ and the LR−, their values were as follows: 17.7 (95% CI 12.5–20.3) and 0.10 (95% CI 0.08–0.12). Global post-test probability was 95.6% (95% CI 94.1–97.0%).

Table 3 Performance values (in percentage), derived from 2 × 2 contingency tables (with correspondent 95% confidence intervals), of test of age of majority in the Portuguese population.

Males were better classified into adults or minors (94%; 95% CI 92.4–95.7%) than females (91.2%; 95% CI 89.9–93.8%). Specificity was better for females (95.7%; 95% CI 94.3–97.1%) compared to males (93.5%; 95% CI 91.8–95.3%) while sensitivity was better for males (94.9%; 95% CI 93.3–96.4%) compared to females (87.5%; 95% CI 85.2–89.8%). Both estimating post-test probability results were excellent; in males was 94.7% (95% CI 93.1–96.3%) and 96.2% (95% CI 94.2–97.5%) in females. The LR+ and LR− were 20.5 (95% CI 15.0–23.3) and 0.13 (95% CI 0.11–0.15), respectively, in females and 14.6 (95% CI 9.9–17.1) and 0.06 (95% CI 0.04–0.08), respectively, in males.

Data from articles showing results with cutoff value 0.08 in various European samples

Also, we compared the results of the present study with previous European studies (Tables 4 and 5 present).

Table 4 Global performance values (in percentage, with correspondent 95% confidence intervals), of test of age of majority in several European samples.
Table 5 Performance values (in percentage, with correspondent 95% confidence intervals), discriminated by sex, of test of age of majority in several European samples.

Discussion

The present study is the first to test the validity of I3M in a Portuguese sample. Overall, the null hypothesis was rejected, that is, the I3M is a reliable tool to discriminate adult age in this Portuguese population. Further, I3M was more accurate in male participants. These results have important implications because this tool has potential to be used in legal and criminal settings.

Discriminating minors from adults is important to prevent legal wrongful procedures, specially in cases where valid identification documents are lacking, in order to prevent legal wrongful procedures4,32. Notwithstanding, age estimation is challenging, particularly in the differentiation teens from young adults (aged 15 years old to early 20 s), as the physical appearance and characterization are not clearly related of being an adult1,4. From 15 years of age, the third molars are only teeth not fully developed, as this clinical circumstance is useful in forensic sciences44.

In the Portuguese context, age estimation methods involving dental measures were reported in previous studies45,46,47,48,49,50. Caldas et al. (2011)46 have evaluated third molars, however the method and the age range of these participants were not comparable. Thus, to the best of our knowledge, this is the first study that validates the use of the I3M cutoff value of 0.08 in the Portuguese population.

Cameriere et al. (2008)13 introduced a new methodology for age estimation based on the ratio between measures of the open apices and height of the third molar. Since then, the I3M has been validated in every continent with very strong results, showing only slight variations among different ethnicities.

The sensibility indicates the I3M ability to correctly identify individuals who are 18 years or older (I3M < 0.08). On the other hand, specificity is the ability to discriminate individuals younger than 18 years old (I3M ≥ 0.08). Santiago et al. (2018)41 reported that I3M outperforms in individuals younger than 18 years due to the higher specificity of this tool. This is key in forensic science because if a minor is wrongly processed as an adult it would violate its rights23,41.

To better integrate the results of our study within the European scenario, we analyzed every European study that have analyzed the validity of I3M (Fig. 3). Our pooled overall results on accuracy (92.8%; 95% CI 91.0–94.6%) outperformed all previous studies that presented pooled data2,13,17,22,23 . Both sensitivity and specificity pooled results also surpassed most previous European studies2,22,23 except for Cameriere et al.13 (98%) and De Luca et al.17 (95.7%; 95% CI 92.1–98.0%). The estimated post-test probability of the pooled data (95.6%; 95% CI 94.1–97.0%) showed similar results with previous studies.

Figure 3
figure 3

Pooled results from different European populations.

When analysing our data regarding males and females (Fig. 4), we demonstrated alike maturation of third molars in both males and females, which is in line with similar studies14,15,16,19,20,21,22,23,44,51. Previous European studies present slightly better accuracy and sensibility in males than in females14,15,16,19,20,21,22,23,44,51. Contrary, females presented better specificity than males14,15,16,19,20,21,22,23,44,51. Therefore, I3M seems to be slightly more accurate in males, although it is equivalent to the female participants in this Portuguese population. The effect of age on I3M performance is proposed to rely on the maturation development according to sex as women tend to develop at a younger ages than men52. This maturation difference is suggested to explain the I3M results discrepancies between both sexes, however more studies are needed to fully understand the biological reasons upon this subject19,41,51.

Figure 4
figure 4

Discriminated results by sex, from different European populations.

Estimated post-test probability of both male (94.7%; 95% CI 93.1–96.3%) and female (96.2%; 95% CI 94.8–97.5%) demonstrated results compatible with other European studies2,14,15,19,20,21,22,23,44,51. It is important to refer that Zelic et al.20 and Kelmendi et al.21 reported results very close to 100% in females, and Gulsahi et al.19 reported results of 100% in both sexes, although the sample size was one of the smallest in all European studies (n = 293), which may influence these results.

The likelihood ratio is a practical measure of diagnostic accuracy. The higher the value of LR + , more the test has the capability of establishing the tested condition; an LR + value greater than 10 defines it as a good diagnostic test. In our study, balanced values of LR + and LR − have been achieved. Although I3M presented higher LR + in females (20.5; 95% CI 15–23.3) then males (14.6; 95% CI 9.9–17.1), I3M is an excellent prediction of the probability of majority; and lower LR − (0.06; 95% CI 0.04–0.08) in males than (0.13; 95% CI 0.11–0.15) in females, prove that the test is also an excellent tool at classifying individuals younger than 18 years of age, despite the sex of the individuals19,23.

In conclusion, I3M is a suitable method in legal and forensic purposes to identify minors from adults in the Portuguese population. Further, the cut-off value used (0.08) is predictable and useful to discriminating individuals younger than 18 years of age (high specificity in both sexes).