Introduction

Mandibular shape has been studied extensively, both in 2-dimensional and 3-dimensional (3D) form. Changes during ontogeny involve vertical development of the ramus, increase of the gonial angle prominence, forward growth and backward inclination of the symphysis, and vertical extension of the alveolar process1,2,3,4,5. Variability within the population includes ramus and body width, the angle between the ramus and body, the height of the alveolar process and symphysis, and the inclination and width of the symphysis; sex differences, and results of aging and tooth loss, are additional sources of variation6,7,8,9,10,11.

In addition to its evident biological significance, understanding human mandibular form and its variations is clinically useful, particularly in cases of osseous defects resulting from trauma, pathology, congenital abnormalities, or, more frequently, tooth loss and subsequent alveolar bone resorption. Reconstruction of such defects can be guided by symmetry, if the contralateral side is present, or by employing an atlas of normative variation tailored to the remaining parts12,13,14,15,16. This atlas-based approach is commonly followed in the anthropology domain, where severely damaged specimens are reconstructed by thin plate spline (TPS) warping of a reference template17. A dense landmark configuration that comprehensively covers the specimen is obviously essential for detailed and accurate results.

The aims of this study were (a) to create a 3D atlas of human mandibular shape variability, using a dense landmark configuration uniformly covering the mandibular surface, which can be used for mandibular reconstruction, and (b) to explore potential sex and age differences in mandibular size and shape in an adult sample.

Materials and Methods

Ethical approval

All methods were carried out in accordance with relevant guidelines and regulations. The Scientific Committee of the Dental Association of Attica, Greece, and the Research Ethics Committee of the School of Dentistry, National and Kapodistrian University of Athens, gave ethical approval for this work (protocols 2151 and 109777, respectively). Informed consent was obtained from all subjects involved in the study.

Sample

The sample consisted of cone-beam computed tomography (CBCT) images, in the form of DICOM files, from the archives of a dental imaging centre and the Department of Oral Diagnosis and Radiology of the School of Dentistry, the National and Kapodistrian University of Athens. The data had been obtained previously for diagnostic purposes, unrelated to this study and all patients had signed consent forms for use of their images for research purposes. Voxel size ranged between 0.2 and 0.4 mm. The images were anonymized, retaining only age and sex data. For this study, we excluded patients who presented with gross anatomical deformities (e.g. hemifacial microsomia, missing condyle or ramus) but included patients with normal morphological variations, missing or extracted teeth, and alveolar bone resorption due to tooth loss.

We accrued patients until we reached 50 adult males and 50 adult females of a comparable age distribution (Table 1, Fig. 1). This size is considered sufficient for obtaining reliable data using geometric morphometric tools18,19. Seventy-six patients had at least one tooth missing, excluding the third molars, with an average number of 2.9 missing teeth among them (Table 1). All patients were of Greek ethnicity and sex was based on birth assignment.

Table 1 Demographics of the sample.
Figure 1
figure 1

Histogram of age (years) for the female (red) and male (blue) groups.

Preprocessing

All data processing and analysis was performed using the Viewbox 4 software (dHAL Software, Kifissia, Greece). We loaded the DICOM files and subsampled the volume by 50% for faster processing, if voxel size was 0.2 mm or smaller. The histogram of the voxel values was cropped by setting the top 0.2% of the voxels to the 99.8% value, to enhance contrast. The bone threshold was computed using Otsu multilevel histogram thresholding20 with three levels (air, soft tissues and bone). Segmentation was based on the computed bone threshold and manually refined at each axial slice using the software’s paintbrush, by the same observer (AC). Due to frequent artifacts from metallic restorations and prostheses, teeth could not always be segmented with confidence, and tooth surfaces should be considered unreliable. A triangular mesh surface was subsequently constructed from the segmentation using a variant of the marching cubes algorithm21. The internal bone structure was deleted and holes were closed, obtaining a water-tight mesh of the mandibular surface, including the dentition. Finally, a remeshing filter was applied, to increase triangle regularity and remove degenerate triangles22. The final meshes averaged approximately 130,000 vertices and 260,000 triangles. The average edge length of the triangles was 0.44 mm. All surfaces were uploaded and are available from the Zenodo repository (https://doi.org/10.5281/zenodo.6335430).

Landmarking

Digitization of the meshes was based on a template of curves and landmarks, with an even distribution over the entire mandibular surface, except for the dentition (Fig. 2, Supplementary Table S1). The template was based on a previously used template of 415 landmarks8,23, which we refined and augmented. We used a simplified symmetric mandibular mesh as the base for placing the landmarks. The mesh was modified by expanding it outwards by 1.5 mm, thus making it thicker overall, but most importantly in the ramus region (Supplementary Fig. S1). This avoided a common problem of the landmarks projecting on the wrong side of a thin ramus, and made ‘inflating’ the landmarks along the normals to the surface unnecessary24. The curves were cubic splines, adjusted to the mandibular surface by control points, and used for sliding the curve semilandmarks. There were 9 fixed landmarks, 84 curve semilandmarks and 426 surface semilandmarks, free to slide over the mesh surface, for a total of 519 points.

Figure 2
figure 2

Curves and landmarks shown on one of the meshes of the sample. Surface semilandmark colours facilitate detection of gross placement errors. Green lines: curves for sliding. See Supplementary Table S1 for definition of curves and landmarks.

Landmarks Condyle lateral, Condyle medial, Gonion and Gnathion were automatically located by the software as extrema of the mesh surface along pre-specified directions (Supplementary Table S1). The condylar poles (Condyle lateral, Condyle medial) were defined as the two points farther apart from each other along the condylar axis. Recognizing that this definition is circular, we implemented an iterative procedure (Supplementary Fig. S2): first, the lateral and medial points were placed on the condyle, at the farthest and closest point from the midsagittal plane, respectively, to define an initial condylar axis. Then, each point was repositioned outwards along the condylar axis and re-projected to the closest point on the mesh. Due to the ellipsoid shape of the condylar head, the projections fall closer to the true poles and define a new condylar axis. A few iterations of this step were sufficient to stabilize the points at the medial and lateral poles, at the farthest distance between them.

The curves Notch, Coronoid, and Posterior-inferior were also located automatically based on mesh curvature data and extremal directions. The procedure was similar to that used for the condylar poles, but the direction of outward movement was fixed (superior, inferior or posterior, depending on the curve). The Alveolar buccal and Alveolar lingual curves were digitized manually to follow the alveolar bone crest. All other landmarks were initially placed by a thin plate spline (TPS) warping of the template to the previously digitized points and curves, and then projected on the closest point of the mesh surface. To easily detect projection errors at this step and subsequent sliding steps, the surface semilandmarks were coloured differently, based on their anatomical position (Supplementary Fig. S1 and Fig. 2). Digitization was inspected and corrected manually. All digitizations were performed by the same investigator (AC).

Geometric morphometrics

Shape analysis was based on the traditional toolbox of 3D geometric morphometric (GM) methods. After each mesh was digitized, the semilandmarks were allowed to slide against the sample average shape, to minimize bending energy25,26,27, then all landmarks were re-projected on their corresponding curves or on the mesh surface. Sliding and projecting was repeated five times, each time reducing the sliding step in a linear fashion, to avoid oscillations in landmark position. The average shape was then re-computed and used as the reference configuration for a subsequent iteration of sliding and projecting. Finally, we aligned all configurations using the generalized Procrustes alignment (GPA) method28, computed centroid size, and ran a principal component analysis (PCA) to extract the most significant shape variation patterns as principal components (PC). PCA was performed both in shape space and form space, which includes the logarithm of the centroid size—ln(CS)—as an extra variable17. The final landmark coordinates are available from Zenodo (https://doi.org/10.5281/zenodo.6335430).

The region of the gonial angle has been reported to differ between the sexes, apparently due to stronger muscle attachment in males, causing gonial eversion and ramus flexure29,30. In order to investigate potential sex differences in this area, we ran a regional GPA and PCA analysis, confined to the landmarks of the ramus, on the right side. This included a total of 123 landmarks (Supplementary Fig. S3).

Additionally, we computed the volume enclosed by the mesh surface, and calculated the mesh normalized centroid size as the square root of the average sum of the squared distances of the mesh vertices to their centroid. Interpretation of these measurements needs caution, as the meshes also included the teeth (see “Discussion” section).

Statistical analysis

Digitization error was tested by repeating the digitization of 20 randomly selected meshes by a second investigator (DJH). We assessed measurement error by comparing the Procrustes distances between repeats to the extent of the sample in shape space and also by a Procrustes ANOVA31,32.

Shape patterns were visualized by TPS warping the average shape along the PC directions over the range of ± 3 standard deviations (SD). The distribution of the sample was inspected in shape space by a 3D plot of the relevant PCs. Shape comparison between the sexes was based on the Procrustes distance between the group means, and on Goodall’s F test on the squared Procrustes distances, both using permutation tests (100,000 permutations without replacement). A multivariate general linear model (MANOVA) was run, with PCs as the dependent variables, sex as the fixed factor, and age, centroid size, volume, and number of missing teeth as covariates. The model was then adjusted based on the interaction effects and multicollinearity.

Sex shape differences were visualized by TPS warping of the average shape along the trajectory connecting the average male and female shapes, exaggerated for clarity by three times the Procrustes distance between the sexes in shape space. Linear discriminant analysis (LDA), with a leave-one-out cross-validation (jackknifing) procedure, was used to compute the percentage of correctly classified subjects by sex, based on shape and size. For LDA and MANOVA, we used the non-trivial PCs, computed as those for which both the ‘random average under permutation’ (Avg-Rnd) and ‘eigenvalue under randomization’ (Rnd-Lambda) stopping criteria were satisfied (100 permutations)33. We tested static allometry by regression of the ln(CS) on the shape PCs. Sex differences in mandibular size were also tested by comparing the mandibular volumes and mesh centroid sizes.

Geometric morphometric tests (including PCA stopping criteria) and visualizations were performed by Viewbox (dHAL Software, Kifissia, Greece); other statistical tests were performed using MorphoJ version 1.08.0131, StatsDirect 1.8.10 (StatsDirect Ltd, Wirral, UK), PAST 4.0934 and IBM SPSS Statistics for Windows, version 28.0 (IBM Corp., Armonk, NY, USA).

Results

Digitization error

The average Procrustes distance between the 20 repeated digitizations was 2.9% of the extent of the sample in shape space, as determined by double the distance of the farthest specimen from the centre of the sample (diameter of hypersphere enclosing the sample). The repeated digitizations virtually coincided with the originals in the PC1-PC2 plot (Supplementary Fig. S4). Based on the Procrustes ANOVA (Supplementary Table S2), digitization error explained 0.4% of the total variation and repeatability was 0.99232.

Shape analysis

The plot of the sample in shape space is shown in Fig. 3 and the plot in form space in Supplementary Fig. S5. The Avg-Rnd and Rnd-Lambda stopping criteria33 showed 12 non-trivial PCs for the shape space analysis and 13 PCs for the form space analysis. In shape space, the first 3 PCs described 49% of the shape variance; the first 12 PCs described 80% of the variance (Supplementary Table S3).

Figure 3
figure 3

The sample plotted in shape space. Red: females, blue: males. Crosses show average of each group and standard deviations.

Shape variation, as described by the first 3 PCs of shape space, is shown in Fig. 4 (average shape warped along each PC at − 3 and + 3 standard deviations). PC1 described variability in the width of the mandible, the height of the ramus and the alveolar process. PC2 described variability in the angle between the ramus and the corpus, whereas PC3 described variability in the height of the coronoid process, the inclination of the symphysis and the prominence of the mandibular angle.

Figure 4
figure 4

Shape variations described by the first 3 PCs. Average shape warped along each PC at − 3 and + 3 standard deviations (green and purple, respectively). Top row: PC1, middle row: PC2, bottom row: PC3.

Univariate correlations

Table 2 shows correlations between the variables sex, age, ln(CS), number of missing teeth and mesh volume. Sex was related to landmark centroid size and mesh volume, and age was related to the number of missing teeth. There was no correlation between sex and the number of missing teeth. Volume and number of missing teeth showed a low correlation, which is not significant if a Bonferroni correction is applied to Table 2.

Table 2 Correlation matrix of independent variables.

The correlation between volume and ln(CS) was high for the whole sample (r2 = 0.45), and lower for each group separately (females: r2 = 0.19, P = 0.0016; males: r2 = 0.28, P < 0.0001). A higher correlation might be expected, but the volume effect of tooth loss and ensuing alveolar bone resorption is primarily in the vertical dimension, without significantly influencing the average distance of the landmarks from the centroid, which is located midsagittaly between the alveolar processes.

Sex differences

The male and female groups differed significantly in shape space (permutation tests; Procrustes distance between means: P < 0.0002; Goodall’s F test: P < 0.0002), mainly along PC2 and PC3. Figure 5 shows the intergroup shape difference of the average male and female shapes, exaggerated 3 times along the male–female vector, keeping the same centroid size. Males showed a wider mandible at the gonial angles, but narrower at the condyles and coronoids, a higher ramus with higher condylar and coronoid processes, more pronounced antegonial notch, and a more posteriorly inclined symphysis with prominence at menton. In contrast, females had a larger mandibular angle, a wider ramus and a more gracile body. These are relative differences, with size adjusted to be equal between the two groups.

Figure 5
figure 5

Superimposition of the male and female shapes. To enhance differences, the average male and female shapes were moved apart in shape space by three times the original Procrustes distance between them, along the male–female shape vector. Both shapes are scaled to the same centroid size. Male: blue, female: red.

There was a clear sex difference in mandibular size, expressed as the mesh centroid size (males larger by 8.9%), the mandibular volume (25%), or the landmark centroid size (8.3%) (Table 3, Fig. 6). The plot of the sample in form space showed very little overlap (Supplementary Fig. S5). A multivariate general linear model (MANOVA) of the first 12 PCs in shape space, as dependent variables, showed that sex and number of missing teeth were significant factors (Table 4).

Table 3 Size variables. Student’s t-test.
Figure 6
figure 6

Jitter strip plots of ln(CS) and mesh volume for the female (red) and male (blue) groups.

Table 4 General multivariate linear model of PCs 1-12 in shape space, by sex and number of missing teeth.

Discriminant analysis using the first 4 principal components in form space (where PC1 is heavily weighted on centroid size) showed a 93% correct sex classification. This reduced to 91% when including only PC1 and PC2 and did not increase beyond 93% even by including all 13 non-trivial PCs. LDA using the PCs of shape space had a correct classification of 59% when including the first 2 PCs, increasing to within the 66–72% range when adding PCs 3–12 (Supplementary Table S4).

Static allometry

Regression of ln(CS) on the shape variables of the pooled sample showed a significant but low correlation, less than 5% of the shape variance explained by size (10,000 permutations, predicted variance = 4.44%, P < 0.0001). However, this was lost when regressing each group separately (Table 5). In contrast, mandibular volume was related to the shape variables in both groups separately, and in the pooled sample, explaining almost 13% of shape variance in the female group. The number of missing teeth was also related to shape, but only in the male group and the pooled sample (Table 5).

Table 5 Correlations between independent variables and shape variables (PCs in shape space), tested with 10,000 permutations.

Age

There was no statistically significant correlation between age and shape, or between age and centroid size (Table 5 and Table 2, respectively).

Ramus analysis

As expected, a clear size and shape difference was also observed for the isolated ramus. Centroid size of males was larger by 11.8% (t-test, P < 0.0001). Shape differed significantly (P = 0.0003, 10,000 permutations). Similar shape differences were noted to those observed for the whole mandible (Fig. 7). We could not observe gonial angle eversion in the male group, nor ramus flexure.

Figure 7
figure 7

Superimposition of the male (blue) and female (red) groups showing shape differences of the ramus (here exaggerated). Middle: female ramus transparent; right: both rami transparent. Note female outline (red) curving towards the lingual side, and difference in condylar axis angulation.

Discussion

The morphology of the mandible does not provide a large number of distinctly identifiable landmarks to attain comprehensive coverage of its surface, potentially leading to loss of important phenotypic information. However, even if achievable, dense landmarking would not be sufficient unless complemented with reasonable confidence of landmark homology (correspondence) across specimens. We recognize two main approaches for establishing correspondences and landmarking of 3D surface meshes; both use a reference template (atlas) with the landmarks of interest already identified on it. The first approach performs a rigid alignment of the template mesh to match the target, followed by deformable registration to refine the match, and then transfers the landmarks from the atlas to the target mesh. Examples are ALPACA35,36 and MeshMonk37,38, which mainly differ in their non-rigid registration algorithm. Variants of this approach abolish landmarks altogether and achieve correspondences between the vertices of the meshes directly14,39.

The second approach needs digitization of a (relatively small) subset of the landmarks on both meshes, then performs a TPS warping of the template, driven by this subset of landmarks, and transfers the remaining points to the target mesh, usually by projection on the mesh surface. Thus, the first approach is essentially a mesh-to-mesh registration, whereas the second is a TPS warping of point configurations. This method belongs to the geometric morphometric toolbox17,28,40 and is usually followed by sliding of semilandmarks to enhance geometric correspondence25,26 and mapping of the specimens in a shape space via generalized Procrustes alignment41 and PCA.

The shape space is the final goal of all methods, as it represents a statistical shape model13,14 which describes population variability and can be used both as a reference for testing novel shapes and as a generative source for creating plausible anatomy. We consider GM methods preferable, due to their solid statistical foundation and excellent visualization tools. Although highly dense models are produced from methods that establish correspondences at the mesh vertex level13,14,39, the anatomical correspondence (homology) is not guaranteed26,42.

Studies of the mandible with dense landmarking are scarce. Most studies limit the landmarks on the anterior and posterior ramus ridges, the mandibular notch, the inferior outline, and the symphysis outline on the midsagittal plane, in addition to ubiquitous conventional landmarks, such as Gonion, Gnathion, Coronoid and the condylar poles5,6,7,9,43,44,45,46. The total number of landmarks ranges from below 20 (e.g.45) to above 100 (e.g. 11344, 30146 or 10005); however, seldom is the mandibular surface between ridges landmarked5,8,23,36.

In creating an atlas of shape variability, the design of the template is a key factor24. The base mesh does not need to be detailed or be one of the meshes of the sample; indeed, simpler geometries sometimes work better24. We used a simplified symmetric mandibular mesh, expanded outwards by 1.5 mm, to avoid the common problem of the landmarks projecting on the wrong surface24, especially in the area of the gonial angle, where the ramus can be thin and the gonial angle everted. We aimed for a large number of landmarks, dispersed evenly over the whole surface, to capture both the shape of the main ridges and the smooth areas in-between. The number of fixed landmarks was small, limited to the condyle poles, the coronoid processes, Gonion and Gnathion. Gonion is a problematic landmark, showing high identification error, both in 2D and 3D digitizations47,48,49. Although we could set this as a sliding semilandmark, or remove it altogether, we opted to retain it as a sentinel between the ramus and the corpus, to avoid the curve and surface semilandmarks from invading the wrong area. However, we located it algorithmically using a clearly defined geometric procedure, to reduce identification error (Supplementary Table S1). Gnathion was similarly located, as the farthest point from the condyle on the midsagittal plane. Locating landmarks algorithmically avoids subjectivity, and ensures repeatability and validity, although biological homology is debatable42. The curve semilandmarks were few and sparsely dispersed, to allow them to adjust by sliding, since a very high density effectively prohibits sliding and reverts to equidistant sampling. The surface semilandmarks were dispersed on the template mesh via a diffusion algorithm, to ensure an initially even distribution.

Our sample was a convenience sample, imaged for various reasons, most commonly for dental implant planning and third molar pre-extraction evaluation. Although there is no assurance that the average coincides with the average of the population, research has shown minimal effect of including even extreme cases50. Sample size was adequate for assessing average shape and shape variability18,19. Sex grouping was based on birth-assignment. Although several factors can affect the phenotype, such as genetic variations, sex chromosomes, epigenetic variation, hormones, environmental factors, and others51, it was not possible to investigate them in the present sample.

Landmarking was accurate, as shown by the repeated digitizations, because the curves were placed on well-defined ridges and the remaining surface points were located by a TPS warping of the template followed by sliding. Fixed landmarks were placed by automated heuristics (e.g. Gnathion, Gonion) further minimizing subjective identification52. The only curves that required full human intervention were the alveolar buccal and lingual curves.

Almost 80% of the shape variance was described by the first 12 PCs, the first 3 of these describing 49%, so most of the shape variability was included in only a few PCs, comparable to previous work7,36. Van der Wel et al.11 report a larger spread of shape variance among the PCs; this can be attributed to their sample comprising patients treated by orthognathic surgery, therefore potentially more extreme cases, and to segmentation artifacts in the area of the teeth, due to metallic orthodontic appliances. The number of landmarks is a significant factor affecting the percentage of shape variance distributed between the PCs. Studies with a few landmarks report a large fraction of variance in the first few PCs because shape is more coarsely measured (e.g. 14 landmarks: 67% shape variance in the first 2 PCs53, 13 landmarks: 61% variance in the first 2 PCs9,45).

The shape patterns were similar to those reported elsewhere, mainly related to mandibular width, angulation between the ramus and corpus, inclination of the symphysis and prominence of the gonial angle. PC1 described mandibular width variation, in relation to ramus height and corpus length (Fig. 4) whereas PC2 mainly described the ramus-corpus angulation. The same primary patterns are seen in the work of van der Wel et al.11, and potentially Fournier et al.36 and Kim et al.39, although the visualizations in those publications do not facilitate a direct comparison. The shape patterns obviously depend on several factors, including ethnicity, age, sex, and degree of edentulism. Our sample was mono-ethnic, equally divided by sex, and of low edentulism prevalence (average number of teeth missing: 2.2, Table 1), so the results need to be assessed under these conditions.

A significant effect of edentulism on mandibular shape has been observed9,45,54, which we detected here, but only in the male group and the pooled sample. In addition to reduction in the height of the alveolar process, loss of teeth was associated with retraction of the anterior alveolar area with relative prominence of menton, increase of the gonial angle and intercondylar distance, and posterior inclination of the ramus (Supplementary Fig. S6). We mention these associations with great caution, even though they agree with previous reports overall9,45,54, since our sample contained very few patients with many (> 5) missing teeth, did not contain completely edentulous mandibles, and alveolar resorption had not progressed significantly in several patients.

A clear sex difference was evident, both regarding size and shape, as expected for an adult sample10. Centroid size, computed from the mesh vertices, was 8.9% larger in males; however, mesh volume was 25% larger. A discrepancy between the two is expected because, with scaling, volume increases to the third power, whereas centroid size to the first power. However, the expected volume change would be larger, at 29% (1.0893 = 1.291). This can be explained by the mandible’s shape and the position of the centroid, which lies in empty space, on the midsagittal plane, at the level of the molars. The distance of the landmarks relative to the centroid is affected mostly by variation in mandibular width, and not so much by variation in ramus height, ramus anteroposterior width, or ramus and corpus thickness, factors that significantly affect volume. The superimposition of the size-adjusted averages (Fig. 5) shows differences in shape that explain this discrepancy between volume and centroid size ratios, e.g. a higher ramus and a more pronounced gonial and symphyseal area in the males.

The mesh-based size and volume measurements included the tooth regions. Any sex differences assessed by these variables could therefore be confounded by an unequal number of missing teeth between the groups, or by sex differences in tooth size. The first factor was not pertinent here because the degree of edentulism was similar between the groups (Table 1), but the second factor could be relevant, as males tend to have larger teeth than females55,56. However, the dentition area is small relative to the whole mesh, and tooth-size sex differences are also too small for them to be of concern here. In any case, these measurements are of low reliability, additionally because volume segmentation was often uncertain due to streaking artefacts.

The landmark-based centroid size difference was similar to the mesh-based difference (8.3% vs. 8.9%, respectively) and is considered more reliable as it is not affected by inclusion of teeth. Size differences have been noted in all previous studies of adult samples. Franklin et al.7 reported almost the same centroid size difference (7.8%) for their sample of 30 mandibles. Vallabh et al.57 list various linear measurements, of which ramus height shows the largest relative difference between sexes (14%) whereas width measurements are comparable to our centroid size ratio (intercondylar width: 5.6% and intergonial width 8.7%). This difference in intercondylar and intergonial widths is also reflected in the shape difference we detected (Fig. 5). Kranioti et al.58, in a sample of the same ethnic origin as ours, report a comparable inter-gonial width difference of 7.6%, giving a sex classification accuracy of 71%.

Shape differences were less pronounced than size differences, as seen when comparing Fig. 3 and Supplementary Fig. S5. In addition to a higher ramus, more pronounced gonial and mental areas, males showed a wider inter-gonial distance. Such sex differences have been noted by other investigators as well7,8,53.

One of the traits considered a male characteristic is gonial eversion, presumably arising from a strong masseteric attachment. However, evidence suggests that this difference is lower than initially assumed29,59. To overcome the qualitative nature of this trait, Oettlé et al.59 applied GM methods confined to the posterior ramal and gonial areas and obtained quantitative data. Although they detected differences between males and females, mainly in the extent and location of the eversion, the accuracy of sexing was below 75%. Our sample showed a larger inter-gonial width in males (Fig. 5), but there was no clear gonial eversion when examining the gonial area. On the contrary, the female outline curved towards the lingual and the male outline was straight (Fig. 7). A difference in condylar axis angulation was also observed.

Another alleged dimorphic trait is ramus flexure, an “angulation of the posterior border of the mandibular ramus at the level of the occlusal surface of the molars”30. Although the initial results for this trait were positive, later evidence is conflicting10,29,60,61. Unfortunately, this trait is qualitative and could not be incorporated into the GM analysis. Visual inspection of the posterior ramus border did not show flexure in our sample (Fig. 7).

Discriminant analysis based on the first 2–4 PCs in form space was very successful in assigning subjects to their correct group (91–93% accuracy). However, this was achieved mainly due to size differences, similarly to previous research using linear measurements between anatomical landmarks10,58. Classification accuracy was higher than that reported by previous studies using conventional linear and angular measurements. Kranioti et al.58 report an accuracy of 80% based on 2 linear variables in a Greek population. Other reports vary, from around 75% to almost 90%, using univariate or multivariate models62,63,64,65,66,67,68,69. Franklin et al.70 report an exceptionally high accuracy of 95%, but this needs to be interpreted with caution, as 10 variables were applied on a sample of 40 mandibles, suggesting a danger of overfitting.

We did not detect an age-related shape change, even though the degree of edentulism was related to both age and shape. This is in contrast to some previous reports, e.g., Costa-Mendes et al.53. As noted above, our sample size was relatively small in the higher and lower age bins; however, we had a similar degree of edentulism in both sex groups, whereas this was not recorded in53 and could have biased the results.

The atlas (average form and variability patterns) can be used clinically as a guide for planning surgical mandibular reconstruction. In cases of missing or deformed parts, the intact mandible can be used to fit the model and obtain a plausible anatomical form of the remaining. Virtual reconstruction based on statistical shape models has been demonstrated with good results12,15,16. It is evident that dense landmarking, for detailed representation of the anatomy, and matching of the population from which the model was constructed to the patient characteristics, are essential factors of success.

Limitations

Future research on mandibular shape variability and sex differences could improve on this study in several ways. First, the template could be augmented with more points on the condylar head, so variability in condylar form could be investigated in detail. However, a GM analysis, focused on that region, instead of encompassing the whole mandible, would probably be more appropriate. Such an analysis could be performed on the mental region, as well, as this has also been reported to show sex differences. A notable challenge with regional analyses lies in delineating the region to be studied, particularly when clear morphological boundaries are lacking.

A second limitation was related to sample composition. Ours was of Greek ethnicity, and contained few patients in the tail ends of the age distribution, making it difficult to evaluate age effects. Also, time of tooth loss was not available. The effect of tooth loss on alveolar bone resorption, and subsequent functional issues that depend on prosthetic rehabilitation are expected to affect mandibular shape, but could not be evaluated here. Our analysis showed that tooth loss and sex were the main factors related to mandibular shape; since tooth loss is heavily tied to age, it is not easy to uncover a potential age-related effect of sex. A sample with minimal edentulism over a wide age range would be valuable in this respect.