Ageing and degeneration analysis using ageing-related dynamic attention on lateral cephalometric radiographs

Zhang, Zhiyong; Liu, Ningtao; Guo, Zhang; Jiao, Licheng; Fenster, Aaron; Jin, Wenfan; Zhang, Yuxiang; Chen, Jie; Yan, Chunxia; Gou, Shuiping

doi:10.1038/s41746-022-00681-y

Download PDF

Article
Open access
Published: 27 September 2022

Ageing and degeneration analysis using ageing-related dynamic attention on lateral cephalometric radiographs

Zhiyong Zhang^1,2,3^na1,
Ningtao Liu ORCID: orcid.org/0000-0002-5590-3229^4,5^na1,
Zhang Guo ORCID: orcid.org/0000-0001-7586-4020⁶,
Licheng Jiao⁴,
Aaron Fenster⁵,
Wenfan Jin⁷,
Yuxiang Zhang²,
Jie Chen²,
Chunxia Yan² &
…
Shuiping Gou⁴

npj Digital Medicine volume 5, Article number: 151 (2022) Cite this article

2624 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

With the increase of the ageing in the world’s population, the ageing and degeneration studies of physiological characteristics in human skin, bones, and muscles become important topics. Research on the ageing of bones, especially the skull, are paid much attention in recent years. In this study, a novel deep learning method representing the ageing-related dynamic attention (ARDA) is proposed. The proposed method can quantitatively display the ageing salience of the bones and their change patterns with age on lateral cephalometric radiographs images (LCR) images containing the craniofacial and cervical spine. An age estimation-based deep learning model based on 14142 LCR images from 4 to 40 years old individuals is trained to extract ageing-related features, and based on these features the ageing salience maps are generated by the Grad-CAM method. All ageing salience maps with the same age are merged as an ARDA map corresponding to that age. Ageing salience maps show that ARDA is mainly concentrated in three regions in LCR images: the teeth, craniofacial, and cervical spine regions. Furthermore, the dynamic distribution of ARDA at different ages and instances in LCR images is quantitatively analyzed. The experimental results on 3014 cases show that ARDA can accurately reflect the development and degeneration patterns in LCR images.

Estimating chronological age through learning local and global features of panoramic radiographs in the Korean population

Article Open access 09 December 2023

Automatic detection of mesiodens on panoramic radiographs using artificial intelligence

Article Open access 29 November 2021

Performance evaluation of a deep learning model for automatic detection and localization of idiopathic osteosclerosis on dental panoramic radiographs

Article Open access 23 February 2024

Introduction

In the past few decades, the world’s population has been ageing dramatically as countries have a rising life expectancy resulting from improved healthcare. This trend emerged first in developed countries but is now also common in developing countries. The aged population is currently at its highest level in human history, and according to the United Nations’world population ageing report, the number of people over the age of 60 years in the world will climb to 1.4 billion by 2030.

Population ageing does raise some formidable new challenges in the cost of social security, health care, and the well-being of the elderly¹. The increase in the total number and the elderly population proportion results in the decline in the proportion of the social labor force and rising pension costs². In addition, population ageing is also a signal of the advent of a tidal wave of chronic and non-communicable diseases such as cardiovascular disease, cancer, diabetes, and chronic respiratory diseases, which draws the attention of researchers and the government to understand and cope with the ageing and degeneration of humans from different perspectives.

Many methods have been applied to the research of human ageing, such as genomics, where the expression levels of RNAs were studied as a function of ageing to characterize age-associated changes in skeletal muscle gene expression of healthy individuals³, proteomics, where a proteomic ageing clock comprised of proteins was proposed to predict the human age in the studies of human proteomics ageing⁴, and clinical investigations, which focused on how dietary and pharmacological interventions promote a healthy lifespan by influencing energy intake and circadian rhythms⁵.

Compared with genomics, proteomics, and clinical investigation approaches, radiomics methods have advantages in cost, dataset scale, and convenience. There have been some radiomics studies for human ageing, such as analyzing the correlation between cervical spine alignment and ageing by manually measuring geometric features⁶, and evaluating the 3D mandibular dental changes using the registration of digital models⁷.

The above-mentioned studies have attempted to understand or characterize the ageing of humans from different perspectives or to explore the factors that cause and delay ageing.

Within these studies, it is easy to find that the ageing process of the human body is highly correlated to age changes. At different ages, there are obvious age-related changes in various tissues, while ageing is also a direct manifestation of the increase of age. In most age estimation tasks, the basis of age estimation comes from the characteristics of ageing^8,9,10, which inspires us to extract ageing features by an age estimation task.

With the rapid growth of data scale and computing power, deep learning methods have been widely used in healthcare. Gialluisi et al. used a deep neural network to predict mortality and hospitalization risk with multiple circulating biomarkers¹¹. Lima et al. used deep neural networks with electrocardiograms to predict the age of patients and explored the correlation between the difference between predicted and actual age and death¹². In many medical image analysis tasks, the convolutional neural networks (CNNs) have achieved promising performances in recent years, including classification¹³, detection¹⁴, segmentation^15,16, registration¹⁷, and general feature characterization¹⁸. Deep learning methods are also widely used for age estimation tasks because of the strong representational capabilities and performance far exceeds that of handcrafted feature extraction methods. Deep learning methods have been applied to age estimation in various applications and imaging modalities. For example, specially designed CNNs were used in the task of estimating age with shoeprint images that commonly used in forensics¹⁹ and masked 3D keen MRI images²⁰. X-ray images of hand-wrist bones and teeth were also widely used in age estimation tasks with deep learning models^8,21.

The lateral cephalometric radiograph (LCR) image, which is one of the most commonly used dental X-ray radiographs in dental clinics, are selected as our research objects. The LCR image can provide more clinical clues about ageing features since it contains more regions such as craniofacial bones and the cervical spine than intraoral periapical radiographs (IPR) images and dental panoramic radiographs (DPR) images. As a result, an age estimation deep learning model trained on a large scale LCR image dataset was used as a promising method for characterizing ageing-related features in LCR images.

Although it is still difficult to track the complete development and ageing process of an individual due to the difficulty of sampling and continuous tracking of a single individual, a large number of study samples distributed across different ages can be used to characterize the process of human development and ageing. In addition, features based on a large number of samples are more objective and general than those obtained from a single individual. Therefore, a fully automated radiomics feature learning method on LCR images is proposed to discover the radiomics ageing representation and relationship between development/ageing and age based on a large number of samples distributed across consecutive ages.

In this study, ageing salience, the numerical form of ageing-related attention, is used to characterize the degree of correlation between regions in an image and ageing in an orderly manner, which can be used to intuitively visualize the drastic degree of ageing changing in LCR images. The general ageing attention obtained from the feature extractor is named ageing-related dynamic attention (ARDA), and the value of ARDA is the average ageing salience as it represents developmental and ageing patterns common to a large number of subjects. The proposed method can show the distribution of average ageing salience and ageing region and their dynamic changes in the LCR images.

In this study, an automatic human development and ageing analysis methodology using deep learning on the LCR images is proposed. Based on this methodology, the quantitative dynamic distribution of ARDA on LCR images is demonstrated. Three ARDA concentrated regions including the teeth, craniofacial, and cervical spine regions are found by the distribution of ARDA. The proposed method not only validates the findings of previous studies of human ageing, but also demonstrates the change process of ageing regions that are not discovered by traditional ageing research methods. The performance of age estimation for adult subjects is significantly improved by the proposed ARDA-constrained model, and the ARDA-guided age estimation model provides a new perspective and solution for research of clinical degenerative diseases and forensic practice.

Results

The LCR image

LCR images are the most commonly-used dental X-ray radiographs in dental clinics. All the LCR images in this study were acquired with a Cranex D digital X-ray unit (Soredex, Tuusula, Finland) for diagnosis and therapeutic purposes. The exposure parameters for the LCR images were 73 kV and 7 mA, with an exposure time of 11.7 s. All LCR images are standardized and archived in the Digital Imaging and Communications in Medicine (DICOM) format.

The LCR images contained craniofacial bones, teeth, and C1-C5 of the cervical spine, which can provide more information about the development and ageing than IPR and DPR images. As shown in Fig. 1a, we labeled several instances in LCR images according to the anatomical structure to quantitatively analyze ageing salience.

Ageing representation using ARDA in LCR images

The quantified ARDA of each instance in LCR images is shown in Fig. 1b. Only regions with average ageing salience larger than the threshold were considered to be the ageing-significant regions. Setting the ageing-significant region threshold can filter out the ageing irrelevant regions and avoid the influence of instance size in the quantitative analysis of ARDA. The thresholds of ageing-significant region for each LCR image were set to the median, 75-th and 90-th percentile of its corresponding average ageing salience. For the ARDA at each age, the mean value of average ageing salience in the ageing-significant region contained in each instance was calculated as the quantified ARDA of the instance.

According to the quantified ARDA at the instance scale of ARDA, all the instances in the LCR images are highly correlated to ageing before the age of 9 years. When the threshold is increased, the changing patterns of ARDA in the age dimension remained unchanged, while the sphenoid and temporal bones show more obvious ageing correlations in the instance dimension as the distribution of ARDA is more concentrated in these two instances. From the perspective of the dynamic changes with age, the ARDA is significant but attenuates rapidly during the rapid development period. After that, the average ageing salience increases and tends to change slowly.

As shown in Fig. 1b, c, ARDA is distributed widely in all regions of LCR images as evident during the growth and development period, especially from 4 to 9 years old. After that, although ageing salience and ageing regions grow steadily, they are concentrated in several partial regions in the LCR image. This shows that the development process of the human body is widely reflected in various regions contained in LCR images, while ageing is mainly concentrated in several local regions. In detail, the ageing salience and the ageing region in the ARDA maps before the age of 10 years are greater than that after the age of 10 years, but both decrease until the age of 12 years. The blank regions without any tissue in the LCR images shrink with the development of the tissue, resulting in these regions also showing high ageing salience.

After the age of 12 years, the ARDA concentrates in the zygomatic, maxilla, and mandibular bones steadily. The common feature of these instances is that ARDA is evenly and extensively distributed in them. They still show high ageing salience in the quantitative average salience analysis, even if there is not extremely salience spot in them. After the age of 20 years, the eye socket and sphenoid bone also show ageing salience occasionally. At later ages, both the area and the salience of the ageing-significant region increase slowly.

From an instance point of view, before the age of 9 years, the parietal, frontal, and maxilla bones show a strong ageing correlation, which is consistent with the development stage of the skull and maxilla. At the ages of 10 to 13, the zygomatic, maxilla, and mandibular bones show high ageing salience, suggesting that there are rapid developments and changes in the craniofacial bones. Moreover, the regions with obvious average ageing salience do not appear accidentally, but usually expand or strengthen from an existing one, which suggests that ageing is a slow and progressive process relative to developmental processes, and it extends usually from one tissue to the surrounding region. It is worth noting that there is a particularly conspicuous region above the external auditory canal, where the ARDA is more visible than other regions in the LCR images.

Three ARDA concentrated regions: teeth, craniofacial and cervical spine

As shown in Fig. 1c, the ageing regions in the ARDA map remain consistent after the age of 12 years, which can be roughly divided into three ARDA concentrated regions: the teeth, craniofacial, and cervical spine as demonstrated in Fig. 6a. One or two single regions of the three regions were used to train the baseline network for age estimation. The age estimation performances in Tables 1, 2, and Fig. 2, show that similar performance can be achieved as long as one of the three regions of the teeth, craniofacial, and cervical spine regions are included when LCR images are used for age estimation in an uncontrolled environment. The similar performance can relieve the requirement of age estimation for complete human tissues in LCR images and provide a new perspective and the clinical method, especially in the practice of forensic medicine. This confirms that ARDA can represent well the distribution of real ageing and development information on LCR images and shows that the developmental and ageing information in the LCR images is redundant. See the Supplementary Table 1 for the pearson correlation coefficient and p-value between real ages and the predicted ages.

Table 1 Tabulated measurements results of the mean of absolute error (MAE) ± standard deviation of the absolute error (SD) (years).

Full size table

Table 2 Tabulated measurements results of the cumulative score-5 (CS-5) and mean cumulative score-5 (MCS-5) (%).

Full size table

**Fig. 2: The performance comparison of age estimation in each age group.**

In the comparison of overall age estimation performance, the teeth region contains more development/ageing information than the other two regions, while the craniofacial region contains the least. Specifically, the performance of age estimation using the teeth region is the best in the 4–25 age group, while the cervical spine region performs best for the older subjects in the 26–40 age group. In the 26–40 age group, the accuracy and stability of the age estimation using the cervical spine region suggest the unique importance of this region for ageing research of mature adult subjects.

The dynamic distribution of ARDA in ARDA concentrated regions

More detailed ARDA maps of the three ARDA concentrated regions: the teeth, craniofacial, and cervical spine are shown in Figs. 3, 4, and 5, respectively, and the quantitative analysis at the instance scale of these regions is illustrated in Fig. 6b, c, d, respectively. As demonstrated in both the ARDA maps and quantitative analysis, the common feature of the ARDAs in the three regions is that their distribution range and average ageing salience continued to increase with age from 4 to 40 years, unlike the pattern of its distribution in the LCR images. However, similar to the distribution of ARDA in LCR images, the most obvious ageing regions (i.e., the red or dark red regions in the ARDA map) in these regions change steadily with age.

**Fig. 3: The ARDA map of the teeth region from the ages of 4 to 40 years.**

**Fig. 4: The ARDA map of the craniofacial region from 4 to 40 years old.**

**Fig. 5: The ARDA map of the cervical spine region from the ages of 4 to 40 years old.**

**Fig. 6: Illustration of quantified ARDA in ARDA concentrated regions.**

In the teeth region, as shown in Fig. 6b, the mandibular and maxilla are the main instances showing ageing salience, which can be reflected in the ARDA map in Fig. 3 where the upper teeth and lower teeth are the main regions of the ARDA distributions. In addition, the ARDA in the lower teeth region is much more obvious than the upper teeth at age ranges from 4 to 40 years except for the ages before 10 years and after 37 years.

In the craniofacial region, the difference in the quantified ARDA of each instance is not obvious, while the quantified ARDA of the eye socket and maxilla bone is slightly greater than that of other instances. The ARDA map of the craniofacial region in Fig. 4 shows that the growth of the average ageing salience is flat, and there are two most obvious ageing regions in the temporal bone. In addition, the two regions that always show ARDA are the frontal bone and the eye socket.

In the cervical spine region, as shown in Fig. 6d, the average ageing salience of C2 is obviously higher than that of other instances before the age of 25 years. After that, ARDA starts to be evenly distributed across instances. In the age range of 4 to 40 years, the quantified ARDA of the vertebral bodies is higher than that of the spinous processes, which indicates that the ARDA is widely distributed in the vertebral bodies.

Furthermore, the effects of increasing the threshold of the ageing-significant region on the quantitative ARDA in the teeth region and craniofacial region mainly appear in the sphenoid, temporal, and mandibular bones in the instance dimension, while the effect on the pattern of change in the age dimension is not significant. In the case of the cervical spine region, the pattern of the quantified ARDA changes moderately with age, while the increase in the threshold causes fluctuations.

In summary, the distribution of ARDA is basically the same as the ageing region proposed by existing studies^{22,23,24,25,26}, and there are several regions with obvious average ageing salience are worthy of being verified by subsequent studies such as the upper and lower posterior teeth, maxillary tuberosity in Fig. 3, the frontal bone and eye socket of in Fig. 4, and the spinous processes of C2–C5 in Fig. 5.

Discussion

In this study, we propose a human development and ageing characterizing method using a deep learning model as feature extractor. The proposed method reveals the distribution of ARDA in LCR images and dynamic changes in time (ages from 4 to 40 years old) and space (instances in LCR image). Our results demonstrate that the ARDA changes dynamically with age and show the changing patterns of ARDA. The age estimation experiments show that ARDA and age information are distributed mainly in three independent regions. As well, the performance and stability of age estimation were improved by the proposed ARDA.

The proposed method can accurately describe the ageing salience and dynamics of the human craniofacial and cervical spine with advantages over previous studies in terms of objectivity, dynamics, and efficiency.

The age estimation method with ARDA constraint was designed in this study. The constraint of ARDA can significantly improve the stability of the model for age estimation, which reduces the overall standard deviation (SD) of the mean absolute error (MAE) from 2.24 to 1.54 years (31.25% reduction). The SD of the absolute error in the age estimation for the 26–40 age group decreased from 5.06 to 2.90 years (42.69% reduction), which shows that the proposed ARDA can improve the stability for the older subjects’age estimation. In addition, the age estimation accuracy for subjects of 26–40 years is also improved by ARDA, in which the MAE decreased from 3.61 to 3.08 years (14.68% reduction). The age estimation error of younger subjects increased slightly by the ADRA constraint, compared to the baseline model. Therefore, the proposed retest method combines the advantages of age estimation with ARDA constraint and age estimation without ARDA constraint and obtains the smallest MAE under the premise of improving stability. In practice, the retest method described in Section should be a feasible solution. The performance of age estimation experiments with incomplete images demonstrates the redundancy of age information in the completed LCR images, which can provide new perspectives and workflows for age estimation and forensic practice.

Craniofacial development is a complex and irreversible process, involving age-related changes in bone and soft tissue. The developmental process that can be quantified allows researchers to have a clearer picture of the development of craniofacial bones and teeth. Previous related ageing and developmental studies attempted to characterize ageing changes in tissues from the perspective of geometric measurements, which include measuring changes in head circumference²⁷ and skull volume²⁸, and measuring sutural growth displacement of the maxilla²⁹, etc. The ageing-significant regions revealed by the proposed ARDA map corresponding developmental period are basically consistent with the developing regions of craniofacial and teeth proven in previous related ageing studies.

At the age of 0 to 7 years, the skull is in the most intense period of growth. Generally, the skull volume of a child can exceed 90% of that of an adult by the age of 7 years, while the growth after the age of 10 years is minimal^27,28,30. As shown in Fig. 7a, the cranial region in the ARDA map between the age of 4 to 7 years shows significant ageing salience, which disappears in the ARDA map after the age of 10 years. The change in ageing salience is highly consistent with the findings of the previous studies.

**Fig. 7: Examples of regions in ARDA map showing ageing salience that can be cross validated with existing research.**

The growth of maxillary length is mainly through bone deposition with the palatal suture and the maxillary tuberosity and peaks at the age of 11 years^29,31. The growth center is at the posterior margin of the maxilla³² indicated by the arrow in Fig. 7a, which shows strong salience, while the salience in this region disappears consistently after the age of 11 years.

The temporary teeth dentition fully erupts at the age of 2.5 years, and the permanent tooth germs develop before the age of 6 years, after which, the temporary teeth are replaced during the period from 6 to 14 years of age. Therefore, the ageing salience of the upper and lower teeth region at this period is associated with the eruption and replacement of teeth. The wear of teeth appears after the teeth fully erupt, so the ageing salience in this region after the age of 15 years is related to tooth wear. As shown in Fig. 7b, the ageing salience in the upper and lower dentitions regions gradually increases but is not the most significant after the age of 15 years, while in Fig. 7c the ageing salience is significant. The difference in ageing salience between the same region on complete and incomplete LCR images is caused by that the ARDA represents ageing salience in a relative or ordinal manner. The ageing salience of a given region is not only related to the absolute degree of its ageing change but also depends on its relative rate of change in the image.

In summary, the average ageing salience shown in the ARDA maps and the quantified ARDA can also be well explained by findings from studies related to craniomaxillofacial.

In addition, the ageing-significant regions of the teeth and maxillary sinus ageing shown in the ARDA map during ageing period are also consistent with the findings of related studies on tooth wear^22,23 and maxillary sinus ageing^24,25,26. Researchers have conducted in-depth research on the ageing changes of the eye socket^33,34, in which the eye sockets are proven to increase with age. In the cervical spine region, research shows that structural changes of the cervical spine begin in middle age, but sometimes earlier³⁵. Intervertebral disc degeneration begins at adolescence, and as it progresses, it can also lead to morphological alterations of the vertebral bodies, while the cervical lordosis increased with age³⁶.

Although few related studies that can validate our method in ageing period, our method is widely validated by the existing research during the growth and development period. Therefore, we can reasonably speculate that our proposed ARDA is still reliable during ageing and that the unidentified significant regions of ageing/development revealed by our method are also reasonably valuable for follow-up studies.

Current research tends to estimate the age based on ageing changes in the bones. But there are few studies on ageing itself and its salience as well as regional dynamic changes with age. The feasibility of our method in representing the development and ageing of the craniofacial and cervical spine provides a new perspective and ageing metrics for ageing research.

Our method can capture the distribution of developmental/ageing quantitative salience on LCR images of healthy individuals aged 4 to 40 years. The analysis in time (age) and space (instance) dimensions shows that the ageing salience exhibits regularity, which is general, objective, and credible based on a large-scale subject set. Therefore, the regularity of ARDA distribution in different ages and different instances proposed in this study can be initially used as a clinical quantitative auxiliary index to detect whether there is abnormal ageing/development in a subject.

As shown in Fig. 8, which demonstrates the distribution of our proposed ARDA across instances and ages for all 3014 subjects in the testing set. The distribution of quantified ARDA is narrower for each instance, especially when the age is greater than 10 years, which suggests that the ARDA has the potential as a standard indication of ageing and development. Assuming that the distribution of the ARDA follows a normal distribution, the highlighted point in Fig. 8 indicates that for a healthy subject of 20 years old, the probability that the quantified ARDA of mandibular bone generated by our method is in the range of 0.059 to 0.119 is 95.45%. The distribution range of ARDA width decreases with the increase of the threshold of the ageing-significant region. In this sense, we suggest that the ARDA has potential as a standard indication of ageing and development.

**Fig. 8: The distribution of ARDA across instances and ages.**

It should be noted that the feasibility of ARDA as a quantitative ageing/developmental indicator still needs further validation, such as the inclusion of gender factors, a wider range of ages, and a larger data dataset, which will be the focus of our future studies. The index can only be used as a computer-aided diagnosis but is not a viable substitute for fully automated diagnosis by physicians due to the size and age range limitations of the current dataset and the lack of cross-validation in subsequent ageing and developmental studies.

In addition to validating the findings of existing studies, our model also finds a few ageing-significant regions that have not been validated by previous studies. For example, the circumferential (before the age of 10 years) and punctate (after the age of 10 years) regions above the external auditory canal, are also shown correspondingly in the ARDA map of the craniofacial region. However, tissues in LCR images are overlapping, and ARDA on 2D scale cannot accurately represent the ageing salience of tissues in 3D space. Feature work on age-related changes in this area will be carried out on 3D images to eliminate the limitation of tissue overlap on 2D X-ray images.

Physiological changes in the human can be roughly divided into two stages, development and ageing. The developmental and ageing forms of face can reflect the differences between these two stages³⁷. The age range from 4 to 40 years chosen in this study could basically cover the development and ageing periods. Children under the age of 4 years are rarely examined with lateral radiographs, and X-ray examination cannot be performed on research subjects for research purposes due to medical ethical reasons. Thus, we chose the age of 4 years as the lower limit of our study age range. In the related research on age estimation using traditional manual feature extraction methods and deep learning^8,38, the performance of age estimation decreases with the age of the subjects during the ageing period, which is also a challenge for related studies³⁹. In the study of age estimation using orthopantomogram images and 3D facial images related to LCR images, the estimated age performance is significantly worse after the age of 40 years^8,40. In addition, the number of orthodontic patients in northwestern China decreased rapidly from the age of about 15 years as shown in Table 3, which also results in poor performance of age estimation and theinability to obtain accurate and generalized ageing characteristics. Therefore, the age of 40 years was set as the upper age limit for this study.

Table 3 The age distribution of the LCR images.

Full size table

Limited by the data we collected, our study covers a partial ageing period of people aged from 4 to 40 years. However, based on the verifiability of the ageing and its changing patterns in the LCR images represented by the proposed ARDA, especially the consistency with the findings of existing studies during the development period, future work will extend the ARDA to the whole life span of humans and more anatomical features in application scenarios where the scale of the training data allows.

Another limitation of our proposed ARDA is that it characterizes ageing salience in a relative manner, i.e., highlighting the most ageing-significant regions in LCR images and cannot be used to establish an accurate mapping relationship with physical quantities such as area, rate of tissue change, etc. Correlating ARDA with physical quantities of ageing changes and extending it as an index for aided diagnosis of ageing abnormalities is also a feasible direction for future work.

Methods

In this study, an ageing and degeneration analysis method using ARDA on LCR images is proposed. The overview of the proposed method is shown in Fig. 10. The method mainly consists of four modules: (1) a pre-trained deep learning model for ageing feature extraction, (2) ARDA generation, (3) acquisition of three ARDA concentrated regions, and (4) quantitative analysis of ageing and degeneration distribution. Among them, the ageing features extracted by the pre-trained age estimation model are used to generate ageing salience maps and obtain the ARDA map. Three ARDA concentrated regions of the LCR image are divided based on the ARDA distributions. Finally, the ARDA of both the LRC image and the ARDA concentrated regions are quantitatively analyzed at the pixel and the instance scales.

Ethics statement

This study is approved by the Affiliated Stomatological Hospital of Xi’an Jiaotong University Health Science Center (Approval number: xjkqll[2022]NO.30). The study was non-interventional and retrospective, all participants in the study signed the written informed consent, and the LCR images used in this data were anonymized. A sampled and desensitized example dataset was shared in source code repository.

Study population

We obtained LCR images from the Stomatological Hospital of Xi’an Jiaotong University Health Science Center, China. The bit depth of the images in the dataset is 16 bpp, and the size of most images is 2144 × 2304.

The age of the subjects was calculated by subtracting the imaging date from the date of birth and dividing by 365.25 (due to leap years) and rounding to the nearest hundredth. 20174 LCR images were divided into 7 age groups according to age after screening the unqualified images shown in Fig. 9. The age and gender distribution of the subjects are shown in Table 3. The size distribution of the LCR images is shown in Table 4.

**Fig. 9: Examples of some typical unqualified images.**

Table 4 The size distribution of LCR images.

Full size table

The proposed ARDA is robust to low quality images since it is based on the block-scale, high-dimensional, and abstract features of the images, and it represents the ageing regions without emphasizing pixel-scale precision. The manifestation of ARDA is the representation of the difference between the center of the ageing region from the surrounding region. Therefore, the impact on the accuracy of ARDA of the possible low quality image with blurring, low resolution, and noise is limited.

Ageing feature extraction

In this study, the ageing features of LCR images were extracted by a pre-trained CNN for age estimation. When the CNN was trained to estimate age, the smaller the error, the more accurately the ageing characteristics in the LCR image can be extracted. For the age estimation tasks using large LCR images (larger than 1500 × 2000), the balance of performance and computing resource requirements needs to be considered.

EfficientNet-B0 can achieve the best result with the fewest parameters compared with other CNNs, so it was used as the baseline age estimation model and ageing feature extractor. The performance comparison for age estimation and the number of parameters from the CNN models that perform well on natural images is shown in Table 5. These models were trained using a tiny version of the training set for quick comparison.

Table 5 Age estimation performance comparison of ageing feature extractors.

Full size table

Efficient-B0 was trained for age estimation on the training set, in which the pre-processing process of the LCR images included contrast enhancement, shape fixing, and resizing. In addition, random affine transformation and random horizontal flip were also used for data augmentation.

The main block of Efficient-B0 is a mobile inverted bottleneck (MBConv)^41,42, to which the squeeze-and-excitation optimization⁴³ was also added. The structure of Efficient-B0 is shown in Table 6. See the Supplementary Note 1 for a detailed description of the EfficientNet model structure.

Table 6 The structure of Efficient-B0.

Full size table

The loss function for ageing feature extractor was set as L1 loss:

$$L=\frac{1}{N}\mathop{\sum }\limits_{n=1}^{N}| {y}_{n}^{\prime}-{y}_{n}|$$

(1)

where N is the number of samples, y_n and ${y}_{n}^{\prime}$ are the age and predicted age of n-th LCR image.

The training strategy, image augmentation and preprocessing for the training process are provided in the Supplementary Note 2 and Supplementary Note 3, respectively. The fitted regression models on the true age and the age predicted by different age estimation methods are show in Supplementary Figure 1. The p-values of f-test and t-test between different age estimation methods are shown in Supplementary Table 2 and Supplementary Table 3, respectively.

ARDA generation

With the pursuit of the interpretability of deep learning methods, some methods for gradient visualization of deep CNNs have been proposed, including CAM⁴⁴, Grad-CAM⁴⁵, etc. The gradient visualization method can be used not only to visualize the salience region of the model, but also to represent the ageing-related attention by the ageing feature extractor. The details of the ARDA generation module are shown in Fig. 10. The global average of the gradient of the feature map in the pre-trained model is defined as its weight, and the weight of the k-th feature map inputted into the fully connected layer is calculated by

$${w}_{k}=\frac{1}{Z}\mathop{\sum}\limits_{i}\mathop{\sum}\limits_{j}\frac{\partial \hat{y}}{\partial {F}_{ij}^{k}}$$

(2)

where, Z is the number of pixels in feature map F, $\hat{y}$ is the output of network, and ${F}_{ij}^{k}$ is the value of pixel (i, j) in the k-th feature map. The ageing salience map M is obtained by calculating the weighted sum of all the feature maps:

$$M=ReLU\left(\mathop{\sum}\limits_{k}{w}_{k}{F}^{k}\right)$$

(3)

The ARDA map of age a is generated by:

$${A}_{a}=\frac{1}{{N}_{a}}\mathop{\sum }\limits_{n=1}^{{N}_{a}}{M}_{a,n}$$

(4)

where a ∈ {4, 5, …, 39, 40}, N_a is the number of samples with age a in the training set, M_a,n is the ageing salience map corresponding to n-th LCR image of age a, and ∑ is element-wise summation.

**Fig. 10: Overview of the method used in this study.**

Here, the ageing salience maps are generated by the pre-trained age estimation network which can extract generic ageing features from LCR images.

The ARDA is obtained by merging all ageing salience maps of the same age. The general distribution of the ageing salience on the LCR images can be displayed, while avoiding the effect of abnormal samples. The ARDA generated for each age can show the dynamic changes of ageing salience with age in LCR images.

Age-related changes are not significant in adults. Therefore, it is difficult to estimate accurately adult age using only LCR images. Here, we propose a method to apply salience maps as the attention constraint of the neural network, which can make use of the knowledge learned by the baseline model on all training data to all input LCR images and enable the baseline model to focus more on ageing-relevant elements.

In the training phase of the ARDA-constrained age estimation, the LCR images older than the selected KA and their corresponding salience maps are used as the input of the baseline model. Otherwise, only the LCR images are used as the input. This distinction is acceptable because age labels are available during the training phase. However, in the testing phase, as the age labels are not available, we applied a different strategy, i.e., if the predicted age from the age estimation model is greater than KA, we concatenate the LCR images and their corresponding ageing salience map as the input of the network and estimate the age again. The latter estimation result is regarded as the final estimated age. Otherwise, the result of the first test is directly used as the final estimated age. We call this method retest.

Acquisition of three ARDA concentrated regions

From forensic practice, we know that a complete LCR image is sometimes difficult to obtain, and the ageing salience of the local region in the LCR images can be achieved by analyzing the distribution of ARDA on the LCR images. The distribution of ageing salience on the LCR images represented by the ARDA is shown in Fig. 1c. These ARDA concentrated regions are the teeth, craniofacial and cervical spine regions. Under the guidance of ARDA, each LCR image in our dataset is divided into three overlapping parts with the same strategy, as the position of each part in the LCR image is fixed. We mapped the ARDA to color, ranging from blue to red, to visualize the distribution of ageing salience and ageing regions, i.e., the closer the color of the pixel is to red, the larger the ARDA of the pixel. An LCR image of an individual of the age of 28 years and its corresponding ageing salience map are shown overlapping to show the relative position relationship between the ARDA map and the LCR image in Figs. 3, 4, and 5.

Figure 6 a shows a specific LCR image with a width ${{{\mathcal{W}}}}$ and height ${{{\mathcal{H}}}}$, in which the upper left corner was set as the origin of the coordinate system, the positive x-axis is in the down direction, and the positive y-axis is in the right direction. Table 7 summarizes the coordinate of the upper left corner and the lower right corner of the three ARDA concentrated regions.

Table 7 The position of the three ARDA concentrated regions. Set the upper left corner of the LCR image as the origin of the coordinate system.

Full size table

Each ARDA concentrated region of the LCR image is selected or discarded as the input of the model for comparing the performance of the age estimation and providing guidance for forensic practice, The ARDA maps of each region are also generated as the same way as we obtain the ARDA maps of the complete LCR images.

Quantitative analysis of ageing and degeneration

In this study, the quantified analysis of ARDA is performed in the time and space dimensions. In the time dimension, the patterns of dynamic salience and region of ageing with age in LCR images are analyzed. In the spatial dimension, the quantitative ageing salience of LCR images is represented on the pixel scale and the instance scale.

As shown in Fig. 1a, the size of the instances in the LCR images varies greatly. To eliminate the effect of size differences on the quantified ARDA analysis, we only calculate the mean ARDA of the ageing-significant region within each instance. The quantified ARDA generation of the instances is detailed in Supplementary Note 4.

In this study, the ageing-significant region threshold is set to the median value, the 75-th quantile and the 90-th quantile of the average ageing salience corresponding to a specific age. In Fig. 6b, c, d, the three surface plots from the bottom to top are the cases in which the ageing-significant region threshold was set to the median, 75-th and 90-th percentile of the average ageing salience, respectively.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The data used in this study is not open access due to privacy and security concerns. After obtaining the sharing agreement, it can be shared with third parties for reasonable use, relevant requests should be addressed to C.Y. (yanchunxia@mail.xjtu.edu.cn) or Z.Z. (zzy20011126@mail.xjtu.edu.cn). To enable a complete run of the code shared in this study, a minimum amount of desensitized sample data is shared with the code. The source data underlying Figs. 1c, 2, 3, 4, 5, and 6b–d are provided as a Source Data file.

Code availability

Source code of this study is provided at https://github.com/LiuNingtao/ARDA.

References

Börsch-Supan, A. The impact of global ageing on labour, product and capital markets. Information and communication technologies for active ageing 7–34 (2009).
Bloom, D. E., Canning, D. & Fink, G. Implications of population ageing for economic growth. Oxford Rev. Econ. Policy 26, 583–612 (2010).
Article Google Scholar
Tumasian, R. A. et al. Skeletal muscle transcriptome in healthy aging. Nat. Commun. 12, 1–16 (2021).
Article Google Scholar
Johnson, A. A., Shokhirev, M. N., Wyss-Coray, T. & Lehallier, B. Systematic review and analysis of human proteomics aging studies unveils a novel proteomic aging clock and identifies key processes that change with age. Ageing Res. Rev. 60, 101070 (2020).
Article CAS PubMed Google Scholar
Acosta-Rodríguez, V. A., Rijo-Ferreira, F., Green, C. B. & Takahashi, J. S. Importance of circadian timing for aging and longevity. Nat. Commun. 12, 1–12 (2021).
Article Google Scholar
Chen, Y. et al. The change of cervical spine alignment along with aging in asymptomatic population: a preliminary analysis. Eur. Spine J. 26, 2363–2371 (2017).
Article PubMed Google Scholar
Garib, D. et al. Three-dimensional mandibular dental changes with aging. Am. J. Orthod. Dentofac. Orthop. 159, 184–192 (2021).
Article Google Scholar
Vila-Blanco, N., Carreira, M. J., Varas-Quintana, P., Balsa-Castro, C. & Tomas, I. Deep neural networks for chronological age estimation from opg images. IEEE Trans. Medical Imaging 39, 2374–2384 (2020).
Article PubMed Google Scholar
Jónsson, B. A. et al. Brain age prediction using deep learning uncovers associated sequence variants. Nat. Commun. 10, 1–10 (2019).
Article Google Scholar
Lee, H. et al. Fully automated deep learning system for bone age assessment. J. Digit. Imaging 30, 427–441 (2017).
Article PubMed PubMed Central Google Scholar
Gialluisi, A. et al. Exploring domains, clinical implications and environmental associations of a deep learning marker of biological ageing. Eur. J. Epidemiol. 37, 35–48 (2022).
Article PubMed Google Scholar
Lima, E. M. et al. Deep neural network-estimated electrocardiographic age as a mortality predictor. Nat. Commun. 12, 1–10 (2021).
Article Google Scholar
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chiang, T.-C., Huang, Y.-S., Chen, R.-T., Huang, C.-S. & Chang, R.-F. Tumor detection in automated breast ultrasound using 3-d cnn and prioritized candidate aggregation. IEEE Trans. Med. Imaging 38, 240–249 (2018).
Article PubMed Google Scholar
Anthimopoulos, M. et al. Semantic segmentation of pathological lung tissue with dilated fully convolutional networks. IEEE J Biomed. Health Inform. 23, 714–722 (2018).
Article PubMed Google Scholar
Zhou, Z., Siddiquee, M. M. R., Tajbakhsh, N. & Liang, J. Unet++: a nested u-net architecture for medical image segmentation. In Deep learning in medical image analysis and multimodal learning for clinical decision support, 3–11 (Springer, 2018).
Ferrante, E., Dokania, P. K., Silva, R. M. & Paragios, N. Weakly supervised learning of metric aggregations for deformable image registration. IEEE J Biomed. Health Inform. 23, 1374–1384 (2018).
Article PubMed Google Scholar
Cearns, M., Hahn, T. & Baune, B. T. Recommendations and future directions for supervised machine learning in psychiatry. Trans. Psych. 9, 1–12 (2019).
Google Scholar
Hassan, M. et al. Deep learning analysis and age prediction from shoeprints. Forensic Sci. Int. 327, 110987 (2021).
Article PubMed Google Scholar
Mauer, M. A. D. et al. Automated age estimation of young individuals based on 3d knee mri using deep learning. Int. J Legal Med. 135, 649–663 (2021).
Article PubMed Google Scholar
Li, S. et al. A deep learning-based computer-aided diagnosis method of x-ray images for bone age assessment. Complex & Intelligent Systems 1–11 (2021).
Jeon, C.-L., Pak, S. & Woo, E. J. The correlation between the tooth wear of the first molar and the estimated age from the auricular surfaces in a joseon dynasty population, south korea. Int. J. Osteoarchaeology 30, 759–768 (2020).
Article Google Scholar
Faillace, K. E., Bethard, J. D. & Marks, M. K. The applicability of dental wear in age estimation for a modern american population. Am. J Phys. Anthropol. 164, 776–787 (2017).
Article PubMed Google Scholar
Solheim, T. Recession of periodontal ligament as an indicator of age. J. Forensic Odonto-stomatology 10, 32 (1992).
CAS Google Scholar
Mendelson, B. & Wong, C.-H. Changes in the facial skeleton with aging: implications and clinical applications in facial rejuvenation. Aesthetic Plastic Surgery 44, 1151–1158 (2020).
Article PubMed Google Scholar
Jeon, A. et al. Anatomical changes in the east asian midface skeleton with aging. Folia Morphologica 76, 730–735 (2017).
Article CAS PubMed Google Scholar
Dokladal, M. Growth of the main head dimensions from birth up to twenty years of age in czechs. Hum. Biol. 31, 90–109 (1959).
CAS PubMed Google Scholar
Epstein, H. T. Phrenoblysis: Special brain and mind growth periods. i. human brain and skull development. Develop. Psychobiol.: J. Int. Soc. Develop. Psychobiol. 7, 207–216 (1974).
Article CAS Google Scholar
Björk, A. & Skieller, V. Growth of the maxilla in three dimensions as revealed radiographically by the implant method. Br. J. Orthodontics 4, 53–64 (1977).
Article Google Scholar
Bastir, M., Rosas, A. & O’Higgins, P. Craniofacial levels and the morphological maturation of the human skull. J. Anatomy 209, 637–654 (2006).
Article Google Scholar
Iseri, H. & Solow, B. Growth displacement of the maxilla in girls studied by the implant method. Eur. J. Orthodontics 12, 389–398 (1990).
Article CAS Google Scholar
Enlow, D. H. & Bang, S. Growth and remodeling of the human maxilla (1965).
Ching, J. A., Ford, J. M. & Decker, S. J. Aging of the adult bony orbit. J. Craniofacial Surgery 31, 1082–1085 (2020).
Article Google Scholar
Chon, B., Zhang, K. R., Hwang, C. J. & Perry, J. D. Longitudinal changes in adult bony orbital volume. Ophthal. Plastic Reconst. Surgery 36, 243–246 (2020).
Article Google Scholar
Hirsch, C., Schajowicz, F. & Galante, J. Structural changes in the cervical spine: a study on autopsy specimens in different age groups. Acta Orthopaedica Scandinavica 38, 1–77 (1967).
Article Google Scholar
Yukawa, Y., Kato, F., Suda, K., Yamagata, M. & Ueta, T. Age-related changes in osseous anatomy, alignment, and range of motion of the cervical spine. part i: Radiographic data from over 1,200 asymptomatic subjects. Eur. Spine J. 21, 1492–1498 (2012).
Article PubMed PubMed Central Google Scholar
Fu, Y., Guo, G. & Huang, T. S. Age synthesis and estimation via faces: a survey. IEEE Trans. Pattern Anal. Mach. Intelligence 32, 1955–1976 (2010).
Article Google Scholar
Ge, Z.-p, Ma, R.-h, Li, G., Zhang, J.-z & Ma, X.-c Age estimation based on pulp chamber volume of first molars from cone-beam computed tomography images. Forensic Sci. Int. 253, 133–e1 (2015).
Article PubMed Google Scholar
Milošević, D., Vodanović, M., Galić, I. & Subašić, M. Automated estimation of chronological age from panoramic dental x-ray images using deep learning. Expert Syst. Appl. 189, 116038 (2022).
Article Google Scholar
De Angelis, D. et al. Age estimation from canine volumes. La Radiologia Medica 120, 731–736 (2015).
Article PubMed Google Scholar
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. & Chen, L.-C. Mobilenetv2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE conference on computer vision and pattern recognition, 4510–4520 (2018).
Tan, M. et al. Mnasnet: Platform-aware neural architecture search for mobile. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2820–2828 (2019).
Hu, J., Shen, L. & Sun, G. Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, 7132–7141 (2018).
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A. & Torralba, A. Learning deep features for discriminative localization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016).
Selvaraju, R. R. et al. Grad-cam: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE international conference on computer vision, 618–626 (2017).

Download references

Acknowledgements

This study was supported by the National Natural Science Foundation of China (No. 62102296), the Natural Science Foundation of Shaanxi Province (No. 2019ZDLGY03-02-02, No. 2020JM-413, No. 2022JQ-661 and No. 2021SF-189), the Fundamental Research Funds for the Central Universities (No. XJS222215 and No. XJS222221), the Research Industrialization Plan of Xi’an (No. XA2020-RGZNTJ-0075), the Opening Project of Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi’an Jiaotong University (No. 2017LHM-KFKT004), and the Science and Technology of Xi’an city (No. 20YXYJ001(2)).

Author information

These authors contributed equally: Zhiyong Zhang, Ningtao Liu.

Authors and Affiliations

Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi’an Jiaotong University, Xi’an, 710004, Shaanxi, China
Zhiyong Zhang
College of Forensic Medicine, Xi’an Jiaotong University Health Science Center, Xi’an, 710061, Shaanxi, China
Zhiyong Zhang, Yuxiang Zhang, Jie Chen & Chunxia Yan
Department of Orthodontics, the Affiliated Stomatological Hospital of Xi’an Jiaotong University, Xi’an, 710004, Shaanxi, China
Zhiyong Zhang
Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education, School of Artificial Intelligence, Xidian University, Xi’an, 710071, Shaanxi, China
Ningtao Liu, Licheng Jiao & Shuiping Gou
Robarts Research Institute, Western University, London, N6A 3K7, ON, Canada
Ningtao Liu & Aaron Fenster
Academy of Advanced Interdisciplinary Research, Xidian University, Xi’an, 710071, Shaanxi, China
Zhang Guo
Department of Radiology, the Affiliated Stomatological Hospital of Xi’an Jiaotong University, Xi’an, 710004, Shaanxi, China
Wenfan Jin

Authors

Zhiyong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ningtao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Zhang Guo
View author publications
You can also search for this author in PubMed Google Scholar
Licheng Jiao
View author publications
You can also search for this author in PubMed Google Scholar
Aaron Fenster
View author publications
You can also search for this author in PubMed Google Scholar
Wenfan Jin
View author publications
You can also search for this author in PubMed Google Scholar
Yuxiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jie Chen
View author publications
You can also search for this author in PubMed Google Scholar
Chunxia Yan
View author publications
You can also search for this author in PubMed Google Scholar
Shuiping Gou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.L., Z.Z., S.G., L.J. and C.Y. participated in the designing of the study, topic definition, and review of relevant studies. S.G. and N.L. designed the overall research methodology with the support of Z.Z. and C.Y. Deep learning models and statistical analysis were designed and implemented by N.L. Statistical data analysis, figures, and tables were done by N.L. and Z.Z. with the support of A.F. N.L. and Z.Z. wrote the first draft. S.G., A.F. and Z.G. contributed greatly to subsequent versions of the manuscript. J.C., Y.Z. and W.J. participated in the collection and desensitization of the data required for this study. All authors critically reviewed the paper, all authors have a clear understanding of the content, results, and conclusions of the study and agree to submit this manuscript for publication. The corresponding author (S.G. and C.Y.) declare that all authors listed meet the authorship criteria and that no other authors involved in this study are omitted. S.G. and C.Y. are ultimately responsible for this article.

Corresponding authors

Correspondence to Chunxia Yan or Shuiping Gou.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Reporting Summary

Source data

Source Data File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, Z., Liu, N., Guo, Z. et al. Ageing and degeneration analysis using ageing-related dynamic attention on lateral cephalometric radiographs. npj Digit. Med. 5, 151 (2022). https://doi.org/10.1038/s41746-022-00681-y

Download citation

Received: 10 February 2022
Accepted: 22 August 2022
Published: 27 September 2022
DOI: https://doi.org/10.1038/s41746-022-00681-y

Subjects

Abstract

Similar content being viewed by others

Estimating chronological age through learning local and global features of panoramic radiographs in the Korean population

Automatic detection of mesiodens on panoramic radiographs using artificial intelligence

Performance evaluation of a deep learning model for automatic detection and localization of idiopathic osteosclerosis on dental panoramic radiographs

Introduction

Results

The LCR image

Ageing representation using ARDA in LCR images

Three ARDA concentrated regions: teeth, craniofacial and cervical spine

The dynamic distribution of ARDA in ARDA concentrated regions

Discussion

Methods

Ethics statement

Study population

Ageing feature extraction

ARDA generation

Acquisition of three ARDA concentrated regions

Quantitative analysis of ageing and degeneration

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Material

Reporting Summary

Source data

Source Data File

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links