5. Discussion
In the field of forensic science, the determination of an individual's age is of paramount importance. Consequently, the methodology employed must be both reliable and accurate. This indicates that the degree of accuracy should be high and the mean discrepancy between dental age and chronological age should be as minimal as feasible [
66].
The age estimation methods that have been documented in the existing literature are based on the evaluation of specific indicators that assess the stage of dental development that the individual has reached. The earliest known studies on this subject are dated to the 19th century [
17]. One of the earliest methods based on dental indicators is the Schour and Massler method, which was developed in 1941. This method involves the morphological evaluation of tooth development through the use of diagrams that illustrate the expected developmental stages of deciduous and permanent teeth [
62]. Subsequently, Nolla et al. [
63] developed a ten-stage chart for age estimation, which was considered an important milestone in the field of age estimation, prompting the rapid development of new methods [
11,
17,
63,
64,
65]. Currently in literature, the most commonly used methods for age estimation in children are reported to be the Willems and Demirjian methods [
66].
Demirjian et al. [
64] developed a method that can be used as a universal tool for assessing dental maturity and estimating dental age in children. Demirjian's method is the first in the literature to provide visualization of the stages of tooth development, descriptive criteria, radiographic examples of each stage, and selection rules for decision-making at borderline stages. In the Willems method, the stages of tooth development described in the Demirjian method are used to estimate tooth age by means of maturity tables that provide the age in years directly [
11,
64,
65].
The classical methods that have been introduced in the literature have been developed through the analysis of large datasets comprising a large number of participants. However, these methods have several shortcomings, including the fact that results are often population-specific and rely on time-consuming manual procedures that are susceptible to observer subjectivity [
16,
67].
A further limitation of classical manual radiologic tooth age estimation techniques is that they lack the requisite number of stages to enable the closest possible monitoring of the growth process. Another disadvantage is the difficulty in selecting a method that allows researchers to distinguish teeth that are not sufficiently differentiated in terms of developmental stage [
16].
The objective of the research was to estimate the age of patients from a real-time dataset comprising patient records and OPG images using AI-based methodologies. This study employed deep CNN with 1D and 2D architectures for feature extraction and a MG-RF for estimating age. The proposed system was evaluated by examining the discrepancies between the actual and predicted values. The results demonstrated a strong correlation between the two variables. Furthermore, the performance of this system was evaluated in comparison to that of the conventional system in terms of MAE, RMSE, MSE, SD, and R2 score. The analytical outcomes revealed that the proposed system outperformed the conventional system, exhibiting a SD rate of 0.0004, an MAE rate of 0.0079, an MSE rate of 0.00027, an RMSE value of 0.0888, and an R2 value of 0.9999.
The performance of the reference methods and machine learning algorithms utilized in the study conducted by Galibourg et al.[
66] was evaluated with analogous metrics to those employed in our study, namely R², MAE, RMSE, and SD. The researchers reported that age estimation with machine learning methods demonstrated superior performance compared to manual methods based on radiographic tooth staging from childhood to early adulthood. The results of the study are presented in
Table 3, along with a comparison to the findings of our study.
The study of Galibourg et al. was carried out with a methodology based entirely on machine learning. Demirjian and Willems methods were used as the explanatory system in the training of the machine learning system. These methods focus on the left seven mandibular permanent teeth for age estimation.
Our study was carried out with the aim of performing age estimation on OPGs sections containing the left seven mandibular permanent teeth in a fully automatic manner without any explanatory system. From the radiographs and patient records that constitute the dataset of our study, feature extraction was performed using 1D-DCNN and 2D-DCNN architecture from deep learning methods. In the regression step, RF and GA methods were modified and combined and age estimation was performed.
When our study is compared with the study of Galibourg et al. in terms of performance, it is observed that our study has a superior performance than the manual predictions and all machine learning approaches evaluated in the related study. In addition, it was found that the MAE value decreased significantly with the system used in our study. It is thought that the reason for the decrease in the MAE value may be due to the use of deep learning algorithms in the feature extraction step of our study, due to the perception of various age-related indicators that cannot be detected by the human eye by deep learning algorithms and thus more information is transferred to the MG-RF algorithm. In the MG-RF step, the RF algorithm reduces the similarity between individual trees as a methodology. For this reason, it is thought to increase the robustness of the final model by selecting the point of departure from a random subset of the input features at each step in the tree building process. In addition, in our study, it is thought that the integration of four different approaches into the system together with the use of machine learning techniques following deep learning processes results in a significant decrease in the error rate and a significant increase in the performance of the system.
In our study, age estimation is performed completely automatically without using any explanatory reference system. The fact that our study eliminates the disadvantages of exposure to human interpretation and the subjectivity of human observers by automating the age estimation task can be argued as a distinct advantage of our study. In addition, the fact that no explanatory system is used and that it is an automatic method makes our study easy to use, fast and reproducible.
Tao et al. [
69] proposed a machine learning-based approach to improve the accuracy of tooth age estimation in 2020 using a dataset of 1636 OPGs of 787 male and 849 female individuals aged 11 to 19 years. In the study, tooth age estimation is considered as a regression problem.
In the methodology of the study, manual measurements were first performed using the Demirjian method and the Willems method. The attributes were determined by entering the real ages of the patients into the system. Then, the MLP algorithm, which is a feed-forward artificial neural network from machine learning approaches, was trained with these features and experiments were conducted. The performance of the proposed system was evaluated using MAE, MSE, RMSE metrics. It is reported that the proposed system outperforms the reference manual methods in terms of all performance metrics [
69]. The findings of the study and the findings of our study are presented in
Table 4.
The findings of our study, as indicated by the RMSE, MSE, and MAE metrics, demonstrate significantly enhanced performance in comparison to the results obtained for both female and male groups in the study conducted by Tao et al. This discrepancy may be attributed to the fact that the group under examination in our study comprises younger individuals (6-15 years old), and the variations in the age distribution of these individuals. Given that age indicators of growth and development decline with age, it is a well-established fact that studies in the field of age estimation obtain more accurate results with younger study populations [
17].
In our study, it is thought that the combination of deep learning and machine learning methods significantly enhances the performance. The use of deep learning techniques allows for the establishment of connections that are not discernible to the human eye, thereby enabling the inclusion of age indicators that cannot be calculated manually in the system under study. It can be proposed that this may be the source of the observed improvement in the system's performance. Furthermore, our study employs an automated system that is not contingent on any explanatory framework. Therefore, our study has the advantages of being fast, repeatable, and less susceptible to human interpretation.
In their 2021 study, Shen et al. [
79] employed a series of machine learning systems to estimate age. The dataset utilized in the study comprises 748 OPGs of 356 female and 392 male individuals between the ages of 5 and 13. The study employed a methodology based on random forest (RF), support vector machines (SVM), and linear logistic regression (LR). The machine learning models were trained with the manually realized Cameriere method as an explanatory system and gender information. The target value is set to the subject's chronological age. The accuracy of the proposed systems for estimating age is evaluated based on the following metrics: R², ME, RMSE, MSE, and MAE. The results are then compared with those obtained using the European and Chinese formulas of the Cameriere method, which were employed in the training of the system. The findings of the study and the findings of our study are shown in
Table 5. [
79].
The performance of the systems proposed in the study is comparable to that observed in our own study in terms of the R² value. It is, however, noteworthy that the error rates of the systems proposed by Shen et al. are significantly higher than those observed in our study with respect to other performance metrics evaluated. It is thought that this discrepancy may be attributed to the utilization of deep convolutional neural networks in the feature extraction phase of the present study, the incorporation of certain age indicators that cannot be discerned by the human eye, and the combination of numerous techniques.
In 2017, Čular et al. [
71] conducted a study utilizing OPGs of 203 individuals between the ages of 10 and 25. In this study, the researchers proposed a semi-automated system based on deep learning techniques to estimate tooth age by examining the mandibular right third molar on OPGs. The researchers employed two statistical computer vision models, namely the Active Shape Model (ASM) and the Active Appearance Model (AAM), which have been extensively utilized in face recognition, gender estimation, and medical image interpretation and feature extraction in previous studies, to extract the features describing the right mandibular wisdom teeth selected for this investigation. In the training set, the images were manually segmented. In the study, the extracted features were presented as input to an artificial neural network, specifically the Radial Basis Network, and age estimation was performed as the output. The findings of the age estimation performance of the study were evaluated using the mean absolute error (MAE) in years. The findings of that study indicated that the system demonstrated superior performance when AAM feature extraction was employed. Although AGM was reported to perform better in this study, it was reported that the age estimation performance of the system was adversely affected when AAM and ASM were applied together [
71].
The researchers noted that the MAE value of less than 3 years represents a promising preliminary result. Should the prediction error be reduced in future studies, the system proposed in their study may prove a viable option for use within the scope of forensic sciences. Furthermore, the researchers indicated that the system proposed in this study offers two key advantages: minimal user input and the ability to function without the input of an experienced dentist. [
71] The findings of the study and the findings of our study are shown in
Table 6.
A direct comparison between our study and that of Čular et al. [
71] is not feasible due to differences in methodology and age group. As there are a greater number of age indicators of growth and development in younger individuals within the existing literature, the MAE value is typically reported to be lower in studies involving this age group. The present study focuses on individuals between the ages of 6 and 15. Accordingly, the lower MAE value observed in our study relative to that reported by Čular et al. is an anticipated outcome.
In contrast to the present study, the fact that our study does not entail a manual step such as segmentation over radiographs while estimating age can be demonstrated as an advantage of the system utilized in our study. Furthermore, as our study employed OPGs sections encompassing the left seven mandibular permanent teeth, the third molars, which are the most common congenitally missing teeth, were not required, thus conferring another advantage to our study.
In 2021, Wallraff et al. [
73] conducted a study on automatic age estimation to reduce the estimation error, which is a disadvantage of traditional manual age estimation methods. In this study, the researchers proposed a deep learning system based on supervised regression to perform age estimation. The study was conducted on 14000 OPGs of individuals between the ages of 11 and 20. The system proposed in the study uses raw OPGs as input. Images do not need to be pre-processed and cropped. The researchers reported that the focus of this study was 11-20 years of age, as teeth develop in a predictable pattern during the first two decades of life. Radiographs in the dataset that were affected by various external factors such as low-contrast images, diseases and jaw malpositions were not excluded. In this study, age estimation is considered as a regression problem and ResNet18 algorithm is used as the network architecture in the proposed system. The MAE of the proposed system was reported to be 1.08 with an SD of +1.41 and an error rate of 17.52%. The researchers stated that the dataset in their study provided comprehensive data for the age range of 11-20 years and that individuals aged 0-20 years should be included in the proposed system in the future [
73].
As indicated in the literature, age-related indicators tend to decline as growth and development progress, resulting in lower error rates in studies that examine younger age groups. In the study conducted by Wallraff et al. [
73], individuals below the age of 11 were excluded from the analysis. A comparison of our study with that of Wallraff et al. revealed a significantly lower MAE value for our study. It is postulated that this finding is attributable to the fact that our study focused on younger individuals.
In their 2020 study, Vila-Blanco et al. proposed a method based on deep learning algorithms for fully automatic age estimation from OPGs. This method was designed to overcome the limitations of traditional methods, which are affected by observer subjectivity and time-consuming manual operations. The study was conducted on OPGs of 2,289 individuals from the Spanish population, with an age range of 4.5 to 89 years. Only OPGs of the requisite quality were included in the study. In contrast to other studies in the literature, this study did not exclude OPGs with any of the following: orthodontic brackets or appliances, dentures, implants, restorative materials, fillings, endodontic treatment, foreign bodies such as caries, missing teeth, residual tooth roots, or earrings, and external elements such as distorted or blurred images. The images in question have been identified as defective within the system. In this study, convolutional neural networks from deep learning algorithms were employed as the methodology. Two distinct network architectures, namely DANet and DASNet, were devised for the purposes of this study. These architectures were designed and trained specifically for the purpose of age estimation in the context of this study [
16].
The results of the study indicate a strong correlation between the DANet and DASNet systems and age. The coefficient of determination was R²=0.87 for DANet and R²=0.90 for DASNet. In the assessment of the data across all age groups, MAE was found to be 2.84 years. As the age of the individuals decreased, MAE value was found to be 0.78 years for DANet and 0.75 years for DASNet in the group of individuals younger than 15 years.
In contrast to the methodology employed by Vila-Blanco et al., our study utilized a OPG section encompassing seven left mandibular permanent teeth as the input data for the system. In the study conducted by Vila-Blanco et al., the raw OPGs were utilized as the input data, and no exclusion criteria were employed. However, radiographs with inadequate acquisition quality were designated as defective and submitted to the system. The present study was conducted using high-quality radiographs of individuals without any dental or bony pathology. It is believed that this contributed to the success of the system used in our study.
With regard to the age group under examination, our study encompasses a much younger population than that considered by Vila-Blanco et al. Given that developmental age indicators are more prevalent in younger age groups, it may be posited that the superior performance of the Vila-Blanco et al. study can be attributed to the findings of our study. Moreover, the integration of deep learning and machine learning methodologies is believed to be a contributing factor to the enhanced performance of the system in our study.