Generative AI vs. Dentists: Reliability and Reproducibility in Mandibular Cortical Index Classification

Keisuke Seki; Minori Kashima; Taiki Akiyama; Atsushi Kobayashi; Ko Dezawa; Yoshimasa Takeuchi; Mika Furuchi; Atsushi Kamimoto

doi:10.20944/preprints202605.0011.v1

Submitted:

30 April 2026

Posted:

01 May 2026

You are already at the latest version

Abstract

The mandibular cortical index (MCI) is a valuable screening tool for osteoporosis on dental panoramic radiographs; however, inter-examiner variability remains a significant challenge. This study aimed to evaluate the diagnostic performance and reproducibility of a closed-type generative AI (NotebookLM, Google) compared with eight dentists of varying experience levels. One hundred radiographs were evaluated in two sessions with an interval of at least two weeks. The intra-examiner reliability for the AI was exceptionally high (κ = 0.987), and its processing speed was approximately six times faster than that of the dentists. However, the agreement between the AI and the dentists remained at "slight agreement" or lower (κ < 0.2), statistically rejecting the null hypothesis of diagnostic equivalence. Notably, a "two-level discrepancy" was observed, where the AI interchanged Class 1 (normal) and Class 3 (severe) in over 10% of cases. In contrast, dentists demonstrated a significant learning effect, with inter-examiner agreement improving between sessions. These results suggest that while generative AI offers superior speed and reproducibility, its current decision-making logic deviates fundamentally from human expert criteria. Future integration should focus on hybrid models where AI serves as a standardized feedback tool while dentists provide final confirmatory diagnoses.

Keywords:

mandibular cortical index

;

generative AI

;

inter-rater reliability

;

dental panoramic radiography

;

osteoporosis screening

;

NotebookLM

;

kappa coefficient

;

image recognition

Subject:

Medicine and Pharmacology - Dentistry and Oral Surgery

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Generative AI vs. Dentists: Reliability and Reproducibility in Mandibular Cortical Index Classification

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe