Version 1
: Received: 23 January 2024 / Approved: 23 January 2024 / Online: 24 January 2024 (07:16:18 CET)
How to cite:
Brion, A.; Dang, L.-A.; Poncelet, E.; Ferret, L.; Vermersch, M.; Nicolas, L. Comparative Clinical Evaluation of 3 Artificial Intelligence Algorithms for Breast Cancer Screening with Mammography. Preprints2024, 2024011706. https://doi.org/10.20944/preprints202401.1706.v1
Brion, A.; Dang, L.-A.; Poncelet, E.; Ferret, L.; Vermersch, M.; Nicolas, L. Comparative Clinical Evaluation of 3 Artificial Intelligence Algorithms for Breast Cancer Screening with Mammography. Preprints 2024, 2024011706. https://doi.org/10.20944/preprints202401.1706.v1
Brion, A.; Dang, L.-A.; Poncelet, E.; Ferret, L.; Vermersch, M.; Nicolas, L. Comparative Clinical Evaluation of 3 Artificial Intelligence Algorithms for Breast Cancer Screening with Mammography. Preprints2024, 2024011706. https://doi.org/10.20944/preprints202401.1706.v1
APA Style
Brion, A., Dang, L. A., Poncelet, E., Ferret, L., Vermersch, M., & Nicolas, L. (2024). Comparative Clinical Evaluation of 3 Artificial Intelligence Algorithms for Breast Cancer Screening with Mammography. Preprints. https://doi.org/10.20944/preprints202401.1706.v1
Chicago/Turabian Style
Brion, A., Mathilde Vermersch and Laurent Nicolas. 2024 "Comparative Clinical Evaluation of 3 Artificial Intelligence Algorithms for Breast Cancer Screening with Mammography" Preprints. https://doi.org/10.20944/preprints202401.1706.v1
Abstract
Objectives: Few studies have evaluated and compared artificial intelligence (AI) models for breast cancer detection on the same database. The purpose is to compare the performances of 3 artificial intelligence algorithms on the same dataset at a free operative threshold and to evaluate how the performance is impacted by the chosen threshold.
Materials: In this retrospective single-center study, a dataset of 314 2D screening mammograms performed between 2012 and 2020 was established with a prevalence of 19.6% of histologically proven cancer. Three AI constructors using CE (European conformity) marking algorithms have agreed to take part in the study by submitting the cancer-enriched cohort to their algorithm. They chose a free operative threshold to distinguish benign and malignant mammograms. The most suspicious lesion was marked and correctly located by the AI algorithm. Statistical parameters were analyzed and compared between the algorithms.
Results: Regarding both sensitivity and specificity, at the chosen threshold, AI 1 had the best compromise between sensitivity (74%) and specificity (79%). AI 2 had a statistically lower sensitivity (52%; p<0.05) with a higher specificity (98.4%) than the other AI algorithms. AI 3 had a nonsignificant sensitivity difference (69.9%) but a significantly lower specificity (45.3%; p<0.001) than the other two.
The performances varied depending on the chosen threshold; when the AI 2 threshold was lowered, the sensitivity increased (69.9%), while a higher specificity (86.1%) was maintained.
Conclusion: Two AI algorithms stood out in terms of performance when the threshold was optimized, resulting in an acceptable sensitivity and specificity.
Keywords
Breast neoplasia; Artificial intelligence; Algorithm; Mammography
Subject
Medicine and Pharmacology, Other
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.