Submitted:
30 June 2025
Posted:
01 July 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
Study Objective
2. Materials and Methods
Study Design and Patient Selection
- BI-RADS 4 or 5 lesions on contrast-enhanced mammography (CEM),
- Histologically confirmed invasive breast cancer,
- Complete imaging workup (mammography, ultrasound, and CEM).
Data Management and CEM Protocol
- Demographics (age, patient ID),
- Imaging metadata (ACR density categories, BPE grade),
- Quantitative measurements (glandular dimensions, volumetric parameters).
Dataset
Neural Network Architecture for BPE and Breast Density Prediction
- Input Layer: Numerical variables including BPE score, ACR density grade, and patient age.
- Hidden Layer 1: Dense layer with 64 neurons, ReLU activation.
- Hidden Layer 2: Dense layer with 32 neurons, ReLU activation.
- Hidden Layer 3: Dense layer with 16 neurons, ReLU activation.
- Output Layer: Single neuron with linear activation, predicting a continuous standardized BPE or density score.

Model Implementation and Training Configuration
- Adam optimizer with learning rate 0.001,
- L2 regularization (λ = 0.01),
- Dropout rate of 30%,
- Mean Squared Error (MSE) as the loss function.
Clinical Relevance of Training Optimization
AI Integration into Radiological Workflow
- Image acquisition and preprocessing,
- Feature extraction (BPE grade, density category, patient age),
- Model prediction generating continuous BPE estimates or density reclassification,
- Radiologist review where AI outputs served as a second opinion, particularly in borderline BI-RADS C/D cases known for higher interobserver variability,
- Final clinical decision by the radiologist, preserving clinical autonomy.
Statistical Analysis
3. Results
Correlation Analysis
Model Performance
- Baseline linear regression (Excel): MSE = 0.864, R2 = 14.4%
- Optimized linear regression (scikit-learn): MSE = 0.641, R2 = 20.3%
- Neural network (TensorFlow): MSE = 0.638, R2 = 23.3%
Clinical Validation
Interpretation
4. Discussion
- 35% reduction in interpretation time (4.1 → 2.7 min/case),
- 22% fewer false positives in BI-RADS C/D cases,
- Structured reports that facilitated multidisciplinary communication.
5. Clinical Outlook
6. Study Limitations
- Sample size was modest (n = 213), limiting statistical power and reducing the training potential of more complex models.
- Single-center design: All cases were acquired on a single CEM device (GE Senographe Pristina) using a uniform protocol, potentially restricting generalizability to other clinical environments or imaging systems.
- Patient selection bias: Exclusion of patients undergoing hormone replacement therapy—known to affect BPE—may have limited biological heterogeneity.
- Lack of multimodal input: The model did not include data from MRI, ultrasound, or relevant clinical factors (e.g., hormonal status, genetic risk), which could enhance prediction robustness.
- Model constraints: Although the DNN showed better performance metrics, gains were statistically non-significant and required greater architectural complexity. Its limited interpretability and reliance on small datasets echo prior challenges in both general AI applications [28] and contrast-enhanced mammography [29].
- No prospective or multicenter validation: Generalizability remains unconfirmed outside the original dataset.
7. Future Directions
- Hybrid Probabilistic Models: Bayesian neural networks could offer not only high accuracy but also uncertainty estimates—critical for borderline BI-RADS cases.
- Multimodal Integration: Combining CEM data with radiomics, genomics, and patient history may disentangle technical noise from true biological signals.
- Federated Learning and Validation: Multi-institutional studies, using platforms like the NYU Breast Cancer Screening Dataset, may allow privacy-preserving, demographically diverse training.
- Explainability Tools: Methods such as SHAP or Layer-wise Relevance Propagation (LRP) can enhance model transparency and foster radiologist trust.
- Prospective Trials: Longitudinal studies should evaluate whether AI-assisted CEM interpretation reduces false positives and interval cancer rates while improving patient outcomes.
8. Conclusions
Author Contributions
Funding
Informed Consent Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| BPE | Background Parenchimal Enhancement |
| OV | Observer variability |
| CEM | contrast-enhanced mammography |
| ANN | artificial neural networks |
| BD | Breast density |
Appendix A - Technical Glossary
DNN Architecture Specifications
Training Configuration
Clinical Validation Metrics
Clinical Validation Outcomes
Key Terminological Additions
Appendix B - Simplified Tutorial - Using the 213-Patient Dataset
- Young patients with high density and mild BPE
- Middle-aged patients with heterogeneous density and minimal BPE
- Older patients with lower density and moderate BPE
- 149 records for training
- 32 for validation
- 32 for independent testing
Appendix C - Simplified Tutorial - Using VinDr-Mammo Dataset
- Histogram equalization, enhancing contrast across dense tissue regions
- Pixel normalization, rescaling intensity values to the [0,1] range
-
Case 1: 48-year-old, BI-RADS D
- ○
- AI Prediction: 3.8
- ○
- Reference: 4.0 → Error: 0.2
-
Case 2: 55-year-old, BI-RADS C
- ○
- AI Prediction: 2.5
- ○
- Reference: 3.0 → Error: 0.5
-
Case 3: 62-year-old, BI-RADS D
- ○
- AI Prediction: 4.1
- ○
- Reference: 4.0 → Error: 0.1
- Agreement with radiologist consensus: 94%
- Mean Absolute Error (MAE): 0.65 BI-RADS points
- False Positive Reduction: 22% relative to baseline
- Improved standardization of density assessments
- Reduction in inter-reader variability (κ: 0.45 to 0.82)
- 35% reduction in case interpretation time (mean: 4.1 to 2.7 minutes)
References
- Sprague, B.L.; Conant, E.F.; Onega, T.; et al. Variation in Mammographic Breast Density Assessments Among Radiologists in Clinical Practice: A Multicenter Observational Study. J Natl Cancer Inst. 2021, 113, 391–399. [Google Scholar] [CrossRef] [PubMed]
- Youk, J.H.; Gweon, H.M.; Son, E.J.; Kim, J.A. Observer variability in Breast Imaging Reporting and Data System (BI-RADS) density assessment: Comparison between digital mammography and digital breast tomosynthesis. Eur Radiol. 2020, 30, 66–76. [Google Scholar]
- Jochelson, M.S.; Lampen-Sachar, K.; Gibbons, G.; et al. Do clinical and radiologist characteristics impact mammographic interpretation? A Breast Cancer Surveillance Consortium study. J Clin Oncol. 2014, 32, 2299–2306. [Google Scholar]
- Sung, J.S.; Lebron, L.; Keating, D.; et al. Background parenchymal enhancement at breast MR imaging: Normal patterns, diagnostic challenges, and potential for false-positive and false-negative interpretation. Radiographics. 2014, 34, 234–247. [Google Scholar] [CrossRef]
- McCormack, V.A.; dos Santos Silva, I. Breast density and parenchymal patterns as markers of breast cancer risk: A meta-analysis. Cancer Epidemiol Biomarkers Prev. 2006, 15, 1159–1169. [Google Scholar] [CrossRef]
- Boyd, N.F.; Guo, H.; Martin, L.J.; et al. Mammographic density and the risk and detection of breast cancer. N Engl J Med. 2007, 356, 227–236. [Google Scholar] [CrossRef]
- Rhodes, D.J.; Radecki Breitkopf, C.; Ziegenfuss, J.Y.; et al. Clinical Utility of Breast Density Awareness: Knowledge of Breast Density and Its Consequences Among Breast Clinic Patients. J Am Coll Radiol. 2020, 17, 1389–1394. [Google Scholar]
- DenseBreast-info.org. Legislation & Regulatory Activity. 2023 [cited 2023 Nov 15]. Available from: https://densebreast-info.org/legislation/.
- Sardanelli, F.; Aase, H.S.; Ålvarez, M.; et al. Position paper on screening for breast cancer by the EUSOBI International Breast Density Working Group. Eur Radiol. 2017, 27, 2737–2743. [Google Scholar] [CrossRef]
- Girometti, R.; Zanotel, M.; Londero, V.; et al. Comparison of background parenchymal enhancement in contrast-enhanced mammography and magnetic resonance imaging. Eur Radiol. 2022, 32, 404–413. [Google Scholar]
- Kim, S.A.; Chang, J.M.; Cho, N.; et al. Interreader Variability in the BI-RADS Final Assessment and Management Recommendations for Breast MRI. J Breast Cancer 2020, 23, 197–209. [Google Scholar]
- Berg, W.A.; Campassi, C.; Langenberg, P.; Sexton, M.J. Breast Imaging Reporting and Data System: Inter- and Intraobserver Variability in Feature Analysis and Final Assessment. AJR Am J Roentgenol. 2000, 174, 1769–1777. [Google Scholar] [CrossRef] [PubMed]
- Rhodes, D.J.; Radecki Breitkopf, C.; Ziegenfuss, J.Y.; et al. Clinical Utility of Breast Density Awareness: Knowledge of Breast Density and Its Consequences Among Breast Clinic Patients. J Am Coll Radiol. 2020, 17, 1389–1394. [Google Scholar]
- Magni, V.; Cozzi, A.; Muscogiuri, G.; Benedek, A.; Rossini, G.; Fanizza, M.; Di Giulio, G.; Sardanelli, F. Background parenchymal enhancement on contrast-enhanced mammography: Associations with breast density and patient’s characteristics. Radiol Med. 2024, 129, 1303–1312. [Google Scholar] [CrossRef] [PubMed]
- Nissan, N.; Sevilimedu, V.; Gluskin, J.; Arita, Y.; Keating, D.M.; D’Alessio, D.; Fruchtman-Brot, H.; Ochoa-Albiztegui, R.E.; Sung, J.S.; Jochelson, M.S. Hormonal Regulation of Background Parenchymal Enhancement at Contrast-enhanced Mammography. Radiology 2025, 314, e241158. [Google Scholar] [CrossRef]
- Di Grezia, G.; Nazzaro, A.; et al. From Variability to Standardization: The Impact of Breast Density on Background Parenchymal Enhancement in Contrast-Enhanced Mammography and the Need for a Structured Reporting System. 2025. [CrossRef]
- Nguyen, H.T.; Nguyen, H.Q.; Pham, H.H.; Lam, K.; Le, L.T.; Dao, M.; Vu, V. VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography. Sci Data 2023, 10, 277. [Google Scholar] [CrossRef]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; et al. Scikit-learn: Machine Learning in Python. J Mach Learn Res. 2011, 12, 2825–2830. [Google Scholar]
- Abadi, M.; Barham, P.; Chen, J.; et al. TensorFlow: A System for Large-Scale Machine Learning. 12th USENIX Symposium on Operating Systems Design and Implementation. 2016, 265–283. [Google Scholar]
- Prechelt, L. Early Stopping - But When? In: Orr GB, Müller KR (eds) Neural Networks: Tricks of the Trade. Lecture Notes in Computer Science, vol 1524. Springer. 1998.
- Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning (Section 7.8: Early Stopping). MIT Press. 2016.
- Kim, S.A.; et al. Variability in Dense Breast Reporting: A Multicenter Analysis. Radiology. 2023, 306, 220–229. [Google Scholar]
- Girometti, R.; et al. Interval Cancers in Dense Breasts: A 10-Year Retrospective Study. Eur Radiol. 2024, 34, 123–134. [Google Scholar]
- Youk, J.H.; et al. False-Positive Rates in CEM for Dense Breasts. J Breast Imaging. 2023, 5, 89–97. [Google Scholar]
- Gelman, A.; et al. Bayesian Networks for Medical Decision Support. Stat Med. 2016, 35, 3899–3916. [Google Scholar]
- Sardanelli, F.; et al. Global Standards for Dense Breast Management. Eur Radiol. 2025, 35, In Press. [Google Scholar]
- Ripaud, E.; Jailin, C.; Quintana, G.I.; et al. Deep-learning model for background parenchymal enhancement classification in contrast-enhanced mammography. Phys Med Biol. 2024, 69, 115013. [Google Scholar] [CrossRef] [PubMed]
- Schulz, M.-A.; Yeo, B.T.T.; Vogelstein, J.T.; Mourão-Miranda, J.; Kather, J.N.; Kording, K.; Richards, B.; Bzdok, D. Different scaling of linear models and deep learning in UKBiobank brain images versus machine-learning datasets. Nature Communications 2020, 11, 4238. [Google Scholar] [CrossRef]
- Jailin, C.; Mohamed, S.; Iordache, R.; Milioni De Carvalho, P.; Ahmed, S.Y.; Abdel Sattar, E.A.; Moustafa, A.F.I.; Gomaa, M.M.; Kamal, R.M.; Vancamberg, L. AI-Based Cancer Detection Model for Contrast-Enhanced Mammography. Bioengineering 2023, 10, 974. [Google Scholar] [CrossRef] [PubMed]




| BI-RADS Category | Description | Number of Patients (n) | Percentage (%) |
|---|---|---|---|
| A | Almost entirely fatty | 12 | 5.6% |
| B | Scattered areas of fibroglandular | 31 | 14.6% |
| C | Heterogeneously dense | 95 | 44.6% |
| D | Extremely dense | 75 | 35.2% |
| Total | 213 | 100% |
| Model | MSE | R2 | AUC | Precision | Recall | Inference Time (ms) |
|---|---|---|---|---|---|---|
| Excel (Baseline) | 0.864 | 14.4% | - | - | - | - |
| Linear (scikit-learn) | 0.641 | 20.3% | 0.73 | 0.70 | 0.68 | 0.8 |
| DNN (Primary) | 0.638 | 23.3% | 0.75 | 0.72 | 0.69 | 12.4 |
| DNN (VinDr-Mammo) | 0.652 | 22.9% | 0.74 | 0.70 | 0.68 | 13.1 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).