Submitted:
16 April 2025
Posted:
18 April 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
3. Results
3.1. Benford’s Law Analysis
- “Number of services charged”: The observed distribution of leading digits in this category significantly deviated from the expected Benford distribution, with a χ2 value of 26.302 (eight degrees of freedom) and a p-value between 0.0005 and 0.005 (Figure 1, Table 1). This deviation suggests potential anomalies in the billing process.

| Leading digit | Benford probability | Observed probability |
|---|---|---|
| 1 | 30,10 | 9,38 |
| 2 | 17,61 | 9,38 |
| 3 | 12,49 | 18,75 |
| 4 | 9,69 | 3,13 |
| 5 | 7,92 | 6,25 |
| 6 | 6,69 | 21,88 |
| 7 | 5,80 | 12,50 |
| 8 | 5,12 | 6,25 |
| 9 | 4,58 | 12,50 |

| Leading digit | Benford probability | Observed probability |
|---|---|---|
| 1 | 30,10 | 21,88 |
| 2 | 17,61 | 37,50 |
| 3 | 12,49 | 12,50 |
| 4 | 9,69 | 0,00 |
| 5 | 7,92 | 0,00 |
| 6 | 6,69 | 15,63 |
| 7 | 5,80 | 0,00 |
| 8 | 5,12 | 3,13 |
| 9 | 4,58 | 9,38 |

| Leading digit | Benford probability | Observed probability |
|---|---|---|
| 1 | 30,10 | 3,23 |
| 2 | 17,61 | 12,90 |
| 3 | 12,49 | 29,03 |
| 4 | 9,69 | 19,35 |
| 5 | 7,92 | 19,35 |
| 6 | 6,69 | 6,45 |
| 7 | 5,80 | 6,45 |
| 8 | 5,12 | 3,23 |

| Leading digit | Benford probability | Observed probability |
|---|---|---|
| 1 | 30,10 | 41,94 |
| 2 | 17,61 | 3,23 |
| 3 | 12,49 | 0,00 |
| 4 | 9,69 | 3,45 |
| 5 | 7,92 | 0,00 |
| 6 | 6,69 | 10,34 |
| 7 | 5,80 | 13,79 |
| 8 | 5,12 | 17,24 |
| 9 | 4,58 | 6,90 |

| Leading digit | Benford probability | Observed probability |
|---|---|---|
| 1 | 30,10 | 32,26 |
| 2 | 17,61 | 29,03 |
| 3 | 12,49 | 9,68 |
| 4 | 9,69 | 9,68 |
| 5 | 7,92 | 0,00 |
| 6 | 6,69 | 6,45 |
| 7 | 5,80 | 3,23 |
| 8 | 5,12 | 3,23 |
| 9 | 4,58 | 6,45 |

| Leading digit | Benford probability | Observed probability |
|---|---|---|
| 1 | 30,10 | 13,33 |
| 2 | 17,61 | 13,33 |
| 3 | 12,49 | 16,67 |
| 4 | 9,69 | 16,67 |
| 5 | 7,92 | 16,67 |
| 6 | 6,69 | 10,00 |
| 7 | 5,80 | 10,00 |
| 8 | 5,12 | 3,33 |
| 9 | 4,58 | 0,00 |
3.2. Outlier Detection
3.2.1. Grubbs’ Test
- “Number of services charged”: One of two large university health centers in the country was identified as an outlier, with a GG-value of 4.906761. This result suggests that its billing practices differ significantly from the norm, possibly due to factors like its size, patient demographics, potentially including a larger number of elderly patients, or unique billing practices.
- “Total number of points charged”: The same large university healthcare center as mentioned above was identified as an outlier, with a GG-value of 4.955756, indicating unusual point billing practices.
- “Number of points per examination”: A specialized privately owned dermatology center was identified as an outlier, with a GG-value of 4.494556, indicating unusually high number of points per examination.
- “Average examination value (€)”: The same specialized dermatology center was an outlier, with a GG-value of 6.243344, suggesting unusually high examination values. Another dermatology clinic within a primary healthcare center in a major city was also identified as an outlier, with a GG-value of 3.062068.
- “Number of first examinations”: The same specialized dermatology center was again identified as an outlier, with a GG-value of 4.680405, suggesting unusually high first examination numbers.
- “Total number of examinations”: The same specialized dermatology center as mentioned previously was identified as an outlier, with a GG-value of 4.458205, suggesting unusually high total examination numbers with a larger number of follow-up examinations.
3.2.2. Hampel’s Test
- 7.
- “Number of services charged”, and “total number of points charged”: No outliers were detected using Hampel’s test for these categories.
- 8.
- “Number of points per examination”: A specialized privately owned dermatology center and a primary healthcare center in a major city, both mentioned previously, were identified as outliers. This result suggests that these centers charge an unusually high number of points per examination.
- 9.
- “Average examination value (€)”: A specialized privately owned dermatology center was again identified as an outlier, with the primary healthcare center in a major city also being close to the outlier threshold.
- 10.
- “Number of first examinations”: The same one of the two large university health centers mentioned previously was identified as an outlier. This suggests that the center’s first examination numbers are unusually high compared to others.
- 11.
- “Total number of examinations”: The large university health center was again identified as an outlier. This suggests that the center may have a high number of follow-up examinations due to a larger number of elderly patients, or unique billing practices.
3.2.3. T-Test
- 12.
- “Number of services charged”: The large university health center was identified as an outlier. This again suggests that the center’s service billing practices differ significantly from the norm.
- 13.
- “Total number of points charged”: No outliers were detected.
- 14.
- “Number of points per examination”: The specialized privately owned dermatology center mentioned previously was identified as an outlier. This again indicates that the center charges an unusually high number of points per examination.
- 15.
- “Average examination value (€)”: The specialized privately owned dermatology center was consistently identified as an outlier. This suggests that the center’s average examination value is significantly higher than expected.
- 16.
- “Number of first examinations”: The large university health center was again identified as an outlier. This indicates that its first examination numbers are unusually high.
- 17.
- “Total number of examinations”: The large university health center mentioned previously was an outlier. This suggests that the center may also have had an unusually high number of follow-up examinations.
4. Discussion
4.1. Strategic Use of Historical Data for Evaluating Fraud Detection Methods in Healthcare Billing
4.2. Assessing Data Credibility and Authenticity with Benford’s Law: Practicality and Usefulness in Healthcare Fraud Detection
4.3. Pinpointing Anomalies with Outlier Detection Tests: Practicality and Usefulness of Grubbs’, Hampel’s, and T-Tests in Healthcare Billing Analysis
4.4. Future Research Directions
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Conflicts of Interest
Abbreviations
| HIIS | Health Insurance Institute of Slovenia |
| MAD | Median absolute deviation |
| DRG | Diagnosis-related group |
References
- Najar AV, Alizamani L, Zarqi M, Hooshmand E. A global scoping review on the patterns of medical fraud and abuse: integrating data-driven detection, prevention, and legal responses. Arch Public Health, 2025; 83: 43. [CrossRef]
- Herland M, Bauder RA, Khoshgoftaar TM. Approaches for identifying U.S. medicare fraud in provider claims data. Health Care Manag Sci, 2020; 23: 2–19. [CrossRef]
- Hamid Z, Khalique F, Mahmood S, Daud A, Bukhari A, Alshemaimri B. Healthcare insurance fraud detection using data mining. BMC Med Inform Decis Mak, 2024; 24: 112. [CrossRef]
- Johnson JM, Khoshgoftaar TM. Data-Centric AI for Healthcare Fraud Detection. SN Comput Sci, 2023; 4: 389. [CrossRef]
- Antunes AM, Teixeira D, Sousa F. Benford’s Law: the fraud detection’s left hand. 2023 18th Iberian Conference on Information Systems and Technologies (CISTI). Aveiro, Portugal: IEEE, 2023.
- Orita M, Hagiwara Y, Moritomo A, Tsunoyama K, Watanabe T, Ohno K. Agreement of drug discovery data with Benford’s law. Expert Opin Drug Discov, 2013; 8: 1–5. [CrossRef]
- Goh, C. Applying visual analytics to fraud detection using Benford’s law. J Corp Accounting Finance, 2020; 19 31: 202–8. [CrossRef]
- Zhang, F. Using Outliers Detection in Policy Analysis: A Pilot Case Study of the Detection and Analysis of Average Healthcare Expense in China. Health Systems and Policy Research, 2015; 2.
- No authors listed: https://www.google.com/search?q=Miller-Nigrini+Excel+Benford+Tester&rlz=1C1GCEB_enSI923SI923&oq=Miller-Nigrini+Excel+Benford+Tester&gs_lcrp=EgZjaHJvbWUyBggAEEUYOTIGCAEQIxgnMgYIAhAjGCcyBwgDEAAY7wUyBwgEEAAY7wUyBwgFEAAY7wUyBwgGEAAY7wUyCggHEAAYgAQYogTSAQkxNzE5ajBqMTWoAgCwAgA&sourceid=chrome&ie=UTF-8.
- Adikaram KKLB, Hussein MA, Effenberger M, Becker T. Data Transformation Technique to Improve the Outlier Detection Power of Grubbs’ Test for Data Expected to Follow Linear Relation. Journal of Applied Mathematics, 2015; 2015: 1–9. [CrossRef]
- Urvoy M, Autrusseau F. Application of Grubbs’ test for outliers to the detection of watermarks. Proceedings of the 2nd ACM workshop on Information hiding and multimedia security. Salzburg Austria: ACM, 2014.
- Tripathy, SS. Comparison of Statistical Methods for Outlier Detection in Proficiency Testing Data on Analysis of Lead in Aqueous Solution. AJTAS, 2013; 2: 233. [CrossRef]
- Skaik, YA. The bread and butter of statistical analysis “t-test”: Uses and misuses. Pak J Med Sci, 1969; 31. [CrossRef]
- Ben-Gal, I. Outlier Detection. In: Maimon O, Rokach L, eds. Data Mining and Knowledge Discovery Handbook. Boston, MA: Springer US, 2009.
- Albreht T, Polin K, Pribaković Brinovec R, et al. Slovenia: Health System Review. Health Syst Transit, 2021; 23: 1–183.
- OECD, European Commission. Health at a Glance: Europe 2024: State of Health in the EU Cycle. OECD, 2024. [CrossRef]
- Prevolnik Rupel V, Došenović Bonča P. Is Value-Based Health Care Just the Latest Fad or can it Transform the Slovenian Health Care System? Slovenian Journal of Public Health, 2023; 62: 55–8.
- European Central Bank. A methodology for automatised outlier detection in high-dimensional datasets: an application to euro area banks’ supervisory data. LU: Publications Office, 2018. https://data.europa.eu/doi/10.2866/357467 (accessed 1 Apr 2025).
- Teitcher JE, Bockting WO, Bauermeister JA, Hoefer CJ, Miner MH, Klitzman RL. Detecting, preventing, and responding to “fraudsters” in internet research: ethics and tradeoffs. J Law Med Ethics, 2015; 43: 116‐33.
- Kilicoglu, H. Biomedical text mining for research rigor and integrity: tasks, challenges, directions. Brief Bioinform, 2018; 19: 1400‐1414. [CrossRef]
- Zhaksylyk A, Zimba O, Yessirkepov M, Kocyigit BF. Research Integrity: Where We Are and Where We Are Heading. J Korean Med Sci, 2023; 38: e405. [CrossRef]
- Azevedo CDS, Gonçalves RF, Gava VL, Spinola MM. A Benford’s law based method for fraud detection using R Library. MethodsX, 2021; 8: 101575. [CrossRef]
- Bosnjak M, Dahm S, Kuhnert R, Weihrauch D, Rosario AS, Hurraß J, Schmich P, Wieler LH. Potential COVID-19 test fraud detection: Findings from a pilot study comparing conventional and statistical approaches. J Health Monit, 2024; 9: e12100. PMCID: PMC11262742. [CrossRef] [PubMed]
- Auzins A, Aboltins A, Lismanis A. Identification of external shock impact on agri-food sector by applying outlier detection tests: case of Latvia. 2024. [CrossRef]
- Kokolanski Z, Dimcev V. Outlier Detection Approach in Sensor-to-Microcontroller Interfaces. 2024 XV International Symposium on Industrial Electronics and Applications (INDEL). Banja Luka, Bosnia and Herzegovina: IEEE, 2024.
- Ding K, Zhang J, Ding H, Liu Y, Chen F, Li Y. Fault detection of photovoltaic array based on Grubbs criterion and local outlier factor. IET Renewable Power Gen, 2020; 14: 551–9. [CrossRef]
- Ayinoluwa Feranmi Kolawole, Shukurat Opeyemi Rahmon. Designing secure data pipelines for medical billing fraud detection using homomorphic encryption and federated learning. Int J Sci Res Arch, 2024; 10: 1210–22. [CrossRef]
- Chouhan N, Kediya S, Wagh U, Deshpande P, Karmore P, Das D. A Meta-Analysis of AI in Fraud Detection: Evaluating the Effectiveness of Different Algorithms and Data Sources. 2024 2nd DMIHER International Conference on Artificial Intelligence in Healthcare, Education and Industry (IDICAIEI). Wardha, India: IEEE, 2024.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).