Submitted:
25 September 2025
Posted:
26 September 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction

- Solution of the above-mentioned pitfall of the Bayes theorem within a Naïve Bayes classifier framework.
- Empirical benchmark showing a robust classification performance of the plausible naïve Bayes classifier using the Pareto Density Estimation (PDE).
- Visualization of the class-conditional likelihoods and posteriors to support model interpretability.
2. Materials and Methods
2.1. Bayes Classification
2.2. Density Estimation
2.3. Pareto Density Estimation
2.4. Smoothed Pareto Density Estimation
2.5. Plausible Naïve Bayes classification
2.6. Practical Considerations
2.7. Interpretability of PDENB
2.8. Benchmark Datasets and Conventional Naïve Bayes Algorithms
3. Results
3.1. Classification Performance
3.2. Interpretable Naïve Bayes Classifier








3.3. A Baseline for the distinction of Blood vs. Bone marrow biological population frequencies
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Data Availability
Acknowledgements
Conflict of interest
References
- [Bellman, 1961] Bellman, R. E.: Adaptive control processes: a guided tour, Princeton university press, ISBN: 1400874661, 1961.
- [Bickel/Frühwirth, 2006] Bickel, D. R., & Frühwirth, R.: On a fast, robust estimator of the mode: Comparisons to other robust estimators with applications, Computational Statistics & Data Analysis, Vol. 50(12), pp. 3500-3530, 2006.
- [Blackman/Tukey, 1958] Blackman, R. B., & Tukey, J. W.: The measurement of power spectra from the point of view of communications engineering—Part I, Bell System Technical Journal, Vol. 37(1), pp. 185-282, 1958.
- [Breiman, 2001] Breiman, L.: Random forests, Machine Learning, Vol. 45(1), pp. 5-32, 2001.
- [Buitinck; et al. , 2011] Buitinck, L., Louppe, G., Blondel, M., Pedregosa, F., Mueller, A., Grisel, O.,... Varoquaux, G.: Scikit-learn: Machine Learning in Python, Journal of Machine Learning Research, Vol. 12, pp. 2825--2830, 2011.
- [Chatterjee, 2021] Chatterjee, S.: A new coefficient of correlation, Journal of the American Statistical Association, Vol. 116(536), pp. 2009-2022, 2021.
- [Chicco/Jurman, 2020] Chicco, D., & Jurman, G.: The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation, BMC Genomics, Vol. 21(1), pp. 1-13, 2020.
- [Chicco; et al. , 2021] Chicco, D., Tötsch, N., & Jurman, G.: The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation, BioData mining, Vol. 14, pp. 1-22, 2021.
- [Comon, 1992] Comon, P.: Independent Component Analysis, In Lacoume, J. L. (Ed.), Higher-Order Statistics, pp. 29-38, France, Elsevier, doi https://hal.archives-ouvertes.fr/hal-00346684, 1992.
- [Cover/Hart, 1967] Cover, T. M., & Hart, P. E.: Nearest neighbor pattern classification, IEEE transactions on information theory, Vol. 13(1), pp. 21-27, 1967.
- [Devroye; et al. , 2013] Devroye, L., Györfi, L., & Lugosi, G.: A probabilistic theory of pattern recognition, (Vol. 31), Springer Science & Business Media, ISBN: 1461207118, 2013.
- [Devroye/Lugosi, 2000] Devroye, L., & Lugosi, G.: Variable kernel estimates: on the impossibility of tuning the parameters, Springer, ISBN: 1461271118, 2000.
- [Domingos/Pazzani, 1997] Domingos, P., & Pazzani, M.: On the optimality of the simple Bayesian classifier under zero-one loss, Machine Learning, Vol. 29, pp. 103-130, 1997.
- [Dua/Graff, 2019] Dua, D., & Graff, C. (2019). UCI machine learning repository. Retrieved from: http://archive.ics.uci.edu/ml.
- [Duda; et al. , 2001] Duda, R. O., Hart, P. E., & Stork, D. G.: Pattern Classification, (Second Edition ed.), New York, USA, John Wiley & Sons, ISBN: 0-471-05669-3, 2001.
- [Ekblom, 1972] Ekblom, H.: A Monte Carlo investigation of mode estimators in small samples, Journal of the Royal Statistical Society: Series C (Applied Statistics), Vol. 21(2), pp. 177-184, 1972.
- [Fernández, Delgado; et al. , 2014] Fernández Delgado, M., Cernadas García, E., Barro Ameneiro, S., & Amorim, D. G.: Do we need hundreds of classifiers to solve real world classification problems?, Journal of Machine Learning Research, Vol. 15, pp., 2014.
- [Fritsch/Carlson, 1980] Fritsch, F. N., & Carlson, R. E.: Monotone piecewise cubic interpolation, SIAM Journal on Numerical Analysis, Vol. 17(2), pp. 238-246, 1980.
- [Fukunaga/Kessell, 1973] Fukunaga, K., & Kessell, D.: Nonparametric Bayes error estimation using unclassified samples, IEEE transactions on information theory, Vol. 19(4), pp. 434-440, 1973.
- [Fukunaga/Kessell, 1971] Fukunaga, K., & Kessell, D. L.: Estimation of classification error, IEEE Transactions on computers, Vol. 100(12), pp. 1521-1527, 1971.
- [Gastwirth, 1971] Gastwirth, J. L.: A general definition of the Lorenz curve, Econometrica: Journal of the Econometric Society, Vol., pp. 1037-1039, 1971.
- [Gastwirth/Glauberman, 1976] Gastwirth, J. L., & Glauberman, M.: The interpolation of the Lorenz curve and Gini index from grouped data, Econometrica: Journal of the Econometric Society, Vol., pp. 479-483, 1976.
- [Good, 2013] Good, P.: Permutation tests: a practical guide to resampling methods for testing hypotheses, Springer Science & Business Media, ISBN: 147573235X, 2013.
- [Hall/Frank, 2008] Hall, M. A., & Frank, E.: Combining naive bayes and decision tables, Vol. 21, Proc. FLAIRS, pp. 318-319, 2008.
- [Harmeling; et al. , 2004] Harmeling, S., Meinecke, F., & Müller, K.-R.: Injecting noise for analysing the stability of ICA components, Signal processing, Vol. 84(2), pp. 255-266, 2004.
- [Hoffmann; et al. , 2022] Hoffmann, J., Thrun, M. C., Röhnert, M., Von Bonin, M., Oelschlägel, U., Neubauer, A.,... Brendel, C.: Identification of critical hemodilution by artificial intelligence in bone marrow assessed for minimal residual disease analysis in acute myeloid leukemia: the Cinderella method., Cytometry: Part A, Vol. 103(4), pp. 304–312, doi 10.1002/cyto.a.24686, 2022.
- [Hotelling, 1933] Hotelling, H.: Analysis of a complex of statistical variables into principal components, Journal of educational psychology, Vol. 24(6), pp. 417, 1933.
- [John/Langley, 1995] John, G. H., & Langley, P.: Estimating continuous distributions in Bayesian classifiers, Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, 10.48550/arXiv.1302.4964, Morgan and Kaufman, San Mateo, 1995.
- [Jones/Lotwick, 1984] Jones, M., & Lotwick, H.: Remark AS R50: a remark on algorithm AS 176. Kernal density estimation using the fast Fourier transform, Journal of the Royal Statistical Society. Series C (Applied Statistics), Vol. 33(1), pp. 120-122, 1984.
- [Karlis; et al. , 2003] Karlis, D., Saporta, G., & Spinakis, A.: A simple rule for the selection of principal components, Communications in Statistics-Theory and Methods, Vol. 32(3), pp. 643-666, 2003.
- [Keating/Scott, 1999] Keating, J. P., & Scott, D. W.: A primer on density estimation for the great homerun race of 1998, Stats, Vol. 25, pp. 16-22, 1999.
- [Li, 2024] Li, S.: FNN: Fast Nearest Neighbor Search Algorithms and Applications (Version 1.1.4.1). Retrieved from https://CRAN.R-project.org/package=FNN, 2024.
- [Loizou/Maybank, 1987] Loizou, G., & Maybank, S. J.: The nearest neighbor and the bayes error rates, Ieee Transactions on Pattern Analysis and Machine Intelligence, Vol. (2), pp. 254-262, 1987.
- [Lux/Rinderle-Ma, 2023] Lux, M., & Rinderle-Ma, S.: DDCAL: Evenly Distributing Data into Low Variance Clusters Based on Iterative Feature Scaling, Journal of Classification, Vol., 10.1007/s00357-022-09428-6, pp., doi 10.1007/s00357-022-09428-6, 2023.
- [Meyer; et al. , 2024] Meyer, D., Dimitriadou, E., Hornik, K., Weingessel, A., Leisch, F., Chang, C.-C., & Lin, C.-C.: e1071: Misc Functions of the Department of Statistics, Probability Theory Group (Formerly: E1071), TU Wien (Version 1.7-16). Retrieved from https://CRAN.R-project.org/package=e1071, 2024.
- [Michal Majka, 2024] Michal Majka, R. C. T.: naivebayes: High Performance Implementation of the Naive Bayes Algorithm (Version 1.0.0), CRAN. Retrieved from https://CRAN.R-project.org/package=naivebayes, 2024.
- [Mitchell, 1997] Mitchell, T. M.: Machine Learning, (24th ed.), India, McGraw-Hill Education, ISBN: 978-1-25-909695-2, 1997.
- [Novo, 2022] Novo, D.: A comparison of spectral unmixing to conventional compensation for the calculation of fluorochrome abundances from flow cytometric data, Cytometry Part A, Vol. 101(11), pp. 885-891, 2022.
- [Pearson, 1901] Pearson, K.: LIII. On lines and planes of closest fit to systems of points in space, The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, Vol. 2(11), pp. 559-572, 1901.
- [Plank; et al. , 2021] Plank, K., Dorn, C., & Krause, S. W.: The effect of erythrocyte lysing reagents on enumeration of leukocyte subpopulations compared with a no-lyse-no-wash protocol, International Journal of Laboratory Hematology, Vol. 43(5), pp. 939-947, 2021.
- [Rish, 2001] Rish, I.: An empirical study of the naive Bayes classifier, Vol. 3, Citeseer, Proc. IJCAI 2001 workshop on empirical methods in artificial intelligence, pp. 41-46, 2001.
- [Roever; et al. , 2023] Roever, C., Raabe, N., Luebke, K., Ligges, U., Szepannek, G., Zentgraf, M., & Meyer, D.: klaR: Classification and Visualization (Version 1.7-3), CRAN. Retrieved from https://CRAN.R-project.org/package=klaR 2023.
- [Scott, 2015] Scott, D. W.: Multivariate density estimation: theory, practice, and visualization, John Wiley & Sons, ISBN: 0471697559, 2015.
- [Shapiro, 2005] Shapiro, H. M.: Practical flow cytometry, John Wiley & Sons, ISBN: 0471434035, 2005.
- [Silverman, 1982] Silverman, B. W.: Algorithm AS 176: Kernel density estimation using the fast Fourier transform, Journal of the Royal Statistical Society. Series C (Applied Statistics), Vol. 31(1), pp. 93-99, 1982.
- [Silverman, 1984] Silverman, B. W.: Spline smoothing: the equivalent variable kernel method, The annals of Statistics, Vol., pp. 898-916, 1984.
- [Silverman, 1998] Silverman, B. W.: Density estimation for statistics and data analysis, London, Chapman and Hall, ISBN: 0-412-24620-1, 1998.
- [Thrun, 2018] Thrun, M. C.: Projection Based Clustering through Self-Organization and Swarm Intelligence, (Ultsch, A. & Hüllermeier, E. Eds. , 9: Extended doctoral dissertation, Heidelberg, Springer, ISBN: 978-3658205393, 2018.
- [Thrun, 2021] Thrun, M. C.: The Exploitation of Distance Distributions for Clustering, International Journal of Computational Intelligence and Applications Vol. 20(3), pp. 2150016, doi 10.1142/S1469026821500164, 2021.
- [Thrun; et al. , 2020] Thrun, M. C., Gehlert, T., & Ultsch, A.: Analyzing the Fine Structure of Distributions, PloS one, Vol. 15(10), pp. e0238835, doi 10.1371/journal.pone.0238835 2020.
- [Thrun; et al. , 2022] Thrun, M. C., Hoffman, J., Röhnert, M., Von Bonin, M., Oelschlägel, U., Brendel, C., & Ultsch, A.: Flow Cytometry datasets consisting of peripheral blood and bone marrow samples for the evaluation of explainable artificial intelligence methods, Data in Brief, Vol. 43, pp. 108382, doi 10.1016/j.dib.2022.108382, 2022.
- [Thrun/Märte, 2025] Thrun, M. C., & Märte, J.: Memshare: Memory Sharing for Multicore Computation in R with an Application to Feature Selection by Mutual Information using PDE, arXiv:2509.08632 (preprint), Vol., 10.48550/arXiv.2509.08632 pp., doi 10.48550/arXiv.2509.08632, 2025.
- [Thrun/Ultsch, 2018] Thrun, M. C., & Ultsch, A.: Effects of the payout system of income taxes to municipalities in Germany, in Papież, M. & Śmiech, S. (eds.), 12th Professor Aleksander Zelias International Conference on Modelling and Forecasting of Socio-Economic Phenomena, pp. 533-542, Cracow: Foundation of the Cracow University of Economics, Cracow, Poland, 2018.
- [Thrun/Ultsch, 2020] Thrun, M. C., & Ultsch, A.: Using Projection based Clustering to Find Distance and Density based Clusters in High-Dimensional Data, Journal of Classification, Vol. 38(2), pp. 280-312, doi 10.1007/s00357-020-09373-2, 2020.
- [Ultsch, 2003] Ultsch, A.Optimal density estimation in data containing clusters of unknown structure, technical report, 34,University of Marburg, Department of Mathematics and Computer Science, Report No., 2003.
- [Ultsch, 2005] Ultsch, A.: Pareto density estimation: A density estimation for knowledge discovery, In Baier, D. & Werrnecke, K. D. (Eds.), Innovations in classification, data science, and information systems, pp. 91-100, Berlin, Germany, Springer, doi, 2005.
- [Ultsch; et al. , 2024] Ultsch, A., Hoffman, J., Röhnert, M., Von Bonin, M., Oelschlägel, U., Brendel, C., & Thrun, M. C.: An Explainable AI System for the Diagnosis of High Dimensional Biomedical Data, BioMedInformatics, Vol. 4, pp. 197-218, doi 10.3390/biomedinformatics4010013, 2024.
- [Ultsch/Lötsch, 2015] Ultsch, A., & Lötsch, J.: Computed ABC Analysis for Rational Selection of Most Informative Variables in Multivariate Data, PloS one, Vol. 10(6), pp. e0129767, doi 10.1371/journal.pone.0129767, 2015.
- [Ultsch/Lötsch, 2022a] Ultsch, A., & Lötsch, J.: Euclidean distance-optimized data transformation for cluster analysis in biomedical data (EDOtrans), BMC bioinformatics, Vol. 23(1), pp. 233, 2022a.
- [Ultsch/Lötsch, 2022b] Ultsch, A., & Lötsch, J.: Robust classification using posterior probability threshold computation followed by Voronoi cell based class assignment circumventing pitfalls of Bayesian analysis of biomedical data, International Journal of Molecular Sciences, Vol. 23(22), pp. 14081, 2022b.
- [van den Heuvel/Zhan, 2022] van den Heuvel, E., & Zhan, Z.: Myths about linear and monotonic associations: Pearson’sr, Spearman’s ρ, and Kendall’s τ, The American Statistician, Vol. 76(1), pp. 44-52, 2022.
- [Wainberg et al., 2016] Wainberg, M., Alipanahi, B., & Frey, B. J.: Are random forests truly the best classifiers?, The Journal of Machine Learning Research, Vol. 17(1), pp. 3837-3841, 2016.
- [Zaidi et al., 2013] Zaidi, N. A., Cerquides, J., Carman, M. J., & Webb, G. I.: Alleviating naive Bayes attribute independence assumption by attribute weighting, Journal of Machine Learning Research, Vol. 14(Jul), pp. 1947-1988, 2013.


| N | DIM | Class No. | Cases per class | Pearson | Spearman | Kendall | XICOR | |
| Cell populations | 5121 | 14 | 13 | 1128, 312, 829, 525, 768, 122, 76, 135, 283, 630, 229, 31, 53 | 0-0.99 | 0-0.98 | 0-0.9 | 0-0.85 |
| Crabs (Sex) | 200 | 5 | 2 | 100, 100 | 0 | 0-0.07 | 0-0.03 | 0.01-0.24 |
| Crabs (SP) | 200 | 5 | 2 | 100, 100 | 0 | 0-0.07 | 0-0.03 | 0.01-0.24 |
| Dermatology | 358 | 34 | 6 | 111, 60, 71, 48, 48, 20 | 0-0.94 | 0-0.98 | 0-0.94 | 0-0.85 |
| Iris | 150 | 4 | 3 | 50, 50, 50 | 0.12-0.96 | 0.17-0.94 | 0.08-0.81 | 0.08-0.72 |
| MiceProtein | 1080 | 77 | 8 | 150, 150, 135, 135, 135, 135, 105, 135 | 0-1 | 0-1 | 0-1 | 0-0.99 |
| Penguins | 344 | 4 | 3 | 152, 68, 124 | 0 | 0-0.19 | 0-0.12 | 0-0.25 |
| Satellite | 6435 | 36 | 6 | 1533, 703, 1358, 626, 707, 1508 | 0-0.96 | 0-0.96 | 0.02-0.85 | 0.02-0.76 |
| Spam | 4601 | 57 | 2 | 2788, 1813 | 0-1 | 0-0.94 | 0-0.94 | 0-0.93 |
| Swiss | 200 | 6 | 2 | 100, 100 | 0.06-0.74 | 0.05-0.75 | 0.03-0.59 | 0.01-0.43 |
| Wine | 178 | 13 | 3 | 59, 71, 48 | 0-0.86 | 0.01-0.88 | 0.01-0.7 | 0-0.6 |
| WCBCD | 569 | 30 | 2 | 212, 357 | 0-1 | 0-1 | 0-0.99 | 0-0.97 |
| CoverType | 581012 | 55 | 7 | 211840, 283301, 35754, 2747, 9493, 17367, 20510 | 0-0.79 | 0-0.82 | NA | NA |
| LetterRecognition | 20000 | 16 | 26 | 796, 755, 805, 783, 773, 748, 766, 789, 747, 792, 787, 753, 758, 775, 736, 734, 752, 761, 803, 768, 764, 786, 783, 813, 739, 734 | 0-0.85 | 0-0.87 | NA | NA |
| PDENB | GNB | NPNB | 7NN | PyGNB | klaRGNB | klaRNPNB | e1071GNB | |||||||||
| Cell populations | 0.98 | 0 | 0.97 | 0 | 0.98 | 0 | 0.16 | 0 | 0.97 | 0 | 0.97 | 0 | 0.98 | 0 | 0.97 | 0 |
| CoverType | 0.44 | 0 | 0.22 | 0 | 0.41 | 0 | 0.86 | 0 | 0.22 | 0 | 0.22 | 0 | 0.41 | 0 | 0.22 | 0 |
| Crabs (Sex) | 0.91 | 0.1 | 0.92 | 0.1 | 0.92 | 0.1 | 0.81 | 0.1 | 0.92 | 0.1 | 0.92 | 0.1 | 0.92 | 0 | 0.92 | 0.1 |
| Crabs (SP) | 0.99 | 0 | 0.99 | 0 | 0.97 | 0 | 0.88 | 0.1 | 0.99 | 0 | 0.99 | 0 | 0.97 | 0 | 0.99 | 0 |
| Dermatology | 0.95 | 0 | 0.82 | 0 | 0.9 | 0.1 | 0.81 | 0.1 | 0.85 | 0 | NA | NA | 0.9 | 0.1 | 0.82 | 0 |
| Iris | 0.95 | 0.1 | 0.94 | 0.1 | 0.94 | 0.1 | 0.95 | 0 | 0.94 | 0.1 | 0.94 | 0.1 | 0.94 | 0.1 | 0.94 | 0.1 |
| LetterRecognition | 0.72 | 0.1 | 0.66 | 0.1 | 0.72 | 0.1 | 0.18 | 0.1 | 0.66 | 0.1 | 0.66 | 0.1 | 0.72 | 0.1 | 0.66 | 0.1 |
| MiceProtein | 0.84 | 0 | 0.75 | 0 | 0.84 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | 0.75 | 0 |
| Penguins | 0.98 | 0 | 0.98 | 0 | 0.97 | 0 | 0.97 | 0 | 0.98 | 0 | 0.98 | 0 | 0.97 | 0 | 0.98 | 0 |
| Spam | 0.68 | 0.1 | 0.52 | 0 | 0.38 | 0 | 0.58 | 0 | 0.68 | 0 | NA | NA | 0.37 | 0 | 0.52 | 0 |
| Satellite | 0.77 | 0 | 0.6 | 0 | 0.62 | 0 | 0.69 | 0 | 0.75 | 0 | 0.6 | 0 | 0.62 | 0 | 0.6 | 0 |
| Swiss | 0.98 | 0 | 0.99 | 0 | 0.99 | 0 | 0.98 | 0 | 0.99 | 0 | 0.99 | 0 | 0.99 | 0 | 0.99 | 0 |
| WCBCD | 0.89 | 0 | 0.89 | 0.1 | 0.87 | 0 | 0.93 | 0 | 0.89 | 0.1 | 0.89 | 0.1 | 0.88 | 0 | 0.89 | 0.1 |
| Wine | 0.97 | 0.1 | 0.96 | 0.1 | 0.96 | 0.1 | 0.95 | 0.1 | 0.96 | 0.1 | 0.96 | 0.1 | 0.96 | 0.1 | 0.96 | 0.1 |
| PDENB | PyGNB | NPNB | klaRNPNB | e1071GNB | GNB | kNN7 | klaRGNB | |
| Grade | 2.46 | 4 | 4.21 | 4.46 | 4.86 | 4.93 | 5.54 | 5.54 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).