Submitted:
15 July 2023
Posted:
17 July 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Principal Component Analysis
2.2. Bootstrap
2.3. Principal Components Analysis with bootstrap confidence interval
2.4. Approaches using bootstrap
- The maximum eigenvalue can be found by repeatedly resampling dataset of (B) times. Using the resampled dataset I's covariance matrices.
- The difference between the true and bootstrap eigenvalues (d) should be calculated.
- A 0.025 percentile and a b=0.975 percentile of d are needed to calculate the confidence interval 95%. The appropriate confidence interval is (true-b, true+a).
3. Results
3.1. Protein data
3.2. Chemical data
4. Discussion
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Einbeck, J.; Kalantan, Z.; Kruger, U. Practical Considerations on Nonparametric Methods for Estimating Intrinsic Dimensions of Nonlinear Data Structures. International Journal of Pattern Recognition and Artificial Intelligence 2020. Vol. 34, No. 9, 2058010.
- Pearson, K. LIII. On lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science 1901, 2(11), 559-572.
- Fisher, R.; Mackenzie, W. Studies in crop variation. II. The manurial response of different potato varieties, Journal of Agricultural Science 1923, 13 (1923) 311-32.
- Wold, H. (1973). Nonlinear estimation by iterative least squares procedures, in F. David (Editor), Research Papers in Statistics, Wiley, New York, 1966, pp. 411-444.
- Hotelling, H. (1933) Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology, 24, 417-441.DOI.org/10.1037/h0071325.
- Rao, Y. (1964). Multivariate Statistical Data Analysis- Principal Component Analysis (PCA). DOI: 10.5455/ijlr.20170415115235. [CrossRef]
- Alqahtani, N. and Kalantan, Z.I. (2020). Gaussian Mixture Models Based on Principal Component and Application. Mathematical Problems in Engineering. Hindawi. Vol.2020. ID 1202307. https://doi.org/10.1155/2020/1202307. [CrossRef]
- Efron, B (1979). Bootstrap methods: Another Look at the Jackknife. The Annals of Statistics,7, 1-26. DOI: 10.1007/978-1-4612-4380-9_41. [CrossRef]
- Efron, B., & Tibshirani, R. (1993). An Introduction to the Bootstrap. Chapman and Hall. DOI:10.1201/9780429246593. [CrossRef]
- Jaki, T., Su, T., Kim, M., & Lee Van Horn, M. (2017). An Evaluation of the bootstrap for model validation in mixture models. Communications in Statistics-Simulation and Computation, 0, 1-11. DOI:10.1080/03610918.2017.1303726. [CrossRef]
- Fang, Y., & Wang, J. (2012). Selection of the number of clusters via the bootstrap method. Computation Statistics and Data Analysis,56, 468-477. DOI: 10.1016/j.csda.2011.09.003. [CrossRef]
- Binhimd, S., & Coolen, F. (2020). Nonparametric Predictive Inference Bootstrap with Application to Reproducibility of the Two-Sample Kolmogorov–Smirnov Test. Journal of Statistical Theory and Practice. DOI:10.1007/s42519-020-00097-5. [CrossRef]
- Binhimd, S., & Almalki, B. (2019). Bootstrap Methods and Reproducibility Probability. American Scientific Research Journal for Engineering, Technology, and Sciences (ASRJETS).
- Daudin, J. J., Duby, C., & Trecourt, P. (1988). Stability of principal component analysis studied by the bootstrap method. Statistics: A journal of theoretical and applied statistics, 19(2), 241-258.
- Chateau, F., & Lebart, L. (1996). Assessing sample variability in the visualization techniques related to principal component analysis: bootstrap and alternative simulation methods. In COMPSTAT: Proceedings in Computational Statistics 12th Symposium held in Barcelona, Spain, 1996 (pp. 205-210). Physica-Verlag HD.
- Linting, M., Meulman, J. J., Groenen, P. J., & Van der Kooij, A. J. (2007). Stability of nonlinear principal components analysis: An empirical study using the balanced bootstrap. Psychological methods, 12(3), 359.
- Hennig, C. (2007). Cluster-wise assessment of cluster stability. Computational Statistics & Data Analysis, 52(1), 258-271.








| Variable (food) | |||||
|---|---|---|---|---|---|
| Red_Meat | -0.303 | -0.056 | -0.298 | -0.646 | 0.322 |
| White_Meat | -0.311 | -0.237 | 0.624 | 0.037 | -0.300 |
| Eggs | -0.427 | -0.035 | 0.182 | -0.313 | 0.079 |
| Milk | -0.378 | -0.185 | -0.386 | 0.003 | -0.200 |
| Fish | -0.136 | 0.647 | -0.321 | 0.216 | -0.290 |
| Cereal | 0.438 | -0.233 | 0.096 | 0.006 | 0.238 |
| Starch | -0.297 | 0.353 | 0.243 | 0.337 | 0.736 |
| Nuts | 0.420 | 0.143 | -0.054 | -0.330 | 0.151 |
| Fruits_Vegetables | 0.110 | 0.536 | 0.407 | -0.462 | -0.234 |
| Initial | Bootstrap Mean | CI- P2.5 | CI- P97.5 | CI- MEI | CI- MES | |
|---|---|---|---|---|---|---|
| Dim 1 | -44.516 | 46.954 | 38.577 | 56.741 | 38.041 | 55.867 |
| Dim 2 | -18.167 | 19.771 | 14.508 | 26.407 | 13.628 | 25.913 |
| Dim 3 | 12.532 | 13.015 | 9.343 | 16.828 | 9.219 | 16.811 |
| Dim 4 | 10.607 | 8.861 | 5.597 | 12.260 | 5.457 | 12.265 |
| Dim 5 | 5.154 | 5.028 | 3.119 | 7.589 | 2.750 | 7.306 |
| Dim 6 | 3.613 | 3.120 | 1.886 | 4.725 | 1.684 | 4.555 |
| Dim 7 | 3.018 | 1.872 | 0.946 | 3.039 | 0.778 | 2.967 |
| Dim 8 | 1.292 | 0.938 | 0.370 | 1.524 | 0.349 | 1.527 |
| Dim 9 | 1.101 | 0.441 | 0.080 | 0.953 | -0.001 | 0.884 |
| Variable | |||||
|---|---|---|---|---|---|
| pH | 0.094 | 0.573 | 0.365 | 0.082 | 0.024 |
| ALKALINITYmeql | 0.395 | -0.044 | -0.077 | -0.199 | 0.127 |
| CO2free | -0.085 | -0.496 | -0.486 | 0.216 | 0.013 |
| NNH4mgl | 0.314 | -0.202 | -0.012 | -0.475 | 0.504 |
| NNO3mgl | 0.276 | -0.110 | 0.258 | 0.726 | 0.414 |
| SRPmglP | 0.403 | -0.029 | -0.052 | 0.062 | -0.363 |
| TPmgl | 0.378 | 0.112 | -0.205 | 0.050 | -0.364 |
| TSSmgl | -0.106 | 0.351 | -0.566 | 0.334 | 0.160 |
| CONDUCTIVITYmScm | 0.402 | -0.086 | 0.016 | 0.069 | 0.222 |
| TSPmglP | 0.403 | 0.005 | -0.077 | 0.053 | -0.362 |
| Chlorophyllamgl | 0.114 | 0.476 | -0.432 | -0.167 | 0.295 |
| Initial | Bootstrap Mean | CI- P2.5 | CI- P97.5 | CI- MEI | CI- MES | |
|---|---|---|---|---|---|---|
| Dim 1 | 51.067 | 51.319 | 49.330 | 53.395 | 49.308 | 53.330 |
| Dim 2 | 18.355 | 18.465 | 17.150 | 20.068 | 16.997 | 19.932 |
| Dim 3 | 12.961 | 12.879 | 11.529 | 14.170 | 11.540 | 14.217 |
| Dim 4 | 6.191 | 6.284 | 5.262 | 7.424 | 5.170 | 7.399 |
| Dim 5 | 4.014 | 4.031 | 3.289 | 4.844 | 3.240 | 4.822 |
| Dim 6 | 3.296 | 3.118 | 2.516 | 3.700 | 2.519 | 3.716 |
| Dim 7 | 1.767 | 1.766 | 1.465 | 2.185 | 1.419 | 2.112 |
| Dim 8 | 1.476 | 1.382 | 1.033 | 1.693 | 1.049 | 1.715 |
| Dim 9 | 0.640 | 0.539 | 0.248 | 0.793 | 0.225 | 0.853 |
| Dim 10 | 0.197 | 0.184 | 0.108 | 0.295 | 0.092 | 0.277 |
| Dim 11 | 0.036 | 0.034 | 0.019 | 0.054 | 0.017 | 0.051 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).