Submitted:
03 July 2025
Posted:
07 July 2025
You are already at the latest version
Abstract
Keywords:

1. Introduction and Literature Review
2. Data and Research Methodology
2.1. Data Collection and Pre-Processing
- Innovative: these are the firms that have reported new developments, such as introducing a new product or service to the market, implementing a new production process or method, adopting new management practices, or exploring new ways of selling goods or services.
- Fast growing: these are the cases where the annual turnover increases by more than 20%.
- Open to equity financing: whether it reported equity as either a relevant funding source or one used in the past six months.
2.2. Machine Learning Algorithms

- Logistic Regression
- K-Nearest Neighbor
- Random Forest
- Support Vector Machines with RBF and linear kernel
- Naïve Bayes Classifier
- AdaBoost
- Easy Ensemble
- Balanced Bagging Classifier
- Gradient Boosting Trees
2.3. Forecasting Performance Metrics
- True Positives (TP): The number of instances that the model correctly predicts the positive class (in our case the SME to be predicted as investment-ready, class 1).
- True Negatives (TN): The number of instances that the model correctly predicts the negative class (in our case the SME to be predicted as non-investment-ready, class 0).
- False Positives (FP): The number of instances that the model incorrectly predicts the positive class when the actual class is negative.
- False Negatives (FN): The number of instances that the model incorrectly predicts the negative class when the actual is positive.


2.4. Variable Importance Measure (VIM)
3. Empirical Results



- 267 TP predictions (investment-ready) out of 374, or 71.3%.
- 1440 TN predictions (not-investment-ready) out of 1814 or 79.3%.
- 374 FP predictions (predicted as investment-ready but are not) or 20.6%
- 107 FN predictions (predicted as not-investment-ready but are not) or 28.6%.
- The most important predictor appears to be the firm’s “confidence in negotiations with equity investors or venture capital firms”. High levels of negotiation confidence likely reflect firm preparation, understanding of investor expectations, and stronger business fundamentals, all of which are critical traits for attracting external investment.
- The second most important predictor is “financing growth”, which indicates that the firms that are actively seeking and managing financial growth, tend to be investment ready. This highlights the importance of proactive financial planning and scaling strategies as indicators of a firm’s investment potential.
- “Factors in the future of financing of the firm” is ranked third in top predictors. This variable points to the importance of future financial planning. Investors may favor firms that not only demonstrate current performance but also show foresight in securing future financing.
- “External financing factors” including market conditions or access to funding channels is also an essential feature that influences the model’s decision-making process. Firms capable of navigating external factors-influences may be more successful in attracting external investors.
- “Autonomous organization type”, relating to the structure of the organization plays the fourth more important role on predicting an investment ready SME. Autonomous firms might be more agile and able to innovate and thus they draw investors’ interest.
- “Willingness of investors to invest in the enterprise”, relating to the investors’ sentiment towards the firm.


4. Conclusion
Appendix A
| Model | TN | FP | FN | TP |
| Gradient Boosting | 1440 | 374 | 107 | 267 |
| Logistic Regression | 1413 | 401 | 104 | 270 |
| Easy Ensemble Classifier | 1406 | 408 | 106 | 268 |
| Balanced Random Forest Classifier | 1386 | 428 | 108 | 266 |
| Random Forest | 1506 | 308 | 136 | 238 |
| SVC | 1457 | 357 | 127 | 247 |
| Balanced SVC | 1368 | 446 | 111 | 263 |
| AdaBoost | 1393 | 421 | 124 | 250 |
| MultinomialNB | 1352 | 462 | 121 | 253 |
| Balanced MultinomialNB | 1354 | 460 | 123 | 251 |
| Balanced KNeighbors | 1396 | 418 | 155 | 219 |
| KNeighbors | 1745 | 69 | 321 | 53 |
| Model | Balanced Accuracy | ROC-AUC |
| Gradient Boosting | 0.754 | 0.815 |
| Logistic Regression | 0.750 | 0.811 |
| Easy Ensemble Classifier | 0.746 | 0.804 |
| Balanced Random Forest Classifier | 0.738 | 0.810 |
| Random Forest | 0.733 | 0.807 |
| SVC | 0.732 | 0.793 |
| Balanced SVC | 0.729 | 0.800 |
| AdaBoost | 0.718 | 0.794 |
| MultinomialNB | 0.711 | 0.790 |
| Balanced MultinomialNB | 0.709 | 0.789 |
| Balanced KNeighbors | 0.678 | 0.743 |
| KNeighbors | 0.552 | 0.640 |
| Model |
Precision class 0 |
Recall class 0 |
F1 class 0 |
Precision class 1 |
Recall class 1 |
F1 Class 1 |
| Gradient Boosting | 0.931 | 0.794 | 0.857 | 0.417 | 0.714 | 0.526 |
| Logistic Regression | 0.931 | 0.779 | 0.848 | 0.402 | 0.722 | 0.517 |
| Easy Ensemble Classifier | 0.930 | 0.775 | 0.845 | 0.396 | 0.717 | 0.510 |
| Balanced Random Forest Classifier | 0.928 | 0.764 | 0.838 | 0.383 | 0.711 | 0.498 |
| Random Forest | 0.917 | 0.830 | 0.872 | 0.436 | 0.636 | 0.517 |
| SVC | 0.920 | 0.803 | 0.858 | 0.409 | 0.660 | 0.505 |
| Balanced SVC | 0.925 | 0.754 | 0.831 | 0.371 | 0.703 | 0.486 |
| AdaBoost | 0.918 | 0.768 | 0.836 | 0.373 | 0.668 | 0.478 |
| MultinomialNB | 0.918 | 0.745 | 0.823 | 0.354 | 0.676 | 0.465 |
| Balanced MultinomialNB | 0.917 | 0.746 | 0.823 | 0.353 | 0.671 | 0.463 |
| Balanced KNeighbors | 0.900 | 0.770 | 0.830 | 0.344 | 0.586 | 0.433 |
| KNeighbors | 0.845 | 0.962 | 0.899 | 0.434 | 0.142 | 0.214 |










References
- Commission Recommendation of 6 May 2003 concerning the definition of micro, small and medium-sized enterprises (notified under document number C(2003) 1422), Official Journal L 124 , 20/05/2003 P. 0036 – 0041, https://eur-lex.europa.eu/eli/reco/2003/361/oj.
- Chonsawat, N., & Sopadang, A. (2020). Defining SMEs’ 4.0 Readiness Indicators. Applied Sciences, 10(24), 8998. [CrossRef]
- Cusolito, A. P., Dautovic, E., & McKenzie, D. (2021). Can government intervention make firms more investment-ready? A randomized experiment in the Western Balkans. Review of Economics and Statistics, 103(3), 428–442. [CrossRef]
- Douglas, E. J., & Shepherd, D. A. (2002). Exploring investor readiness: Assessments by entrepreneurs and investors in Australia. Venture Capital, 4(3), 219–236. [CrossRef]
- Du, J., & Rada, R. (2010). Machine learning and financial investing. In Handbook of Research on Machine Learning Applications and Trends: Algorithms, Methods, and Techniques (pp. 375–398). IGI Global. [CrossRef]
- Dumitrescu, E., Hué, S., Hurlin, C., & Tokpavi, S. (2021). Machine learning or econometrics for credit scoring: Let’s get the best of both worlds. HAL Working Paper hal-02507499. Available at: https://hal.archives-ouvertes.fr/hal-02507499.
- European Central Bank (ECB) (2023). Survey on the Access to Finance of Enterprises (SAFE) Dataset. Retrieved from https://www.ecb.europa.eu/stats/ecb_surveys/safe/html/data.en.html.
- Fellnhofer, K. (2015). Literature review: Investment readiness level of small and medium-sized companies. International Journal of Managerial and Financial Accounting, 7(3/4), 268–284. [CrossRef]
- Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29(5), 1189–1232. [CrossRef]
- Friedman, J. H. (1999). Stochastic Gradient Boosting. Technical Report. Stanford University.
- Khan, A. H., Shah, A., Ali, A., Shahid, R., Zahid, Z. U., Sharif, M. U., Jan, T., & Zafar, M. H. (2023). A performance comparison of machine learning models for stock market prediction with novel investment strategy. PLOS ONE, 18(9), e0286362. [CrossRef]
- Li, M. (2023). Financial investment risk prediction under the application of information interaction Firefly Algorithm combined with Graph Convolutional Network. PLOS ONE, 18(9), e0291510. [CrossRef]
- Mason, C. M., & Harrison, R. T. (2001). "Investment readiness": A critique of government proposals to increase the demand for venture capital. Regional Studies, 35(7), 663–668. [CrossRef]
- Mason, C. M., & Kwok, J. (2010). Investment readiness programmes and access to finance: A critical review of design issues. Working Paper 10-03, Hunter Centre for Entrepreneurship, University of Strathclyde.
- Mullainathan, S., & Spiess, J. (2017). Machine learning: An applied econometric approach. Journal of Economic Perspectives, 31(2), 87–106. [CrossRef]
- Okfalisa, et al. (2021). Measuring the effects of different factors influencing the readiness of SMEs towards digitalization: A multiple perspectives design of decision support system. Decision Science Letters, 10, 425–442. [CrossRef]
- Owen, R., Botelho, T., Hussain, J., & Anwar, O. (2023). Solving the SME finance puzzle: An examination of demand and supply failure in the UK. Venture Capital, 25(1), 31–63. [CrossRef]
- Pirola, F., Cimini, C., & Pinto, R. (2020). Digital readiness assessment of Italian SMEs: a case-study research. Journal of Manufacturing Technology Management, 31(5), 1045–1083. [CrossRef]
- Shobana, G., & Umamaheswari, K. (2021). Forecasting by machine learning techniques and econometrics: A review. In Proceedings of the Sixth International Conference on Inventive Computation Technologies (ICICT 2021). IEEE Xplore.
- Vasilescu, L. G. (2009). ‘Investment readiness’—aligning capital demand and supply. [Details needed from original publication].
- World Bank (2023). Small and Medium Enterprises (SMEs) Finance – Improving SMEs’ access to finance and finding innovative solutions to unlock sources of capital. Retrieved from: https://www.worldbank.org.
- Zana, D., & Barnard, B. (2019). Venture capital and entrepreneurship: The cost and resolution of investment readiness. SSRN Electronic Journal. Available at SSRN: . [CrossRef]
- Norvig, Russel (2020). Artificial Intelligence. A Modern Approach (4rth Edition) Pearson Series.
- Cover, T.M., & Hart, P.E. (1967). Nearest neighbor pattern classification. IEEE Trans. Inf. Theory, 13, 21-27.
- L. Breiman. (2001) Random Forests, Machine Learning, 45(1), 5-32.
- Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20(3), 273–297.
- Cox, D.R. (1958) The regression analysis of binary sequences. Journal of the Royal Statistical Society: Series B (Methodological), 20(2), pp.215–232.
- Freund, Y., & Schapire, R.E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. European Conference on Computational Learning Theory.
- Liu, X., Wu, J., & Zhou, Z. (2009). Exploratory Undersampling for Class-Imbalance Learning. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 39, 539-550.
- Breiman, L. (1996). Bagging predictors. Machine Learning, 24(2), 123–140.
- C.D. Manning, P. Raghavan and H. Schuetze (2008). Introduction to Information Retrieval. Cambridge University Press, pp. 234-265.
- Brodersen, K.H.; Ong, C.S.; Stephan, K.E.; Buhmann, J.M. (2010). The balanced accuracy and its posterior distribution. Proceedings of the 20th International Conference on Pattern Recognition, 3121-24.
- Gogas, P.; Papadimitriou, T.; Sofianos, E. (2019) Money Neutrality, Monetary Aggregates and Machine Learning. Algorithms, 12,137.
- Peykani, P., Peymany Foroushany, M., Tanasescu, C., Sargolzaei, M., & Kamyabfar, H. (2025). Evaluation of Cost-Sensitive Learning Models in Forecasting Business Failure of Capital Market Firms. Mathematics, 13(3), 368. ECB Data References: European Central Bank (2023). Survey on the Access to Finance of Enterprises (SAFE). Retrieved from: https://www.ecb.europa.eu/stats/ecb_surveys/safe/html/data.en.html. ECB (2023). SAFE Methodology Documentation. Retrieved from: https://data.ecb.europa.eu/data/datasets/SAFE/structure. ECB (2023). Dataset Structure Documentation. Retrieved from: https://data.ecb.europa.eu/data/datasets/SAFE/structure. ECB (2023). SAFE Data Information. Retrieved from: https://data.ecb.europa.eu/data/datasets/SAFE/data-information. [CrossRef]




| Model | FP | FN | TMC |
| Gradient Boosting | 438 | 94 | 908 |
| Logistic Regression | 379 | 107 | 914 |
| Easy Ensemble Classifier | 409 | 103 | 924 |
| Random Forest | 441 | 99 | 936 |
| AdaBoost | 386 | 111 | 941 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).