Submitted:
27 April 2025
Posted:
28 April 2025
You are already at the latest version
Abstract
Keywords:
I. Introduction
II. Data Collection
III. Analysis and Result
A. Placeholder Subsection
- 1) Data Preprocessing and Feature Selection
- 2) Logistic Regression Methodology
- 3) Study Results
IV. Related Work
V. Conclusion
References
- D. Jaime, “Goblin: Neo4j maven central dependency graph,” Sep 2024. [Online]. [CrossRef]
- A. Abdellatif, Y. Zeng, M. Elshafei, E. Shihab, and W. Shang, “Simplifying the search of npm packages,” Information and Software Technology, vol. 126, p. 106365, 2020. [CrossRef]
- D. Jaime, J. E. Haddad, and P. Poizat, “Navigating and exploring software dependency graphs using Goblin,” in Proceedings of the International Conference on Mining Software Repositories (MSR), 2025.
- H. Borges, A. Hora, and M. T. Valente, “Understanding the factors that impact the popularity of GitHub repositories,” in Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME), 2016, pp. 334–344. [CrossRef]
- H. Borges and M. T. Valente, “What’s in a GitHub star? understanding repository starring practices in a social coding platform,” Journal of Systems and Software, pp. 112–129, 2018. [CrossRef]
- K. Aggarwal, A. Hindle, and E. Stroulia, “Co-evolution of project documentation and popularity within GitHub,” in Proceedings of the ACM/IEEE Working Conference on Mining Software Repositories (MSR), 2014, pp. 360–363. [CrossRef]
- J. Zhu, M. Zhou, and A. Mockus, “Patterns of folder use and project popularity: A case study of GitHub repositories,” in Proceedings of the ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM), 2014. [CrossRef]
- H. Sajnani, V. Saini, J. Ossher, and C. V. Lopes, “Is popularity a measure of quality? an analysis of Maven components,” in Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME), 2014, pp. 231–240. [CrossRef]
- T. Wang, S. Wang, and T.-H. P. Chen, “Study the correlation between the readme file of GitHub projects and their popularity,” J. Syst. Softw., Nov 2023. [CrossRef]
- A. Zerouali, T. Mens, G. Robles, and J. M. Gonzalez-Barahona, “On the diversity of software package popularity metrics: An empirical study of npm,” in Proceedings of the IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER), 2019, pp. 589–593. [CrossRef]
- J. Han, S. Deng, X. Xia, D. Wang, and J. Yin, “Characterization and prediction of popular projects on GitHub,” in Proceedings of the IEEE Annual Computer Software and Applications Conference (COMPSAC), 2019, pp. 21–26. [CrossRef]
- S. Mujahid, R. Abdalkareem, and E. Shihab, “What are the characteristics of highly-selected packages? a case study on the npm ecosystem,” Journal of Systems and Software, Apr 2023. [CrossRef]


| Top 20% | Bottom 20% | |||
|---|---|---|---|---|
| Feature | Source | Mean/Median/Min/Max | Mean/Median/Min/Max | -value & Cohen’s |
| Star Count | GitHub | 2162.74 / 415.0 / 31 / 229891 | 6.41 / 3.0 / 0 / 31 | 0.05 0.57 (M) |
| Fork Count | GitHub | 418.42 / 53.0 / 0 / 47070 | 3.12 / 1.0 / 0 / 194 | 0.05 0.33 (S) |
| Pull Requests | GitHub | 27.44 / 30.0 / 0 / 87 | 10.58 / 3.0 / 0 / 87 | 0.05 1.39 (L) |
| Subscriber Count | GitHub | 60.99 / 18.0 / 0 / 6616 | 4.67 / 3.0 / 0 / 418 | 0.05 0.48 (M) |
| License | GitHub | ———————— | ———————— | —————– |
| Tags Count | GitHub | 23.56 / 30.0 / 0 / 30 | 10.35 / 5.0 / 0 / 30 | 0.05 1.26 (L) |
| Open Issues Count | GitHub | 18.64 / 23.0 / 0 / 30 | 3.41 / 0.0 / 0 / 30 | 0.05 1.58 (L) |
| Closed Issues Count | GitHub | 27.77 / 30.0 / 0 / 30 | 11.06 / 4.0 / 0 / 30 | 0.05 1.65 (L) |
| Contributors Count | GitHub | 18.22 / 20.0 / 0 / 30 | 3.84 / 2.0 / 0 / 30 | 0.05 1.60 (L) |
| Commits Count | GitHub | 29.66 / 30.0 / 0 / 30 | 25.57 / 30.0 / 0 / 30 | 0.05 0.65 (M) |
| README Exists | GitHub | ———————— | ———————— | —————– |
| About Info | GitHub | 5.59 / 5.0 / 1 / 71 | 4.33 / 4.0 / 0 / 47 | 0.05 0.31 (S) |
| Closed Issues Percentage | GitHub | 0.64 / 0.52 / 0.0 / 1.0 | 0.55 / 0.70 / 0.0 / 1.0 | 0.05 0.26 (S) |
| Usages | Maven Neo4j | 406.84 / 1.0 / 0 / 1716315 | 19.45 / 0.0 / 0 / 64281 | 0.05 0.05 (N) |
| Dependencies | Maven Neo4j | 5.40 / 4.0 / 0 / 271 | 0.55 / 0.70 / 0.0 / 1.0 | 0.05 0.06 (N) |
| Popularity 1 Year | Maven Neo4j | 9.39 / 0.0 / 0 / 89815 | 1.33 / 0.0 / 0 / 5516 | 0.05 0.02 (N) |
| Release Frequency | Maven Neo4j | 0.09 / 0.02 / 0.0 / 122.4 | 0.11 / 0.01 / 0.0 / 10.0 | 0.05 -0.04 (N) |
| Release Count | Maven Neo4j | 31.12 / 8.0 / 1 / 1288 | 9.09 / 3.0 / 1 / 691 | 0.05 0.36 (S) |
| Vulnerabilities | Maven Neo4j | 0.01 / 0.0 / 0.0 / 20.5 | 0.00 / 0.0 / 0.0 / 165.0 | 0.05 0.00 (N) |
| Metric | Wald | p-Value | Significance |
|---|---|---|---|
| License | 1326.15 | < 0.0001 | *** |
| Commits Count | 1066.31 | < 0.0001 | *** |
| README Exists | 336.01 | < 0.0001 | *** |
| About Info | 658.34 | < 0.0001 | *** |
| Dependencies | 13.21 | 0.0003 | *** |
| Usages | 475.63 | < 0.0001 | *** |
| Closed Issues Percentage | 9.26 | 0.0023 | ** |
| Release Frequency | 11.00 | 0.0009 | ** |
| Vulnerabilities | 3.39 | 0.066 | . |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).