Submitted:
20 December 2023
Posted:
21 December 2023
Read the latest preprint version here
Abstract
Keywords:
1. INTRODUCTION


- Exploring existing methodologies to design an effective recommender system in an e-learning environment
- Analyzing existing methodologies and comparing them to identify the most promising area for the future.
- Identify knowledge gaps in the existing literature and identify future challenges.
- Identifying the objective of each research and comparing objectives of existing literature.
- Evaluate the performance of existing methodologies and compare them with each other to identify the one methodology that is outperforming others
2. Survey Methodology
3. Literature Review
3.1. Graph-Based Techniques
3.2. Content-Based and Collaborative Filtering
3.1.1. Content-Based Filtering
3.1.2. Collaborative Filtering
3.1.3. Matrix Factorization
3.3. Search Query Based Recommendations
3.4. Educational Recommender System Using Big Data
3.5. Recommendations Using Forum Posts
| Reference | Methodology | Performance measure | Results |
| [53] | Fuzzy linguistics and ontology-based approach | User satisfaction | 88% |
| [55] | Semantic analysis and collaborative filtering | MAE | 0.25 |
| [56] | Semantic analysis and SVM | Precision | 0.59 |
| Recall | 0.65 | ||
| Accuracy | 65% |
3.6. Other Emerging Recommendation Techniques in Education
3.7. Critical Analysis
4. Comparative Analysis
4.1. Objective Based Clustering
4.2. Dataset analysis
4.3. Performance analysis
4.4. Comparison of Best Scheme From Each Category Existing Schemes
5. Research Challenges and Gaps
5.1. The Complex Relationship Between LOs
5.2. Cold Start Problem
5.3. Increased Processing Time
5.4. Limited Availability of Public Data
5.5. Sematic Study
6. Optimal solution
7. Proposed Solution
8. Conclusion
References
- Goldberg D, Nichols D, Oki BM, Terry D (1992) Using collaborative filtering to weave an information tapestry. Commun ACM 35:61–70.
- Schafer B, Konstan J, Riedl J (1999) Recommender Systems in E-Commerce. 1st ACM Conference on Electronic Commerce, Denver, Colorado, United States. [CrossRef]
- Kim H-T, Kim E, Lee J-H, Ahn CW (2010) A recommender system based on genetic algorithm for music data. In: 2010 2nd International Conference on Computer Engineering and Technology. pp V6-414-V6-417.
- Azaria A, Hassidim A, Kraus S, Eshkol A, Weintraub O, Netanely I (2013) Movie recommender system for profit maximization. In: Proceedings of the 7th ACM conference on Recommender systems. Association for Computing Machinery, New York, NY, USA, pp 121–128.
- Gomez-Uribe CA, Hunt N (2016) The Netflix Recommender System: Algorithms, Business Value, and Innovation. ACM Trans Manage Inf Syst 6:13:1-13:19.
- Ragab AHM, Mashat AFS, Khedra AM (2012) HRSPCA: Hybrid recommender system for predicting college admission. In: 2012 12th International Conference on Intelligent Systems Design and Applications (ISDA). pp 107–113.
- Natividad MCB, Gerardo BD, Medina RP (2019) A fuzzy-based career recommender system for senior high school students in K to 12 education. In: IOP Conference Series: Materials Science and Engineering. IOP Publishing, p 012025.
- Estrela D, Batista S, Martinho D, Marreiros G (2017) A Recommendation System for Online Courses. In: Advances in Information Systems and Technologies. pp 195–204.
- UNICEF (2021) COVID-19 and School Closures: One year of education disruption. In: UNICEF DATA. https://data.unicef.org/resources/one-year-of-covid-19-and-school-closures/. Accessed 16 Sep 2021.
- Impey C, Formanek M (2021) MOOCS and 100 Days of COVID: Enrollment surges in massive open online astronomy classes during the coronavirus pandemic. Social Sciences & Humanities Open 4:100177.
- Rafiq MS, Jianshe X, Arif M, Barra P (2021) Intelligent query optimization and course recommendation during online lectures in E-learning system. J Ambient Intell Human Comput. [CrossRef]
- Gupta S, Sabitha AS (2019) Deciphering the attributes of student retention in massive open online courses using data mining techniques. Educ Inf Technol 24:1973–1994.
- Gamage D, Fernando S, Perera I (2015) Factors leading to an effective MOOC from participiants perspective. In: 2015 8th International Conference on Ubi-Media Computing (UMEDIA). pp 230–235.
- Rawat B, Samriya JK, Pandey N, Wariyal SC (2020) Enriching ‘user item rating matrix’ with resource description framework for improving the accuracy of recommendation in E-learning environment. Materials Today: Proceedings. [CrossRef]
- Nabizadeh AH, Gonçalves D, Gama S, Jorge J, Rafsanjani HN (2020) Adaptive learning path recommender approach using auxiliary learning objects. Computers & Education 147:103777.
- Niknam M, Thulasiraman P (2020) LPR: A bio-inspired intelligent learning path recommendation system based on meaningful learning theory. Education and Information Technologies 25:3797–3819.
- Mu R (2018) A survey of recommender systems based on deep learning. Ieee Access 6:69009–69022.
- Nabizadeh AH, Leal JP, Rafsanjani HN, Shah RR (2020) Learning path personalization and recommendation methods: A survey of the state-of-the-art. Expert Systems with Applications 159:113596.
- Peška L, Tashu TM, Horváth T (2019) Swarm intelligence techniques in recommender systems - A review of recent research. Swarm and Evolutionary Computation 48:201–219.
- Nikzad–Khasmakhi N, Balafar MA, Reza Feizi–Derakhshi M (2019) The state-of-the-art in expert recommendation systems. Engineering Applications of Artificial Intelligence 82:126–147.
- George G, Lal AM (2019) Review of ontology-based recommender systems in e-learning. Computers & Education 142:103642.
- Obeidat R, Duwairi R, Al-Aiad A (2019) A collaborative recommendation system for online courses recommendations. In: 2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML). IEEE, pp 49–54.
- Zhang M, Zhu J, Wang Z, Chen Y (2019) Providing personalized learning guidance in MOOCs by multi-source data analysis. World Wide Web 22:1189–1219.
- Corbett AT, Anderson JR (1994) Knowledge tracing: Modeling the acquisition of procedural knowledge. User modeling and user-adapted interaction 4:253–278.
- Wan S, Niu Z (2018) An e-learning recommendation approach based on the self-organization of learning resource. Knowledge-Based Systems 160:71–87.
- Muangprathub J, Boonjing V, Chamnongthai K (2020) Learning recommendation with formal concept analysis for intelligent tutoring system. Heliyon 6:e05227.
- Zhou Y, Huang C, Hu Q, Zhu J, Tang Y (2018) Personalized learning full-path recommendation model based on LSTM neural networks. Information Sciences 444:135–152.
- Hochreiter S, Schmidhuber J (1997) Long Short-Term Memory. Neural Computation 9:1735–1780.
- Dorigo M, Maniezzo V, Colorni A (1996) Ant system: optimization by a colony of cooperating agents. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 26:29–41.
- Well-Separated Clusters and Optimal Fuzzy Partitions: Journal of Cybernetics: Vol 4, No 1. https://www.tandfonline.com/doi/abs/10.1080/01969727408546059. Accessed 18 Feb 2023.
- Shi D, Wang T, Xing H, Xu H (2020) A learning path recommendation model based on a multidimensional knowledge graph framework for e-learning. Knowledge-Based Systems 195:105618.
- Balabanović M, Shoham Y (1997) Fab: content-based, collaborative recommendation. Commun ACM 40:66–72.
- Resnick P, Iacovou N, Suchak M, Bergstrom P, Riedl J (1994) GroupLens: an open architecture for collaborative filtering of netnews. In: Proceedings of the 1994 ACM conference on Computer supported cooperative work. Association for Computing Machinery, New York, NY, USA, pp 175–186.
- Xiao J, Wang M, Jiang B, Li J (2018) A personalized recommendation system with combinational algorithm for online learning. Journal of Ambient Intelligence and Humanized Computing 9:667–677.
- Neamah AA, El-Ameer AS (2018) Design and Evaluation of a Course Recommender System Using Content-Based Approach. In: 2018 International Conference on Advanced Science and Engineering (ICOASE). IEEE, pp 1–6.
- Tarus JK, Niu Z, Kalui D (2018) A hybrid recommender system for e-learning based on context awareness and sequential pattern mining. Soft Computing 22:2449–2461.
- Agrawal R, Imieliński T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of the 1993 ACM SIGMOD international conference on Management of data. Association for Computing Machinery, New York, NY, USA, pp 207–216.
- Klašnja-Milićević A, Ivanović M, Vesin B, Budimac Z (2018) Enhancing e-learning systems with personalized recommendation based on collaborative tagging techniques. Appl Intell 48:1519–1535.
- De Medio C, Limongelli C, Sciarrone F, Temperini M (2020) MoodleREC: A recommendation system for creating courses using the moodle e-learning platform. Computers in Human Behavior 104:106168.
- Zhang H, Huang T, Lv Z, Liu S, Yang H (2019) MOOCRC: A highly accurate resource recommendation model for use in MOOC environments. Mobile Networks and Applications 24:34–46.
- Hinton GE, Osindero S, Teh Y-W (2006) A fast learning algorithm for deep belief nets. Neural Comput 18:1527–1554.
- Rumelhart DE, McClelland JL (1987) Information Processing in Dynamical Systems: Foundations of Harmony Theory. In: Parallel Distributed Processing: Explorations in the Microstructure of Cognition: Foundations. MIT Press, pp 194–281.
- Pang Y, Liao C, Tan W, Wu Y, Zhou C (2018) Recommendation for MOOC with learner neighbors and learning series. In: International Conference on Web Information Systems Engineering. Springer, pp 379–394.
- Agrawal R, Srikant R (1994) Fast Algorithms for Mining Association Rules. pp 487–499.
- Kaufman L, Rousseeuw P (1990) Finding Groups in Data: An Introduction To Cluster Analysis. Wiley, New York ISBN 0-471-87876-6. [CrossRef]
- Murad DF, Heryadi Y, Isa SM, Budiharto W (2020) Personalization of study material based on predicted final grades using multi-criteria user-collaborative filtering recommender system. Educ Inf Technol 25:5655–5668.
- Segal A, Gal K, Shani G, Shapira B (2019) A difficulty ranking approach to personalization in E-learning. International Journal of Human-Computer Studies 130:261–272.
- Symeonidis P, Malakoudis D (2019) Multi-modal matrix factorization with side information for recommending massive open online courses. Expert Systems with Applications 118:261–271.
- Lalitha TB, Sreeja PS (2020) Personalised Self-Directed Learning Recommendation System. Procedia Computer Science 171:583–592.
- Zhang H, Huang T, Lv Z, Liu S, Zhou Z (2018) MCRS: A course recommendation system for MOOCs. Multimedia Tools and Applications 77:7051–7069.
- Dahdouh K, Dakkak A, Oughdir L, Ibriz A (2019) Large-scale e-learning recommender system based on Spark and Hadoop. Journal of Big Data 6:1–23.
- Hou Y, Zhou P, Xu J, Wu DO (2018) Course recommendation of MOOC with big data support: A contextual online learning approach. In: IEEE INFOCOM 2018-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS). IEEE, pp 106–111.
- Porcel C, Ching-López A, Lefranc G, Loia V, Herrera-Viedma E (2018) Sharing notes: An academic social network based on a personalized fuzzy linguistic recommender system. Engineering Applications of Artificial Intelligence 75:1–10.
- Zadeh LA (1974) The Concept of a Linguistic Variable and its Application to Approximate Reasoning. In: Fu KS, Tou JT (eds) Learning Systems and Intelligent Robots. Springer US, Boston, MA, pp 1–10.
- Albatayneh NA, Ghauth KI, Chua F-F (2018) Utilizing learners’ negative ratings in semantic content-based recommender system for e-learning forum. Journal of Educational Technology & Society 21:112–125.
- Ntourmas A, Daskalaki S, Dimitriadis Y, Avouris N (2021) Classifying MOOC forum posts using corpora semantic similarities: a study on transferability across different courses. Neural Comput & Applic. [CrossRef]
- Boser BE, Guyon IM, Vapnik VN (1992) A training algorithm for optimal margin classifiers. In: Proceedings of the fifth annual workshop on Computational learning theory. Association for Computing Machinery, New York, NY, USA, pp 144–152.
- G S, M P, Dubey P, Drolia AR, S S (2020) Subjective Areas of Improvement: A Personalized Recommendation. Procedia Computer Science 172:235–239.
- Dwivedi P, Kant V, Bharadwaj KK (2018) Learning path recommendation based on modified variable length genetic algorithm. Education and information technologies 23:819–836.
- Lin J, Li Y, Lian J (2020) A novel recommendation system via L0-regularized convex optimization. Neural Computing and Applications 32:1649–1663.
- Kim T, Yun Y, Kim N (2021) Deep Learning-Based Knowledge Graph Generation for COVID-19. Sustainability 13:2276.
- Shahid, H., Ashraf, H., Javed, H., Humayun, M., Jhanjhi, N. Z., & AlZain, M. A. (2021). Energy optimised security against wormhole attack in iot-based wireless sensor networks. Comput. Mater. Contin, 68(2), 1967-81.
- Wassan, S., Chen, X., Shen, T., Waqar, M., & Jhanjhi, N. Z. (2021). Amazon product sentiment analysis using machine learning techniques. Revista Argentina de Clínica Psicológica, 30(1), 695.
- Almusaylim, Z. A., Zaman, N., & Jung, L. T. (2018, August). Proposing a data privacy aware protocol for roadside accident video reporting service using 5G in Vehicular Cloud Networks Environment. In 2018 4th International conference on computer and information sciences (ICCOINS) (pp. 1-5). IEEE.
- 65 Ghosh, G., Verma, S., Jhanjhi, N. Z., & Talib, M. N. (2020, December). Secure surveillance system using chaotic image encryption technique. In IOP conference series: materials science and engineering (Vol. 993, No. 1, p. 012062). IOP Publishing.
- Humayun, M., Alsaqer, M. S., & Jhanjhi, N. (2022). Energy optimization for smart cities using iot. Applied Artificial Intelligence, 36(1), 2037255.
- 67 Kaur, R., Verma, S., Jhanjhi, N. Z., & Talib, M. N. (2021, August). A comprehensive survey on load and resources management techniques in the homogeneous and heterogeneous cloud environment. In Journal of Physics: Conference Series (Vol. 1979, No. 1, p. 012036). IOP Publishing.
- 68 Alotaibi, A. F. (2021). A comprehensive survey on security threats and countermeasures of cloud computing environment. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 12(9), 1978-1990.
- 69 Bashir, I. R. A. M., Hamid, B. U. S. H. R. A., Jhanjhi, N. Z., & Humayun, M. A. M. O. O. N. A. (2020). Systematic literature review and empirical study for success factors: client and vendor perspective. J Eng Sci Technol, 15(4), 2781-2808.
- 70 Ponnusamy, V. (Ed.). (2016). Biologically-Inspired Energy Harvesting through Wireless Sensor Technologies. IGI Global.
- 71 Ponnusamy, V., Jung, L. T., Ramachandran, T., & Zaman, N. (2017, April). Bio-inspired energy scavenging in wireless ad hoc network. In 2017 International Conference on Innovations in Electrical Engineering and Computational Technologies (ICIEECT) (pp. 1-5). IEEE.
- 72 Abbas, S. F., Shahzad, R. K., Humayun, M., Jhanjhi, N. Z., & Alamri, M. (2019). SOA Issues and their Solutions through Knowledge Based Techniques—A Review. Int. J. Comput. Sci. Netw. Secur., 19(1), 8-21.
- 73 Humayun, M., Jhanjhi, N. Z., Talib, M. N., Shah, M. H., & Suseendran, G. (2021). Cybersecurity for Data Science: Issues, Opportunities, and Challenges. Intelligent Computing and Innovation on Data Science: Proceedings of ICTIDS 2021, 435-444.
- 74 Kumar, P., Kumar, R., Aljuhani, A., Javeed, D., Jolfaei, A., & Islam, A. N. (2023). Digital twin-driven SDN for smart grid: A deep learning integrated blockchain for cybersecurity. Solar Energy, 263, 111921.
- 75 Mukherjee, D., Ghosh, S., Pal, S., Akila, D., Jhanjhi, N. Z., Masud, M., & AlZain, M. A. (2022). Optimized Energy Efficient Strategy for Data Reduction Between Edge Devices in Cloud-IoT. Computers, Materials & Continua, 72(1).
- 76, E. Ndashimye, N. I. Sarkar, and S. K. Ray, “A Multi-criteria based handover algorithm for vehicle-to-infrastructure communications,” Computer Networks, vol. 185, no. 202152, Article ID 107652, 2020.
- 77 Ray, S. K., Pawlikowski, K., & Sirisena, H. (2009). A fast MAC-layer handover for an IEEE 802.16 e-based WMAN. In AccessNets: Third International Conference on Access Networks, AccessNets 2008, Las Vegas, NV, USA, October 15-17, 2008. Revised Papers 3 (pp. 102-117). Springer Berlin Heidelberg.
- 78 Srivastava, R. K., Ray, S., Sanyal, S., & Sengupta, P. (2011). Mineralogical control on rheological inversion of a suite of deformed mafic dykes from parts of the Chottanagpur Granite Gneiss Complex of eastern India. Dyke Swarms: Keys for Geodynamic Interpretation: Keys for Geodynamic Interpretation, 263-276.
- 79 Ray, S. K., Sinha, R., & Ray, S. K. (2015, June). A smartphone-based post-disaster management mechanism using WiFi tethering. In 2015 IEEE 10th conference on industrial electronics and applications (ICIEA) (pp. 966-971). IEEE.
- 80 Chaudhuri A, Ray S (2015) Antiproliferative activity of phytochemicals present in aerial parts aqueous extract of Ampelocissus latifolia (Roxb.) planch. on apical meristem cells. Int J Pharm Bio Sci 6(2):99–108.
- 81 Hossain, A., Ray, S. K., & Sinha, R. (2016, December). A smartphone-assisted post-disaster victim localization method. In 2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS) (pp. 1173-1179). IEEE.
- 82 Airehrour, D., Gutierrez, J., & Ray, S. K. (2018). A trust-based defence scheme for mitigating blackhole and selective forwarding attacks in the RPL routing protocol. Journal of Telecommunications and the Digital Economy, 6(1), 41-49.
- 83 Ray, S. K., Ray, S. K., Pawlikowski, K., McInnes, A., & Sirisena, H. (2010, April). Self-tracking mobile station controls its fast handover in mobile WiMAX. In 2010 IEEE Wireless Communication and Networking Conference (pp. 1-6). IEEE.
- 84 Dey, K., Ray, S., Bhattacharyya, P. K., Gangopadhyay, A., Bhasin, K. K., & Verma, R. D. (1985). Salicyladehyde 4-methoxybenzoylhydrazone and diacetylbis (4-methoxybenzoylhydrazone) as ligands for tin, lead and zirconium. J. Indian Chem. Soc.;(India), 62(11).
- 85 Airehrour, D., Gutierrez, J., & Ray, S. K. (2017, November). A testbed implementation of a trust-aware RPL routing protocol. In 2017 27th International Telecommunication Networks and Applications Conference (ITNAC) (pp. 1-6). IEEE.
- 86 Ndashimye, E., Sarkar, N. I., & Ray, S. K. (2016, August). A novel network selection mechanism for vehicle-to-infrastructure communication. In 2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress (DASC/PiCom/DataCom/CyberSciTech) (pp. 483-488). IEEE.
- 87 Ndashimye, E., Sarkar, N. I., & Ray, S. K. (2020). A network selection method for handover in vehicle-to-infrastructure communications in multi-tier networks. Wireless Networks, 26, 387-401.
- 88 Hussain, S. J., Ahmed, U., Liaquat, H., Mir, S., Jhanjhi, N. Z., & Humayun, M. (2019, April). IMIAD: intelligent malware identification for android platform. In 2019 International Conference on Computer and Information Sciences (ICCIS) (pp. 1-6). IEEE.
- 89 Shahid,H.,Ashraf,H.,Ullah,A.,Band,S.S.&Elnaffar,S.Wormholeattackmitigationstrategiesandtheirimpact onwirelesssensornetworkperformance: Aliteraturesurvey. InternationalJournalofCommunicationSystems35, e5311(2022). URLhttps://onlinelibrary.wiley.com/doi/abs/10.1002/dac.5311. https:// onlinelibrary.wiley.com/doi/pdf/10.1002/dac.5311.


| Authors | The topic of the survey conducted | Limitations |
| [17] | Recommender systems using deep learning in the existing literature | This survey was limited only to the deep learning techniques and left major areas like content-based and collaborative filtering |
| [18] | Survey on existing ITS | It only covered ITS while left all other techniques used for recommendations |
| [19] | Surveys on swarm intelligence | It only included researches using swarm intelligence |
| [20] | Recommending experts in a field | Limited scope of query-based search and information retrieval. |
| [21] | Ontology-based recommender systems in e-learning. | Studied only ontology-based models that mainly focus on personalization |
| Reference | Methodology | Performance measure | Results |
|---|---|---|---|
| [15] | Cluster mean | Accuracy | 90% |
| [16] | FCM and ACO | Learning gain | 33 |
| [23] | Historical BKT | Area under curve | 0.7695 |
| P-value | 0.0003 | ||
| [25] | Ontology based graph | Average Computation time | 0.07s |
| [26] | Formal concept analysis | Learning gain | 10.63 |
| [27] | LSTM | Precision | 0.45 |
| [31] | Knowledge graph | User satisfaction | 0.9 |
| Reference | Methodology | Performance measure | Results |
|---|---|---|---|
| [13] | CF and jena rules using RDF | Precision | 0.98 |
| Recall | 0.90 | ||
| RMSE | 0.25 | ||
| [22] | Apriori with clustering | Coverage | 60% |
| Apriori without clustering | 48% | ||
| SPADE with clustering | 56% | ||
| SPADE without clustering | 51% | ||
| [35] | KNN and naïve bayes | Precision | 0.75 |
| Recall | 0.50 | ||
| [36] | Hybrid (CF, CB and GSM combined) | precision | 0.35 |
| Recall | 0.45 | ||
| [38] | Tagging and Apriori | Coverage | 55% |
| [39] | CB and CF | Precision | 0.22 |
| Recall | 0.39 | ||
| [40] | DBN and RBN | RMSE | 0.67 |
| [43] | RLNLS | Precision | 0.34 |
| Recall | 0.38 | ||
| [46] | CF and KNN | MSE | 21.4 |
| [47] | Copeland method and Bayesian knowledge tracing | AP correlation (higher is better) | 0.77 |
| NDPM (lower is better |
0.28 | ||
| [48] | Matrix factorization | RMSE | 0.491 |
| Reference | Methodology | Performance measure | Results |
|---|---|---|---|
| [11] | Association rules mining | Accuracy | 90% |
| [49] | KNN and knowledge based filtering | Recall | ---- |
| Reference | Methodology | Performance measure | Results |
|---|---|---|---|
| [50] | Apriori | Analysis of generated rules | ---- |
| [51] | FP-growth | Execution time | 34s |
| [52] | Reinforcement learning | Average regret comparison | Decrease in regret as rounds increased |
| References | Methodology | Performance measure | Results |
|---|---|---|---|
| [58] | SVM | User satisfaction | Satisfactory performance |
| [59] | Collaborative filtering and genetic algorithm | Learner’s score | 90% |
| [60] | Convex optimization algorithm | Accuracy | 90% |
| Author | Technique | Limitations and future work |
| [14] | Their research used collaborative filtering while utilizing Jena rules using resource description framework (RDF). After utilizing the RDF and Jena rules engine, an enriched rating matrix will be generated. KNN is applied to further refine the results to generate final recommendations. The recommendation list is evaluated and then is presented to the user | Their proposed methodology utilized manual method to update RDF factbase and this method can be automized in future. [14] |
| [15] | Their proposed model focuses on covering all concepts instead of completing all lessons. Their proposed method also monitors the learner’s progress with the passage to time and if the recommended path is not showing satisfactory results, then the recommender recommends auxiliary LOs. The auxiliary LOs are the highest ranked LOs which has not been seen by the user | Their proposed methodology faced coldstart problem for initial LOs that can be solved in future work by automatically selecting the first LO. [15] |
| [11] | Their methodology focused on prompting user to enter information and to also give feedback regarding generated recommendations. Association rules mining is utilized for generating most relevant recommendations. | In future, Swarm intelligence and genetic algorithm can be utilized to improve results [11]. |
| [16] | Their research used hybrid methodology using both clustering and graph-based method to design an effective recommender system. This study focusses on LPR using ACO. Fuzzy C-means (FCM) clustering is used to assign learner to a cluster. | The proposed FCM and ACO are computationally intensive. They can be parallelized in future for more effective performance. In future, the same model can be tested for large audience with more diverse demographics [16]. |
| [22] | They utilized collaborative filtering along with clustering and association rules mining for better course recommendations. They also utilized sequence mining that helped in recommending courses in correct sequence. For clustering, they used k-means algorithm that uses Euclidean distance to groups similar students together. Aprioir algorithm was used for association rules mining while SPADE algorithm was used for sequence rules mining. If students are predicted to score higher, then the course is recommended to the students otherwise it is excluded from the list of recommendations. | ---- |
| [23] | They utilized several techniques like content analysis, assessment and behavioral analysis. Four variations of BKT were used for knowledge, tracing in the proposed model including Coarse BKT, multi-grained BKT, Fine BKT and Historical BKT. | In future, words embeddings can be used to construct knowledge graph instead of word co-occurrence to improve the quality of constructed graphs [23]. |
| [25] | This study is based on the concept of self-organizing LOs. The recommendation is based on both user’s explicit requirements and implicit needs that are captured by analyzing user’s behaviour. Recommendations are generated by constructed ontology based graph using user’s profile and similarity between LOs. | The one case study was not enough to test their proposed method. In future more case studies can be conducted to verify the effectiveness of their proposed methodology [25]. |
| [26] | An ITS was proposed in this research that aimed to adapt the system according to the user’s requirements based on his or her learning style. A personalized path was generated for each user utilizing the knowledge-based graph using co-occurrence of different objects. | Their proposed model has been tested only on limited set of students. More extensive evaluation can be performed on proposed model in future. |
| [27] | Their methodology proposed a separate path when there is no sufficient prior information to avoid cold start problem. The users are also divided into five main clusters based on their interaction with the platform and then learning effect is calculated using LSTM model. | In future, data streams of learners can be utilized to improve the generated learning paths. |
| [31] | Their proposed methodology constructed a multi-dimensional knowledge graph that stores class wise hierarchical structure that includes basic knowledge, algorithm and tasks. Their framework inputs a query form user, tokenize it and then collects the target LOs. It is possible for a system to generate multiple paths for one query. In that case the system recommends the path with highest score. | In future this system can be implemented for subjects other than computing and user’s profile can also be considered in further researchers. |
| [34] | Their methodology utilized content and collaborative filtering for relevant recommendation for MOOC courses. Their framework picks one of the techniques depending upon a threshold value. If threshold is met then collaborative filtering is used otherwise the algorithm will take the path of content-based filtering. | ---- |
| [35] | Their proposed methodology consists of two phases. The first phase uses KNN classifier to determining most relevant courses based on user’s profile and this portion of proposed methodology will be executed for every recommendation. The second phase is optional and is executed only when enough ratings are available in the database for the relevant courses. In second phase, naïve Bayes classifier is used to recommend courses based on user’s rating. | Even though one algorithm showed better precision for course recommendation but still the overall f-measure was not showing satisfactory results. Other hybrid algorithms like association rules mining, deep learning and collaborative filtering can be considered to improve results in future. |
| [36] | A hybrid recommender system was proposed by them that included content based collaborative filtering and sequence pattern mining to help in identifying a sequence in which courses should be studied. The proposed framework used both content and context aware course recommendations by utilizing the weblogs, LOs and user’s profile. GSM algorithm was used for identifying the recommended sequence of courses. | Emerging tools of machine learning and artificial intelligence can be utilized in future to generate more effective recommendations [36]. |
| [38] | Their proposed methodology utilized tag based Collaborative filtering. In their proposed method, the learners will write tags according to their requirements. | In future, some suggested tags can also be utilized keeping in view the experts’ generated tags to ensure vast coverage of LOs. [38] |
| [39] | They proposed MoodelREC teacher centric system that utilized popular LO repositories to crawl LOs. These recommendations can be used to design a new course hence the proposed mode provides an outline that an instructor can utilize to design a new course. | There are no standard LOs and LOs may differ in different LORs. Semantic solutions can be used to solve this problem in future. [39] |
| [40] | They combined deep belief network with collaborative filtering for more relevant recommendations. RBM are used for constructing neural networks for unsupervised learning. The unsupervised learning RMB model assigns initial values to the back-propagation model and hence eliminate the drawback of traditional method. | --- |
| [43] | They proposed a methodology that is based on RLNLS model. The RLNLS model utilized the learner’s interest and capacity to predict their behaviour in future. Collaborative filtering technique was used to find behaviour of neighbour learners while Hawkes process was used to predict the learning intensity. | In future, feature reduction can be performed to extract data only from relevant variables to maximize efficiency of the algorithm [43]. |
| [46] | Their proposed framework combines collaborative filtering with rule-based filtering and KNN classification to create a hybrid system. The recommendation hence generated will be context sensitive. | Their proposed methodology does not consider the authenticity of the user posting course. It can be improved in future by analyzing the profile of instructor. |
| [47] | Their proposed EduRank method used collaborative filtering along with social voting to evaluate the relevance of a course. The recommended questions are ranked in descending order based on their Copeland score while item response theory and Bayesian knowledge tracing were used for calculating item difficulty. AP correlation matrix was used to evaluate the proposed method and perform comparison with competing methodologies. | Their framework only considered questions and their solution time for difficulty-based ranking while not adding any solution to identify or help weak students. Work can be done in future to help students in coping with their weaknesses while utilizing their strengths to maximize learning. |
| [48] | They used multi-modal matrix factorization for course recommendation in MOOCs platform. The proposed methodology calculates either a course provides particular skill set that a user is looking for or not. The most frequent items in the neighborhood are ranked the highest. | The precision results showed between 1% to 3% accuracy for sparse datasets [48]. The lack of available ratings is affecting the accuracy of their proposed model and it be overcome in future by using hybrid techniques. |
| [49] | They proposed a method that focuses on the older audience who are self-learning and do not have access to any recommendations from teacher or instructor. Content based filtering, collaborative filtering and web mining were used to identify appropriate learning material on the internet which was later narrowed down according to personal needs of the learners. Clustering is done using KNN to group similar learners in one cluster to maximize personalization. | --- |
| [50] | They proposed Course recommender model that utilized big data to generate association rules in distributed environment. They utilized Hadoop Distributed File System (HDFS) while using Spark for data processing for association rules mining and Sqoop to import and export dataset. Course recommender used Apriori algorithm to generate recommendations of most relevant courses. | For future work, they recommended using deep learning for better utilization of big data in generating course recommendations [50]. |
| [51] | The purpose of their study was to discover most relevant course by implementing association rules mining on big data. They used Spark ML libraries to implement FP-Growth algorithm. For course recommendations, they analyzed the behaviour of learners who interacted with system in past by utilizing course enrollment and log data. | In future, big data can be utilized to analyze real time data |
| [52] | They utilized the feedback of the users and improved the model by using the concept of regret to improve recommendations for the future. | In future work can be done to handle the dynamically increasing courses dataset. (Hou et al. 2018) |
| [53] | They proposed a framework called sharing notes that is a web based forum designed to facilitate students by providing them personalized material. Sharing notes utilized collaborative and content based filtering to rate the material while fuzzy linguistics model was implemented to perform qualitative study on communication between different users. Ontologies were used to map the relationship between users and their preferred material. The final recommendations are generated after obtaining ratings from the trusted users. Sharing notes was tested with the help of 425 users amongst those 88% of the users shows satisfaction with their proposed recommender system. | In future their own specific metrics can be used to perform analysis of social networks, and incorporate them in new recommendation approaches [53]. |
| [55] | Their proposed methodology added the layer of semantic analysis along with ratings to improve the performance of generated recommendations. The semantics matrix was separately constructed and was used by utilizing cosine similarity. | In future difficulty level can also be added to generated more relevant recommendations [55]. |
| [56] | Their proposed solution performed semantic analysis on messages posted on discussion forums. Text pre-processing was performed only on content logistic related posts and semantic similarity calculations were used to performed semantic analysis. SVM was utilized in the end to generate recommendations for relevant posts. | In future, other classification models like LSTM can be utilized to improve accuracy [56]. |
| [58] | The proposed methodology takes the data of online courses as input and rank them into various difficulty levels. Similarly, students create their profile and their data is also extracted through the log files that records the data of a student interacting with the system. | Their proposed model did not consider the rating matrix that can include information about how other users interact with the recommender system. |
| [59] | They collected data from the learners profile and used variable length genetic algorithm for learning path recommendations. Collaborative filtering was used to find the popularity matrix which was later utilized by genetic algorithm for personalized course sequencing. | In future work can be done to calculate trust for different users and use this value in collaborative filtering [59]. |
| [60] | Their solution considered both formatted data like courses and unformatted data like web sources, documents, papers and publications. They performed clustering on the statistics of data rather than on raw data hence improving the processing speed. | In future, deep learning algorithms can also be explored |
| Reference | Year | Accuracy | Personalization | precision | Recall | f-measure | RMSE | Reinforcement learning | Serendipity | Normalized Discounted Cumulative Gain | coverage | Optimal course ranking | Transferability | Less execution time | Improved memory utilization | Query optimization | Learning path recommendation | Effective recommender | Early recommendation | To find association between courses | Difficulty based ranking | Student retention | Improvement in learning |
| [14] | 2020 | ✓ | ✓ | ||||||||||||||||||||
| [15] | 2020 | ✓ | |||||||||||||||||||||
| [11] | 2021 | ✓ | |||||||||||||||||||||
| [16] | 2020 | ✓ | |||||||||||||||||||||
| [22] | 2019 | ✓ | ✓ | ||||||||||||||||||||
| [23] | 2019 | ✓ | |||||||||||||||||||||
| [25] | 2018 | ✓ | |||||||||||||||||||||
| [26] | 2020 | ✓ | |||||||||||||||||||||
| [27] | 2018 | ✓ | ✓ | ✓ | |||||||||||||||||||
| [31] | 2020 | ✓ | |||||||||||||||||||||
| [34] | 2018 | ✓ | ✓ | ✓ | |||||||||||||||||||
| [35] | 2018 | ✓ | ✓ | ||||||||||||||||||||
| [36] | 2018 | ✓ | ✓ | ||||||||||||||||||||
| [38] | 2018 | ✓ | ✓ | ✓ | |||||||||||||||||||
| [39] | 2020 | ✓ | |||||||||||||||||||||
| [40] | 2019 | ✓ | ✓ | ✓ | |||||||||||||||||||
| [43] | 2018 | ✓ | |||||||||||||||||||||
| [46] | 2021 | ✓ | ✓ | ||||||||||||||||||||
| [47] | 2019 | ✓ | ✓ | ✓ | |||||||||||||||||||
| [48] | 2019 | ✓ | |||||||||||||||||||||
| [49] | 2020 | ✓ | |||||||||||||||||||||
| [50] | 2018 | ✓ | ✓ | ||||||||||||||||||||
| [51] | 2019 | ✓ | |||||||||||||||||||||
| [52] | 2018 | ✓ | ✓ | ||||||||||||||||||||
| [55] | 2018 | ✓ | ✓ | ✓ | |||||||||||||||||||
| [56] | 2021 | ✓ | |||||||||||||||||||||
| [59] | 2018 | ✓ | |||||||||||||||||||||
| [60] | 2020 | ✓ |
| Reference | Dataset | Source/platform |
| [14] | Retrived dataset from MOODLE server | MOODLE |
| [15] | Mooshak and ENKI | ENKI and Mooshak |
| [11] | data is collected from multiple online sources like emails, blog, reports, phone calls and social sites | Scrapping of publically available data |
| [16] | Dataset was created | UM Learn (University of Manitoba Learning Management System) Tools: a third-party open-source quiz tool called “Savsoft Quiz v3.0” |
| [22] | Harvard and MIT dataset | HarvardX-MITx Person-Course Academic Year 2013 De-Identified dataset |
| [23] | MOOCs | Coursera courses from Peking University (2 courses) |
| [25] | Own dataset | two courses taught by the authors |
| [26] | Own dataset | Own dataset |
| [27] | Junyi Academy -- http://www.junyiacademy.org/ a Chinese e-learning website which is established on open-source code released by Khan Academy (https://www.khanacademy.org/) in 2012 | Haw-Shiuan Chang, Hwai-Jung Hsu and Kuan-Ta Chen, "Modeling Exercise Relationships in E-Learning: A Unified Approach," International Conference on Educational Data mining (EDM), 2015 (Link: https://pslcdatashop.web.cmu.edu/DatasetInfo?datasetId=1198) |
| [31] | Crawled data from different educational websites (sources not mentioned) | Own dataset |
| [34] | Shanghai Lifelong Learning Network, | Shanghai Lifelong Learning Network, |
| [36] | LMS of university (name of university not mentioned) | - |
| [38] | An educational dataset, consisting of 120 learners, high school students, participants of the Center for Young Talents project, supported by the Schneider Electric DMS NS company, Novi Sad |
---- |
| [39] | (2018). SOFIA, Teachers Formation Platform. http://www.istruzione.it/pdgf/ Last |
Sofia |
| [40] | Own dataset | starC MOOC platform of Central China Normal University |
| [43] | http://jclass.pte.sh.cn | Mic-video Platform of ECNU |
| [46] | BOL repository including records from 2017 to 2018. variables include students’ personal information, learning activity scores, and perception and confidence scores toward each mandatory course |
BINUS online learning (BOL) |
| [47] | Koedinger, K.R., Baker, R., Cunningham, K., Skogsholm, A., Leber, B., Stamper, J., 2010. A data repository for the edm community: the pslc datashop. Handbook of educational data mining. pp. 43–55 (https://pslcdatashop.web.cmu.edu/KDDCup) |
KDD cup 2010 and an unpublished data of schools name K12 dataset by authors |
| [48] | http://merlot.org | Multimedia Educational Resource for Learning and MACE project (Real world object based access to architecture learning material–the mace experience. In: Proceedings of the World Conference on Educational Multimedia and Hypermedia (EDMEDIA 2010), Toronto, Canada (June 2010) (2010)) and Online Teaching (MERLOT) |
| [49] | Own dataset | Own dataset |
| [50] | IBM dataset for Apriori algorithm,T10I4D100K (http://fimi.ua.ac.be/data/),T25I10D10K (http://www.philippe-fournier-er.com/spmf/index.php?link=datasets.php)] (generated by a randomiteration of the database), Harvard and MIT published the edX learning data in 2012–2013, HMXPC13_DI_v2_5–14–14.csv (https://dataverse.harvard.edu/dataset.xhtml?persistentId= doi:10.7910/DVN/26147)] |
IBM and edX |
| [51] | Moodle. Open-source learning platform|Moodle.org. https ://moodl e.org/. | operational database of ESTenLigne platform that is built on LMS Moodle |
| [52] | Created their own dataset | Created their own dataset |
| [53] | http:// sharingnotes.ujaen.es/ |
Sharing notes |
| [55] | Created their own dataset | Created their own dataset |
| [56] | MOOCs | MOOC |
| [58] | Source not mentioned | One public one private dataset (source not mentioned) |
| [59] | Created their own dataset | Created their own dataset |
| [60] | data were gathered from several local Universities (Shandong University of Finance and Economics and Shandong University) |
Local universities dataset |
| Reference | Year | Accuracy | precision | Recall | f-measure | Execution time |
| [14] | 2020 | 0.98 | 0.90 | 0.93 | ||
| [27] | 2018 | 0.45 | ||||
| [35] | 2018 | 0.75 | 0.50 | 0.60 | ||
| [36] | 2018 | 0.35 | 0.45 | |||
| [50] | 2018 | 60s |
| Reference | Methodology | Results | Limitations |
| [14] | Combined jena rules with collaborative filtering | Showed 0.97 precision for active user and 0.64 precision value for non-active users. | Their proposed methodology utilized manual method to update RDF factbase and this method can be atomized in future. [14] |
| [11] | Association rules mining combined with multiple agents | The accuracy of their proposed model initially increased when the number of lectures increased but start to decrease once number of lectures reached 40. Overall the accuracy value stayed above 90% | In future, Swarm intelligence and genetic algorithm can be utilized to improve results [11]. |
| [27] | Hybrid methodology including clustering and LSTM | Their proposed model showed up to 45% precision for full course recommendation while up to 75% precision for ten-step recommendation | In future, data streams of learners can be utilized to improve the generated learning paths. |
| [52] | Reinforcement learning | The proposed algorithm showed around 0.85 value for reward even for massive dataset. | In future work can be done to handle the dynamically increasing courses dataset. (Hou et al. 2018) |
| [56] | Neural networks embedding were to calculate semantic similarity along with SVM. | The accuracy was measured in terms of transferability where world history trained dataset when tested on python course dataset showed 62% accuracy while when model trained on python was tested on world history, it showed 65% accurate results. | In future, other classification models like LSTM can be utilized to improve accuracy [56]. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
