Submitted:
17 July 2024
Posted:
18 July 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Random Forest Algorithm
1.1.1. Decision Tree
1.1.2. Ideas for the Construction of a Random Forest Algorithm
- 1)
- Draw an equal number of samples from the overall training sample, using the self-help method of sampling to ensure that each sample has an equal probability of appearing in different self-help sample sets.
- 2)
- In each self-help sample set, randomly select a portion of features to be used for training the classification tree, to avoid the influence of certain features on the model being too significant.
- 3)
- Construct multiple classification trees with different features to form a random forest.
- 4)
- Predict the new data in each classification tree and count the prediction results of each classification tree, and get the final classification results according to the voting results.
1.1.3. Parameterization
- 1)
- n_estimators
- 2)
- random_state
- 3)
- Max_features
- 4)
- Min_samples_lesf
- 5)
- Min_samples_split
1.2. Predictive Modeling Based on Random Forest Algorithm
1.3. Critical Feature Extraction and Performance Validation
- 1)
- If there is an AI learning tool, would you choose to use it?
- 2)
- Do you have any idea how to complete your assignments with the help of AI learning tools?
- 3)
- Do you have any idea how to complete quizzes with the help of AI learning tools?
- 4)
- What is your attitude towards the credibility of the AI learning tool in answering questions?
- 5)
- If you were to use an AI learning tool, what would you prefer to get out of it?
- 6)
- What safety aspects of using AI tools have you considered?
2. Classification of AI Usage Based on PCA-K-Means Algorithm
2.1. Feature Merging Based on the PCA Algorithm
2.2. Usage Classification Based on the K-Means Algorithm
- (1)
- Each data can be classified into only one class;
- (2)
- Each class contains at least one data.
- (1)
- For each sample point i, calculate a(i), which is the average distance between point i and all other points in the same cluster. This can be expressed by the following formula:where is the cluster containing sample point i, distance between sample points Ci and d(i,j) is the distance between sample points i and j.
- (2)
- For each sample point i, calculate b(i), which is the minimum average distance between point i and all other points not in the same cluster. This can be expressed by the following formula:where is an arbitrary cluster that does not contain sample point i.
- (3)
- For each sample point i, the profile coefficient s(i) is:Define s(i) = 0 if sample i is the only member of the cluster it is in (i.e., || = 1).
- (4)
- The average of the profile coefficients of all sample points is the profile coefficient of the entire data set:Where N is the total number of samples in the data set.
3. Conclusion
References
- Thomas K. F. Chiu, Qi Xia, Xinyan Zhou, Ching Sing Chai, and Miaoting Cheng. 2023. Systematic literature review on opportunities, challenges, and future research recommendations of artificial intelligence in education. Computers and Education: Artificial Intelligence 4, (January 2023), 100118. [CrossRef]
- Chunling Geng. 2024. Research on the effectiveness of interactive e-learning mode based on artificial intelligence in English learning. Entertainment Computing 51, (September 2024), 100735. [CrossRef]
- Sabrina Habib, Thomas Vogel, Xiao Anli, and Evelyn Thorne. 2024. How does generative artificial intelligence impact student creativity? Journal of Creativity 34, 1 (April 2024), 100072. [CrossRef]
- Manuela-Andreea Petrescu, Emilia-Loredana Pop, and Tudor- Dan Mihoc. 2023. Students’ interest in knowledge acquisition in Artificial Intelligence. Procedia Computer Science 225, (January 2023), 1028–1036. [CrossRef]
- Wilson Kia Onn Wong. 2024. The sudden disruptive rise of generative artificial intelligence? An evaluation of their impact on higher education and the global workplace. Journal of Open Innovation: Technology, Market, and Complexity 10, 2 (June 2024), 100278. [CrossRef]
- Zhou, P., Peng, R., Xu, M., Wu, V., Navarro-Alarcon, D. 2021. Path planning with automatic seam extraction over point cloud models for robotic arc welding. IEEE robotics and automation letters, 6(3),(April 2021), 5002-5009. [CrossRef]
- Jinxin Xu, Haixin Wu, Yu Cheng, Liyang Wang, Xin Yang, Xintong Fu, and Yuelong Su. 2024. Optimization of Worker Scheduling at Logistics Depots Using Genetic Algorithms and Simulated Annealing. arXiv:cs.NE/2405.11729 [cs.NE]. (May 2024). [CrossRef]





| parameter | value |
| n_estimators | 700 |
| random_state | 42 |
| Max_features | Auto |
| Min_samples_lesf | 1 |
| Min_samples_split | 2 |
| precision | recall | f1-score | support | |
| 1.0 | 0.85 | 0.87 | 0.86 | 290 |
| 2.0 | 0.86 | 0.86 | 0.86 | 346 |
| 3.0 | 0.90 | 0.87 | 0.89 | 272 |
| accuracy | / | / | 0.87 | 908 |
| macro avg | 0.87 | 0.87 | 0.87 | 908 |
| weighted avg | 0.87 | 0.87 | 0.87 | 908 |
| Predicted Labels | |||
| True Labels | 250 | 32 | 6 |
| 28 | 300 | 20 | |
| 17 | 18 | 240 | |
| precision | recall | f1-score | support | |
| 1.0 | 0.82 | 0.85 | 0.84 | 290 |
| 2.0 | 0.86 | 0.84 | 0.85 | 346 |
| 3.0 | 0.88 | 0.87 | 0.87 | 272 |
| accuracy | / | / | 0.85 | 908 |
| macro avg | 0.85 | 0.85 | 0.85 | 908 |
| weighted avg | 0.85 | 0.85 | 0.85 | 908 |
| Predicted Labels | |||
| 2500 | 31 | 12 | |
| True Labels | 35 | 290 | 20 |
| 19 | 17 | 240 | |
| 0 | 1 | 2 | |
| 0 | -3.549843 | 2.509355 | -0.158039 |
| 1 | 3.060733 | -2.079833 | 2.650664 |
| 2 | -2.514655 | -0.044689 | 0.844996 |
| 3 | -1.489571 | -0.883850 | 0.359377 |
| 4 | -1.204717 | 1.390635 | 0.790669 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).