Submitted:
30 April 2025
Posted:
30 April 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
3. Basic Theory
3.1. Sparse Autoencoder
3.2. Stacking Ensemble Learning
3.3. Optimized Selection of the Stacking Ensemble Prediction Learner
4. Framework
4.1. Data Set
4.2. Data Preprocessing
4.3. Parameters Dimension Reduction
4.4. Model Training and Prediction
4.5. Evaluation
5. Results and Discussion
5.1. Analysis of the Predictive Results from a Single Model
5.2. Analysis of the Predictive Results of the Stacking Ensemble Learning Model

6. Conclusion
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Zhang Chao, Xia, Chun, Fang Junhua, et al. Influence of gasoline aromatic hydrocarbon content on particulate matter emission and its microphysical and chemical properties. Internal Combustion En-gine Engineering 2018, 39, 8–14. [Google Scholar]
- Zhu Xiao, Jiang Juncheng, Pan Yong, et al. Research on the prediction of octane number of alkanes based on molecular structure. Industrial Safety and Environmental Protection 2011, 37, 27–29. [Google Scholar]
- Lucimar V A, Nathalia D S A S, Vinicius R, et al. Effects of gasoline composition on engine performance, exhaust gases and operational costs. Renewable and Sustainable Energy Reviews 2021, 135, 110196. [Google Scholar] [CrossRef]
- Kelly J, Barlow C, Callis J. Prediction of gasoline octane numbers from near-infrared spectral features in the range 660—1215 nm. Anal Chem 1989, 61313–61320.
- Yu Fan, Fan Baoan, Huang Tiegai.Research progress of data mining in chemistry and chemical engineering[J]. Applied Chemical Industry, 2017, 46(1): 159-162, 166.
- Xu Xiao, Zhao Hongtao, Lu Kailiang, et al. BDMOS: Big data mining optimization system and its application in chemical process optimization. Computers and Applied Chemistry 2018, 35, 433–439. [Google Scholar]
- Tian Jingzhi, Du Xiaoxin, Zheng Yongjie, et al. Prediction of sulfur content in hydrodesulfurization diesel oil based on the PSO-BP neural network. Petrochemical Engineering 2017, 46, 62–67. [Google Scholar]
- Xu Zhanghua, Huang, Xuying, Lin Lu, et al. BP neural networks and random forest models to detect damage by Dendrolimus punctatus Walker. Journal of Forestry Research 2020, 31, 107–121. [Google Scholar] [CrossRef]
- Wang Wei, Wang, Kun, Yang Fan, et al. Construction and analysis of gasoline yield prediction model for fluid catalytic cracking unit (FCCU)based on GBDT and P-GBDT algorithm. Acta Petrolei Sinica (Petroleum Processing Section) 2020, 36.
- Qiu Aibo, Zhou Rujin, Qiu Songshan, et al. Research the progress of gasoline components and gasoline octane number prediction methods. Natural Gas Chemical Industry 2014, 39, 62–66. [Google Scholar]
- Kardamakis A, Pasadakis N. Autoregressive modeling of near-R spectra and MLR to predict RON values of gasoline. Fuel 2010, 89, 158–161. [Google Scholar] [CrossRef]
- Soo H P, Hong Y, Mubarakat S, et al. Detection of apple Marssonina blotch with PLSR, PCA, and LDA using outdoor hyperspectral imaging. Spectroscopy and Spectral Analysis 2020, 40, 1309–1314. [Google Scholar]
- Deng Chengxin, Xu, Jinlong, Zou Lianning, et al. Application and prospects of near infrared spectroscopy in crude oil analysis. In-spection and Quarantine 2019, 29, 128–131. [Google Scholar]
- Ghosh P, Hickeyk J, Jaffe S. Development of a detailed gasoline composition-based octane model. Industrial & Engineering Chemistry Research 2006, 45, 337–345. [Google Scholar]
- Han Zhiqi. National methanol vehicle pilot methanol gasoline co-lane number test research[D]. Xi'an: Chang'an University, 2015.
- Qiu Aibo, Zhou Rujin, Qiu Songshan, et al. Research progress of gasoline components and gasoline octane number prediction methods. Natural Gas Chemical Industry 2014, 39, 62–66. [Google Scholar]
- Zhang Baoquan, Ma Yali, Guan Rui, et al. A neural network method that can give the reliability of oil chromatographic fault diagnosis of oil-filled electrical equipment. Science Technology and Engineering 2021, 21, 1857–1864. [Google Scholar]
- Qin Yucui. Quantitative structure of organic mixture based on artifi-cial neural network research on the relationship of nature[D]. Xi'an:Xi'an Shiyou University, 2018.
- Wu Ping, Zhong Yihua, Yong Xue, et al. Application of data mining method in calculating the loss of gasoline octane number. Science Technology and Engineering 2022, 22, 4046–4054. [Google Scholar]
- Chen Yanzhan, Hu Hao, Ren Zichang, et al. Model Analysis of Gasoline Octane Loss in Catalytic Cracking Post Refining Unit Based on XG Boost and Improved Gray Wolf Optimization A lgorithm. Acta Petrolei Sinica(Petroleum Processing Section) 2022, 38, 208–219. [Google Scholar] [CrossRef]
- Shi Jiaqi, Zhang Jianhua. Load forecasting based on multimodel by stacking ensemble leaming. Proceedings of the CSEE 2019, 39, 4032–4042. [Google Scholar]
- Liu Jixiang, Zhang Qipei, Yang Zhihong, et al. Short-arm load forecasting method based on CNN-LSTM hybrid neural network model. Automation of Electric Power Systems 2019, 43, 131–137. [Google Scholar]
- Xu Huili. The study and improvement of stacking.Guangzhou: South China University of Technology, 2018.
- Lin Jun. Research on semantic classification model of teaching evaluation based on feature weighted stacking algorithm [D]. Guangzhou: South China University of Technology, 2020.
- Li Xiao, Wang, Xin, Zheng, Yihui, et al. Short-term wind load forecasting based on improved LSSVM and error forecasting correction. Power System Protection and Control 2015, 43, 63–69. [Google Scholar]
- Yang Jiajia, Liu, Guolong, Zhao, Junhua, et al. , A long -short-term memory-based deep learning method for industrial load forecasting. Electric Power Construction 2018, 39, 20–27. [Google Scholar]
- Li Bing, Han Rui, He Yigang, et al. Applications of the improved random forest algorithm in fault diagnosis of motor bearings. Proceedings of the CSEE 2020, 40, 1310–1319. [Google Scholar]
- Mei Guiqin. The research on improved Elman neural network and parameter optimization algorithm [D]. Chongqing: Southwest University, 2017.
- Tang Wei, Zhong Shiyuan, Shu Jiao, et al. Research on spatial load forecasting of distribution network based on GRA-LSSVM method. Power System Protection and Control 2018, 46, 76. [Google Scholar]






| Evs | Meanae | Mse | Medianae | R2 | |
| Lasso | 0.5865 | 0.1002 | 0.0365 | 0.0905 | 0.5822 |
| Ridge | 0.7073 | 0.0088 | 0.0015 | 0.0090 | 0.7010 |
| SVR | 0.7056 | 0.0939 | 0.0152 | 0.0741 | 0.6985 |
| ElA | 0.8815 | 0.0561 | 0.0061 | 0.0420 | 0.8796 |
| NB | 0.2186 | 0.1463 | 0.0402 | 0.1087 | 0.2092 |
| LR | 0.2839 | 0.1402 | 0.0365 | 0.1017 | 0.2826 |
| RF | 0.8787 | 0.0571 | 0.0066 | 0.0379 | 0.8694 |
| GBR | 0.8556 | 0.0666 | 0.0076 | 0.0523 | 0.8505 |
| ERT | 0.3213 | 0.1362 | 0.0349 | 0.1017 | 0.3134 |
| XGB | 0.6039 | 0.0106 | 0.0023 | 0.0004 | 0.6000 |
| StackingEL | 0.9478 | 0.0457 | 0.0031 | 0.0412 | 0.9387 |
| SAE+StackingEL | 0.9657 | 0.0356 | 0.0021 | 0.0280 | 0.9578 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).