Submitted:
16 April 2025
Posted:
18 April 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
3. System Architecture
3.1. Computing Power Network
3.2. Computing Power Allocation System
4. Model Establishment
4.1. Problem Formulation
4.2. Computing Power Model
4.3. Objective Function
5. Improved SAC (iSAC) Algorithm
5.1. Algorithm Architecture
5.2. MDP Engineering
5.2.1. State Description
5.2.2. Action Description
5.2.3. Reward Engineering
5.3. Algorithm Implementation
| Algorithm 1 PER-iSAC |
| Input: discount factor , temperature coefficient , soft update coefficient , batch size n, learning rate |
| Output: policy |
| 1: Initialize Actor and Q Net 1, Q Net 2, Target Q Net 1, Target Q Net 2, hyper parameters |
| 2: Initialize Replay Buffer |
| 3: for each do |
| 4: Initialize environment |
| 5: for each do |
| 6: |
| 7: |
| 8: = sampling from action space based on |
| 9: = environment exec |
| 10: save () in Replay Buffer |
| 11: compute the priority p in Replay Buffer |
| 12: if > n then |
| 13: sample n samples from Replay Buffer according to p |
| 14: train with n samples |
| 15: update Q Net 1, Q Net 2, Actor with |
| 16: update Target Q Net 1, Target Q Net 2 |
| 17: update , p |
| 18: end if |
| 19: end for |
| 20: end for |
6. Experiment Results
6.1. Simulation Settings
6.2. Experimental Results
7. Conclusions and Futrue Works
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| SAC | Soft Actor-Critic |
| PER | Prioritized Experience Replay |
| A3C | Asynchronous Advantage Actor Critic |
| PPO | Proximal Policy Optimization |
| TD | Temporal Difference |
References
- Merzougui, S.E.; Limani, X.; Gavrielides, A.; Palazzi, C.E.; Marquez-Barja, J. Leveraging 5G Technology to Investigate Energy Consumption and CPU Load at the Edge in Vehicular Networks. World Electr. Veh. J. 2024, 15, 171. [Google Scholar] [CrossRef]
- Huo, Y.; Zhang, C. A Review of Key Technologies for Environment Sensing in Driverless Vehicles. World Electr. Veh. J. 2024, 15, 290. [Google Scholar] [CrossRef]
- Ministry of Industry and Information Technology. Vehicle Network (Intelligent Connected Vehicles) Industry Development Action Plan. Vehicle Network (Intelligent Connected Vehicles) Industry Development Action Plan. 2018, December 25, 2018.
- Dai, Z.; Guan, Z.; Chen, Q.; Xu, Y.; Sun, F. Enhanced Object Detection in Autonomous Vehicles through LiDAR—Camera Sensor Fusion. World Electr. Veh. J. 2024, 15, 297. [Google Scholar] [CrossRef]
- Cui, H.; Lei, J. An Algorithmic Study of Transformer-Based Road Scene Segmentation in Autonomous Driving. World Electr. Veh. J. 2024, 15, 516. [Google Scholar] [CrossRef]
- Lu, S.; Shi, W. Vehicle Computing: Vision and Challenges. Journal of Information and Intelligence. 2022. [Google Scholar] [CrossRef]
- Abbas, N.; Zhang, Y.; Taherkordi, A.; Skeie, T. Mobile Edge Computing: A Survey. IEEE Internet of Things Journal. 2018, 5, 450–465. [Google Scholar] [CrossRef]
- Chougule, S.B.; Chaudhari, B.S.; Ghorpade, S.N.; Zennaro, M. Exploring Computing Paradigms for Electric Vehicles: From Cloud to Edge Intelligence, Challenges and Future Directions. World Electr. Veh. J. 2024, 15, 39. [Google Scholar] [CrossRef]
- Lu, S.; Yao, Y.; Shi, W. CLONE: Collaborative Learning on the Edges. IEEE Internet of Things Journal 2021, 8, 10222–10236. [Google Scholar] [CrossRef]
- Haarnoja, T.; Zhou, A.; Abbeel, P.; Levine, S. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. ArXiv. 2018, abs/1801.01290.
- Schaul, T.; Quan, J.; Antonoglou, I.; Silver, D. Prioritized Experience Replay. CoRR. 2015, abs/1511.05952.
- Pang, X.; Jia, W.; Li, H.; Gao, Q.; Liu, W. A Bi-Objective Optimal Scheduling Method for the Charging and Discharging of EVs Considering the Uncertainty of Wind and Photovoltaic Output in the Context of Time-of-Use Electricity Price. World Electr. Veh. J. 2024, 15, 398. [Google Scholar] [CrossRef]
- Shuai, H.; Li, F.F.; Pulgar-Painemal, H.; Xue, Y. Branching Dueling Q-Network-Based Online Scheduling of a Microgrid With Distributed Energy Storage Systems. IEEE Transactions on Smart Grid. 2021, 12, 5479–5482. [Google Scholar] [CrossRef]
- Shuai, H.; He, H. Online Scheduling of a Residential Microgrid via Monte-Carlo Tree Search and a Learned Model. IEEE Transactions on Smart Grid. 2020, 12, 1073–1087. [Google Scholar] [CrossRef]
- Hossain, M.B.; Pokhrel, S.R.; Choi, J. Orchestrating Smart Grid Demand Response Operations With URLLC and MuZero Learning. IEEE Internet of Things Journal. 2024, 11, 6692–6704. [Google Scholar] [CrossRef]
- Meidyani, B.; Sarno, R.; Nurlaili, A.L. Time and cost optimization using scheduling job shop and linear goal programming model. 2018 International Conference on Information and Communications Technology (ICOIACT). 2018, 555–560. [Google Scholar]
- Nayak, D.; Martha, V.S.; Threm, D.; Ramaswamy, S.; Prince, S.; Fahrnberger, G. Adaptive scheduling in the cloud - SLA for Hadoop job scheduling. 2015 Science and Information Conference (SAI). 2015, 832–837. [Google Scholar]
- Li, T.; Cui, W.; Cui, N. Soft Actor-Critic Algorithm-Based Energy Management Strategy for Plug-In Hybrid Electric Vehicle. World Electr. Veh. J. 2022, 13, 193. [Google Scholar] [CrossRef]
- Liu, Y.; Zhu, Y.; Yu, S.; Wang, Z.; Li, Z.; Chen, C.; Yang, L.; Lin, Z. Optimal Scheduling of Integrated Energy System Considering Virtual Heat Storage and Electric Vehicles. World Electr. Veh. J. 2024, 15, 461. [Google Scholar] [CrossRef]
- Saleem, U.; Liu, Y.; Jangsher, S.; Li, Y.; Jiang, T. Mobility-Aware Joint Task Scheduling and Resource Allocation for Cooperative Mobile Edge Computing. IEEE Transactions on Wireless Communications. 2021, 20, 360–374. [Google Scholar] [CrossRef]
- Han, Z.; Tan, H.; Li, X.; Jiang, S.H.; Li, Y.; Lau, F. OnDisc: Online Latency-Sensitive Job Dispatching and Scheduling in Heterogeneous Edge-Clouds. IEEE/ACM Transactions on Networking. 2019, 27, 2472–2485. [Google Scholar] [CrossRef]
- Zhang, K.; Mao, Y.; Leng, S.; Zhao, Q.; Li, L.; Peng, X.; Pan, L.; Maharjan, S.; Zhang, Y. Energy-Efficient Offloading for Mobile Edge Computing in 5G Heterogeneous Networks. IEEE Access. 2016, 4, 5896–5907. [Google Scholar] [CrossRef]
- Li, Y.Y.; Fang, Y.; Qiu, L. Joint Computation Offloading and Communication Design for Secure UAV-Enabled MEC Systems. In 2021 IEEE Wireless Communications and Networking Conference (WCNC). IEEE Press. 2021, 1–6.
- Ullah, I.; Lim, HK.; Seok, YJ. Optimizing task offloading and resource allocation in edge-cloud networks: a DRL approach. J Cloud Comp. 2023, 12, 112. [Google Scholar] [CrossRef]
- Nam, D. H. A Comparative Study of Mobile Cloud Computing, Mobile Edge Computing, and Mobile Edge Cloud Computing. Proceedings of CSCE 60160. 2023. [Google Scholar] [CrossRef]
- Tang, X.; Cao, C.; Wang, Y.; Zhang, S.; Liu, Y.; Li, M.; He, T. Computing power network: The architecture of convergence of computing and networking towards 6G requirement. China Communications 2021, 18, 175–185. [Google Scholar] [CrossRef]
- Andriulo, F.C.; Fiore, M.; Mongiello, M.; Traversa, E.; Zizzo, V. Edge Computing and Cloud Computing for Internet of Things: A Review. Informatics 2024, 11, 71. [Google Scholar] [CrossRef]
- Schulman, J.; Wolski, F.; Dhariwal, P.; Radford, A.; Klimov, O. Proximal Policy Optimization Algorithms. ArXiv. 2017, abs/1707.06347.
| 1 | Computing Power Units:1 KFLOPS = FLOPS, 1MFLOPS = FLOPS, 1GFLOPS = FLOPS, 1TFLOPS = FLOPS,1PFLOPS = FLOPS, 1EFLOPS = FLOPS |






| Servers | GPU Computing Power(TFLOPS) | GPU Storage(GB) | Idle Load Power(W) | Full Load Power(W) |
|---|---|---|---|---|
| S0 | 200~250 | 32 | 300~500 | 500~1000 |
| S1 | 140~160 | 24 | 150~350 | 350~500 |
| S2 | 130~150 | 24 | 150~300 | 300~500 |
| S3 | 100~120 | 16 | 50~150 | 200~450 |
| S4 | 110~130 | 16 | 50~150 | 200~500 |
| S5 | 100~120 | 8 | 50~100 | 150~300 |
| Parameters | Values |
|---|---|
| Computing Power Requirement | 200~4000 GFLOPs |
| Image Data Size | 4.8~160 MB |
| Model Data Size | 10~500 MB |
| Task Result Coefficient1 | 0.1~0.3 |
| Completion Time Coefficient2 | 1.2~1.5 |
| Link Speed | 5G: 100 Mbps~10 GbpsOptical Fiber Network: 200~400 Gbps |
| Communication Range | 10~500 m |
| Communication Delay | 1~2 ms |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).