Submitted:
26 November 2024
Posted:
26 November 2024
You are already at the latest version
Abstract




Keywords:
1. Introduction
2. System Model
2.1. Network Model
2.2. Communication Model
2.3. Computation Model
3. Problem Modeling
4. Problem Solution
- Sub-problem 1: Determining the Set of Sleeping Satellites
- Sub-problem 2: Aggregating Businesses from Sleeping Satellites to Working Satellites
- Step 1: Initialization. The control satellite initializes the process by collecting information on service distribution and satellite parameters.
- Step 2: Determine the Number of Working Satellites. Based on the current service volume, determine the number of satellites that will remain active.
- Step 3: Determine the Set of Active Satellites and the Set of Dormant Satellites. Classify satellites into active and dormant sets based on service demand.
- Step 4: Model the Transfer Problem Using MDP. Use a Markov Decision Process (MDP) to model the transfer problem for .
- Step 5: Perform Transfer Analysis of Using Double DQN. Conduct transfer analysis for using a Double Deep Q-Network (Double DQN).
- Step 6: Analyze Indicator Values, Record the Number of Working Satellites and Business Aggregation Plans. Analyze indicator values, and record the number of active satellites and the business aggregation scheme.
- Step 7: Check if All Dormant Satellite Services Are Considered. Verify if all services for dormant satellites have been considered. If not, return to continue the process; if yes, end the process.
4.1. Determining the Set of Sleeping Satellites
- Sorting Satellites: First, sort the business satellites in ascending order based on their business volume to obtain the list .
- Calculating the Number of Working Satellites: Determine the number of working satellites V using the relationship between the current business volume and the system’s maximum capacity as per Equation 22:where represents the current total business volume of ; is the maximum computable business volume per satellite (measured in bits); and denotes the ceiling function.
- Determining Satellite Sets: Based on the calculated number V and the sorted list , identify the set of working satellites and the set of sleeping satellites .
4.2. Aggregating Businesses
4.2.1. MDP Element Group
-
Define the elements of the MDP as follows:
- State Space ():where represents the current business volume of satellite .
-
Action Space ():Each action denotes assigning business to satellite .
-
Transition Probability (): The probability of assigning business to satellite is represented as:This denotes the probability of satellite transitioning from state to state after action .
- Reward Function (): The reward for assigning business to satellite is defined as:where is the energy difference associated with the business transfer; is the idle power consumption; and are the transmission and reception power consumptions, respectively; and and represent delay components.
- Discount Factor (): is a discount factor between 0 and 1 that balances the importance of current and future rewards.
4.2.2. Double DQN
- Online Network (): The primary network used for selecting actions.
- Target Network (): A secondary network with parameters that are periodically updated to stabilize training.
- Action Selection Using -Greedy Strategy: Based on the -greedy policy, an action is selected in a given state as per:where is a probability between 0 and 1 representing the likelihood of choosing a random action for exploration, and is the action-value function of the online network.
- Calculating Transition Probabilities: The probability of transferring business to satellite is computed as:where is the maximum business volume that satellite can handle, and represents the current business volume of satellite .
- Reward Calculation: The reward for taking action in state is calculated as:
- Updating the Q-Values Using the Bellman Equation: Update the Q-value of the online network using the Bellman equation as shown:where: where is the previous Q-value before the update, is the learning rate, is the discount factor, and are the parameters of the target network.
- Convergence Check: Perform a convergence check to determine if the Q-values have stabilized:where is a small threshold value. If remains below for multiple consecutive updates, the Q-values are considered to have converged, indicating that the optimal business aggregation strategy for has been found.
5. Simulation Analysis
5.1. Simulation Parameter Design
5.2. Strategy Simulation and Analysis
- Ant Colony Strategy (AC): This strategy generates aggregation decisions and resource aggregations by simulating ant colony behavior for global optimization.
- MDP-QL Strategy (MDP-QL): This strategy constructs a Q-table to generate aggregation decisions and resource aggregations, aiming to achieve global optimization.
6. Conclusion
Acknowledgments
References
- ITU. Framework and Overall Objectives of the Future Development of IMT for 2030 and Beyond. Available online: https://www.itu.int/en/publications/Pages/default.aspx (accessed on Day Month Year).
- Zhu, X.; Jiang, C. Creating Efficient Integrated Satellite-Terrestrial Networks in the 6G Era. IEEE Wireless Communications 2022, 29, 154–160. [Google Scholar] [CrossRef]
- Yue, P.; An, J.; Zhang, J.; Ye, J.; Pan, G. Low Earth Orbit Satellite Security and Reliability: Issues, Solutions, and the Road Ahead. IEEE Communications Surveys & Tutorials 2023, 25, 1604–1652. [Google Scholar]
- Angmo, D.; Yan, S.; et al. Toward Rollable Printed Perovskite Solar Cells for Deployment in Low-Earth Orbit Space Applications. ACS Applied Energy Materials 2024, 7, 1777–1791. [Google Scholar] [CrossRef]
- Song, Y.; Li, X.; et al. Energy-Aware Task Offloading and Resource Allocation in the Intelligent LEO Satellite Network. In Proceedings of the IEEE Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC); 2022; pp. 481–486. [Google Scholar]
- Al-Hraishawi, H.; Chatzinotas, S. Quantum Optimization Algorithm for LEO Satellite Communications Based on Cell-Free Massive MIMO. In Proceedings of the IEEE International Conference on Communications Workshops (ICC Workshops), 2023.
- Bhattacharjee, D.; Madoery, P.G.; et al. On-Demand Routing in LEO Mega-Constellations with Dynamic Laser Inter-Satellite Links. IEEE Transactions on Aerospace and Electronic Systems 2024. [Google Scholar] [CrossRef]
- Wang, Y.; Gui, K.A. An Energy Consumption Minimization Resource Allocation Strategy in Satellite-Terrestrial Communication Networks. Electronics Letters 2023, 59, e13025. [Google Scholar] [CrossRef]
- Raj, R.; Dharmaraja, S. Stochastic Modelling of Multi-Layer HAP-LEO Systems in 6G for Energy Saving: An Analytical Approach. Computer Communications 2023, 210, 22–34. [Google Scholar] [CrossRef]
- Gao, X.; Kaushik, A.; et al. Virtual Network Function Placement in Satellite Edge Computing with a Potential Game Approach. IEEE Transactions on Network and Service Management 2022, 19, 1243–1259. [Google Scholar] [CrossRef]
- Zhang, Y.; Wu, Q.; et al. Energy Drain Attack in Satellite Internet Constellations. In Proceedings of the IEEE/ACM International Symposium on Quality of Service (IWQoS); 2023; pp. 1–10. [Google Scholar]
- Lin, X.; Liu, A.; Han, C.; et al. LEO Satellite and UAVs Assisted Mobile Edge Computing for Tactical Ad-Hoc Network: A Game Theory Approach. IEEE Internet of Things Journal 2023. [Google Scholar] [CrossRef]





| Parameter | Value | Parameter | Value |
|---|---|---|---|
| Control satellite altitude | 2000 km | 1–3 W | |
| Business satellite altitude | 500–1500 km | 20–40 MHz | |
| Number of Control satellite | 1 | 20–40 MHz | |
| Number of business satellites | 5 | bit/s | |
| Number of businesses | 1000–9000 | f | cycle/s |
| 10–50 W | -140.4 dBm | ||
| 5–25 W | -110 dBm | ||
| 60–415 W | 300 cycle/bit | ||
| 5–8 W | 1 s | ||
| 3–6 W | 0.6 s | ||
| 45–55 W | bit– bit |
| Parameter | Value |
|---|---|
| Learning Rate (lr) | 0.001 |
| Epsilon Decay | 0.995 |
| Discount Factor () | 0.99 |
| Episodes | 500 |
| Number of Businesses | Delay Increment (s) | Energy Reduction Ratio |
|---|---|---|
| 1000 | 0.0161 | 47.87% |
| 3000 | 0.0227 | 22.01% |
| 5000 | 0.0203 | 11.34% |
| 7000 | 0.0304 | 7.83% |
| 9000 | 0.0299 | 4.36% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).