2. Related Works
RQ1: What clustering algorithms have been used in UAV-assisted wireless coverage, and how do they perform in disaster scenarios?
Clustering algorithms are vital for optimising UAV-assisted wireless coverage, particularly in disaster scenarios. Among these, K-means and K-means++ have been widely adopted for clustering ground users and UAV deployment. K-means++ enhances the initial centroid selection, significantly improving the user throughput and satisfaction ratio [
1,
2,
3]. The Hybrid K-means PSO approach combines K-means with PSO, achieving a more balanced performance in terms of coverage, energy efficiency, and reliability [
4]. Particle swarm optimization (PSO) is another widely used technique for optimising cluster coverage and 3D UAV placement problems. It exhibits faster convergence and requires fewer UAVs than K-means [
5,
6]. The Ellipse Clustering Algorithm further contributes by adjusting the UAV antenna parameters to maximise the user coverage probability while minimizing the transmission power [
7]. Fuzzy Logic and Fuzzy C-Means (FCM) offer dynamic solutions such as cluster head selection based on sensor node energy and storage levels [
2]. An enhanced version, Fit-FCM, integrates additional factors such as energy, distance, and trust to improve cluster head selection in UAV-based wireless sensor networks [
8]. DBSCAN, a density-based clustering algorithm, has been used in conjunction with convex optimisation for UAV trajectory planning and charger allocation. This integration reduces the number of chargers required while boosting the overall throughput [
9]. Meanwhile, Log Linear Learning has proven effective for UAV deployment in disaster-stricken regions by using a UAV coverage utility function to ensure comprehensive network coverage [
10]. These algorithms have different strengths and weaknesses in disaster scenarios. K-means is effective for initial coverage and user satisfaction, although its performance can vary based on the centroid initialisation and distance metrics used [
1,
3]. It supports user association, optimal UAV placement, and altitude selection to maximise data rates in emergencies [
11]. PSO-based algorithms not only reduce the number of UAVs required, but also outperform GA and artificial bee colony (ABC) methods in terms of execution time and energy efficiency [
6,
12]. Fuzzy logic and fit fuzzy c-means clustering improve the lifetime of the network and reduce packet loss, which are crucial for reliable communication in WSNs assisted by UAVs [
2,
8].
DBSCAN minimises charger use and increases throughput, offering a cost-effective disaster response solution [
9]. Finally, log-linear learning has demonstrated both generality and the best performance compared with optimal selection algorithms in emergency deployments [
10].
Clustering algorithms, such as K-means, PSO, fuzzy logic, FCM, DBSCAN, and log-linear learning, are critical for enhancing the effectiveness of UAV-assisted wireless coverage. These methods contribute significantly to improving coverage, energy efficiency, and system reliability, particularly when the terrestrial communication infrastructure is compromised by the environment. While K-means and PSO offer robust overall performance, algorithms such as Fuzzy Logic and DBSCAN provide specialised advantages in terms of energy conservation and operational cost, making them indispensable in emergency and disaster response scenarios.
Table 1 compares eight clustering methods, K-means, K-means++, PSO, FCM, DBSCAN, and log-linear learning, used in UAV-assisted disaster scenarios. It highlights each method’s type, disaster support capability, optimisation goals, and respective strengths and limitations. For instance, K-means++ offers simplicity and fast convergence but is sensitive to centroid initialisation, whereas DBSCAN is cost-effective in sparse networks but struggles with scalability. This comparison sets the foundation for why a hybrid adaptive clustering method is needed in post-disaster UAV networks (see
Table 1).
RQ2: How have machine learning or neural models been used to guide clustering or adapt path planning in dynamic environments?
Machine learning and neural models have been increasingly applied to guide clustering and adapt path planning in dynamic environments. One notable approach is the Two-Stage Learning-Based Model, where the first stage extracts features of the surrounding environment and predicts the trajectories of dynamic obstacles, and the second stage utilises reinforcement learning to plan a feasible and efficient path based on those predictions. This model demonstrates high predictability and processing capacity for dynamic obstacles and successfully executes planning tasks in various dynamic scenarios [
13]. In the realm of Neural Network-Guided Path Planning, methods such as NST-PRM and NST-RRT employ neural networks trained on datasets generated through probabilistic roadmaps (PRM) and rapidly exploring random trees (RRT). These techniques notably reduce the dataset generation time and enhance path-planning efficiency in environments with numerous obstacles [
14]. A Graph Neural Network (GNN)-Based Planner, further improves planning by leveraging prior exploration experience and minimising replanning costs under unpredictable conditions. This results in a high planning performance with fast speeds and low path costs, outperforming both traditional and other learning-based methods [
15]. Another advancement is the Dynamic Distance Transform Algorithm, in which a neural model adapts the distance transform method for dynamic contexts, forming a dynamically updating potential field. This ensures an effective local adaptation and optimal path planning in dynamic environments [
16].
In terms of Clustering for Path Planning, cluster-based routing for UAVs integrates online path planning, clustering-based network topology, reinforcement-learning-driven cluster management, and data routing. The outcomes include improved coverage, better adaptation to changing environments, enhanced packet delivery ratios, and reduced communication delays [
17]. Furthermore, combining clustering and neural networks for trajectory prediction, such as the use of DBSCAN and multicell neural networks (MCNN), produces accurate short-term 4D trajectory predictions, demonstrating their robustness and effectiveness in dynamic settings [
18,
19].
Among Hybrid and Reinforcement Learning Approaches, Q-Learning-Based Local Planning considers factors such as path length, safety, and energy consumption and shows reliable performance in both static and dynamic environments by avoiding the typical pitfalls of conventional algorithms [
20]. When Artificial Neural Networks (ANNs) are combined with Q-learning, the ANN serves as a path-planning controller, whereas Q-learning generates training samples. This hybrid strategy is more effective than using either technique alone to find optimal paths [
21]. In practical applications, Multi-Robot Systems (MRS) benefit from improved neural dynamics approaches that incorporate territorial mechanisms, resulting in robust and fair path planning in complex and changing environments [
22]. Similarly, Autonomous Vehicles leverage deep reinforcement learning and clustering-based strategies to develop both theoretical models and practical solutions for effectively navigating dynamic surroundings [
23,
24].
The integration of machine learning and neural models has significantly improved path planning and clustering performance in dynamic environments. By enhancing efficiency, adaptability, and accuracy, these methods, particularly those involving reinforcement learning, neural networks, and clustering, effectively address the challenges posed by dynamic obstacles and varying conditions.
Table 2 lists various ML- and NN-based frameworks that assist in UAV path planning and clustering in dynamic environments. It spans methods such as Two-Stage ML+RL models, GNN, neural potential fields, and hybrid RL-routing systems. The table clarifies their roles in clustering, path planning, adaptability to environmental changes, and trade-offs between complexity and data dependency. This provides the motivation for integrating ANN-guided clustering into the proposed model ( see
Table 2).
RQ3: What hybrid clustering or dynamic strategy selection methods exist, and how do they affect trajectory or resource efficiency?
Hybrid clustering and dynamic strategy selection methods have been introduced to enhance trajectory planning and resource efficiency in various applications. One of such approach is the Hybrid Selection Multi-Objective Evolutionary Algorithm (HSMEA), which combines angle and distance metrics for clustering individuals, followed by a hybrid selection mechanism. This method strikes an effective balance between convergence and diversity, making it suitable for complex multi-objective optimisation tasks [
25]. Another notable method, the Density-based K-means (DKGK) approach, utilises Differential Evolution (DE) to promote diversification, K-means for refinement, and a GA enhanced with heuristic crossover to ensure fast convergence. This combination outperforms conventional approaches, such as DE, GA, DE-K-means, and GA-K-means, by significantly reducing intra-cluster distances, leading to better clustering accuracy and efficiency [
26].
The hybrid grasshopper and differential evolution-based optimisation algorithm (HGDEOA) integrates adaptive strategies into DE, thereby enhancing its global search ability and reducing the risk of premature convergence. This results in improved energy stability, higher throughput, and extended network lifetime in Wireless Sensor Networks (WSNs) [
27]. Similarly, the Hybrid Deep Fixed K-Means (HDF-KMeans) approach merges Deep K-Means++ for advanced feature extraction and centroid initialisation with Fixed Centered K-Means to improve stability. This hybrid model enhances clustering accuracy and consistency, particularly in critical domains such as healthcare [
28].
Among the Dynamic Strategy Selection Methods, the Dynamic Genetic Algorithm (DGA) improves the automatic calculation of the number of clusters (k) and improves the population initialisation, genetic operators, and fitness functions. These improvements lead to more accurate clustering outcomes and better estimation of the cluster numbers [
29]. The Gaussian Mutation Adaptive Artificial Fish Swarm Algorithm (GAAFSA) incorporates Gaussian mutation and adaptive strategies to avoid local optima and early convergence, which boosts the network lifespan, increases packet reception rates, and reduces packet loss in Industrial Wireless Sensor Networks (IWSNs) [
30]. Meanwhile, adaptive density peak clustering with Fisher linear discriminant (ADPC-FLD) employs kernel density estimation and weighted Euclidean distances, along with adaptive strategies for selecting cluster centres. This significantly enhances the clustering accuracy and efficiency, particularly in high-dimensional data scenarios [
31].
Regarding the effects on trajectory and resource efficiency, methods such as HGDEOA and GAAFSA contribute significantly to energy efficiency and network longevity by optimising cluster head selection and data transmission in WSNs and IWSNs [
27,
30]. In terms of convergence and diversity, HSMEA and DKGK provide robust clustering solutions that enhance system performance [
25,
26]. For applications demanding high precision, such as healthcare and complex data analysis, techniques such as HDF-KMeans and ADPC-FLD deliver notable improvements in precision and consistency [
28,
31]. Hybrid clustering and dynamic strategy selection methods offer meaningful advancements in trajectory optimisation and resource efficiency. By enhancing energy stability, promoting balanced convergence and diversity, and increasing clustering accuracy, these techniques are valuable in a wide array of technical domains.
Table 3 compares hybrid approaches, such as HSMEA, DKGK, HGDEOA, and adaptive clustering methods. Each combines different algorithms (e.g., DE+GA+K-means) to optimize clustering accuracy, convergence, and resource use. The strengths of these models include improved energy stability and accuracy; however, their limitations often involve parameter tuning or domain-specific constraints. The discussion in this paper leverages this to justify the hybrid APC–DBSCAN + ANN framework (see
Table 3).
RQ4: What fitness functions or multi-objective optimisation approaches are most effective for UAV path planning in DRNs?
To identify the most effective fitness functions or multi-objective optimisation approaches for UAV path planning in DRNs, various insights have emerged from recent research. Several
fitness functions are typically used in this context. One core focus is on minimising both
distance and risk, as seen in methods that apply the Bézier theory and impose constraints such as turning angle and flight altitude [
32]. Similarly, another study incorporated travelling distance and risk along with height, angle, and slope limitations to guide UAV navigation [
33].
Energy efficiency is another critical criterion. Some models aim to reduce fuel consumption, altitude costs, and threat exposure during flights [
34]. In particular, Bézier curve-guided paths optimised via GAs and multi-objective swarm-based strategies are effective in improving energy use [
35]. In
multi-UAV systems, utility-based objectives such as maximising the number of people rescued are coupled with collision avoidance to ensure operational safety [
36]. Additionally,
Quality of Service (QoS) is emphasised in UAV-assisted Mobile Edge Computing (MEC) frameworks, where path planning considers geometric distance, risk level, and terminal user demand [
37]. Several techniques have been proposed for textbfmulti-objective approaches. A
knee-guided differential evolution algorithm directs the search toward optimal UAV paths with smooth trajectory generation [
32], whereas another variant incorporates an
adaptive selection mutation to improve refinement while preserving exploration [
33].
Reinforcement learning (RL) methods are effective in dynamic urban settings, optimising UAV paths under conditions of mobile obstacles and variable threats [
37].
GAs remain widely used for path optimization due to their versatility in complex optimisation. One approach normalizes the fitness criteria and employs swarm-based enhancements for energy efficiency [
35]. Another study combined an
adaptive GA for mission assignment with an
improved artificial bee colony method for optimal path planning [
38]. The
Hybrid Equilibrium Optimizer (HEO) uses techniques such as Gaussian distribution estimation and Lévy flight to divide populations and balance exploration and exploitation [
34]. Moreover,
Ant Colony Optimisation (ACO) has been improved for accurate 3D path planning, minimising flight path length, and terrain threats [
39].
Table 4 outlines the fitness criteria, such as distance, risk, energy, and QoS, alongside the optimisation technique used (e.g., DE, RL, GA, hybrid metaheuristics, ACO). This emphasises that effective UAV path planning must balance efficiency, safety, and adaptability. This reinforces why the proposed GA-based framework incorporates multiple weighted objectives ( see
Table 4).
Effective UAV path planning in DRNs using GA depends on fitness functions that measure distance, risk, energy consumption, and mission utility. Multi-objective optimisation techniques, such as differential evolution, reinforcement learning, GAs, hybrid optimisation, and ant colony optimisation, exhibit strong capabilities in navigating complex dynamic environments. Each technique offers unique strengths, ranging from convergence speed to adaptability, making it appropriate for different operational requirements. Expanding
Table 4,
Table 5 details the scope, strengths, and limitations of specific optimisation approaches, such as knee-guided DE, RL-based planning, GA+swarm hybrids, and enhanced ACO. It shows how each method addresses constraints such as smoothness, collision avoidance, and threat minimisation, further supporting the design choices in the hybrid framework (see
Table 5).
RQ5: How do clustering and optimisation approaches handle varying BS loss, user densities, or terrain constraints in DRNs?