Submitted:
20 May 2025
Posted:
20 May 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Problem Definition
2.1. Network-Based Uncapacitated Facility Location Problem (N-UFLP)
- Node Location Integration: Facilities and customers are co-located within a unified node set, where the first N nodes serve dual roles as both customers and candidate facility locations. This contrasts with traditional UFLP, which treats facilities and customers as distinct sets and introduces self-service constraints (e.g., a facility can serve itself). This approach better reflects practical applications such as telecommunications, logistics, and waste management systems, where real-world nodes (e.g., intersections, hubs, or urban locations) can act both as service origins and destinations, improving model fidelity to network realities [5,42,43].
- Path-Dependent Distance Metric: Transportation costs are determined by shortest-path distances on the network, calculated as the sum of edge weights between nodes using the Floyd-Warshall algorithm. Unlike classical UFLP models, which assume Euclidean distances or direct transportation costs, this approach dynamically computes path costs based on the network topology. For many network applications, e.g., transportation and waste collection, using shortest-path network distances instead of geometric distances is essential for operational feasibility, as real-world routes are dictated by the topology of the network and its connectivity [2,43,44].
- Network Distance Matrix: A precomputed M×M matrix stores all-pairs shortest path distances for M nodes, replacing conventional transportation cost matrices. Storing precomputed shortest-path distances allows large-scale problems to be solved more efficiently, enabling rapid cost evaluations within metaheuristic or exact optimization frameworks commonly used in both academic study and industrial practice [2,29].
- Network Connectivity Constraints: Accommodates disconnected subnetworks (e.g., isolated traffic zones) by introducing virtual edges to connect orphaned nodes to the nearest network component, ensuring full service coverage. This is especially vital in urban logistics, telecommunications, or emergency services, where service continuity across potentially disconnected or dynamically changing networks must be ensured [42,45]. Virtual or auxiliary links are routinely adopted in real-world applications to guarantee full coverage.
- Demand-Weighted Travelled Cost: Combines customer demand volumes with path distances to generate a demand-weighted travelled cost metric. Weighting travel costs by demand mirrors operational objectives in logistics, emergency services, and supply chain management, where both the volume and the distance traveled significantly influence the optimal facility placement and overall system efficiency [42,46].

- (a)
- Transportation Cost Rate (α): A scalar parameter α multiplies the effective distance to yield the transportation cost.
- (b)
- Facility Setup Cost (β): A scalar parameter β denotes the unitary fixed cost for opening a facility. This cost is assumed to be identical for all candidate locations.
- (c)
- Demand-weighted traveled distance (Wij): The transportation cost incurred when assigning customer demand hj to a facility at node i is given by Wij= hj · dij.
- Self-Service Feasibility:
- Path Consistency (Triangle-Inequality) Constraint:
- Variables xij and yi are the decision variables representing the assignment relationships and facility location, respectively
2.2. Notations
| Symbol | Definition |
| Sets | |
| I | Set of all nodes (∣I∣=M), including facilities and customers. |
| J | Subset of customer nodes/demand site (∣J∣=N), also candidate facilities. |
| S | Final set of selected facilities. S={s1,s2,…,sp}. p×1 Vector of indices. Output. |
| Asgmt. | 1×N Vector mapping customers to assigned facilities. Output. |
| K | Set of all candidate facility locations. K={1,2,...,N}. |
| Indices | |
| i, r ∈I | Indices for general nodes (facilities or customers). |
| j∈J | Index specifically for customer nodes. |
| t | Iteration index for facility selection, t=1,2,...,p,where p≤N. |
| k∈K | k∈K corresponds to a potential facility. |
| Parameters | |
| diradj | Adjacency-based distance between nodes i and r (direct edge length). |
| Dadj | M×M adjacency distance matrix. |
| dir | Shortest-path distance between nodes i and r. |
| D | M×M shortest-path distance matrix. |
| (hj)j∈J | Demand weight (trip attraction) at customer j (N×1 vector). Input. |
| Wij | Demand-weighted traveled distance from node i to node j. |
| α | Uniform facility setup cost. Input components. |
| β | Transportation cost rate per unit-distance-demand. Input. |
| p | Maximum allowed facilities. |
| Z(s) | TotalCost. Scalar objective value of solution S. Output. |
| Decision Variables | |
| yi | yi∈{0,1}: 1 if a facility is opened at node i; 0 otherwise. |
| xij | xij∈{0,1}: 1 if customer j is assigned to facility i; 0 otherwise. |
| Key Matrices | |
| NodeNames | M×1 cell array of unique node identifiers. Input components#. |
| NodeCoords | M×2 matrix of geographic coordinates (longitude, latitude). Input. |
| DistTable | L×3 matrix of adjacency distances (origin, destination, edge length). Input. |
| DistanceMatrix | M ×M matrix of precomputed shortest-path distances. Input. |
2.3. Assumptions
- A customer node can be served by a facility co-located at the same node.
- All facilities incur identical fixed setup costs, i.e., no site-dependent cost variations.
- Facilities are either fully open or closed; partial opening is prohibited.
- Each customer is entirely served by a single facility.
- There is no upper limit on the demand a facility can serve.
- All costs, demands, and distances are deterministic and known with certainty.
- The maximum number of facilities p is the square root of N (budget-constrained).
3. Methodology
3.1. Baseline Methods
3.1.1. Roulette Wheel Initialization (RWI)
- St-1 is the set of facilities already selected before iteration t.
- K\St-1 is the set of remaining candidate nodes.
- In the basic version one may set wk=(dkadj)min.
- st~Discrete(P(·)) denotes that st is drawn from the discrete distribution defined by the probabilities Pk.
3.1.2. Greedy Initialization (GI)
- St - denote the set of selected facilities after iteration t (S0=∅).
- let K\St-1 be the candidate nodes, unselected candidate nodes (recall K={1,2,...,N}).
- (a)
- Selection Criterion
- (b) Update
3.1.3. Neighborhood Search Algorithm (NS) and Greedy-initialized Neighborhood Search
3.1.4. Variable Neighborhood Search (VNS) and Greedy-initialized Variable Neighborhood Search
- N1: Single-swap operator.
- N2: Double-swap operator.
- N3: Demand-weighted perturbation.
3.2. Demand-Weighted Roulette Wheel Initialization (DWRWI)
3.2.1. Theory
- t: Iteration index for facility selection, t=1,2,...,p,where p≤N.
- St={s1,s2,...,st}: Set of currently selected facility locations by the end of iteration t.
- st: Newly selected facility at the t-th iteration, where st∈K\St-1.
- (a)
- Minimum Distance Calculation:
- (b) Weight Assignment:
- Demand Satisfaction: Prioritize nodes with higher demand (hk) to maximize service coverage.
- Spatial Dispersion: Favor Nodes farther from existing facilities (dk)minto avoid clustering while considering network topology.
- (a)
- Probability Normalization:
- Ensure valid probability distribution (∑Pk=1)
- Proportionally allocate selection likelihood based on relative weights.
3.2.2. Theorem (Correctness and Termination)
- Compute dkmin=mini∈St-1 D(k,i) (at most t-1 comparisons).
- Compute weight wk=dkmin·hk.
- Normalize weights and sample (both O(|K\St-1|)).
4. Results
4.1. Setting Up the Case Study
| Terminology | Abbrev. | Brief Explanation | Reference |
|---|---|---|---|
| Mixed Integer Programming with Branch-and-Price Algorithm |
MIP | Applies the Branch-and-Price algorithm | Corberán et al., 2019; Arslan et al., 2021; Barbato & Gouveia, 2024 |
| Greedy Algorithm with Drop Strategy | Greedy | A myopic algorithm that iteratively selects facility sites to minimize demand-weighted total distance. | Kuehn & Hamburger, 1963; Gokalp, 2020 |
| Genetic Algorithm | GA | Metaheuristic algorithm involving selection, crossover, and mutation. | Moreno-Perez et al., 1994; Alp et al., 2003; Fathali, 2006 |
| Lagrangian Relaxation Algorithm | Lagrangian | Generates feasible solutions and calculates lower bounds iteratively. | Beltran et al., 2006; Beltran-Royo et al., 2012; Nezhad et al., 2013 |
| Neighborhood Search Algorithm with RI | NS1964 | Iteratively updates facility locations and assignment relationships to minimize demand-weighted total distance. | Maranzana, 1964 |
| Neighborhood Search with DWRWI | NS2025 | Incorporates Demand-Weighted Roulette Wheel Initialization (DWRWI). | |
| Variable Neighborhood Search with RI | VNS1997 | Metaheuristic method with key steps: shaking, local search, and swap evaluation. | Hansen & Mladenović, 1997 |
| Variable Neighborhood Search with DWRWI | VNS2025 | Incorporates DWRWI and adjusts the p-range for enhanced performance. | |
| Neighborhood Search with GI | NSG/ NSGreedy |
Combines greedy initialization with the Neighborhood Search algorithm. | Kuehn & Hamburger, 1963; Gokalp, 2020 |
| Variable Neighborhood Search with GI | VNSG/ VNSGreedy |
Combines Greedy Initialization with the Variable Neighborhood Search algorithm. | Kuehn & Hamburger, 1963; Gokalp, 2020 |
| Demand-Weighted Roulette Wheel Initialization | DWRWI | A construction method, whereas NS2025 and VNS2025 belong to the DWRWI-based methods. | |
| Greedy Initialization | GI | A construction method, NSG and VNSG are both categorized as GI-based methods. | Kuehn & Hamburger, 1963; Gokalp, 2020 |
| Random Initialization | RI | A construction method, whereas NS1964 and VNS1997 belong to the RI-based methods. | Celebi et al., 2013 |
| Scenarios | Instances | Zones (Demands) | Nodes | Links |
|---|---|---|---|---|
| Methodology Demonstration (§4.2) | 6-points UFLP NetData | 6 | 6 | 5 |
| Analysis of Case Cohorts (§4.4) Application Analysis (SF inst., §4.5) |
SiouxFalls | 24 | 24 | 76 |
| Berlin-Friedrichshain | 23 | 224 | 523 | |
| Berlin-Tiergarten | 26 | 361 | 766 | |
| Berlin-Mitte-Center | 36 | 398 | 871 | |
| Anaheim | 38 | 416 | 914 | |
| Berlin-Mitte-Prenzlauerberg-Friedrichshain-Center | 98 | 975 | 2184 | |
| Large-scale Network (§4.3) | GoldCoast | 1068 | 4807 | 11140 |
| Terminology | Program Var. | Brief Explanation | |
|---|---|---|---|
| Input Components | nodeNames | Nodes Names | M×1 cell matrix structure. A unique set of node identifiers, where M is the total number of nodes. Facilities and customers share the same node set. The first N nodes serve as both customers and candidates for facilities. Facilities and customers can be co-located. |
| Nodes Coordinates | nodeCoords | M×2 double matrix structure. Each row represents the (x, y) coordinates (longitude/latitude) of a node, where M is the number of nodes. | |
| Distance Matrix/DistTable | DistTable | L×3 double matrix structure. Columns represent: start node, end node, and edge distance. Distances are symmetric (distance(i,j)=distance(j,i)) and finite (0<distance<∞). For non-adjacent nodes, shortest-path distances are computed. Non-connected nodes are linked using nearest-neighbor operations. | |
| Customer Demands | tripAttraction | N×1 double matrix structure. Represents demand (trip attraction) for the first N nodes in M. The first N nodes of DistanceMatrix represent demand. Referred to as the "demand" variable. | |
| Transportation Cost Rates | transport_rates | Scalar value. Multiplier for transportation cost: transport_rates×distance×demand | |
| Facility Setup Costs | facility_rates | Scalar value. Unitary setup cost for facilities, uniform across all nodes. | |
| DistanceMatrix | M×M matrix storing shortest-path distances between all node pairs. Derived from DistTable, it is symmetric, with diagonal elements as 0. Non-adjacent nodes have infinite distances. | ||
| Output Components | Number of Open Facilities | p | Scalar value representing the number of open facilities in the final solution. Used to validate the solution against constraints |
| Selected Facility Indices | selectedFacilities | p×1 vector storing the indices of open facility nodes in the current solution. Helps locate facilities for further optimization. | |
| Total Cost | totalCost | Scalar value of the objective function. Represents the minimum total cost, including fixed facility setup costs and transportation costs for customer assignments. | |
| LB GAP (%) | facilitate comparative analysis across different methods within the same scenario. | ||
| Customer-Facility Assignment | assignments | 1×N matrix. Each element corresponds to the index of the facility assigned to a customer. Indicates the customer-to-facility mapping. | |
| Silhouett | evaluate clustering quality in customer-to-facility assignments | ||
| Iteration Count | iteration step | Number of iterations during the optimization process. | |
| Computation Time | iteration time | Total computation time (in seconds) to solve the problem. | |
| Result Statistics | Optimal Number of Open Facilities | Best_p | Scalar value. Number of open facilities corresponding to the lowest total cost across 4 runs. |
| Optimal Total Cost | Best_totalCost | Scalar value. Minimum total cost, including fixed and transportation costs. | |
| Iteration Count for Best Solution | Best_iterCount | Scalar value. Number of iterations required to achieve the best configuration. | |
| Computation Time for Best Solution | Best_iterTime | Scalar value. Total computation time (in seconds) to obtain the best solution. | |
| Cost Interquartile Range (IQR) | Cost_IQR | Scalar value. Interquartile range (IQR) of total costs, calculated as Q3−Q1. |
4.2. Small-Scale Illustrative Scenario
| InitTech | Init.SelectedFac | FinalSelectedFac | Assignments | Total Cost | |
| NS1964 | RI | {A,C,E} | {A,D,E} | {A,D,D,D,E,D} | 2375 |
| NS2025 | DWRWI | {B,D,E} | {B,D,E} | {D,B,D,D,E,D} | 2275 |
| NSG | GI | {B,D,E} | {B,D,E} | {D,B,D,D,E,D} | 2275 |
4.3. Ultra-Large-Scale Network Analysis
| p | Selected Facilities Indies | Total Cost |
Iteration Time |
Optimal Toal Cost | Silhouette | Time for Best Solution (s) | |
|---|---|---|---|---|---|---|---|
| NS1964 | 6 | {1204,1347,1556,1940,3191,4603} | 9189353 | 7.98 | 9189353 | 0.3833 | 7.98 |
| 7 | {561,1141,1299,1425,1647,2864,2918} | 9355370 | 9.69 | ||||
| 8 | {561,1133,1299,1347,1499,1630,1736,1940} | 9557402 | 7.52 | ||||
| 8 | {1204,1425,1711,2358,2833,2918,2943,4803} | 9661646 | 7.69 | ||||
| NS2025 | 5 | {2845,2916,3872,3916,4074} | 9373934 | 18.98 | 9372502 | 0.3859 | 14.84 |
| 6 | {1425,1879,2833,3872,4074,4102} | 9402195 | 15.12 | ||||
| 6 | {1425,2918,3872,4074,4102,4550} | 9403390 | 13.75 | ||||
| 5 | {1347,1940,3872,3916,4074} | 9372502 | 14.84 | ||||
| NSG | 7 | {561,1133,1299,1556,1940,2845,4602} | 9206563 | 129.58~139.32 | 9206563 | 0.3752 | 132.39 |
| VNS1997 | 8 | {1499,112,817,2916,3700,3930,4603,1275} | 10552437 | 3628.89 | 10066266 | 0.3626 | 3562.11 |
| 8 | {4007,3219,2155,1567,3085,1724,1969,4413} | 10066266 | 3562.11 | ||||
| VNS2025 | 5 | {622,81,1529,2155,1940} | 9736593 | 2661.41 | 9736593 | 0.3776 | 2666.41 |
| 7 | {1940,202,684,430,3632,785,958} | 10257816 | 2642.73 | ||||
| 7 | {1,458,1940,561,725,3231,975} | 10342441 | 2615.61 | ||||
| 4 | {4484,3926,3695,2155} | 10113549 | 2583.04 | ||||
| VNSG | 7 | {1647,3559,3496,1299,561,2864,1556} | 9497958 | 3285~3457 | 9497958 | 0.3197 | 3361.59 |
4.4. Comparative Analysis Across Case Cohorts: Multi-Method and Multi-Run Evaluation
| Stances | Statistics | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| SiouxFalls | Berlin-Frd. | Berlin-Tgr. | Berlin-M-C | Anaheim | Berlin-M-P-F-C | CRP | Rank | ||
| Optimal Total Cost | MIP | 4.848E+06 | 2.543E+07 | 3.408E+07 | 2.957E+07 | 1.047E+10 | 8.278E+07 | 1.000 | 1.0 |
| Greedy | 1.494E+07 | 2.754E+07 | 3.466E+07 | 3.234E+07 | 1.178E+10 | 8.826E+07 | 1.073 | 6.2 | |
| GA | 1.462E+07 | 2.543E+07 | 3.429E+07 | 3.226E+07 | 1.354E+10 | 1.001E+08 | 1.094 | 5.2 | |
| Lagrangian | 1.883E+07 | 4.628E+07 | 7.365E+07 | 4.700E+07 | 2.398E+10 | 1.216E+08 | 1.743 | 10.0 | |
| NS1964 | 1.530E+07 | 2.543E+07 | 4.061E+07 | 3.063E+07 | 1.109E+10 | 8.549E+07 | 1.059 | 4.7 | |
| NS2025 | 1.786E+07 | 2.769E+07 | 3.654E+07 | 2.965E+07 | 1.119E+10 | 9.298E+07 | 1.058 | 5.2 | |
| VNS1997 | 5.753E+06 | 2.762E+07 | 3.779E+07 | 3.596E+07 | 1.308E+10 | 9.847E+07 | 1.158 | 8.0 | |
| VNS2025 | 5.050E+06 | 2.543E+07 | 3.815E+07 | 3.603E+07 | 1.242E+10 | 9.731E+07 | 1.121 | 7.0 | |
| NSG | 5.076E+06 | 2.647E+07 | 3.408E+07 | 3.039E+07 | 1.077E+10 | 8.560E+07 | 1.028 | 3.3 | |
| VNSG | 5.110E+06 | 2.669E+07 | 3.486E+07 | 3.039E+07 | 1.170E+10 | 8.744E+07 | 1.045 | 4.5 | |
| Total Cost LB GAP ((%) | MIP | 74.26% | 45.05% | 53.72% | 37.08% | 56.34% | 31.93% | - | - |
| Greedy | 20.66% | 40.50% | 52.94% | 31.18% | 50.86% | 27.43% | - | - | |
| GA | 22.35% | 45.05% | 53.44% | 31.37% | 43.53% | 17.68% | - | - | |
| Lagrangian | 0.00% | 0.00% | 0.00% | 0.00% | 0.00% | 0.00% | - | - | |
| NS1964 | 18.74% | 45.05% | 44.86% | 34.82% | 53.76% | 29.71% | - | - | |
| NS2025 | 5.16% | 40.16% | 50.38% | 36.91% | 53.32% | 23.55% | - | - | |
| VNS1997 | 69.45% | 40.32% | 48.69% | 23.49% | 45.45% | 19.04% | - | - | |
| VNS2025 | 73.18% | 45.05% | 48.20% | 23.33% | 48.22% | 19.99% | - | - | |
| NSG | 73.05% | 42.81% | 53.72% | 35.34% | 55.10% | 29.62% | - | - | |
| VNSG | 72.87% | 42.33% | 52.67% | 35.34% | 51.20% | 28.11% | - | - | |
| CostIQR(%) | MIP | 0.00% | 0.00% | 0.00% | 0.00% | 0.00% | 0.00% | - | 1.0 |
| Greedy | 0.00% | 0.00% | 0.00% | 0.00% | 0.00% | 0.00% | - | 2.0 | |
| GA | 11.01% | 8.12% | 11.49% | 2.87% | 1.19% | 7.20% | - | 6.8 | |
| Lagrangian | 0.00% | 0.00% | 0.00% | 0.00% | 0.00% | 0.00% | - | 3.0 | |
| NS1964 | 24.77% | 4.09% | 23.87% | 11.84% | 6.68% | 9.20% | - | 8.2 | |
| NS2025 | 18.57% | 11.00% | 11.06% | 13.13% | 3.08% | 6.24% | - | 7.3 | |
| VNS1997 | 23.53% | 20.82% | 4.49% | 17.33% | 8.38% | 12.36% | - | 8.8 | |
| VNS2025 | 29.70% | 12.24% | 11.94% | 14.10% | 10.03% | 4.52% | - | 8.8 | |
| NSG | 0.00% | 0.00% | 0.00% | 0.00% | 0.00% | 0.00% | - | 4.0 | |
| VNSG | 2.82% | 0.00% | 0.00% | 0.00% | 0.00% | 4.48% | - | 5.0 | |
| Time for Best Solution (s) | MIP | 0.16 | 0.10 | 0.07 | 0.12 | 0.15 | 1.34 | 50.330 | 5.0 |
| Greedy | 0.03 | 0.69 | 2.24 | 3.79 | 4.51 | 112.79 | 606.742 | 8.0 | |
| GA | 0.02 | 0.03 | 0.04 | 0.08 | 0.05 | 0.27 | 15.780 | 3.0 | |
| Lagrangian | 0.04 | 0.44 | 1.31 | 2.71 | 3.25 | 86.00 | 464.278 | 7.0 | |
| NS1964 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.03 | 1.000 | 1.0 | |
| NS2025 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.05 | 2.151 | 2.0 | |
| VNS1997 | 0.35 | 0.87 | 1.66 | 2.30 | 2.43 | 12.26 | 528.073 | 7.7 | |
| VNS2025 | 0.44 | 0.84 | 1.40 | 2.04 | 2.42 | 9.85 | 500.429 | 7.0 | |
| NSG | 0.07 | 0.06 | 0.23 | 0.15 | 0.10 | 0.50 | 41.096 | 4.7 | |
| VNSG | 12.59 | 12.77 | 16.92 | 25.09 | 24.98 | 49.33 | 6107.573 | 9.7 | |
| Silhouette | MIP | 0.3131 | 0.4505 | -0.2623 | -0.2657 | -0.3434 | -0.1849 | 1.420 | 4.0 |
| Greedy | 0.2739 | 0.4505 | -0.2761 | -0.2379 | -0.2838 | -0.1983 | 1.348 | 5.8 | |
| GA | 0.3131 | 0.4505 | -0.3260 | -0.2197 | -0.3412 | -0.2198 | 1.466 | 5.8 | |
| Lagrangian | 0.1884 | 0.2518 | -0.1235 | -0.2790 | -0.3714 | -0.1339 | 1.011 | 6.7 | |
| NS1964 | 0.2979 | 0.4505 | -0.0807 | -0.1764 | -0.3343 | -0.1951 | 1.085 | 4.2 | |
| NS2025 | 0.3073 | 0.4720 | -0.1541 | -0.2837 | -0.3592 | -0.2184 | 1.367 | 6.0 | |
| VNS1997 | 0.3013 | 0.4038 | -0.1445 | -0.1711 | -0.3338 | -0.2038 | 1.179 | 5.0 | |
| VNS2025 | 0.2979 | 0.4505 | -0.1500 | -0.1853 | -0.2788 | -0.1897 | 1.172 | 4.3 | |
| NSG | 0.3067 | 0.4332 | -0.2623 | -0.3257 | -0.3517 | -0.1913 | 1.468 | 6.8 | |
| VNSG | 0.3131 | 0.4332 | -0.2623 | -0.3257 | -0.3431 | -0.1891 | 1.465 | 6.3 | |
4.5. Warehouse Location Analysis
Discussion
- Total Cost: Firstly, when considering total cost outcomes, results from the ultra-large-scale Gold Coast dataset (Section 4.3) illustrate that the VNSGreedy (VNSG) method occasionally attains lower total costs than the DWRWI-based VNS2025. Specifically, VNSG achieved a total cost of approximately 9,497,958, surpassing VNS2025’s best of 9,736,593 by a margin of around 2.5%. This effect can be explained by the inherently myopic nature of greedy initialization: the greedy heuristic aggressively eliminates immediate cost inefficiencies by selecting candidate facilities that yield large local transportation savings early in the process. Such a mechanism ensures rapid cost reduction in the initial phases but risks convergence to inferior global optima due to insufficient exploration of the wider solution space. Consequently, while VNSG may deliver marginally better costs in specific instances, its lack of global search diversity may limit general applicability across more heterogeneous or complex networks.
- Clustering Quality: DWRWI-based methods strongly outperform their greedy and random initialization counterparts. Across several datasets and multi-run experiments summarized in Section 4.4, the Silhouette metric, which evaluates the spatial cohesion and separation of customer-facility clusters, consistently favored DWRWI-based algorithms. NS2025 attained an average Silhouette score of 0.3859—markedly higher than NSG’s 0.3752 and RI’s 0.3833—indicating that DWRWI initialization effectively promotes balanced cluster formation that aligns with demand distribution and network topology.
- Construction Solution: The impact of initialization strategies is further corroborated by the small-scale illustrative scenario from Section 4.2, where the RI-based NS1964 initially chose suboptimal facilities {A, C, E} resulting in inflated total costs (2375) due to poor alignment with high-demand nodes D and E. Though iterative improvements moved the solution towards {A, D, E}, suboptimal placement of the peripheral low-demand node A constrained overall effectiveness. In contrast, DWRWI iteratively selected facilities {D, E, B} with high demand–distance weights, converging rapidly to the global optimum at a total cost of 2275. This example highlights the crucial role of demand- and topology-aware initialization in guiding neighborhood search algorithms towards high-quality solutions efficiently.
- Solution Stability: It is also important to address the inherent trade-offs introduced by DWRWI’s probabilistic sampling. While enabling greater exploration, this stochastic process results in some degree of solution variability, as reflected by nonzero IQR values in cost distributions over multiple runs (Section 4.4). This contrasts with the zero-variability outcomes from exact methods like MIP or deterministic greedy algorithms. However, the extent of cost fluctuations remains modest and well within acceptable margins for practical applications, particularly when balanced against gains in Clustering Quality, solution quality and computational efficiency.
- On the computational front: DWRWI consistently reduces initialization and overall algorithmic complexity relative to the greedy approach. The asymptotic complexity analysis in Section 3.2.4 anticipates a reduction to approximately O(Mp2) for initialization alone, whereas greedy methods incur O(MNp) costs. This efficiency translates concretely in large-scale scenarios. For instance, in Gold Coast experiments (Section 4.3), NS2025 completed optimization in roughly 8 to 19 seconds—nearly a tenfold speedup relative to NSG’s ~130 seconds, while still producing comparable solution quality. Even within the VNS class, VNS2025 attained a 27% faster runtime (~2600 seconds) than traditional VNS1997 (~3562 seconds), highlighting DWRWI’s capacity to accelerate convergence without sacrificing solution quality.
6. Conclusions
Author Contributions
Funding
Abbreviations
| Abbrev. | Terminology |
| DLP | Discrete Location Problem |
| pMP | p-median problem |
| UFLP | Uncapacitated Facility Location Problem |
| N-UFLP | Network-based Uncapacitated Facility Location Problem |
| MIP | Mixed Integer Programming |
| with Branch-and-Price Algorithm | |
| Greedy | Greedy Algorithm with Drop Strategy |
| GA | Genetic Algorithm |
| Lagrangian | Lagrangian Relaxation Algorithm |
| NS1964 | Neighborhood Search Algorithm with RI |
| NS2025 | Neighborhood Search with DWRWI |
| VNS1997 | Variable Neighborhood Search with RI |
| VNS2025 | Variable Neighborhood Search with DWRWI |
| NSG/ | Neighborhood Search with GI |
| NSGreedy | |
| VNSG/ | Variable Neighborhood Search with GI |
| VNSGreedy | |
| DWRWI | Demand-Weighted Roulette Wheel Initialization |
| GI | Greedy Initialization |
| RI | Random Initialization |
| DWRWI | Demand-Weighted Roulette Wheel Initialization |
| GI | Greedy Initialization |
| RI | Random Initialization |
| CRP | Cross-case Relative Performance |
References
- Verter, V. Uncapacitated and Capacitated Facility Location Problems. In: Eiselt, H.; Marianov, V. (eds) Foundations of Location Analysis. Int. Ser. Oper. Res. Manag. Sci. 2011, 155, 1–24.
- Daskin, M. S. Network and Discrete Location: Models, Algorithms, and Applications. Wiley, 1995.
- Saldanha-da-Gama, F.; Wang, S. Discrete facility location problems. In: Facility Location under Uncertainty. Springer, 2024, 356, 25–45.
- Weber, A. On the Location of Industries. Univ. Chicago Press, 1909.
- Hakimi, S. L. Optimum locations of switching centers and the absolute centers and medians of a graph. Oper. Res. 1964, 12, 450–459. [CrossRef]
- Tansel, B. C. A review and annotated bibliography of location problems in the public sector. Eur. J. Oper. Res. 1983, 12, 42–56.
- Beasley, J. E. Lagrangean heuristics for location problems. Eur. J. Oper. Res. 1993, 65, 383–399. [CrossRef]
- Dantrakul, S.; Likasiri, C.; Pongvuthithum, R. Applied p-median and p-center algorithms for facility location problems. Expert Syst. Appl. 2014, 41, 3596–3604. [CrossRef]
- Kariv, O.; Hakimi, S. L. An algorithmic approach to network location problems. II: The p-medians. SIAM J. Appl. Math. 1979, 37, 539–560. [CrossRef]
- Corberán, Á.; Landete, M.; Peiró, J.; Saldanha-da-Gama, F. Improved polyhedral descriptions and exact procedures for a broad class of uncapacitated p-hub median problems. Transp. Res. Part B Methodol. 2019, 123, 38–63. [CrossRef]
- Arslan, O. The location-or-routing problem. Transp. Res. Part B Methodol. 2021, 147, 1–21.
- Barbato, M.; Gouveia, L. The Hamiltonian p-median problem: Polyhedral results and branch-and-cut algorithms. Eur. J. Oper. Res. 2024, 316, 473–487.
- Beltran, C.; Tadonki, C.; Vial, J. Ph. Solving the p-median problem with a semi-Lagrangian relaxation. Comput. Optim. Appl. 2006, 35, 239–260. [CrossRef]
- Beltran-Royo, C.; Vial, J.-P.; Alonso-Ayuso, A. Semi-Lagrangian relaxation applied to the uncapacitated facility location problem. Comput. Optim. Appl. 2012, 51, 387–409.
- Nezhad, A. M.; Manzour, H.; Salhi, S. Lagrangian relaxation heuristics for the uncapacitated single-source multi-product facility location problem. Int. J. Prod. Econ. 2013, 145, 713–723.
- Mokhtar, H.; Krishnamoorthy, M.; Ernst, A. T. The 2-allocation p-hub median problem and a modified Benders decomposition method. Comput. Oper. Res. 2019, 104, 375–393. [CrossRef]
- Ghaffarinasab, N.; Çavuş, Ö.; Kara, B. Y. A mean-CVaR approach to the risk-averse single allocation hub location problem. Transp. Res. Part B Methodol. 2023, 167, 32–53. [CrossRef]
- Lim, G. J.; Ma, L. GPU-based parallel vertex substitution algorithm for the p-median problem. Comput. Ind. Eng. 2013, 64, 381–388.
- Teitz, M. B.; Bart, P. Heuristic methods for estimating the generalized vertex median of a weighted graph. Oper. Res. 1968, 16, 955–961.
- Kuehn, A. A.; Hamburger, M. J. A heuristic program for locating warehouses. Manag. Sci. 1963, 9, 643–666.
- Gokalp, O. An iterated greedy algorithm for the obnoxious p-median problem. Eng. Appl. Artif. Intell. 2020, 92, 103674. [CrossRef]
- Atta, S.; Mahapatra, P. R.; Mukhopadhyay, A. Solving uncapacitated facility location problem using heuristic algorithms. Int. J. Nat. Comput. Res. 2019, 8, 18–50.
- Ghosh, D. Neighborhood search heuristics for the uncapacitated facility location problem. Eur. J. Oper. Res. 2003, 150, 150–162.
- Moreno-Pérez, J. A.; Roda Garcia, J. L.; Moreno-Vega, J. M. A parallel genetic algorithm for the discrete p-median problem. Stud. Location Anal. 1994, 7, 131–141.
- Kratica, J.; Tošic, D.; Filipović, V.; Ljubić, I. Solving the simple plant location problem by genetic algorithm. RAIRO-Oper. Res. 2001, 35, 127–142.
- Alp, O.; Erkut, E.; Drezner, Z. An efficient genetic algorithm for the p-median problem. Ann. Oper. Res. 2003, 122, 21–42.
- Fathali, J. A genetic algorithm for the p-median problem with pos/neg weights. Appl. Math. Comput. 2006, 183, 1071–1083. [CrossRef]
- Afify, B.; Ray, S.; Soeanu, A.; Awasthi, A.; Debbabi, M.; Allouche, M. Evolutionary learning algorithm for reliable facility location under disruption. Expert Syst. Appl. 2019, 115, 223–244.
- Sonuç, E.; Özcan, E. An adaptive parallel evolutionary algorithm for solving the uncapacitated facility location problem. Expert Syst. Appl. 2023, 224, 119956.
- Özgün-Kibiroğlu, Ç.; Serarslan, M. N.; Topcu, Y. İ. Particle swarm optimization for uncapacitated multiple allocation hub location problem under congestion. Expert Syst. Appl. 2019, 119, 1–19.
- Chang, J.; Wang, L.; Hao, J.-K.; Wang, Y. Parallel iterative solution-based tabu search for the obnoxious p-median problem. Comput. Oper. Res. 2021, 127, 105155.
- Mladenović, N.; Brimberg, J.; Hansen, P.; Moreno-Pérez, J. A. The p-median problem: A survey of metaheuristic approaches. Eur. J. Oper. Res. 2007, 179, 927–939.
- Maranzana, F. E. On the location of supply points to minimize transport costs. J. Oper. Res. Soc. 1964, 15, 261–270.
- Hansen, P.; Mladenović, N. Variable neighborhood search for the p-median. Location Sci. 1997, 5, 207–226. [CrossRef]
- Amrani, H.; Martel, A.; Zufferey, N.; Makeeva, P. A variable neighborhood search heuristic for multicommodity production-distribution networks with alternative facility configurations. CIRRELT-2008-35, 2008, 1–29.
- Hansen, P.; Brimberg, J.; Urošević, D.; Mladenović, N. Solving large p-median clustering problems by primal–dual variable neighborhood search. Data Min. Knowl. Discov. 2009, 19, 1–23.
- Irawan, C. A.; Salhi, S. Solving large p-median problems by a multistage hybrid approach using demand points aggregation and variable neighbourhood search. J. Glob. Optim. 2015, 63, 537–554.
- Herrán, A.; Colmenar, J. M.; Duarte, A. A variable neighborhood search approach for the Hamiltonian p-median problem. Appl. Soft Comput. 2019, 80, 603–616.
- Croci, D.; Jabali, O.; Malucelli, F. The balanced p-median problem with unitary demand. Comput. Oper. Res. 2023, 155, 106242.
- Chagas, G. O.; Lorena, L. A. N.; dos Santos, R. D. C.; Renaud, J.; Coelho, L. C. A parallel variable neighborhood search for α-neighbor facility location problems. Comput. Oper. Res. 2024, 165, 106589.
- Gwalani, H.; Tiwari, C.; Mikler, A. R. Evaluation of heuristics for the p-median problem: Scale and spatial demand distribution. Comput. Environ. Urban Syst. 2021, 88, 101656.
- Gendron, B.; Semet, F. Formulations and relaxations for two-level uncapacitated facility location problems. INFORMS J. Comput. 2009, 21, 490–506.
- Adeleke, O. J.; Oladele, D. O. Facility location problems: Models, techniques, and applications in waste management. Recycling 2020, 5, 1–23.
- Tsuya, K.; Takaya, M.; Yamamura, A. Application of the firefly algorithm to the uncapacitated facility location problem. J. Intell. Fuzzy Syst. 2017, 32, 3201–3208.
- Cui, T.; Ouyang, Y.; Shen, Z.-J. M. Reliable facility location design under the risk of disruptions. Oper. Res. 2010, 58, 998–1011.
- Celebi, M. E.; Kingravi, H. A.; Vela, P. A. A comparative study of efficient initialization methods for the k-means clustering algorithm. Expert Syst. Appl. 2013, 40, 200–210.
- Transportation Networks for Research Core Team. Transportation networks for research. GitHub Repository, 2016.
- Balk, B. M.; De Koster, M. B. M.; Kaps, C.; Zofío, J. L. An evaluation of cross-efficiency methods: With an application to warehouse performance. Appl. Math. Comput. 2021, 406, 126261.
- Dolan, E. D.; Moré, J. J. Benchmarking optimization software with performance profiles. Math. Program. 2002, 91, 201–213. [CrossRef]



Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).