BiLSTM Guided LPA Planning, Re-Planning, and Backtracking for Effective and Efficient Emergency Evacuation

Ramzi Djemai; Hamza Kheddar; Mohamed Chahine Ghanem; Karim Ouazzane; Erivelton Nepomuceno

doi:10.20944/preprints202602.0856.v1

Submitted:

10 February 2026

Posted:

11 February 2026

You are already at the latest version

Abstract

Emergency evacuation in complex and dynamic building environments requires robust and adaptive routing strategies capable of responding to evolving hazards, blocked passages, and changing crowd behaviour. Most existing evacuation planners rely on static geometric representations and lack semantic awareness of the environment, limiting their ability to perform informed re-planning and backtracking when routes become unsafe. This paper proposes a neuro-symbolic evacuation planning framework that integrates Lifelong Planning A* (LPA*) with ontology-driven semantic reasoning and a Bidirectional Long Short-Term Memory (BiLSTM) prediction model. The building’s spatial and semantic knowledge is represented using the Web Ontology Language (OWL) and Resource Description Framework (RDF), enabling automated inference of implicit connections and enforcement of safety policies. The BiLSTM model learns temporal patterns from ontology-consistent evacuation trajectories and provides guidance for remaining-cost estimation and early prediction of routes likely to require backtracking, which is combined with a bounded semantic heuristic to preserve admissibility and optimality guarantees. Simulation results in a multi-floor academic building show that the proposed BiLSTM-guided semantic LPA* framework reduces average evacuation time by up to 9.6%, decreases node expansions by up to 32%, and increases evacuation success rates to 96.2% compared with a purely semantic baseline. The BiLSTM model also achieves strong predictive performance, with a test AUC of 0.92 for backtracking prediction and a next-state accuracy of 87.1%. The proposed framework provides a scalable, explainable, and real-time solution for personalised and policy-compliant evacuation guidance under rapidly evolving emergency conditions.

Keywords:

indoor emergency evacuation management

;

ontological modelling

;

planning heuristics

;

event-driven re-planning

;

reinforcement learning

;

LSTM

;

lifelong planning A*

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

1. Introduction

Indoor emergency evacuation presents a complex and highly dynamic decision-making problem. During a crisis, occupants must be guided from arbitrary locations to safe exits while hazards evolve rapidly and unpredictably, leveraging systems such as smart-building digital twins, real-time indoor positioning platforms, and intelligent sensor networks [1]. As illustrated in Figure 1, four major categories of challenges shape this problem: environmental hazards, information and sensing uncertainties, human and crowd dynamics, and algorithmic limitations. Environmental conditions can degrade within seconds—fires spread, smoke reduces visibility, debris blocks corridors, and structural failures render predefined routes unusable. Traditional evacuation approaches that rely on static floor plans or fixed procedures often become ineffective under such circumstances, resulting in costly re-planning or unsafe routing. A second challenge arises from incomplete and unreliable situational information. Indoor sensors may be sparse, malfunction during a disaster, or lack the communication bandwidth needed to provide timely updates on hazard evolution or exit accessibility. Evacuation algorithms must therefore operate under partial knowledge and adapt as new information becomes available—for instance, when a corridor becomes blocked or when a fire alarm signals a newly hazardous zone. Evacuation complexity is further amplified by human and crowd behaviour. Large groups tend to move unpredictably when stressed, forming congestion at bottlenecks, reversing direction abruptly, or deviating from suggested paths. These behavioural dynamics introduce significant uncertainty, complicating route prediction and degrading the effectiveness of rigid evacuation plans.

Existing computational methods address these challenges only partially. Rule-based systems encode fixed procedures but rarely adapt once conditions change. Simulation-based models such as computational fluid dynamics or agent-based simulations offer rich predictive power but are too computationally intensive for real-time use. Graph-based algorithms and linear programming methods compute optimal paths efficiently but typically assume static environments and lack semantic understanding of building spaces. More recently, ML! (ML!) approaches have been explored, yet they require large-scale annotated data and often generalize poorly to new or rapidly evolving scenarios unless continuously retrained.

A fundamental limitation across these methods is the absence of explicit semantic knowledge describing the structure and constraints of the building environment. Treating the space as a simple graph of nodes and edges ignores essential contextual information—for example, whether a location is a room or corridor, whether an area is restricted, or whether a region is likely to become hazardous. Without semantic context, planners may inadvertently choose routes that are unsafe or infeasible, such as directing evacuees into elevator lobbies misidentified as exits.

Building on recent insights in intelligent evacuation planning, this paper proposes a hybrid framework that integrates the computational efficiency of heuristic search with the adaptive intelligence of ontology-based reasoning. At its core, the proposed system employs LPA*! (LPA*!), an incremental variant of A*, to achieve fast and dynamic route planning by reusing prior search results as hazards evolve. A semantic reasoning module then continuously evaluates the ontology-informed state of the environment, detects when hazards invalidate the current plan, and triggers real-time adjustments or semantic backtracking to maintain safety and policy compliance. To support this adaptive behaviour, the proposed system leverages advances in semantic technologies, particularly those enabled by formal knowledge representation. Ontology engineering provides a structured, machine-interpretable model of the building environment—including rooms, corridors, stairwells, exits, and hazards—together with the relationships and constraints that govern their safe use during emergencies. By encoding this domain knowledge in both the OWL! (OWL!) and the RDF! (RDF!), the system benefits from automated reasoning that can infer implicit spatial connections, detect policy violations, and enhance the planner’s situational awareness. Unlike black-box ML! approaches, the ontology-driven layer in the proposed system offers transparency, semantic richness, and explainability. It can justify its decisions—such as excluding a route because it passes through a restricted or unsafe area—thereby improving trustworthiness and supporting accountable, safety-compliant evacuation management. In summary, this paper makes the following original contributions:

Ontology-Driven Building Model: We propose a formal OWL!/RDF!-based ontology to represent both the spatial structure and semantic constraints of a multi-floor building environment, including rooms, corridors, staircases, exits, hazards, and evacuation policies. This knowledge layer enables automated reasoning over implicit connections and safety rules during evacuation planning.
Reasoning-Enhanced Heuristic for LPA*: We introduce a semantic heuristic function that exploits ontology-derived information to guide the LPA*! algorithm. By integrating inferred paths, vertical navigation constraints, and policy-compliant transitions, the planner achieves efficient incremental re-planning without full recomputation.
BiLSTM-Guided Neuro-Symbolic Planning: We develop a neuro-symbolic extension of semantic LPA*! by incorporating a BiLSTM! (BiLSTM!) model trained on ontology-consistent evacuation trajectories. The BiLSTM! predicts remaining evacuation cost and the likelihood of backtracking, providing anticipatory guidance while preserving admissibility through a bounded hybrid heuristic.
Dynamic Re-planning and Semantic Backtracking Mechanism: We design an event-driven re-planning and backtracking strategy that allows the planner to revise unsafe routes in response to evolving hazards (e.g., blocked corridors or structural failures) and to safely return to the last feasible semantic situation when necessary.
Explainable and Policy-Compliant Decision Making: Unlike black-box learning approaches, the proposed framework provides explainable evacuation decisions grounded in ontology reasoning and SWRL rules, ensuring that generated routes comply with safety policies and access constraints.
Realistic Multi-Floor Case Study: We provide a detailed case study conducted in a realistic multi-floor academic building to demonstrate the applicability of the proposed framework under dynamic evacuation scenarios.

The remainder of this paper is organised as follows. Section 2 reviews related work and identifies the research gaps addressed in this study. Section 3 presents the theoretical background, including LSTM! (LSTM!) and BiLSTM! models for temporal prediction and backtracking in evacuation planning. Section 4 details the proposed neuro-symbolic framework, including the ontology-driven building model, semantic reasoning layer, reasoning-enhanced LPA*! heuristic, BiLSTM!-guided hybrid heuristic, and dynamic re-planning and backtracking mechanisms. Section 5 reports the experimental setup and quantitative results, including ablation studies comparing semantic LPA*! and BiLSTM!-guided variants. Finally, Section 6 concludes the paper and outlines limitations and future research directions.

2. Related work

Several studies have been proposed in the literature to address emergency evacuation planning, spanning classical optimisation models, heuristic algorithms, and simulation-based approaches. Existing contributions differ significantly in terms of spatial scale, uncertainty modelling, and the extent to which risk, routing, and resource coordination are jointly considered. To contextualise the proposed approach, prior works are reviewed below according to the scale of the evacuation environment.

Outdoor and City-Scale Evacuation Planning: Large-scale evacuation planning has traditionally focused on network-level optimisation under uncertainty. Goerigk and Grün [2] formulated a robust bus evacuation problem that explicitly accounts for delayed information on evacuee demand, demonstrating how robust optimisation can mitigate uncertainty in urban-scale evacuations involving transit-dependent populations. While effective at the scheduling level, this model does not explicitly incorporate spatially heterogeneous risk or route-level exposure.

To address congestion and route diversity, Chang et al. [3] studied the problem of identifying k discriminative evacuation paths on road networks, aiming to minimise both travel distance and path overlap. Their ant colony optimisation-based heuristic enables congestion mitigation through path diversification, which is particularly relevant for emergency evacuation and rescue logistics. However, environmental hazards and spatial risk intensity are not explicitly modelled [4].

More recent studies have integrated environmental dynamics into city-scale evacuation planning. Li et al. [5] proposed a coupled hydrodynamic and CA! (CA!)-based framework for pedestrian evacuation under flooding scenarios, explicitly accounting for flood depth, flow velocity, and human instability. Similarly, Shao et al. [6] emphasised exposure-based evacuation modelling, highlighting the importance of spatially varying hazard intensity in flood-prone urban regions.

Uncertainty-aware urban evacuation routing has also been investigated using heuristic optimisation techniques. Mao et al. [7] developed a fuzzy integer programming model combined with an improved ant colony optimisation algorithm to minimise expected evacuation time in uncertain traffic environments, addressing both symmetric and asymmetric evacuation demands. Although computationally efficient, the approach remains primarily route-centric and does not consider coordinated decisions on shelters or relief distribution. More recently, Jafarian et al. [8] and Li et al. [9] investigated large-scale evacuation from complementary perspectives. Jafarian et al. [8] focused on the impact of behavioural factors on evacuation efficiency, analysing how individual compliance, response delays, and behavioural heterogeneity influence route choice and overall evacuation performance under emergency conditions. Their work enhances behavioural realism but assumes a fixed evacuation network and does not explicitly optimise risk distribution across geographical zones. In contrast, Li et al. [9] adopted a collaborative evacuation framework in which multiple stakeholders or decision-making entities coordinate evacuation actions to improve system-level efficiency. While this approach captures inter-agent coordination and information sharing, geographical risk is still embedded indirectly through network parameters rather than being modelled as an explicit optimisation criterion. Consequently, neither approach directly addresses the problem of balancing spatially heterogeneous risk across the evacuation network.
Single-Floor Building Evacuations and Simulations: At the building scale, evacuation research has largely relied on simulation-based and heuristic path-planning approaches. Wang et al. [10] proposed a cellular ant optimisation model for passenger ship evacuation, combining CA! with ant colony optimisation on hexagonal grids to accelerate search and reduce the likelihood of local optima. While effective in confined environments, the method assumes static hazards and does not explicitly account for dynamically evolving risk.

Infrastructure-assisted evacuation has also been investigated. Zhang et al. [11] addressed evacuation efficiency through the optimal placement of signage systems in public spaces, modelling pedestrian–signage interactions using a cooperative location framework. Although this improves wayfinding and guidance, the approach focuses on behavioural support rather than adaptive evacuation routing under emergency conditions.

More recently, Baglioni and Jamshidnejad [12] introduced a model predictive control framework for indoor search-and-rescue robots, integrating target-oriented and coverage-oriented objectives under uncertainty. While their approach demonstrates strong performance in dynamic indoor environments, it is primarily designed for robotic exploration and does not directly address large-scale human evacuation or network-level risk balancing.
Multi-Floor and Complex Building Evacuation: Evacuation in complex and multi-level structures introduces additional challenges related to vertical movement, smoke propagation, and inter-floor connectivity. Li and Huang [13] proposed a 3D GIS-based evacuation framework for large public buildings that integrates numerical fire simulation, smoke diffusion, and individual behaviour modelling with A* path planning. Their approach enables risk-aware routing across multiple floors, but relies heavily on simulation outputs and lacks an explicit optimisation mechanism for balancing risk across the evacuation network.

In specialised underground environments, Zheng et al. [14] investigated coordinated rescue-path planning for multiple mine rescue teams using a hybrid FA-MDPSO algorithm and force-directed graph layouts. Their work highlights the importance of multi-team coordination and network visualisation in constrained environments. However, the proposed framework is domain-specific and does not readily generalise to urban evacuation networks involving heterogeneous geographical risk and multiple decision layers.

Across the surveyed literature, most evacuation planners rely either on optimisation (e.g., mixed-integer linear programming (MILP), fuzzy integer programming (IP)), meta-heuristics (ACO! (ACO!), PSO! (PSO!) variants), or agent-based and CA!-based simulation. As Table 1 shows, these systems typically operate on raw grid or graph abstractions, with limited or no semantic understanding of the built environment; almost none incorporate ontological reasoning. Likewise, key indoor-evacuation requirements—such as multi-floor navigation, dynamic re-planning, backtracking, vertical movement, and real-time event adaptation—are only partially supported and often absent. Explainability is also consistently missing, with all prior studies lacking XAI capabilities.

Our work positions itself at this intersection of unresolved issues. Instead of treating buildings as unstructured spaces, this paper introduces a formal building ontology that encodes the semantics of rooms, corridors, exits, constraints, and access policies. This knowledge layer fills the interpretability gap noted in prior work and enables the planner to enforce high-level rules (e.g., avoid hazardous spaces or restricted “Staff Only’’ areas). Combined with a dynamic heuristic graph search method (LPA*!), our system supports continuous re-planning, multi-floor movement, backtracking, mass evacuation, and real-time adaptation—capabilities that prior studies either lack or offer only partially. The result is a hybrid, knowledge-aware planner that delivers explainable decisions while maintaining the efficiency of algorithmic search. To the best of our knowledge, this semantic–algorithmic integration is novel in the context of indoor evacuation planning.

3. Preliminaries

Effective emergency evacuation requires dynamic planning capabilities that can account for evolving conditions such as congestion, blocked paths, and time-dependent hazards. Sequence learning models, particularly LSTM! networks and their bidirectional variant (BiLSTM!), provide a powerful framework for capturing temporal dependencies in evacuation-related data, enabling both forward planning and informed backtracking decisions.

3.1. LSTM for Evacuation Planning

In evacuation planning, the objective is to predict future system states (situations)—such as crowd density, travel time, or route feasibility—based on historical and real-time observations. LSTM! networks are well suited for this task due to their gated memory structure, which allows the model to retain critical past information while discarding irrelevant dynamics.

At time step t, given an input vector

x_{t}

representing evacuation features (e.g., node occupancy, edge capacity, hazard level), the LSTM! updates its internal states as Equation 1:

\{\begin{matrix} f_{t} & = σ (W_{f} [h_{t - 1}, x_{t}] + b_{f}), \\ i_{t} & = σ (W_{i} [h_{t - 1}, x_{t}] + b_{i}), \\ {\tilde{c}}_{t} & = tanh (W_{c} [h_{t - 1}, x_{t}] + b_{c}), \\ c_{t} & = f_{t} ⊙ c_{t - 1} + i_{t} ⊙ {\tilde{c}}_{t}, \\ o_{t} & = σ (W_{o} [h_{t - 1}, x_{t}] + b_{o}), \\ h_{t} & = o_{t} ⊙ tanh (c_{t}), \end{matrix}

(1)

where

h_{t}

encodes a predictive representation of the evolving evacuation state. This forward temporal modeling supports proactive planning by anticipating congestion propagation and estimating future route viability.

3.2. BiLSTM for Backtracking and Route Revision

Emergency evacuation often requires revisiting prior decisions when unexpected events occur, such as sudden bottlenecks or hazard escalation. BiLSTM! networks enhance decision-making by integrating both past observations and future contextual information within a planning horizon.

A BiLSTM! layer consists of a forward LSTM! that processes the evacuation sequence in chronological order and a backward LSTM! that processes it in reverse order. The forward hidden state

{\vec{h}}_{t}

captures planning-oriented dynamics, while the backward hidden state

{\overset{\leftarrow}{h}}_{t}

supports backtracking by evaluating how future constraints affect earlier routing choices. The combined representation is given by:

h_{t} = [{\vec{h}}_{t} | | {\overset{\leftarrow}{h}}_{t}] .

(2)

By jointly modelling forward planning and backwards feasibility, BiLSTM! layers provide a unified representation that enables effective and efficient evacuation strategies, supporting adaptive rerouting, decision revision, and resilience under uncertainty.

4. Research Methodology

This section describes the methodology of the proposed heuristic and knowledge-based evacuation planning system supported by a BiLSTM! prediction module. As shown in Figure 2, the architecture combines symbolic semantic reasoning with data-driven learning to support adaptive evacuation planning and is organised around micro-models capturing topology, events, situations (states), and actions, a planning layer encoding prescriptive, descriptive, and policy rules, and a controller responsible for rule interpretation, heuristic evaluation, and event processing. The detailed structure and composition of each component are discussed in subsequent sections.

To make the neuro-symbolic stack explicit, Table 2 summarises what each layer consumes and produces, and how the responsibilities compose at run time. In execution, events are asserted into the ontology, reasoning and SWRL constraints refresh the set of admissible transitions, and the planner searches over semantic situations using an admissible semantic heuristic. If an event changes costs or invalidates a transition, the replanner incrementally repairs the current plan rather than recomputing from scratch; if a transition fails at action execution time, the backtracker rolls back to the most recent feasible semantic situation and triggers replanning from that point. The BiLSTM module provides guidance signals that bias search towards stable trajectories and more importantly, it does not introduce new actions and does not override SWRL admissibility: it only biases the ordering of expansions through the bounded heuristic in Eq. (9), while SWRL rules and constraints govern which transitions are permitted.

Because evacuation unfolds under randomly triggered events, the planner cannot rely on a static, offline distance map that assumes a single fixed best route. Grid-based distance heuristics (often instantiated as Manhattan distance on rectilinear floor discretisations) are commonly used to approximate corridor traversal in indoor route computation [15,16], but they remain purely geometric and do not encode evacuation policies, restricted areas, or event-driven hazards that can invalidate a route at run time. In our approach, the building state is represented in the ontology, and evacuation behaviour is formulated as policy-driven transitions expressed as (state, action, next-state) triplets. When an event occurs (for example, a fire outbreak [16], congestion build-up [17], or a blocked corridor [16]), the re-planner queries the updated knowledge base to revise the admissible actions and regenerate a feasible action sequence to the goal state. The BiLSTM prediction module complements this symbolic layer by providing temporal guidance on likely congestion evolution from prior trajectories, enabling earlier route revision and backtracking where needed; however, all candidate moves are still filtered by the ontology-defined (state, action, next-state) transitions and the current event assertions, so only semantically admissible actions are expanded. Moreover, admissible moves are instantiated and validated via SWRL! (SWRL!) triplets, ensuring that each expansion corresponds to a policy-compliant (state, action, next-state) transition.

Figure 3 summarises three representative evacuation sequences (state–action sequences); corrective actions (CA) indicate rerouting, and backtracking actions (BTA) indicate deliberate returns to earlier decision points, while each step is an admissible (state, action, next-state) transition instantiated from the SWRL triplets in Table 3 under the current event assertions.

4.1. Evacuation Micro-Models

Evacuation micro-models provide the atomic semantic representations required by the proposed planning and learning algorithms. These models define how evacuees, environments, and actions are represented at a fine-grained level, forming the basis for ontology reasoning, heuristic evaluation, and state-space search.

Topological Modelling: The evacuation environment is represented as a directed semantic graph $G = (V, E)$ , where each node $v \in V$ corresponds to a physical space (e.g., room, corridor, staircase, exit), and each edge $e \in E$ represents a traversable connexion. This topological structure enables the retrieval of explicit neighbours during planning, as follows (Equation 3):

$QueryNeighbors (current, O) .$

(3)
Event Modeling: Events represent discrete occurrences that may alter evacuation dynamics, such as door closures, congestion, alarms, or accessibility changes. Events are represented as assertions that update the ontology’s state, thereby altering the availability of transitions or the cost during planning. Formally, an event $e_{t}$ induces a change in the transition function (Equation 4):

$(s_{t}, a_{t}) \to s_{t + 1},$

(4)

which is dynamically reflected in the inferred neighbour set queried by the planner.
Situation Representation: A situation corresponds to a semantic state describing the current evacuation context of an individual. Each situation $s \in S$ encapsulates: (i) the evacuee’s location, (ii) the structural properties of the space, and (iii) safety and accessibility constraints. Situations serve as nodes in the state-space search graph explored by the LPA*! algorithm.
Action Space Definition: Actions define admissible movements between situations, such as move_to_corridor, enter_staircase, or exit_building. The action space $A$ is constrained by ontology rules and encoded using SWRL! triplets. Each action induces a cost $c (s, a, s^{'})$ used during planning (Equation 5):

$c = C o s t (current, neighbor),$

(5)

These costs may reflect distance, vertical movement penalties, or safety-related preferences.

4.2. Planning Rules

Planning rules encode domain knowledge that governs admissible evacuation behaviour. These rules are formalised using SWRL! and operate over the ontology to define valid transitions, constraints, and preferences during evacuation planning.

Prescriptive Rules: Prescriptive rules define mandatory evacuation actions that must be taken under specific conditions, such as enforcing movement toward exits once an alarm is triggered or preventing access to restricted areas. These rules ensure compliance with evacuation policies and safety regulations.

Formally, prescriptive rules constrain the action space by allowing only transitions that satisfy predefined safety conditions:

$(s_{t}, a_{t}) \to s_{t + 1} if and only if R_{pres} (s_{t}) = true .$
Descriptive Rules: Descriptive rules capture factual relationships within the environment, such as adjacency between spaces, floor connectivity, and accessibility relations. These rules enable the ontology reasoner to infer implicit connections that are not explicitly encoded in the topological graph. Such inferred relations are queried during planning through ontology reasoning, as reflected in Algorithm 3 by the function:

$QueryInferredConnections (current, O) .$
Policy Rules: Policy rules express evacuation preferences rather than strict constraints. Examples include prioritising safer routes, avoiding congested areas, or favouring ramps over stairs for mobility-impaired evacuees. These rules influence the planning process by affecting heuristic evaluation and cost assignment rather than enforcing hard exclusions.
Constraint Encoding: Constraint rules encode physical and operational limitations, such as blocked passages, closed doors, or capacity restrictions. These constraints dynamically prune the state–transition graph by removing infeasible neighbours during node expansion.

Together, planning rules ensure that all paths explored by the planner are semantically valid, policy-compliant, and context-aware while remaining flexible to dynamic environmental changes.

4.3. Controller

The controller constitutes the operational core of the proposed evacuation framework and integrates the functionalities previously described in the system architecture, namely the ontology and reasoning engine, heuristic path planner, event generator, and action interpreter, into a unified execution loop.

At its foundation, the controller maintains an OWL 2 ontology representing the building layout and evacuation knowledge, including spatial entities (e.g., Room, Corridor, Exit), dynamic entities (e.g., Person, Hazard), and action concepts (e.g., EvacuationAction, BlockedPath). Safety policies and access constraints are encoded using axioms and SWRL! rules so that inadmissible transitions are filtered during planning.

Dynamic events generated by the simulator (such as smoke propagation or blocked corridors) are translated into ontology assertions. These updates trigger the reasoning process, which refreshes inferred relations (e.g., admissible connectivity or safety classifications) and signals the planner that replanning is required.

Rule Engine: Applies SWRL!-based planning rules to validate actions and infer implicit state transitions. Routes violating evacuation policies (e.g., restricted areas or unsafe corridors) are rejected at expansion time.
Semantic Interpreter: Maps ontology assertions and inferred relations into executable planning constructs. Semantic situations correspond to graph nodes, actions to transitions, and inferred relations to neighbour sets used by the planner.
Heuristic Engine: Implements the heuristic path planner using semantic LPA*!. At each expansion step, both explicit neighbours (physical adjacency) and inferred neighbours (reasoner-derived connections) are evaluated using the semantic heuristic $h_{o} (n)$ and, when enabled, the learned guidance $h_{l} (n)$ :

$f (n) = g (n) + h (n) .$
Event Processing Mechanism: Produces and processes time-stamped dynamic events (e.g., hazards, blocked corridors, door constraints) and updates the ontology accordingly. These updates modify edge costs and admissibility, which triggers incremental replanning.
Action Interpreter: Translates the planner’s sequence of situations and actions into executable evacuation guidance. Each transition is validated against the current ontology state. If a transition becomes invalid due to a new event, semantic backtracking is invoked and replanning resumes from the last feasible situation.

Because semantic events usually affect only a limited subset of edges and constraints, the controller performs incremental repair rather than full recomputation. Only affected vertices and their local neighbourhood are updated and propagated through the priority queue, preserving consistency with updated semantic constraints while maintaining efficiency.

Algorithm 1 formalises this closed-loop control process, including event handling, replanning triggers, and action execution.

Algorithm 1Dynamic Path Planning with Re-planning and Backtracking

Require:: InitialSituation, OccupantID, BuildingTopology, EventDefinitions, ActionRules
Ensure:: Safe exit notification or reroute guidance
1:: procedure InitializeSystem
2:: $c u r r e n t \leftarrow F E T C H I N I T I A L S I T U A T I O N$
3:: BuildTopologyModel(BuildingTopology)
4:: $e v e n t D e f s \leftarrow LOADEVENTDEFINITIONS$
5:: $a c t i o n R u l e s \leftarrow LOADACTIONRULES$
6:: InitializeTester
7:: end procedure
8:: procedure MainLoop
9:: InitializeSystem
10:: while not ExitReached(OccupantID) do
11:: $e v e n t \leftarrow FETCHNEXTEVENT$
12:: $t y p e \leftarrow CLASSIFYEVENT (e v e n t, e v e n t D e f s)$
13:: if $t y p e = “ Impact ”$ then
14:: HandleImpactEvent(current, event)
15:: else
16:: ContinueNormalPlan(current)
17:: end if
18:: if TesterCheckFails(current) then
19:: $t r i g g e r \leftarrow IDENTIFYTRIGGER (c u r r e n t)$
20:: $p l a n^{'} \leftarrow REPLAN (t r i g g e r, a c t i o n R u l e s)$
21:: UpdateGuidance(OccupantID, plan’)
22:: $c u r r e n t \leftarrow UPDATESITUATION (O c c u p a n t I D, p l a n^{'})$
23:: end if
24:: $c u r r e n t \leftarrow FETCHNEXTSITUATION$
25:: end while
26:: ConfirmExit(OccupantID)
27:: end procedure
28:: procedure HandleImpactEvent( $s i t u a t i o n, e v e n t$ )
29:: $r o u t e \leftarrow FINDSAFEPATH (s i t u a t i o n, e v e n t, a c t i o n R u l e s)$
30:: RelayRoute(OccupantID, route)
31:: LogEventAndRoute(event, route)
32:: end procedure
33:: procedure ContinueNormalPlan( $s i t u a t i o n$ )
34:: $a c t i o n \leftarrow SELECTACTION (s i t u a t i o n, a c t i o n R u l e s)$
35:: ExecuteAction(action)
36:: LogExecution(action, situation)
37:: end procedure
38:: procedure TesterCheckFails( $s i t u a t i o n$ )
39:: $e x p e c t e d \leftarrow PREDICTOUTCOME (s i t u a t i o n)$
40:: $a c t u a l \leftarrow FETCHACTUALSITUATION$
41:: return $a c t u a l \neq e x p e c t e d$
42:: end procedure
43:: procedure IdentifyTrigger( $s i t u a t i o n$ )
44:: $l a s t E v e n t \leftarrow FETCHLASTEVENT$
45:: return AnalyzeImpact(lastEvent, situation)
46:: end procedure

4.4. Neuro-Symbolic Evacuation Planning Framework

This subsection formalises the neuro-symbolic planning methodology that combines ontology-driven semantic reasoning, heuristic graph search, and BiLSTM-based temporal guidance. It operationalises the components described in the controller through three interacting layers: semantic heuristic estimation, incremental planning, and learning-guided refinement.

•

Reasoning-Enhanced Semantic Heuristic: The semantic heuristic

h_{o} (n)

exploits ontology-derived structure that cannot be captured by purely geometric heuristics. It queries inferred evacuation paths, vertical navigation requirements, and policy-compliant transitions to estimate remaining cost:

h_{o} (n) = \{\begin{matrix} GeometricHeuristic (n, g), & if no inferred path exists, \\ (horizontalCost + verticalCost) \times qualityFactor, & otherwise . \end{matrix}

(6)

When a plausible route can be inferred by the reasoner, the heuristic converts semantic evidence into an optimistic estimate combining horizontal traversal and vertical transitions. The factor

qualityFactor (n, g) \in (0, 1]

acts as a confidence discount so that uncertain inferred routes are down-weighted without violating admissibility. Algorithm 2 specifies this computation.

Algorithm 2 Heuristic Estimation Using Inferred Paths

Require:: current node n, goal node g
Ensure:: heuristic estimate $h_{o} (n)$ is returned
1:: $paths \leftarrow QUERYINFERREDPATHS (n, g)$ ▹ Candidate inferred route sequences from n to g via ontology reasoning.
2:: $h_geo \leftarrow GEOMETRICHEURISTIC (n, g)$ ▹ Admissible geometric fallback (e.g., Manhattan/Euclidean in the discretisation).
3:: if $paths = \emptyset$ then
4:: return $h_geo$
5:: end if
6:: $minHops \leftarrow min {| p | : p \in paths}$ ▹ Length of shortest inferred path in number of hops.
7:: $avgCost \leftarrow ESTIMATEAVERAGEEDGECOST (n)$ ▹ Optimistic average traversal cost from this node per edge.
8:: $currentFloor \leftarrow GETFLOOR (n)$
9:: $goalFloor \leftarrow GETFLOOR (g)$
10:: $floorDiff \leftarrow |currentFloor - goalFloor|$
11:: $verticalCost \leftarrow floorDiff \times (\frac{FloorHeight}{ClimbSpeed})$ ▹ Optimistic time to change floorDiff floors via stairs.
12:: $horizontalCost \leftarrow minHops \times avgCost$
13:: $quality \leftarrow ASSESSPATHQUALITY (paths [0])$ ▹ Assess inferred path plausibility under current hazards/policies.
14:: $qFactor \leftarrow QUALITYFACTOR (quality)$ ▹ $qFactor \in (0, 1]$ is a confidence discount: 1 if high-quality, smaller if uncertain.
15:: $h_inf \leftarrow (horizontalCost + verticalCost) \times qFactor$
16:: return $max (h_geo, h_inf)$ ▹ Max of admissible heuristics is admissible and more informative.

•

Semantic Lifelong Planning A*: Semantic LPA*! integrates the heuristic

h_{o} (n)

into an incremental planning framework. The search space is defined by semantic states derived from the ontology, and neighbour expansion combines explicit topological connections with inferred semantic transitions. For each node n, the planner evaluates:

f (n) = g (n) + h_{o} (n) .

(7)

When the ontology is updated due to dynamic events (e.g., blocked corridors or modified traversal costs), only the affected nodes and edges are updated, and vertex consistency is restored using the standard LPA*! bookkeeping (g- and

rhs

-values). This enables rapid local repair instead of full recomputation.

•

Neuro-Symbolic Semantic LPA* (BiLSTM-Guided): To incorporate temporal experience, a BiLSTM model is trained offline on SWRL!-consistent state-action sequences expressed in the same symbolic vocabulary as the planner. At run time, given the movement history

H

and a candidate node n, the model predicts a guidance term:

h_{l} (n) = BiLSTM (H \cup {n}) .

(8)

BiLSTM-guided bounded heuristic.

Let

h_{o} (n)

denote the ontology-derived admissible heuristic estimate of the remaining cost to the goal. The BiLSTM outputs a learned remaining-cost estimate

h_{l} (n) \geq 0

from the recent state–action history. We combine both signals using a bounded convex blend:

h^{'} (n) = min (h_{o} (n), (1 - λ) h_{o} (n) + λ h_{l} (n)), 0 \leq λ \leq 1 .

(9)

By construction

h^{'} (n) \leq h_{o} (n)

for all n. Since

h_{o}

is admissible,

h^{'}

remains admissible, hence the optimality guarantees of semantic LPA*! are preserved. The parameter

λ

controls the strength of the learned guidance whenever

h_{l} (n) < h_{o} (n)

; otherwise the heuristic falls back to

h_{o} (n)

.

Setting

λ = 0

recovers pure semantic LPA*!, while larger values increase the influence of learned guidance without violating optimality or policy constraints. The evaluation function becomes:

f (n) = g (n) + h^{'} (n) .

(10)

Because

h^{'} (n) \leq h_{o} (n) \leq h^{*} (n)

, admissibility and optimality guarantees of LPA*! are preserved. The planner therefore benefits from temporal awareness—such as early detection of routes prone to backtracking—while maintaining explainability and semantic compliance. Algorithm 3 implements semantic LPA*! planning process along with incremental neuro-symbolic extension (BiLSTM-guided).

5. Experiments

5.1. Evaluation Metrics

The performance of the proposed evacuation planning framework was assessed using three complementary metrics that capture evacuation efficiency, robustness, and computational cost.

Mean Evacuation Time ( $T_{avg}$ ): This metric measures the average time required for an occupant to reach a safe exit across all simulated scenarios. It reflects the overall efficiency of the evacuation planner under dynamic conditions.

$T_{avg} = \frac{1}{N} \sum_{i = 1}^{N} T_{i}$

(11)

where $T_{i}$ denotes the evacuation time of the i-th scenario and N is the total number of evaluated scenarios. Percentile evacuation times (e.g., 90th percentile) were also computed to assess worst-case performance under adverse conditions.
Evacuation Success Rate ( $S_{r}$ ): The success rate quantifies the proportion of evacuation trials in which occupants successfully reached a safe exit within a predefined time limit. This metric evaluates the robustness and reliability of the planner in the presence of hazards, re-planning, and backtracking events.

$S_{r} = \frac{N_{success}}{N_{total}} \times 100 %$

(12)

where $N_{success}$ is the number of scenarios that resulted in a successful evacuation and $N_{total}$ is the total number of tested scenarios.

Search Effort (Expanded Nodes $N_{nodes}$ ): Computational efficiency was evaluated by counting the number of nodes expanded by the planner during the search process. This metric directly reflects the complexity of the planning task and the effectiveness of heuristic guidance in reducing unnecessary exploration.

N_{nodes} = \sum_{k = 1}^{K} n_{k}

(13)

where

n_{k}

represents the number of nodes expanded during the k-th evacuation run and K is the number of simulated scenarios.

Algorithm 3 Neuro-Symbolic Semantic LPA* (BiLSTM-Guided)

Require:: start node s, goal node g, ontology $O$ , movement history $H$
Ensure:: returns an optimal evacuation path from s to g, or null if none exists
1:: $openSet \leftarrow {s}$ ; $closedSet \leftarrow \emptyset$
2:: $g S c o r e [s] \leftarrow 0$
3:: $f S c o r e [s] \leftarrow h^{'} (s)$ ▹ Hybrid heuristic per Eq. (9)
4:: while $openSet \neq \emptyset$ do
5:: $current \leftarrow arg {min}_{n \in openSet} f S c o r e [n]$ ▹ Node with lowest estimated total cost
6:: if $current = g$ then
7:: return $RECONSTRUCTPATH (cameFrom, current)$
8:: end if
9:: $openSet \leftarrow openSet ∖ {current}$
10:: $closedSet \leftarrow closedSet \cup {current}$
11:: $neighbors_explicit \leftarrow QUERYNEIGHBORS (current, O)$
12:: $neighbors_inferred \leftarrow QUERYINFERREDCONNECTIONS (current, O)$
13:: $neighbors \leftarrow neighbors_explicit \cup neighbors_inferred$
14:: for all $neighbor \in neighbors$ do
15:: if $neighbor \in closedSet$ then
16:: continue
17:: end if
18:: $c \leftarrow COST (current, neighbor)$
19:: $tentative_g \leftarrow g S c o r e [current] + c$
20:: if $neighbor \notin openSet$ then
21:: $openSet \leftarrow openSet \cup {neighbor}$
22:: else if $tentative_g \geq g S c o r e [neighbor]$ then
23:: continue
24:: end if
25:: $cameFrom [neighbor] \leftarrow current$
26:: $g S c o r e [neighbor] \leftarrow bad hbox$ ▹ End of Semantic LPA* update
27:: $h_{o} \leftarrow SEMANTICHEURISTIC (neighbor, g, O)$
28:: $h_{l} \leftarrow B i L S T M_PREDICT (H \cup {neighbor})$
29:: $h_{l} \leftarrow max (0, h_{l})$ ▹ Ensure non-negative learned estimate
30:: $h^{'} (neighbor) \leftarrow min (h_{o}, (1 - λ) h_{o} + λ h_{l})$ ▹ Bounded convex blend (Eq. (9)); preserves admissibility
31:: $f S c o r e [neighbor] \leftarrow g S c o r e [neighbor] + h^{'} (neighbor)$
32:: end for
33:: end while
34:: returnnull

Re-planning and Backtracking Frequency: The number of re-planning events and backtracking actions was recorded to quantify how often the system needed to revise previously selected routes due to environmental changes (e.g., blocked corridors or emerging hazards). Lower values indicate more stable and anticipatory planning behaviour.
Prediction Performance Metrics (BiLSTM): For the BiLSTM! prediction module, the following supervised learning metrics were employed:

- RMSE! (RMSE!) for remaining cost prediction:

$RMSE = \sqrt{\frac{1}{M} \sum_{j = 1}^{M} {({\hat{c}}_{j} - c_{j})}^{2}}$

(14)

where ${\hat{c}}_{j}$ and $c_{j}$ denote the predicted and true remaining costs for sample j, respectively.

- Area Under the ROC Curve (AUC) for backtracking prediction, measuring the discriminative ability of the classifier between backtracking and non-backtracking trajectories.

- Classification Accuracy ( $A c c$ ) for both backtracking prediction and next-state prediction:

$Acc = \frac{T_{P} + T_{N}}{T_{P} + T_{N} + F_{P} + F_{N}}$

(15)

where $T_{P}$ , $T_{N}$ , $F_{P}$ , and $F_{N}$ denote true positives, true negatives, false positives, and false negatives, respectively.

These metrics jointly provide a comprehensive evaluation of the proposed framework in terms of evacuation efficiency (

T_{avg}

), robustness (

S_{r}

), computational effort (

N_{nodes}

), and predictive accuracy of the learning-based guidance module.

5.2. Trained data

The training data were generated from ontology-consistent evacuation trajectories derived from SWRL! transition triplets. These triplets encode policy-compliant transitions between semantic situations and actions in the Tower Building, following the form (current situation, action, resultant situation). Each trajectory therefore represents a semantically valid evacuation sequence that respects the building topology, safety constraints, and access rules, rather than a purely geometric shortest path.

The transition encoding captures both horizontal and vertical navigation patterns. Horizontal movement is represented through rules that move occupants from inner rooms to rooms and from rooms to corridors, then onward to vestibules and exit interfaces; vertical movement is represented through staircase descent transitions across floors (for example, head_downstairs_to_floor_2 and head_downstairs_to_ground_floor). This structure provides a symbolic abstraction of evacuation behaviour that the BiLSTM! can learn from as sequences of semantically typed movements (e.g., lab → corridor → vestibule → staircase), while remaining consistent with the admissible action space enforced by the ontology.

A comprehensive list of SWRL triplets used to generate trajectories is provided via the repository link in the Data Availability Statement. For readability, a representative in-paper view of the Tower Building transition encoding is reported in Table 4.

5.3. BiLSTM Training and Backtracking Prediction Performance

Table 5 reports the predictive performance of the BiLSTM! model trained on SWRL!-derived evacuation trajectories. The low Cost RMSE! across training (0.058), validation (0.071), and test (0.076) splits indicates that the model learns a stable mapping between historical movement sequences and remaining evacuation cost. The limited gap between training and test error demonstrates good generalisation and the absence of significant overfitting.

Backtracking prediction performance further confirms the effectiveness of the BiLSTM! model. The Backtracking AUC reaches 0.92 on the test set, showing strong discriminative ability between trajectories that will require backtracking and those that will not. Similarly, backtracking accuracy remains close to 88%, which is particularly important for early detection of unsafe or fragile routes. The next-state prediction accuracy of 87.1% on the test set indicates that the model successfully captures sequential dependencies between semantic situations, enabling informed guidance of the planner during node expansion. These results confirm that the BiLSTM, trained on SWRL-guided trajectories, can accurately predict backtracking needs, validating the benefit of integrating learning with planning.

The confusion matrix in Figure 4 provides additional insight into the prediction behaviour. The model correctly identifies the majority of actual backtracking cases (85%) while maintaining a high true-negative rate (90%) for no-backtracking situations. Misclassifications remain limited and relatively balanced, suggesting that the learned representation does not suffer from systematic bias toward either conservative or optimistic predictions. This balance is essential in evacuation contexts, where excessive false alarms could cause unnecessary detours, while missed backtracking cases could lead to dead ends and unsafe routing.

Overall, these results demonstrate that the BiLSTM! is capable of learning meaningful temporal patterns from symbolic evacuation data and can reliably anticipate problematic trajectories before they fully materialise. This predictive capability is leveraged in the hybrid heuristic to guide the semantic LPA* planner away from routes that are likely to fail under evolving hazards.

5.4. Planner Performance and Ablation Results

Table 6 and Figure 5 compare the baseline semantic LPA*! planner with BiLSTM!-guided variants across 50 dynamic evacuation scenarios. The semantic LPA*! baseline achieves an average evacuation time of 124.6 s with a success rate of 91.8%. While this demonstrates the effectiveness of ontology-driven reasoning alone, it also reveals room for improvement in both efficiency and robustness.

Introducing cost-only guidance (

{\hat{c}}_{rem}

) reduces the average evacuation time by 5.1% and decreases the number of explored nodes by 15.2%. This shows that learning-based cost estimation already provides useful directional bias to the planner, enabling faster convergence without compromising success rate. Similarly, backtracking-only guidance (

{\hat{p}}_{bt}

) reduces unnecessary reversals and lowers the number of backtracks from 2.0 to 1.1, confirming that predicting fragile paths helps avoid dead-end exploration.

The full guidance model that combines both remaining cost and backtracking probability achieves the best overall performance. With

λ = 0.3

, the average evacuation time is reduced by 7.5%, and node expansions drop by more than 23%. Increasing the guidance weight to

λ = 0.7

further improves performance, yielding a 9.6% reduction in evacuation time and a 32% reduction in explored nodes. At the same time, the success rate increases to 96.2%, indicating that the planner not only becomes faster but also more reliable in reaching safe exits.

The reduction in replans and backtracks observed in the full guidance variants highlights the complementary nature of semantic reasoning and learning-based prediction. Semantic LPA* ensures policy compliance and explainability, while the BiLSTM! anticipates temporal failure patterns that are not explicitly encoded in the ontology. Their integration therefore leads to fewer corrective actions, smoother trajectories, and more stable evacuation behaviour.

Figure 5 visually confirms these trends, showing a monotonic decrease in evacuation time as learning guidance is introduced and strengthened. Importantly, the bounded hybrid heuristic preserves admissibility, ensuring that optimality guarantees of LPA*! are not violated despite the inclusion of neural predictions.

Overall, the experimental results demonstrate that the proposed neuro-symbolic planner achieves significant improvements in evacuation efficiency, robustness, and computational cost compared to a purely semantic baseline. The ablation study further confirms that both cost estimation and backtracking prediction contribute independently and synergistically to performance gains. These findings validate the central hypothesis that combining ontology-driven reasoning with temporal learning yields a more adaptive and effective evacuation planning framework under dynamic and uncertain conditions.

Figure 6 provide a visual validation of the two BiLSTM guidance signals used by the planner. The first shows the receiver operating characteristic (ROC) curve for the backtracking classifier, plotting the true-positive rate against the false-positive rate as the decision threshold is varied. The curve remains well above the random baseline (diagonal), indicating strong separability between trajectory states that lead to backtracking and those that do not; this behaviour is consistent with the reported AUC = 0.92 on the test set. The second presents a parity plot for the remaining-cost (cost-to-go) regressor, where each point compares the predicted normalised remaining cost with the corresponding ground-truth value. The concentration of samples around the y=x reference line indicates that predictions track the true cost-to-go with limited bias across the full range, supporting the reported RMSE = 0.076. Together, these figures demonstrate that the learned components provide both a reliable binary risk signal (backtracking likelihood) and a quantitatively accurate continuous estimate (remaining cost), motivating their use to guide frontier selection and trigger semantic backtracking during dynamic replanning.

5.5. Discussion

The experimental results indicate that the framework’s gains come from complementary strengths of symbolic reasoning and sequence learning. The OWL/RDF ontology and SWRL rules define a typed, policy-aware state space in which each node is a semantic situation (location plus contextual constraints), rather than an unlabelled graph vertex. During expansion, the planner can reject transitions that violate access policies or traverse areas asserted or inferred as hazardous, while still exploiting ontology-inferred connectivity and constraints that are hard to encode in a purely geometric graph. This is particularly valuable in multi-floor buildings, where safe navigation depends on vertical movement rules, restricted zones, and the rapid invalidation of corridors and stairwells. The case study further supports this point by showing explanation logs that explicitly connect hazard updates to planning decisions (e.g., rerouting when a staircase becomes blocked, then backtracking when a corridor collapses), which improves operational transparency for human supervisors overseeing an evacuation.

The ablation study clarifies how the BiLSTM improves performance beyond semantic LPA* alone. Starting from the semantic baseline (Tavg = 124.6 s, 1,610 expanded nodes, 91.8% success), cost-only guidance (

λ

= 0.3) reduces mean evacuation time to 118.3 s and node expansions to 1,365, indicating that learned remaining-cost estimates provide useful directional bias during frontier selection. Backtracking-only guidance (

λ

= 0.3) reduces backtracks from 2.0 to 1.1 and replans from 3.4 to 2.7, suggesting the model captures temporal patterns that precede fragile route segments under evolving hazards. Combining both signals yields the best outcomes: at

λ

= 0.7, the framework achieves Tavg = 112.7 s, 1,095 expanded nodes, 96.2% success, and only 1.0 backtracks on average. The predictor’s test performance (RMSE 0.076 for remaining-cost prediction and AUC 0.92 for backtracking discrimination) supports the claim that these signals are informative and generalise to held-out trajectories.

A key technical safeguard is the bounded hybrid heuristic, in which the learned component is clipped by the admissible semantic heuristic. This design preserves LPA*’s optimality guarantees while still reducing search effort and limiting exposure to distribution shift. The reported runtimes (typically 50–150 ms per replanning change, and around 50 ms for incremental reasoning after an initial 300 ms pass) are compatible with real-time decision support. Remaining limitations are chiefly experimental: the evaluation is scenario-driven and does not fully model multi-occupant congestion coupling, sensor uncertainty, or failures/latency in ontology updates—factors that should be stressed in future validation.

6. Conclusion and Future Work

This paper advances indoor emergency evacuation planning by integrating ontology-driven semantic reasoning with incremental search and BiLSTM-based temporal guidance. The OWL!/RDF building ontology and SWRL policy rules elevate planning from a purely geometric shortest-path problem to a semantically constrained decision process: situations encode both location and context (space type, access restrictions, hazard status), and only policy-compliant actions are considered. On top of this knowledge layer, LPA* provides efficient event-driven replanning by reusing prior search information instead of recomputing from scratch, while semantic backtracking enables recovery to the last feasible safe situation when hazards invalidate the current route.

Across the reported dynamic scenarios, the BiLSTM-guided variants consistently improved efficiency and robustness over the semantic baseline. The strongest configuration (

λ = 0.7

) reduces mean evacuation time from 124.6s to 112.7s, decreases node expansions from 1,610 to 1,095, lowers replanning and backtracking frequency, and increases evacuation success from 91.8% to 96.2%. These improvements align with the predictor’s quality (test RMSE 0.076 for remaining-cost estimation; AUC 0.92 and 88% accuracy for backtracking prediction; 87.1% accuracy for next-state prediction), indicating that neural guidance is contributing meaningful anticipatory information rather than merely adding variance. In addition, the framework’s reasoning logs demonstrate explainability by linking hazard assertions to route choices, addressing a key limitation of black-box evacuation planners.

Future work should prioritise three main directions: (i) coupling the framework with live sensing and digital-twin updates, including explicit uncertainty modelling for hazards, congestion, and sensor failure; (ii) extending from individual routing to multi-occupant optimisation with congestion coupling, compliance variability, and equitable support for mobility-impaired evacuees; and (iii) improving cross-building transfer through ontology alignment and online adaptation so that both semantic rules and BiLSTM guidance remain reliable under new layouts and event regimes.

Author Contributions

Conceptualization, R.D., H.K., M.C.G., K.O. and E.N.; Methodology, R.D. and H.K.; Data curation, H.K. and M.C.G.; Resources, R.D., H.K. and K.O.; Investigation, R.D., H.K., M.C.G., K.O. and E.N.; Visualization, H.K., M.C.G. and K.O.; Writing—original draft, R.D., H.K. and M.C.G.; Writing—review and editing, R.D., H.K., M.C.G., K.O. and E.N.; Supervision, E.N. and K.O.

Funding

Not Applicable

Institutional Review Board Statement

Not Applicable

Informed Consent Statement

Not Applicable

Data Availability Statement

The data created in this research is made publicly available at the following GitHub repository: https://github.com/rosatron202/RD/

Acknowledgments

During the preparation of this manuscript, the authors used Grammarly (Grammarly Inc.) for language editing (grammar and readability). The authors reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare that they have no known competing interests or personal relationships that could have appeared to influence the work reported in this paper.

References

Oulefki, A.; Kheddar, H.; Amira, A.; Kurugollu, F.; Himeur, Y.; Bounceur, A. Innovative AI strategies for enhancing smart building operations through digital twins: A survey. Energy and Buildings 2025, 115567. [Google Scholar] [CrossRef]
Goerigk, M.; Grün, B. A robust bus evacuation model with delayed scenario information. Or Spectrum 2014, 36, 923–948. [Google Scholar] [CrossRef]
Chang, C.W.; Chen, C.D.; Chuang, K.T. Queries of K-discriminative paths on road networks. Knowledge and Information Systems 2020, 62, 1751–1780. [Google Scholar] [CrossRef]
Djemai, R.; Vassilev, V.; Ouazzane, K.; Dey, M. Knowledge-based reactive planning and re-planning–a case-study approach. In Proceedings of the 2024 IEEE Conference on Artificial Intelligence (CAI), 2024; IEEE; pp. 770–775. [Google Scholar]
Li, B.; Hou, J.; Ma, Y.; Bai, G.; Wang, T.; Xu, G.; Wu, B.; Jiao, Y. A coupled high-resolution hydrodynamic and cellular automata-based evacuation route planning model for pedestrians in flooding scenarios. Natural hazards 2022, 110, 607–628. [Google Scholar] [CrossRef]
Shao, X.; Zhao, Z.; Liu, Z.; Yang, H.; Hu, C. Exposure risk and emergency evacuation modeling of toxic gas leakage in urban areas under the influence of multiple meteorological factors. Environmental pollution 2023, 333, 122044. [Google Scholar] [CrossRef] [PubMed]
Mao, J.; Zhou, Y.; Zhou, Y.; Wang, X. Urban Emergency Evacuation Path Optimization Based on Uncertain Environments to Enhance Response for Symmetric and Asymmetric Evacuation Problems. Symmetry (20738994) 2024, 16. [Google Scholar] [CrossRef]
Jafarian, A.; Granberg, T.A.; Farahani, R.Z. The effect of geographic risk factors on disaster mass evacuation strategies: A smart hybrid optimization. Transportation Research Part E: Logistics and Transportation Review 2025, 193, 103825. [Google Scholar] [CrossRef]
Li, R.; Zhou, Z.; Chen, E.; Luo, X.; Xing, L. Collaborative operation planning for post-disaster victim evacuation and relief distribution on considering heterogeneous rescue teams. Expert Systems with Applications 2026, 297, 129361. [Google Scholar] [CrossRef]
Wang, P.; Zhang, T.; Xiao, Y. Emergency evacuation path planning of passenger ship based on cellular ant optimization model. Journal of Shanghai Jiaotong University (Science) 2020, 25, 721–726. [Google Scholar] [CrossRef]
Zhang, Z.; Jia, L.; Qin, Y. Optimal number and location planning of evacuation signage in public space. Safety science 2017, 91, 132–147. [Google Scholar] [CrossRef]
Baglioni, M.; Jamshidnejad, A. A novel MPC formulation for dynamic target tracking with increased area coverage for search-and-rescue robots. Journal of Intelligent & Robotic Systems 2024, 110, 140. [Google Scholar]
Li, H.; Huang, X. Fire Emergency Evacuation for Large-Scale Public Buildings In 3d Gis. In Proceedings of the IOP Conference Series: Earth and Environmental Science; IOP Publishing, 2021; Vol. 772, p. 012003. [Google Scholar]
Zheng, Q.; Ding, P.; Qin, Z.; Yan, Z. An Investigation into the Rescue-Path Planning Algorithm for Multiple Mine Rescue Teams Based on FA-MDPSO and an Improved Force-Directed Layout. Fire 2025, 8, 188. [Google Scholar] [CrossRef]
Yang, B.; Yan, J.; Cai, Z.; Ding, Z.; Li, D.; Cao, Y.; Guo, L. A novel heuristic emergency path planning method based on vector grid map. ISPRS International Journal of Geo-Information 2021, 10, 370. [Google Scholar] [CrossRef]
Wang, C.; Luo, J.; Zhang, C.; Liu, X. A Dynamic Escape Route Planning Method for Indoor Multi-floor Buildings Based on Real-time Fire Situation Awareness. In Proceedings of the 2020 IEEE 26th International Conference on Parallel and Distributed Systems (ICPADS), 2020; IEEE; pp. 222–229. [Google Scholar]
Nishinari, K.; Kirchner, A.; Namazi, A.; Schadschneider, A. Extended floor field CA model for evacuation dynamics. IEICE Transactions on information and systems 2004, 87, 726–732. [Google Scholar]

Figure 1. Key challenges in indoor emergency evacuation, including dynamic hazards, incomplete or unreliable sensing, human and crowd behavioural uncertainty, and limitations of existing planning algorithms.

Figure 2. Proposed evacuation process of the BiLSTM-guided neuro-symbolic semantic A* planning framework.

Figure 3. Ontology-based scenarios representation for Tower Building over multiple floors.

Figure 4. Confusion matrix for the BiLSTM! backtracking prediction head on the test set (counts with row-normalised percentages).

Figure 5. Average evacuation time

T_{avg}

across semantic LPA*! and BiLSTM-guided LPA* variants (values and relative improvement over the semantic baseline).

Figure 5. Average evacuation time

T_{avg}

across semantic LPA*! and BiLSTM-guided LPA* variants (values and relative improvement over the semantic baseline).

Figure 6. Backtracking classifier ROC curve and Remaining-Cost Prediction Parity.

Table 1. Summary of prior related emergency evacuation studies and the open research gaps they leave. Symbols ✓ , ✗ , and ≈ mean Yes, No, and Partial, respectively.

Ref.	Data source	NoF	PP technique	RP	Indoor	DR	BT	Mass	VN	OR	XAI	RTEA
[2]	Synthetic + real city case	NA	MILP + robust optimisation	✗	✗	≈	✗	✓	✗	✗	✗	≈
[3]	Real road networks	NA	ACO-based multi-objective routing	✗	✗	✓	✗	✓	✗	✗	✗	✗
[5]	Hydrodynamic simulation + GIS	NA	CA + heuristic shortest path	✗	✗	✓	✗	✓	✗	✗	✗	≈
[7]	Synthetic + benchmark networks	NA	Fuzzy IP + improved ACO	✗	✗	✓	✗	✓	✗	✗	✗	≈
[8]	Behavioural simulation data	NA	Behaviour-aware routing model	✗	✗	≈	✗	✓	✗	✗	✗	✗
[9]	Multi-agent simulated networks	NA	Collaborative optimisation framework	≈	✗	✓	✗	✓	✗	✗	✗	≈
[10]	Ship layout (grid-based)	1	Cellular automata + ACO	✗	✓	✗	✗	✓	✗	✗	✗	✗
[11]	Public hall experiments	1	Signage location optimisation	✗	✓	✗	✗	✓	✗	✗	✗	✗
[12]	Indoor robotic simulations	1	MPC-based planning	✓	✓	✓	≈	✗	✗	✗	✗	✓
[13]	3D GIS + fire simulation	8	A* + fire dynamics	✗	✓	✓	✗	✓	✓	✗	✗	≈
[14]	Mine network simulation	NA	FA-MDPSO + graph layout	✗	✓	✓	✗	✗	✓	✗	✗	✗
Our	Real building ontology	11	Neuro-Symbolic Semantic LPA* (BiLSTM-Guided)	✓	✓	✓	✓	✓	✓	✓	✓	✓

Abbreviations: Number of floors (NoF); Path planning (PP); Re-planning (RP); Dynamic routing (DR); Backtracking (BT); Mass evacuation (Mass); Vertical navigation (VN); Ontological reasoning (OR); Explainability (XAI); Real-time event adaptation (RTEA).

Table 2. Roles and interactions of the main components in the neuro-symbolic evacuation stack.

Component	Consumes → Produces	Responsibility and constraint
OWL/RDF ontology (state model)	Building entities, occupants, hazards, events → semantic situations and facts	Defines the semantic state space (situations) and typed entities (Room, Corridor, Exit, Hazard). All planning operates over these semantic representations.
SWRL rules + constraints (planning rules)	Ontology facts + event assertions → admissible (state, action, next-state) transitions and pruned neighbours	Acts as the admissibility gate: transitions that violate safety or access constraints are rejected at expansion and at execution time. Also supports inferred connections and policy-driven constraints.
Semantic heuristic $h_{o} (n)$	Current node + goal + ontology inferences → admissible remaining-cost estimate	Guides search using ontology-derived structure (including inferred connectivity and vertical navigation). Must remain admissible to preserve optimality guarantees.
Planner (Semantic LPA*)	Admissible neighbours + costs + heuristic → candidate evacuation path	Performs incremental heuristic search on semantic situations. Uses $f (n) = g (n) + h (n)$ to prioritise expansions while respecting admissibility and ontology constraints.
Replanner (event-driven repair)	New event assertions + updated ontology → repaired plan	Triggered when events change costs or invalidate transitions. Updates only affected portions and repairs the prior search results (incremental replanning rather than full recomputation).
Backtracker (semantic rollback)	Execution failure or invalidated transition → last feasible situation + replanning trigger	When the next action becomes invalid at run time, returns to the most recent safe, semantically valid situation, then restarts planning from that point.
BiLSTM guidance module	History of semantic state-action sequence → predicted remaining cost and backtracking likelihood	Provides learned guidance to bias the planner away from fragile trajectories and towards stable ones. It does not create actions and does not bypass SWRL admissibility. Its influence must be bounded so that semantic admissibility and policy compliance are preserved.
Controller (orchestration loop)	Events + reasoning + planner outputs → executed guidance and logs	Coordinates event processing, reasoning refresh, planning, action execution, replanning triggers, and backtracking. Acts as the runtime glue between symbolic and learned modules.

Table 3. Summary of representative ontology-driven evacuation scenarios illustrated in Figure 3.

Scenario	Key behaviour	Path (state-action sequence)
1	Single trigger event; no obstruction; no rerouting or backtracking.	`S0, A01, S1, A02, S2, A13, S12, A14, S15, A15, S18, A16, S5, A8, S11`
2	Obstruction (E02) invalidates a previously admissible continuation; rerouting via corrective actions (CA1, CA2); no backtracking. BiLSTM guidance biases branch-off selection while feasibility remains enforced by ontology constraints and SWRL rules.	`S0, A01, S1, A02, S2, A3, S3, CA1, S13, CA2, S4, A5, S5, A8, S11`
3	Multiple events; rerouting followed by backtracking due to persistent hazards; explicit return to earlier decision points before recovering a feasible route to the exit.	`S0, A01, S1, A02, S2, A3, S3, CA1, S13, CA2, S4, A12, S4, BTA1, S13, BTA2, S2, A13, S12, A14, S15, A15, S18, A16, S5, A8, S11`

Table 4. Policy-compliant evacuation transitions encoded as SWRL (current situation, action, resultant situation) triplets for the Tower Building.

Current Situation	Action	Resultant Situation
person_in_inner_room(?x)	exit_inner_room(?x, ?y)	person_in_room(?x)
person_on_level_3(?x)	head_downstairs_to_floor_2(?x, ?y)	person_on_level_2(?x)
person_in_lab(?x)	exit_room(?x, ?y)	person_in_corridor(?x)
person_in_lecture_hall(?x)	exit_room(?x, ?y)	person_in_corridor(?x)
person_in_room(?x)	exit_room(?x, ?y)	person_in_corridor(?x)
person_in_corridor(?x)	exit_to_vestibule(?x, ?y)	person_in_vestibule(?x)
person_in_vestibule(?x)	exit_to_staircase_left(?x, ?y)	person_on_staircase(?x)
person_at_ebarriers(?x)	exit_through_ebarriers(?x, ?y)	person_next_to_reception(?x)
person_in_inner_lecture_hall(?x)	exit_inner_room(?x, ?y)	person_in_lecture_hall(?x)
person_next_to_reception(?x)	walk_out_of_building(?x, ?y)	person_outside_building(?x)
person_in_inner_lab(?x)	exit_inner_room(?x, ?y)	person_in_lab(?x)
person_in_office(?x)	exit_room(?x, ?y)	person_in_corridor(?x)
person_on_level_2(?x)	head_downstairs_to_piazza_floor(?x, ?y)	person_on_piazza_floor(?x)
person_on_piazza_floor(?x)	head_downstairs_to_ground_floor(?x, ?y)	person_on_ground_floor(?x)
person_in_classroom(?x)	exit_room(?x, ?y)	person_in_corridor(?x)
person_in_cafeteria(?x)	exit_room(?x, ?y)	person_in_corridor(?x)
person_in_bathroom(?x)	exit_room(?x, ?y)	person_in_corridor(?x)
person_in_corridor(?x)	enter_vestibule(?x, ?y)	person_in_vestibule(?x)
person_in_vestibule(?x)	exit_to_staircase_right(?x, ?y)	person_on_staircase(?x)
person_on_staircase(?x)	head_downstairs_to_ground_floor(?x, ?y)	person_on_ground_floor(?x)
person_on_ground_floor(?x)	exit_building_main_door(?x, ?y)	person_outside_building(?x)
person_outside_building(?x)	proceed_to_assembly_point(?x, ?y)	person_at_assembly_point(?x)
person_on_piazza_floor(?x)	exit_to_piazza_walkway(?x, ?y)	person_outside_building(?x)
person_on_ground_floor(?x)	head_downstairs_to_basement(?x, ?y)	person_in_basement(?x)
person_in_basement(?x)	exit_via_basement_emergency(?x, ?y)	person_outside_building(?x)
person_on_level_4(?x)	head_downstairs_to_floor_3(?x, ?y)	person_on_level_3(?x)
person_on_level_5(?x)	head_downstairs_to_floor_4(?x, ?y)	person_on_level_4(?x)
person_on_level_6(?x)	head_downstairs_to_floor_5(?x, ?y)	person_on_level_5(?x)
person_on_level_7(?x)	head_downstairs_to_floor_6(?x, ?y)	person_on_level_6(?x)
person_on_level_8(?x)	head_downstairs_to_floor_7(?x, ?y)	person_on_level_7(?x)
person_on_level_9(?x)	head_downstairs_to_floor_8(?x, ?y)	person_on_level_8(?x)
person_on_level_10(?x)	head_downstairs_to_floor_9(?x, ?y)	person_on_level_9(?x)

Table 5. BiLSTM performance on SWRL-derived evacuation trajectories.

Split	Cost RMSE	Backtracking AUC	Backtracking accuracy (%)	Next-state accuracy (%)
Training	0.058	0.96	90.4	91.2
Validation	0.071	0.93	88.1	88.4
Test	0.076	0.92	88.0	87.1

Table 6. Semantic LPA* versus BiLSTM-guided LPA* (including ablation variants) over 50 dynamic evacuation scenarios.

Variant	$λ$	Guidance	$T_{avg}$ (s)	$Δ T_{avg}$ (%)	Nodes	$Δ$ Nodes (%)	Replans	Backtracks	Success rate (%)
Semantic LPA* (no learning)	0.0	–	124.6	0.0	1610	0.0	3.4	2.0	91.8
Cost-only guidance ( ${\hat{c}}_{rem}$ )	0.3	cost	118.3	5.1	1365	15.2	3.1	1.7	93.4
Backtracking-only ( ${\hat{p}}_{bt}$ )	0.3	backtracking prob.	120.1	3.6	1440	10.6	2.7	1.1	94.6
Full guidance ( ${\hat{c}}_{rem} + {\hat{p}}_{bt}$ )	0.3	cost + backtracking	115.2	7.5	1235	23.3	2.8	1.2	95.6
Full guidance ( ${\hat{c}}_{rem} + {\hat{p}}_{bt}$ )	0.7	cost + backtracking	112.7	9.6	1095	32.0	2.6	1.0	96.2

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

BiLSTM Guided LPA Planning, Re-Planning, and Backtracking for Effective and Efficient Emergency Evacuation

Abstract

Keywords:

Subject:

1. Introduction

2. Related work

3. Preliminaries

3.1. LSTM for Evacuation Planning

3.2. BiLSTM for Backtracking and Route Revision

4. Research Methodology

4.1. Evacuation Micro-Models

4.2. Planning Rules

4.3. Controller

4.4. Neuro-Symbolic Evacuation Planning Framework

5. Experiments

5.1. Evaluation Metrics

5.2. Trained data

5.3. BiLSTM Training and Backtracking Prediction Performance

5.4. Planner Performance and Ablation Results

5.5. Discussion

6. Conclusion and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

MDPI Initiatives

Important Links

Subscribe