Submitted:
05 February 2026
Posted:
06 February 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Works
2.1. Large Language Model Empowerment of Knowledge Graphs
2.2. Application Research of Knowledge Graphs in Water Conservancy Facility Safety
3. Methods
3.1. Overall Research Framework
3.2. Model Construction Process
3.2.1. Multi-Source Heterogeneous Data Processing
- It is necessary to remove formatting errors and garbled characters from the text, eliminate content that fails to meet quality and relevance requirements, while preserving the original format and expression forms of domain-specific materials [29]. This ensures the purity and consistency of the textual data.
- This step is performed using Python programs for regex-based cleansing.
- This phase involves manual curation, segmentation of long sentences, and removal of low-value information to provide robust data support for subsequent construction of the water conservancy facility safety knowledge graph.
3.2.2. Domain Ontology Modeling
- denotes the Comprehensive Water Conservancy Facility Safety Ontology;
- represents the Agency and Personnel Ontology;
- refers to the Engineering Equipment Ontology;
- indicates the Risk and Hidden Danger Ontology;
- signifies the System and Process Ontology;
- captures the Inter-Relationships among these ontologies.
3.2.3. Retrieval-Augmented Knowledge Extraction with Large Language Models
- Text Segmentation
- Prompt Design And Entity-Relationship Extraction.
- Based on the partitioned TextUnits and leveraging the optimization strategy of contextual prompts, the large language model (LLM) is invoked for sequential processing to automatically identify and extract involved entities and their semantic relationships, generating preliminary triple structures.
- Knowledge Graph Construction And Community Generation.
3.2.4. Graph Database Storage and Visualization
4. Construction of Water Conservancy Facility Safety Knowledge Graph
4.1. Data Sources
4.2. Knowledge Graph Construction
4.2.1. Construction of Water Conservancy Facility Safety Ontology
- (1)
- Regarding the construction of the Agency and Personnel Ontology:At the institutional level, entities are classified into five categories based on hierarchical relationships, business scenarios, and emergency roles: government regulatory departments, project management units, construction and operation enterprises, technical support agencies, and emergency coordination agencies.At the personnel level, individuals are categorized into five types according to qualification constraints, duty associations, and cross-industry mappings: unit responsible persons, technical management personnel, operation and maintenance staff, administrative support personnel, and emergency response personnel.Given the complex and dynamic nature of organizational arrangements for institutions and personnel, this paper constructs separate models for institutions and personnel based on this classification to enhance the targeting and efficiency of institutional and personnel arrangements in water conservancy projects. Table 2 demonstrates the construction of the institution ontology using government regulatory departments as an example, while Table 3 illustrates the personnel ontology construction with technical management personnel as an example.
- (2)
- Regarding the construction of the Engineering Equipment Ontology:Based on ISO 55000 (Asset Management Standards) and water conservancy engineering systems theory, core concepts mentioned in various standard documents are refined. Engineering equipment is classified into five categories: water-retaining engineering equipment, water-discharging engineering equipment, water-diversion engineering equipment, monitoring and control engineering equipment, and auxiliary engineering equipment. Table 4 demonstrates the construction of the engineering equipment ontology.
- (3)
- Regarding the construction of the Risk and Hidden Danger Ontology:Based on disaster chain theory, the evolution of hidden dangers exhibits temporality, requiring distinction between latent, trigger, and outbreak phases. It is worth noting that these three phases of hidden danger evolution occur sequentially: the latent phase accumulates risks, the trigger phase is initiated by external conditions, and ultimately leads to disasters in the outbreak phase. Table 5 demonstrates the construction of the risk and hidden danger ontology.
- (4)
- Regarding the construction of the system and process ontology, based on legal hierarchy, effectiveness, and management process stages, system processes are categorized into three types according to legal hierarchy: national laws, administrative regulations and departmental rules, and local regulations. It is important to note that logical consistency in classification must be maintained, avoiding overlaps or omissions. Table 6 demonstrates the construction of the system and process ontology.
4.2.2. LLM-Integrated Prompt Engineering and Ontology-Constrained Entity-Relationship Extraction
- Text Segmentation
- Prompt Design And Entity-Relationship Extraction
- Knowledge Graph Construction And Community Generation
4.3. Model Performance
| Prediction Actual | Actual Positive | Actual Negative |
|---|---|---|
| Predicted Positive | TP | TN |
| Predicted Negative | FP | FN |
5. Knowledge Graph Visualization and Application
6. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Wang, Y.; Hu, A. China’s Water Conservancy: Review and Outlook (1949–2050). J. Tsinghua Univ. (Philos. Soc. Sci.) 2011, *26*, 99–112. [CrossRef]
- Ge, W. Editorial: Risk assessment and management of water conservancy projects. Front. Earth Sci. 2023, *11*, 1330621. [CrossRef]
- Liu, Y.; Tang, Y.; Jing, L.; Chen, F.; Wang, P. Remote Sensing-Based Dynamic Monitoring of Immovable Cultural Relics, from Environmental Factors to the Protected Cultural Site: A Case Study of the Shunji Bridge. Sustainability 2021, *13*, 6042. [CrossRef]
- Lu, J.; Feng, J.; Tang, Z.; Zhang, P. Research on Key Technologies of Water Conservancy Big Data Directory Service and Resource Sharing. Water Resour. Inform. 2017, *4*, 17–20+27. [CrossRef]
- Qiu, L.; Zhang, A.; Li, S.; Zhang, Y.; Shen, M.; Zhou, P. A Review on Knowledge Graph Construction in Aviation Manufacturing. Appl. Res. Comput. 2022, *39*, 968–977. [CrossRef]
- Huang, Y.; Yu, S.; Luo, B.; Li, R.; Li, C.; Huang, W. Exploring the Digital Twin Yangtze River for Joint Intelligent Scheduling of Basin Water Engineering Disaster Prevention. J. Hydraul. Eng. 2022, *53*, 253–269. [CrossRef]
- Chen, Y.; Zhang, T.; Niu, W.; Qin, H. Research on Key Technologies for Digital Twin Construction of the Three Gorges Reservoir Area. Yangtze River 2023, *54*, 19–24. [CrossRef]
- Xie, A.; Wu, Q.; Liu, F. Exploring Intelligent Operation and Maintenance Approaches for Pump Station Projects Based on Voiceprint Recognition and Knowledge Graph Technology. Yangtze River Technol. Econ. 2021, *5*, 88–92. [CrossRef]
- Huang, H.; Yu, J.; Liao, X.; Xi, Y. A Survey of Knowledge Graph Research. Comput. Syst. Appl. 2019, *28*, 1–12. [CrossRef]
- Zhou, Y.; Liu, Z.; Su, X.; Jin, T. Construction of a Q&A Knowledge Graph Ontology Model Integrating Multi-level Data. Lib. Inf. Serv. 2022, *66*, 125–132. [CrossRef]
- Ibrahim, N.; Aboulela, S.; Ibrahim, A.; Kashef, R. A survey on augmenting knowledge graphs (KGs) with large language models (LLMs): models, evaluation metrics, benchmarks, and challenges. Discov. Artif. Intell. 2024, *4*, 76. [CrossRef]
- Lv, W.; Liao, Z.; Liu, S.; Zhang, Y. MEIM: A Multi-source Software Knowledge Entity Extraction Integration Model. Comput. Mater. Contin. 2020, *66*, 1027–1042. [CrossRef]
- Wu, Z.; Pan, S.; Chen, F.; Long, G.; Zhang, C.; Yu, P.S. A Comprehensive Survey on Graph Neural Networks. IEEE Trans. Neural Netw. Learn. Syst. 2021, *32*, 4–24. [CrossRef]
- Chen, W.; Tian, J.; Xiao, L.; He, H.; Jin, Y. Exploring Logically Dependent Multi-task Learning with Causal Inference. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Online, 16–20 November 2020; pp. 2213–2225. [CrossRef]
- Zhao, W.X.; et al. A Survey of Large Language Models. arXiv 2023, arXiv:2303.18223. [CrossRef]
- Pan, S.; Luo, L.; Wang, Y.; Chen, C.; Wang, J.; Wu, X. Unifying Large Language Models andKnowledge Graphs: A Roadmap. IEEE Trans. Knowl. Data Eng. 2024, *36*, 3580–3599. [CrossRef]
- Chen, H.; Xie, R.; Cui, X.; Yan, Z.; Wang, X.; Xuan, Z.; Zhang, K. LKPNR: Large Language Models and Knowledge Graph for Personalized News Recommendation Framework. Comput.Mater. Contin. 2024, *79*. [CrossRef]
- Liu, X.; Lu, H.; Li, H. Intelligent generation method of emergency plan for hydraulic engineering based on knowledge graph---take the South-to-North Water Diversion Project as an example. LHB 2022, *108*, 2153629. [CrossRef]
- Duan, H.; Han, K.; Zhao, H.; Jiang, Y.; Li, H.; Mao, W. Research on the Construction of Comprehensive Water Conservancy Knowledge Graph. J. Hydraul. Eng. 2021, *52*, 948–958. [CrossRef]
- Abdullah, M.H.A.; Aziz, N.; Abdulkadir, S.J.; Alhussian, H.S.A.; Talpur, N. Systematic Literature Review of Information Extraction From Textual Data: Recent Methods, Applications, Trends, and Challenges. IEEE Access 2023, *11*, 10535–10562. [CrossRef]
- Zhang, J.; Zhang, X.; Wu, C.; Zhao, Z. A Survey of Knowledge Graph Construction Technology. Comput. Eng. 2022, *48*, 23–37. [CrossRef]
- Liu, S.; Yang, H.; Li, J.; Kolmanič, S. Preliminary Study on the Knowledge Graph Construction of Chinese Ancient History and Culture. Information 2020, *11*, 186. [CrossRef]
- Wang, W.; Xu, Y.; Du, C.; Chen, Y.; Wang, Y.; Wen, H. Data Set and Evaluation of Automated Construction of Financial Knowledge Graph. Data Intell. 2021, *3*, 418–443. [CrossRef]
- Abu-Salih, B.; AL-Qurishi, M.; Alweshah, M.; AL-Smadi, M.; Alfayez, R.; Saadeh, H. Healthcare knowledge graph construction: A systematic review of the state-of-the-art, open issues, and opportunities. J. Big Data 2023, *10*, 81. [CrossRef]
- Dang, F.-R.; Tang, J.-T.; Pang, K.-Y.; Wang, T.; Li, S.-S.; Li, X. Constructing an Educational Knowledge Graph with Concepts Linked to Wikipedia. J. Comput. Sci. Technol. 2021, *36*, 1200–1211. [CrossRef]
- Cheng, Q.; Wang, J.; Lu, W.; Huang, Y.; Bu, Y. Keyword-citation-keyword network: a new perspective of discipline knowledge structure analysis. Scientometrics 2020, *124*, 1923–1943. [CrossRef]
- Lin, J.; Zhao, Y.; Huang, W.; Liu, C.; Pu, H. Domain knowledge graph-based research progress of knowledge representation. Neural Comput. Appl. 2021, *33*, 681–690. [CrossRef]
- Tariq, A.; et al. Domain-specific LLM Development and Evaluation---A Case-study for Prostate Cancer. J. Biomed. Inform. 2024, *154*, 104650. [CrossRef]
- SENTiVENT: enabling supervised information extraction of company-specific events in economic and financial news. Lang. Resour. Eval. 2020, *54*, 1077–1106. https://link.springer.com/article/10.1007/s10579-021-09562-4.
- Shu, X.; Yang, H. Ontology-driven intelligent assessment system for dam structural safety based on spatiotemporal anomaly detection framework. Comput.-Aided Civ. Infrastruct. Eng. 2025, *40*, 1–20. [CrossRef]
- Ong, Q.C.; et al. Advancing health coaching: A comparative study of large language model and health coaches. Artif. Intell. Med. 2024, *157*, 103004. [CrossRef]
- Sawarkar, K.; Mangal, A.; Solanki, S.R. Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers. In Proceedings of the 2024 IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR), Orlando, FL, USA, 15–17 August 2024; pp. 155–161. [CrossRef]
- Li, J.; Hu, J.; Zhang, G. Enhancing Relational Triple Extraction in Specific Domains: Semantic Enhancement and Synergy of Large Language Models and Small Pre-Trained Language Models. CMC-Comput. Mater. Contin. 2024, *79*, 2481–2503. [CrossRef]
- Amit, A.; Chakraborty, R.; Patra, M.R.; Jadhav, A. Data profiling in property graph databases. J. Supercomput. 2023, *79*, 20056–20073. [CrossRef]
- Jiang, L.; Shi, J.; Wang, C. Multi-ontology fusion and rule development to facilitate automated code compliance checking using BIM and rule-based reasoning. Adv. Eng. Inform. 2022, *51*,101449. [CrossRef]
- Benchmarking Large Language Models: Opportunities and Challenges. Artif. Intell. Rev. 2024, *57*, 72. https://link.springer.com/chapter/10.1007/978-3-031-68031-1_6.
- Yang, F.; Meng, B. Design of Computer-Aided Instruction Model Based on Knowledge GraphConstruction and Learning Path Recommendation. Int. J. Web-Based Learn. Teach. Technol. [CrossRef]
- Dudáš, A.; Kleinedler, A. Effective Visualization of Data Structures in Graph Databases. J. Image Graph. 2024, *12*, 283–291. [CrossRef]







| No. | Name | Category | Level/Source |
|---|---|---|---|
| S01 | Water Law of the People’s Republic of China | National Law | National People’s Congress |
| S02 | Flood Control Law of the People’s Republic of China | National Law | National People’s Congress |
| S03 | Regulations on the Safety Management of Reservoir Dams | Administrative Regulation | State Council |
| S04 | Provisions on Work Safety Management of Water Conservancy Projects | Departmental Rule | Ministry of Water Resources |
| S05 | Provisions on Quality Management of Water Conservancy Projects | Departmental Rule | Ministry of Water Resources |
| S06 | Sichuan Province Water Conservancy Engineering Management Regulations | Local Regulation | Sichuan People’s Congress |
| S07 | Chongqing City Water Conservancy Engineering Management Regulations | Local Regulation | Chongqing People’s Congress |
| S08 | GB/T 40582-2021 Basic Terminology for Hydropower Stations | National Standard | Standardization Administration |
| S09 | DB11/T 2193-2023 Specification for Investigation and Management of Flood Prevention Hidden Dangers—Water Conservancy Projects | Local Standard | Beijing Municipality |
| S10 | Guide to the List of Major Hidden Dangers for Production Safety in Water Conservancy Projects (2021 Edition) | Departmental Normative Document | Ministry of Water Resources |
| S11 | Standard for Post Setting of Water Conservancy Project Management Units (Pilot) and Quota Standard for Water Conservancy Project Maintenance (Pilot) | Departmental Normative Document | Ministry of Water Resources |
| S12 | Measures for the Assessment and Management of Work Safety for Principal Responsible Persons, Project Responsible Persons, and Full-time Work Safety Management Personnel of Water Conservancy and Hydropower Construction Enterprises | Departmental Normative Document | Ministry of Water Resources |
| S13 | Guide for Identification and Risk Assessment of Operational Hazard Sources for Water Conservancy and Hydropower Projects (Reservoirs, Sluices) (Trial) | Departmental Normative Document | Ministry of Water Resources |
| S14 | Guide to the List of Major Hidden Dangers for Production Safety in Water Conservancy Projects (2023 Edition) | Departmental Normative Document | Ministry of Water Resources |
| S15 | Guidelines for Risk Assessment of Dams (ICOLD) | International Organization Guide | International Commission on Large Dams (ICOLD) |
| S16 | Hebei Province Water Conservancy Engineering Management Regulations | Local Regulation | Hebei People’s Congress |
| Level 1 Concept | Level 2 Concept | Instances |
|---|---|---|
| Government Regulatory Agencies | National Regulatory Agencies | Ministry of Water Resources, Ministry of Emergency Management, Ministry of Finance |
| Provincial Regulatory Agencies | Provincial Water Resources Department, Provincial Emergency Management Department, Provincial Finance Department | |
| Municipal Regulatory Agencies | Municipal Water Resources Bureau, Municipal Emergency Management Bureau, Municipal Finance Bureau | |
| County Regulatory Agencies | County Water Resources Bureau, County Emergency Management Bureau, County Finance Bureau | |
| Inter-basin Management Agencies | River Basin Management Agencies, Regional Coordination Agencies | |
| Specialized Regulatory Agencies | Hydrology Bureau, Water Conservancy Project Quality Supervision Station, Water Administration Supervision Detachment |
| Level 1 Concept | Level 2 Concept | Instances |
|---|---|---|
| Technical Management Personnel | Safety Engineer | Beiyun River Levee and Gate Safety, Yangzhuang Reservoir Water Quality Collaborative Management |
| Quality Supervisor | Chaobai River Levee Project Quality Control,Pipe Material Quality Dispute Handling, Ad-hoc Quality Inspection During Flood Season Construction | |
| Hydrological Monitor | Jiyun River Salt-Fresh Water Interaction Monitoring, Storm Surge Red Warning Response |
| Concept Classification | Instances |
|---|---|
| Water-Retaining Engineering Equipment | Dam, Levee, Gate |
| Water-Discharging Engineering Equipment | Spillway, Flood Discharge Tunnel,Drainage Valve |
| Water-Diversion Engineering Equipment | Diversion Channel, Pipeline, Pump Station |
| Monitoring and Control Engineering Equipment | Water Level Sensor, Stress Monitor, SCADA System |
| Auxiliary Engineering Equipment | Hoist, Trash Rake, Emergency Power Supply |
| Phase | Characteristics | Instances |
|---|---|---|
| Latent | Hidden danger exists but not triggered | Concrete Carbonation, Metal Fatigue |
| Trigger | External conditions exceed critical threshold | Water Level Exceeds Warning Line, Peak Ground Acceleration Exceeds Limit |
| Outbreak | System instability leads to disaster | Dam Breach, Pipeline Burst |
| Concept Classification | Instances |
|---|---|
| National Laws | Water Law of the People’s Republic of China, Flood Control Law of the People’s Republic of China |
| Administrative Regulations and Departmental Rules | Regulations on the Safety Management of Reservoir Dams, Provisions on Work Safety Management of Water Conservancy Projects, Provisions on Quality Management of Water Conservancy Projects |
| Local Regulations | Sichuan Province Water Conservancy Engineering Management Regulations, Chongqing City Water Conservancy Engineering Management Regulations |
| Top-Level Semantic Relation | Integrated Similar Expressions |
|---|---|
| Operates / Is Operated By |
Uses / Is Used By, Controls / Is Controlled By, Manages / Is Managed By, Manipulates / Is Manipulated By, Runs / Is Run By, Operates / Is Controlled By |
| Executes / Is Executed By |
Implements / Is Implemented By, Carries Out / Is Carried Out By, Fulfills / Is Fulfilled By, Performs / Is Performed By, Executes / Is Commanded By, Responsible For / Is Responsibility Of |
| Identifies / Is Identified By |
Discovers / Is Discovered By, Detects / Is Detected By, Monitors / Is Monitored By, Diagnoses / Is Diagnosed By, Determines / Is Determined By, Assesses / Is Assessed By |
| Complies With / Regulates |
Obeys / Is Obeyed By, Based On / Is Basis For, Conforms To / Is Conformed To, Follows / Is Followed By, Regulates / Is Regulated By, Constrains / Is Constrained By |
| Triggers / Affects | Causes / Is Caused By, Induces / Is Induced By, Activates / Is Activated By, Results In / Is Resulted In By, Affects / Is Affected By, Exacerbates / Is Exacerbated By |
| Prevents / Exposes | Prevents / Is Prevented By, Avoids / Is Avoided By, Mitigates / Is Mitigated By, Controls / Is Controlled By, Exposes / Is Exposed By, Reveals / Is Revealed By |
| Type | Precision (P) | Recall (R) | F1 |
|---|---|---|---|
| Direct Extraction | 0.435 | 0.560 | 0.490 |
| Template Extraction | 0.643 | 0.728 | 0.683 |
| Prompt + Ontology | 0.840 | 0.948 | 0.891 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).