Submitted:
18 September 2024
Posted:
20 September 2024
Read the latest preprint version here
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Related Works
2.2. Process Overview
- Attributes or units of the analysis, which we consider to be the information used as the input data to the predictive model.
- Summary of approaches implemented in past research, including framing of the problem, sources of data, analytical techniques, and strengths and weaknesses as they apply to scientific validity and reliability.
- Their capacity to deal with large-dimension problems, which is necessary when endeavoring to identify relevant variables among many potential factors.
- Their flexibility in reproducing the data-generation structure, irrespective of complexity, thanks to a non-linear structure that is adaptable to the data (non-parametric philosophy).
- Their great predictive and, in some cases, interpretative, potential.
2.3. Data in Use
2.4. Synergy Possibilities
- Key Performance Indicators (KPI) And Training Datasets: A Synergy of Long-Term Predictive Methods:
- Safety Observation Reports (SOR) And Preliminary Event Notifications (PEN): A Synergy of Situational Predictive Methods:
2.5. Synergy and Cross-Validation among Situational and Long-Term Methods
2.6. Research Hypothesis

2.6.1. Risk Mitigation Statistics
| Injury Type | % of Data related to UA | % of Data related to UC | % of Data related to PF | % of Data related to EF | % of Data related to MA | % of Data related to PSA |
|---|---|---|---|---|---|---|
| FAC | 0.3521 | 0.2735 | 0.1234 | 0.0732 | 0.0987 | 0.0791 |
| MTC | 0.3056 | 0.2845 | 0.1467 | 0.1123 | 0.0874 | 0.0635 |
| RWC | 0.2245 | 0.2987 | 0.1342 | 0.1748 | 0.0981 | 0.0697 |
| LTI | 0.1678 | 0.2512 | 0.1543 | 0.2187 | 0.1426 | 0.0654 |
| AD | 0.1034 | 0.2113 | 0.1321 | 0.2314 | 0.1987 | 0.1231 |

2.6.2. Theoretical Approach
- Their ability to solve large-scale issues is essential when trying to determine important variables from a wide range of variables.
- Their ability to replicate the data generation process, no matter how complex, is due to their non-linear data structure (non-procedural approach).
- Their predictive and sometimes interpretative capabilities.
3. Results
3.1. Unified Model Approach
- Safety Observation Reports (SOR) & Accidentology Historical: Safety Observation Reports (SORs) and accidentology histories serve as situational methods in this context, aiming to forecast safety outcomes for individual events based on specific environmental data. We hypothesize that combining these situational methods can yield predictions that are more reliable than those obtained using a single method in isolation. For instance, Random Forest and Decision Tree algorithms excel in predicting the type of injury and identifying its direct, indirect, and root causes. However, they do not inherently predict injury severity. In contrast, Safety Observation Reports demonstrate proficiency in forecasting injury severity and have proven effective in differentiating between successful and failed safety outcomes. Thus, while both methods focus on situational predictions, they are grounded in distinct aspects of the safety system.
- Safety Key Performance Indicators Datasets: Both safety prediction families, namely safety leading indicators, and the training dataset are time-dependent, as well as safety activities and operations, typically measured over weeks or months. Consequently, they are not suited for situational predictions but rather for forecasting injury rates over extended periods, spanning months to years. While training and safety leading indicators assess distinct facets of the safety system, they may not be entirely independent. Safety leading indicators gauge the efficacy of safety management, as documented by Hinze J, Hallowell M, & Baud K. (2013), whereas training evaluates overall safety perceptions. This encompasses perceptions regarding management's safety commitment, the role of supervisors, and the adequacy and effectiveness of training, encapsulated as "safety climate dimensions". These perceptions may be influenced by various factors, such as the quality and quantity of training programs, audit frequency, and incentive structures. Therefore, we postulate that integrating training data with safety-leading indicators could yield synergistic effects and opportunities for cross-validation.
- Contractors Safety Performance Datasets: This section aims to predict safety performance in construction sites using contractors' safety performance data. By analyzing the comprehensive dataset, including factors such as accident types, severity, and frequency, as well as contractor characteristics and historical safety records, examination of past safety performance, including incident rates, corrective actions taken, and adherence to safety regulations, to gauge the overall safety culture and performance trajectory. We employ various machine learning algorithms to identify patterns and predict potential safety incidents related to the accidentology historic dataset.
3.2. Correlation of Causal Factors
| Injury Type | 1 | 2 | 3 | 4 | 5 | 6 | SD | Mean |
|---|---|---|---|---|---|---|---|---|
| UA | ------ | 0.65* | 0.71* | 0.47* | 0.53* | 0.49* | 0.13 | 0.56 |
| UC | 0.65* | ------ | 0.78* | 0.60* | 0.58* | 0.61* | 0.15 | 0.70 |
| PF | 0.71* | 0.78* | ------ | 0.63* | 0.59* | 0.62* | 0.13 | 0.72 |
| EF | 0.47* | 0.60* | 0.63* | ----- | 0.71* | 0.68* | 0.14 | 0.68 |
| MA | 0.53* | 0.58* | 0.59* | 0.71* | ---- | 0.75* | 0.14 | 0.69 |
| PSA | 0.49* | 0.61* | 0.62* | 0.68* | 0.75* | ---- | 0.14 | 0.69 |
3.3. Impact on Accident Occurrence
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | SD | Mean | |
|---|---|---|---|---|---|---|---|---|---|
| AO | ------ | 0.75* | 0.70* | 0.77* | 0.65* | 0.68* | 0.74* | 0.12 | 0.72 |
| UA | 0.75* | ------ | 0.65* | 0.71* | 0.47* | 0.53* | 0.49* | 0.13 | 0.56 |
| UC | 0.70* | 0.65* | ------ | 0.78* | 0.60* | 0.58* | 0.61* | 0.15 | 0.70 |
| PF | 0.77* | 0.71* | 0.78* | ------ | 0.63* | 0.59* | 0.62* | 0.13 | 0.72 |
| EF | 0.65* | 0.47* | 0.60* | 0.63* | ----- | 0.71* | 0.68* | 0.14 | 0.68 |
| MA | 0.68* | 0.53* | 0.58* | 0.59* | 0.71* | ----- | 0.75* | 0.14 | 0.69 |
| PSA | 0.74* | 0.49* | 0.61* | 0.62* | 0.68* | 0.75* | ---- | 0.14 | 0.69 |
3.4. Predictive Analysis of Accident Occurrence
4. Discussion
5. Conclusions
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Abbasianjahromi, H., and M. & Aghakarimi. 2021. "Safety performance prediction and modification strategies for construction projects via machine learning techniques." Engineering, Construction and Architectural Management. [CrossRef]
- Alexander, D., Hallowell, M., and Gambatese, J. 2017. "Precursors of construction fatalities. II: predictive modeling and empirical validation." Journal of construction engineering and management 143(7).
- APC, Chan, Guan J, Choi TNY, Yang Y, Wu G, and Lam E. 2023. "Improving Safety Performance of Construction Workers through Learning from Incidents." Int J Environ Res Public Health 5 (4570): 4-20.
- Augustine, T., and S. & Shukla. 2022. "Road accident prediction using machine learning approaches." 2nd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE). 808–811.
- Baker, Henrietta, Matthew R. Hallowell, and Antoine J.-P. Tixier. 2020. "AI-based prediction of independent construction safety outcomes from universal attributes." Automation in Construction 118. [CrossRef]
- Baradan, S., and Usmen, M. A. 2006. "Comparative injury and fatality risk analysis of building trades." J. Constr. Eng. Manage. 533-539. [CrossRef]
- Cavalcanti, M., L. Lessa, and B. & Vasconcelos. 2023. "Construction accident prevention: A systematic review of machine learning approaches.. ." Work. [CrossRef]
- Chen, J. R., and Yang, Y. T. 2004. "A predictive risk index for safety performance in process industries." J. Loss Prev. Process Ind. 17(3): 233-242. [CrossRef]
- Chen, W., and et al. 2020. "Artificial Intelligence Marvelous Approach for Occupational Health and Safety Applications in an Industrial Ventilation Field: A Short-systematic Review." Electronics 9. [CrossRef]
- Choi, J., B. Gu, and S. et al. Chin. 2020. "Machine Learning Predictive Model Based on National Data for Fatal Accidents of Construction Workers." Automation in Construction (102974): 110. [CrossRef]
- Chua, D. K. H., and Goh, Y. M. 2005. "A Poisson model of construction incident occurence." J. Constr. Eng. Manage. 715-722.
- Cooper, M. D., and Phillips, R. A. 2004. "Exploratory analysis of the safety climate and safety behavior relationship." J. Saf. Res. 35(5): 497-512. [CrossRef]
- Eetvelde, H., L. Mendonça, C., Seil, R. Ley, and T. & Tischer. 2021. "Machine learning methods in sport injury prediction and prevention: a systematic review. ." Journal of Experimental Orthopaedics, 8 (10.1186).
- Fang, D. P., Chen, Y., and Louisa, W. . 2006. "Safety climate in construction industry: A case study in Hong Kong." J. Constr. Eng. Manage. 573–584. [CrossRef]
- Fargnoli, Mario, and Mara Lombardi. 2020. "Building Information Modelling (BIM) to Enhance Occupational Safety in Construction Activities: Research Trends Emerging from One Decade of Studies." Buildings 10(6):98.
- Gao, Yifan, Vicente Gonzalez, Kenneth Tak Wing Yiu, and Guillermo Cabrera-Guerrero. 2019. "The Use of Machine Learning and Big Five Personality Taxonomy to Predict Construction Workers' Safety Behaviour." Computer Science.
- Gillen, M., Baltz, D., Gassel, M., Kirch, L., and Vaccaro, D. 2002. "Perceived safety climate, job demands, and coworker support among union and nonunion injured construction workers." J. Saf. Res. 33(1): 33-51. [CrossRef]
- Glendon, A. I., and Litherland, D. K. 2001. "Safety climate factors, group differences and safety behavior in road construction." J. Saf. Sci, 39(3): 157-188.
- Hallowell, M. R., and Gambatese, J. A. 2009. "Activity-based safety and health risk quantification for formwork construction." J. Constr. Eng. Manage. 990-998.
- Heinrich, H. W. 1941. Industrial Accident Prevention: A Scientific Approach. McGraw-Hill. [CrossRef]
- Hinze, J, Hallowell, M., and Baud, K. 2013. "Construction-safety best practices and relationships to safety performance." J. Constr. Eng. Man. (04013006): 1943-7862. [CrossRef]
- Johnson, S. E. 2007. "The predictive validity of safety climate." J. Saf, Res. 511-521. [CrossRef]
- Kakhki, Fatemeh Davoudi, Steven A. Freeman, and Gretchen A. Mosher. 2019. "Evaluating machine learning performance in predicting injury severity in agribusiness industries." Safety Science 117: 257-262. [CrossRef]
- Khan, Rafi Ullah, Jingbo Yin, Faluk Shair Mustafa, and Wenming Shi. 2023. "Factor assessment of hazardous cargo ship berthing accidents using an ordered logit regression model." Ocean Engineering 284 (115211). [CrossRef]
- Kim, Y., and S. & Chi. 2021. "Hazardous material releases in construction: Analysis with the decision tree approach. ." Journal of Construction Engineering and Management 2 (04020150): 147.
- Koc, K., and A. & Gurgun. 2021. "MACHINE LEARNING APPLICATIONS IN CONSTRUCTION SAFETY LITERATURE." Proceedings of International Structural Engineering and Construction.
- Lee, J., Y. Yoon, T. Oh, S. Park, and S. & Ryu. 2020. "A Study on Data Pre-Processing and Accident Prediction Modelling for Occupational Accident Analysis in the Construction Industry." Journal of Safety Research 73 (10.1016): 285-297. [CrossRef]
- Lee, S., and Halpin, D. W. 2003. "Predictive tool for estimating accident risk." J. Constr. Eng. Manage. 4(431): 431-436. [CrossRef]
- Mahamulkar, S, V H Lad, and K A Patel. 5-7 September 2022. "Development of a Framework for Selection of a Tunnel Lining Formwork System." Proceedings 38th Annual ARCOM Conference. Glasgow, UK: Association of Researchers in Construction Management. 359-368.
- Rozenfeld, O., Sacks, R., Rosenfeld, Y., and Baum, H. 2010. "Construction Job Safety Analysis." J. Saf. Sci. 48(4): 491-498.
- Sarkar, S., and J. & Maiti. 2020. "Machine learning in occupational accident analysis: A review using science mapping approach with citation network analysis. ." Safety Science (104900): 131. [CrossRef]
- Sarkar, S., and J. & Maiti. 2020. "Machine learning in occupational accident analysis: A review using science mapping approach with citation network analysis." Safety Science (104900): 131. [CrossRef]
- Shrestha, S. 2020. "Occupational Hazards in Building Construction." SCITECH Nepal (10.3126). [CrossRef]
- Shuang, Q., and Z. & Zhang. 2023. "Determining Critical Cause Combination of Fatality Accidents on Construction Sites with Machine Learning Techniques." Buildings.
- Tam, C. M., and Fung, I. W. H. 1998. Effectiveness of safety management strategies on safety performance in Hong Kong. 16(1) vols. J. Construction Management Economy.
- TD, Smith, Mullins-Jaime C, Dyal MA, and DeJoy DM. 2020. "Stress, burnout and diminished safety behaviors: An argument for Total Worker Health® approaches in the fire service." J Safety Res. 75:189-195. [CrossRef]
- Yedla, Anurag, Fatemeh Davoudi Kakhki, and Ali Jannesari. 2020. "Predictive Modeling for Occupational Safety Outcomes and Days Away from Work Analysis in Mining Operations." Int. J. Environ. Res. Public Health 17(19).
- Zarei, E., A. Karimi, E. Habibi, Barkhordari, and & Reniers, G. A. 2021. "Dynamic occupational accidents modeling using dynamic hybrid Bayesian confirmatory factor analysis: An in-depth psychometrics study." Safety Science (105146): 131. [CrossRef]
- Zhang, Shuguang, Afaq Khattak, Caroline Mongina Matara, Arshad Hussain, and Asim Farooq. 2022. "Hybrid feature selection-based machine learning Classification system for the prediction of injury severity in single and multiple-vehicle accidents." PLoS One (10.1371).
- Zhu, K., K. Du, and Y. & Tang. 2020. "Integrating machine learning with human factors for analyzing construction safety risk." Automation in Construction (103366): 119.
- Zhu, R., X. Hu, J. Hou, and X. & Li. 2021. "Application of machine learning techniques for predicting the consequences of construction accidents in China. ." Process Safety and Environmental Protection. [CrossRef]
- Zohar, D. 1998. "Safety climate in industrial organizations: Theoretical and Applied Implications." J. Appl. Psychol. 78-85.









| Study | Main findings | Authors, Year | DOI |
|---|---|---|---|
| Machine Learning Predictive Model Based on National Data for Fatal Accidents of Construction Workers | Machine learning can effectively predict fatal accidents at construction sites, with month, employment size, age, weekday, and service length being the most influential factors. | Jongko Choi, Bonsung Gu, Sangyoon Chin, Jong-seok Lee (2020) | 10.1016/j.autcon.2019.102974 |
| Application of Machine Learning Techniques for Predicting the Consequences of Construction Accidents in China | Naive Bayes and Logistics regression are the best machine learning algorithms for predicting the severity of construction accidents, with accident type, reporting, and handling being the most critical factors. | Rongchen Zhu, Xiaofeng Hu, Jiaqi Hou, Xin Li (2021) | 10.1016/j.psep.2020.08.006 |
| Predictive Modeling for Occupational Safety Outcomes and Days Away from Work Analysis in Mining Operations | Machine learning techniques, such as decision trees and random forests, can improve mining safety by predicting accident outcomes and days away from work. | Anurag Yedla, Fatemeh Davoudi Kakhki, A. Jannesari (2020) | 10.3390/ijerph17197054 |
| Customized AutoML: An Automated Machine Learning System for Predicting Severity of Construction Accidents | Customized AutoML is an automated machine learning system that accurately predicts construction accident severity for professionals with limited data science knowledge, offering higher scalability, accuracy, and result-oriented insight. | V. Toğan, F. Mostofi, Y. Ayözen, Onur Behzat Tokdemir (2022) | 10.3390/buildings12111933 |
| Safety Performance Prediction and Modification Strategies for Construction Projects Via Machine Learning Techniques | The decision tree algorithm effectively predicts safety performance in construction projects, with safety employees, training, rule adherence, and management commitment being key criteria. | H. Abbasianjahromi, Mehdi Aghakarimi (2021) | 10.1108/ecam-04-2021-0303 |
| Component-Based Machine Learning for Performance Prediction in Building Design | This paper presents a component-based machine learning approach for predicting building performance, enabling high prediction quality with errors as low as 3.7% for cooling and 3.9% for heating. | P. Geyer, Sundaravelpandian Singaravel (2018) | 10.1016/J.APENERGY.2018.07.011 |
| Machine Learning Applications in Construction Safety Literature | Machine learning methods, particularly support vector machine and decision tree, are widely used in construction safety literature to predict accident outcomes and identify potential safety risks. | K. Koc, A. Gurgun (2021) | 10.14455/isec.2021.8(1).csa-05 |
| Evaluating Machine Learning Performance in Predicting Injury Severity in Agribusiness Industries | Machine learning techniques can accurately predict injury severity in agribusiness industries using workers' compensation claims, with a 92-98% accuracy rate. | Fatemeh Davoudi Kakhki, S. Freeman, G. Mosher (2019) | 10.1016/j.autcon.2019.102974 |
| Injury Categories | Direct Causes Categories | Indirect Cuses Categories | Root Causes Categories |
|---|---|---|---|
| - First Aid Case -Medical Treatment Case -Restricted Work Case - Lost Time Injury - Asset Damage |
Unsafe Act (UA) | People Factor (PF) | Management Aspect (MA) |
| - Individual behavior/ attitude - Tools or Equipment Use - Procedures implementation |
- Physical Capabilities - Mental Capabilities - Physiological |
- Resource Management - Leadership - Contractors & Subcontractor Mgt. |
|
| Unsafe Condition (UC) | Execution Factor (EF) | Program System Aspect (PSA) | |
| -Workplace Hazards - Process Hazards - Tools & Equipment Condition - Protective Defenses - Weather conditions |
- Engineering / Design - Project level execution - Communication - Skill & Knowledge - Tools & Equipment Provision |
- Work Standards / Procedures - Risk Evaluation - Task Planning - Training - Inspection and Audit program |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).