5. Discussion
The PSC high-deficiency-risk ship prediction model developed in this study demonstrated strong performance and interpretability in the empirical case of the Port of Singapore. This section discusses the implications of model performance, recommendations for policy and practice, and potential limitations and areas for improvement.
5.1. Implications of Model Performance
The outstanding predictive performance of ensemble learning models such as Random Forest underscores the significant potential of machine learning approaches in PSC risk warning applications. Compared to traditional ship selection mechanisms that rely on human judgment or fixed-weight scoring rules, the model proposed in this study dynamically learns patterns from historical data, making it more adaptable to the complexity and variability of real-world scenarios. The model’s high recall (>83%) is particularly significant—it means that the vast majority of genuinely high-risk ships can be identified in time, reducing the number of “slipping through the cracks.” For port state authorities, this translates into a higher detection rate, allowing limited inspection resources to be more effectively allocated. At the same time, a precision of around 80% indicates that most ships predicted as high risk do indeed have a higher number of deficiencies, with a relatively low false positive rate. While a “better safe than sorry” approach is common in regulatory practice, excessive false positives can waste manpower and potentially alienate compliant operators. Thus, maintaining an acceptable precision level supports regulatory cost-efficiency and industry perception. The balance our model achieves between precision and recall represents a major improvement over traditional methods. Previously, port officers relying on subjective judgment may have missed high-risk ships or made inaccurate decisions due to incomplete information. The findings of this study demonstrate that big data analytics can simultaneously enhance detection accuracy and identification coverage, thereby improving the overall effectiveness of PSC inspections. Moreover, the model’s quantification of key risk factors and their relative importance provides a novel perspective for understanding PSC inspection results. For example, the high rankings of vessel age and tonnage reaffirm that older, larger ships require more attention—an insight valuable to port states and the IMO when developing inspection guidelines. The high weight assigned to company performance also suggests that regulators should strengthen oversight at the company level. Measures such as increasing inspection frequency for companies with repeated high-deficiency vessels or issuing early warnings can help improve safety management at the source.
5.2. Policy Recommendations
Based on our findings, we propose the following recommendations for PSC regulatory policy:
(1) Integrate Intelligent Decision Support Systems: Port state authorities are encouraged to incorporate machine learning prediction models into existing PSC information systems as a powerful supplement to current risk scoring methods. This could involve developing a risk-warning platform that automatically retrieves up-to-date ship data (e.g., vessel status, recent inspections) and computes risk scores in real time for incoming ships, providing actionable insights to inspectors. Practically, the system could interface with the Tokyo MoU’s central database to offer pre-arrival risk rankings for vessels scheduled to call at ports.
(2) Optimize Inspection Resource Allocation: Using model predictions, port authorities can better allocate personnel and time. For example, senior surveyors and extended inspection windows can be assigned to predicted high-risk vessels, while low-risk ships could be fast-tracked or exempted to improve overall efficiency. This stratified risk management approach aligns with IMO regulatory trends and enhances port operational effectiveness. Importantly, implementation should emphasize transparency and fairness to prevent perceptions of discrimination. Authorities should clearly communicate key model factors and provide incentives for good performance.
(3) Strengthen International Coordination and Data Sharing: Improving model performance depends on data quality and scale. PSC MoUs and the IMO should consider building stronger data-sharing mechanisms, such as integrating PSC inspection data globally to form a comprehensive maritime safety big data platform. This would enable better tracking of vessel performance and early warnings for ships operating across regions. International coordination should also aim to standardize risk prediction metrics so that AI-assisted regulation can follow consistent frameworks across countries.
(4) Promote Industry Self-Regulation: Beyond government oversight, regulators should provide feedback to shipping companies based on model results to encourage voluntary improvement. Authorities could issue periodic reports showing a company’s fleet risk rankings, motivating them to improve performance for lower-ranked vessels. Companies could also use similar models internally to conduct pre-departure risk assessments, identifying potential issues before inspection. In the long run, as operators recognize that such models can accurately identify safety gaps, a peer pressure and incentive system may emerge—driving investments in safety and fostering a virtuous cycle of improvement.
5.3. Potential Application Scenarios
The results of this study are not only applicable to the Port of Singapore but also hold broader application potential:
(1) Regional Risk Warning Centers: Regional PSC organizations such as the Tokyo MoU could use our method to establish regional early warning centers. These centers could consolidate inspection data from member countries and generate pre-arrival risk rankings for all vessels entering the region. This would enhance regional coordination and enable tighter monitoring of high-risk ships. For instance, if a vessel with multiple deficiencies is flagged in Japan, Singapore could receive early warnings before the vessel’s arrival.
(2) Port Operations and Berthing Management: Apart from regulatory purposes, port operators could use risk predictions for operational planning. If a vessel is predicted to be high risk and likely to be detained, the port could anticipate potential delays and adjust berth allocations accordingly. Conversely, low-risk vessels could be fast-tracked to improve port throughput and service efficiency.
(3) Insurance and Finance: Ship insurers and financial institutions could incorporate such models into their risk assessment frameworks. Vessels with high predicted risk scores may indicate greater safety concerns, prompting insurers to adjust premiums or require additional surveys. Lenders could also reference safety scores when evaluating ship asset or loan risk. With quantifiable risk indicators, market mechanisms may accelerate the phasing out of high-risk assets, indirectly improving overall industry safety.
5.4. Model Limitations and Future Improvements
While this study achieved meaningful results, several limitations remain for future research to address:
(1) Data Scope Limitation: Our model is based on historical data from the Port of Singapore, which may contain region-specific or sample-specific biases. The applicability of the model to other ports or PSC systems requires further validation, as fleet composition and violation characteristics may vary. Future work could expand the dataset to include multiple countries, enhancing the model’s generalizability.
(2) Feature Completeness: Although several relevant factors were considered, some potentially important variables were not included due to data unavailability—such as crew competence (certification and training) or voyage characteristics (e.g., degradation of vessel condition after long-distance sailing). Future access to detailed records (e.g., ISM audit reports, maintenance logs) could further enhance prediction accuracy.
(3) Model Complexity and Interpretability: While ensemble models offer high performance, they are relatively “black box” in nature. We have used feature importance and SHAP analysis to improve transparency, but complex nonlinear interactions are still challenging to interpret intuitively. Future studies may explore more interpretable approaches such as rule-based models or hybrid methods that retain performance while improving explainability. Techniques such as local interpretability models or designed feature interaction indicators could also aid understanding.
(4) Real-Time Deployment: In practical applications, models need frequent updates to reflect the latest data. Our study used offline batch training and did not address real-time updating mechanisms. Future work could explore online learning techniques to dynamically adjust parameters as new PSC data become available. Moreover, deploying AI models in port IT environments requires attention to system integration, computing efficiency, and cybersecurity—areas that may benefit from collaboration with IT and systems engineering experts.
In conclusion, this study highlights the practical significance of developing a predictive model for high-risk ships under PSC in improving maritime regulatory effectiveness. It also offers actionable policy and operational recommendations. With appropriate policy support and technological investment, machine learning models could become powerful tools for port state control, helping countries effectively identify substandard ships amidst growing international traffic—ultimately safeguarding lives, assets, and the marine environment.