Submitted:
17 November 2025
Posted:
18 November 2025
You are already at the latest version
Abstract
Keywords:
Introduction
The Challenge of Generative AI in Healthcare Diagnostics
The Need for a Hybrid Approach in Healthcare
Bias and Fairness in Public Welfare AI Systems
Toward Fair and Transparent AI in Social Services
Safety and Reliability in Autonomous Systems
Neurosymbolic Guardrails for Autonomy
Neurosymbolic AI: Combining Neural Networks and Symbolic Reasoning
- Built-in Safety Guardrails: The symbolic layer can enforce hard constraints derived from domain knowledge (e.g., physics, regulations, expert guidelines). As a result, a neurosymbolic system can avoid many failure modes by design – for example, never proposing an action that breaks a known safety rule, or filtering out a diagnostic hypothesis that contradicts medical knowledge. These guardrails act as a check on the neural network’s outputs, greatly reducing the chance of catastrophic error. In practice, this could have prevented some of the earlier examples (the AI would know not to mix bleach and vinegar, or not to prescribe a contraindicated drug).
- Reasoning and Contextual Understanding: Symbolic AI represents knowledge in structures like ontologies, graphs, or if-then rules, which allows explicit reasoning about relationships and causality. By incorporating this, neurosymbolic systems can exhibit a degree of commonsense and reasoning that pure neural nets lack. For instance, an AI lawyer assistant with a symbolic legal knowledge base can reason through the steps of applying a law to a case, rather than just pattern-matching previous cases. In autonomous driving, reasoning about cause and effect (if road is slippery, then increase following distance) leads to more robust performance in novel situations. Neurosymbolic AI often draws inspiration from the dual-process models of human cognition (System 1 intuitive perception vs. System 2 deliberative reasoning)[20,21], aiming to replicate this balance for more human-like decision-making.
- Explainability and Transparency: Because symbolic representations are inherently interpretable (a rule or a logic step can be understood by humans), neurosymbolic systems can provide explanations for their decisions. Instead of just outputting a classification or an action, the system can output a trace of symbolic reasoning: for example, “Diagnosis = Pneumonia because (Chest X-ray indicates fluid) AND (Patient has fever and cough) AND (Rule: X-ray+symptoms -> Pneumonia)”. In an algorithm deciding social service eligibility, it might log which criteria were met or not met. This transparency is invaluable for users and regulators – it turns AI from a black box into a glass box that can be inspected. Sheth et al. note that symbolic knowledge structures enable “traceability and auditing of the AI system’s decisions,” useful for ensuring regulatory compliance and explainability by tracking inputs, outputs, and intermediate decision steps[22]. Such audit trails directly address the accountability deficits of current AI.
- Mitigating Bias and Supporting Fairness: A neurosymbolic approach allows designers to inject fairness constraints or ethical principles explicitly. Whereas debiasing a neural net alone is difficult (one must retrain on less biased data or add complex regularization), a symbolic rule can, say, ignore certain attributes or apply affirmative constraints (e.g., “ensure equal acceptance rates across groups unless justified by need”). Additionally, because the reasoning is transparent, stakeholders can detect and correct biases. If an undesirable pattern is noticed (e.g., a rule that unintentionally causes disparity), it can be adjusted in the symbolic knowledge base. This modularity and clarity contrasts with trying to tweak millions of opaque neural weights to fix a bias. The result is AI that aligns better with societal values and legal standards of fairness.
- Improved Performance with Domain Knowledge: Contrary to a misconception that adding rules might make AI rigid, neurosymbolic systems have shown success in improving accuracy by integrating domain knowledge. For example, in a medical diagnosis task for predicting diabetes, researchers combined neural networks with logical rules (using a neurosymbolic framework called Logical Neural Networks) and achieved higher accuracy and AUC scores than pure machine learning models, all while providing interpretability into which factors contributed to the diagnosis[23]. The addition of expert knowledge helped guide the learning and reduced overfitting to spurious patterns. In general, neural and symbolic components can have a synergistic effect – the neural side covers the nuances of data, and the symbolic side covers the known generalizations, together yielding a more powerful system than either alone.
Conclusions
References
- A. Sheth, K. Roy, and M. Gaur, “Neurosymbolic Artificial Intelligence (Why, What, and How),” IEEE Intelligent Systems, vol. 38, no. 3, pp. 56–62, 2023.[19][27]. [CrossRef]
- R. Booth, “Revealed: bias found in AI system used to detect UK benefits fraud,” The Guardian, 6 Dec 2024.[10][11].
- J. Pierre, “AI Hallucinations in Medicine and Mental Health,” Psychology Today, 24 Jul 2025.[3][2].
- M. Adams, “ChatGPT May Be Enabling Unhealthy Teen Behaviors: Report,” AboutLawsuits.com, 15 Aug 2025.[7].
- S. Levin and N. Woolf, “Tesla driver killed while using autopilot was watching Harry Potter, witness says,” The Guardian, 1 Jul 2016.[17][28].
- C. Ross and I. Swetlitz, “IBM’s Watson recommended ‘unsafe and incorrect’ cancer treatments, internal documents show,” STAT News, 25 Jul 2018.[4].
- A. Gutiérrez et al., “How Policy Hidden in an Algorithm is Threatening Families in This Pennsylvania County,” ACLU Report, 14 Mar 2023.[14].
- R. Orakzai, “Neurosymbolic AI Explained,” Baeldung on Computer Science, 27 Mar 2025.[8][9].
- Q. Lu et al., “Explainable Diagnosis Prediction through Neuro-Symbolic Integration,” arXiv:2410.01855 [cs.AI], Oct 2024.[23]. [CrossRef]
- Figure 1: Adapted from R. Orakzai, “Neurosymbolic AI Explained,” Baeldung 2025 – Illustration of symbolic logic constraining a neural network’s outputs for safety.
- [1] [2] [3] [6] AI Hallucinations in Medicine and Mental Health | Psychology Today.
- https://www.psychologytoday.com/us/blog/psych-unseen/202506/ai-hallucinations-in-medicine-and-mental-health.
- [4] IBM's Watson recommended 'unsafe and incorrect' cancer treatments | STAT.
- https://www.statnews.com/2018/07/25/ibm-watson-recommended-unsafe-incorrect-treatments/.
- [5] Case Study 20: The $4 Billion AI Failure of IBM Watson for Oncology.
- https://www.henricodolfing.com/2024/12/case-study-ibm-watson-for-oncology-failure.html.
- [7] ChatGPT May Be Enabling Unhealthy Teen Behaviors: Report - AboutLawsuits.com.
- https://www.aboutlawsuits.com/chatgpt-enabling-unhealthy-teen-behaviors/.
- [8] [9] Neurosymbolic AI Explained | Baeldung on Computer Science.
- https://www.baeldung.com/cs/neurosymbolic-artificial-intelligence.
- [10] [11] [12] Revealed: bias found in AI system used to detect UK benefits fraud | Universal credit | The Guardian.
- https://www.theguardian.com/society/2024/dec/06/revealed-bias-found-in-ai-system-used-to-detect-uk-benefits.
- [13] Dutch childcare benefit scandal an urgent wake-up call to ban racist ...
- https://www.amnesty.org/en/latest/news/2021/10/xenophobic-machines-dutch-child-benefit-scandal/.
- [14] [15] [16] How Policy Hidden in an Algorithm is Threatening Families in This Pennsylvania County | American Civil Liberties Union.
- https://www.aclu.org/news/womens-rights/how-policy-hidden-in-an-algorithm-is-threatening-families-in-this-pennsylvania-county.
- [17] [18] [28] Tesla driver killed while using autopilot was watching Harry Potter, witness says | Tesla | The Guardian.
- https://www.theguardian.com/technology/2016/jul/01/tesla-driver-killed-autopilot-self-driving-car-harry-potter.
- [19] [22] [27] "Neurosymbolic Artificial Intelligence (Why, What, and How)" by Amit Sheth, Kaushik Roy et al.
- https://scholarcommons.sc.edu/aii_fac_pub/572/.
- [20] [21] [24] [25] Neurosymbolic Value-Inspired AI (Why, What, and How).
- https://arxiv.org/html/2312.09928v1.
- [23] Explainable Diagnosis Prediction through Neuro-Symbolic Integration.
- https://arxiv.org/html/2410.01855v1.
- [26] Neurosymbolic AI emerges as a potential way to fix AI's reliability ...
- https://fortune.com/2024/12/09/neurosymbolic-ai-deep-learning-symbolic-reasoning-reliability/.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).