Submitted:
15 June 2026
Posted:
17 June 2026
You are already at the latest version
Abstract

Keywords:
1. Introduction
1.1. Contribution of This Review
2. Review Method and Scope
3. Conceptual Background
3.1. AI Classification
3.2. Critical Public Services
3.3. Decision Support, De Facto Automation, and Full Automation
3.4. Supervised Learning, Risk Scoring, and Anomaly Detection
3.5. Bias, Explainability, and Accountability
3.6. Accountability as a Sociotechnical Property
4. State of the Art in Public-Sector AI Classification
| Approach | Typical Strength | Main Governance Implication |
|---|---|---|
| Scoring models, logistic regression, and small decision trees | Relatively interpretable and easier to document | Suitable where reasons, appeals, and threshold justification are central, but still require label and subgroup-error review. |
| Ensembles, support vector machines, and gradient boosting | Often strong performance on structured administrative data | Require stronger calibration, subgroup validation, explanation, monitoring, and procurement documentation. |
| Neural networks and NLP classifiers | Useful for images, text, and complex signals | Harder to justify in adverse decisions unless necessity, reviewability, and contestability are explicit. |
| Anomaly detection | Useful where labelled examples are limited | Should trigger further inquiry rather than serve as evidence of liability, fraud, or danger. |
4.1. Current Governance Gaps
5. Critical Analysis of AI Classification Risks
5.1. Accuracy, Thresholds, and Drift
5.2. Fairness and Discrimination
5.3. Transparency, Automation Bias, and Human Oversight
5.4. Security and Adversarial Risk
6. Key Ethical and Accountability Issues
6.1. Error Impact and Harm Analysis
6.2. Contestability and Procedural Justice
| Case | Governance Failure Illustrated | Accountability Lesson |
|---|---|---|
| SyRI welfare-risk system | Opaque risk indication and limited public scrutiny of data-linking logic | Risk classification requires a clear public purpose, proportionality analysis, transparency about data sources, and a route for affected persons to understand and contest suspicion. |
| Robodebt | Weak evidential basis and administrative burden shifted onto citizens | Automated or data-driven debt claims require reliable evidence, preserved records, human review, and correction mechanisms before adverse action is taken. |
| UK visa streaming tool | Triage affected scrutiny while remaining difficult for applicants to inspect | Routing systems can materially affect burden and delay even when they do not formally decide the application; they therefore require auditability and non-discrimination assessment. |
| Ofqual 2020 grading | Model-mediated allocation produced perceived unfairness and constrained individual correction | Aggregate standardisation can conflict with individual justice; affected persons need meaningful reasons, evidence, and review routes when model outputs affect education outcomes. |
6.3. Proportionality, Necessity, and Democratic Control
7. Implications for Auditing and Information Systems
7.1. Technological Risk and Internal Control
7.2. Compliance and Governance Instruments
| Instrument Type | Examples | Role in Classifier Governance |
|---|---|---|
| Binding law and regulation | EU AI Act; GDPR; sector-specific legal duties | Establish legal obligations for high-risk systems, personal-data processing, documentation, oversight, rights, and compliance. |
| Public-law principles | Legality, proportionality, reason-giving, due process | Require justification of purpose, procedure, intervention, and burdens imposed on affected persons. |
| Technical and management standards | ISO/IEC 42001; ISO/IEC 23894; NIST AI RMF | Structure risk management, organisational controls, monitoring, and audit evidence. |
| Ethical and human-rights guidance | OECD, UNESCO, WHO, Council of Europe | Clarify normative expectations for transparency, human rights, safety, equity, and public benefit. |
| Documentation and audit artefacts | Model cards, datasheets, logs, impact assessments, data protection impact assessments | Preserve evidence about design choices, limitations, testing, deployment, data flows, and decisions. |
7.3. Data Governance and Automated Decision-Making
8. Accountability Models and Governance Mechanisms
8.1. Human-in-the-Loop Governance
8.2. Theoretical Foundation of the Accountability Chain
8.3. Public Classification Accountability Chain
| Chain Element | Core Question | Minimum Evidence |
|---|---|---|
| Public purpose | What legitimate public function does classification serve? | Legal basis, policy rationale, necessity and proportionality assessment. |
| Data and label legitimacy | Do the data and label represent the concept being acted upon? | Data provenance, label rationale, bias assessment, missingness and representativeness review. |
| Model and threshold choice | Why this model and operating point? | Validation results, calibration, subgroup errors, threshold rationale, comparison with simpler alternatives. |
| Human review | Can officials challenge the output in practice? | Review procedure, training, override authority, escalation route, review-time evidence. |
| Audit trail | Can a decision be reconstructed? | Model version, input record, score, threshold, explanation, official action, override and appeal logs. |
| Citizen contestability | Can an affected person identify and challenge error? | Notice, reasons, data-correction route, human appeal, evidential preservation. |
| Post-deployment authority | Who can modify or suspend the system? | Monitoring reports, incident process, named owner, suspension and decommissioning criteria. |
8.4. Impact Assessments, Audits, Procurement, and Traceability
9. Practical Implications
10. Domain-Specific Considerations
10.1. Healthcare
10.2. Criminal Justice, Policing, and Public Security
10.3. Border Control and Migration
10.4. Welfare and Social Benefits
10.5. Education
10.6. Fraud Detection, Taxation, and Public Procurement
11. Challenges and Limitations
12. Recommendations
| Audit Question | Evidence Required |
|---|---|
| What is the legal basis and public purpose? | Legal assessment, policy rationale, necessity and proportionality analysis. |
| What target label is being predicted? | Label definition, rationale, proxy-risk analysis, domain-expert review. |
| How were data selected and assessed? | Provenance records, data-quality tests, missingness analysis, representativeness review. |
| Why were the model and threshold chosen? | Validation results, calibration, subgroup error analysis, threshold report, comparison with alternatives. |
| Can humans meaningfully override the output? | Review procedures, training records, override logs, escalation policy. |
| Can a decision be reconstructed and contested? | Logs, model version, input record, score, explanation, notice, appeal route, correction records. |
| Who can suspend or redesign the system? | Named governance owner, monitoring reports, incident process, suspension criteria. |
13. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Barocas, S., Hardt, M., & Narayanan, A. (2023). Fairness and machine learning: Limitations and opportunities. MIT Press.
- Busuioc, M. (2021). Accountable artificial intelligence: Holding algorithms to account. Public Administration Review, 81(5), 825–836.
- Levy, K. E. C., Chasalow, K. E., & Riley, S. (2021). Algorithms and decision-making in the public sector. Annual Review of Law and Social Science, 17, 309–334.
- Meijer, A., & Wessels, M. (2019). Predictive policing: Review of benefits and drawbacks. International Journal of Public Administration, 42(12), 1031–1039.
- European Parliament and Council of the European Union. (2024). Regulation (EU) 2024/1689 of 13 June 2024 laying down harmonised rules on artificial intelligence (Artificial Intelligence Act). Official Journal of the European Union, L 2024/1689, 12 July 2024.
- Gebru, T., Morgenstern, J., Vecchione, B., Vaughan, J. W., Wallach, H., Daumé III, H., & Crawford, K. (2021). Datasheets for datasets. Communications of the ACM, 64(12), 86–92.
- Margetts, H., & Dorobantu, C. (2019). Rethink government with AI. Nature, 568(7751), 163–165.
- Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., & Galstyan, A. (2021). A survey on bias and fairness in machine learning. ACM Computing Surveys, 54(6), Article 115, 1–35.
- Mitchell, M., Wu, S., Zaldivar, A., Barnes, P., Vasserman, L., Hutchinson, B., Spitzer, E., Raji, I. D., & Gebru, T. (2019). Model cards for model reporting. Proceedings of the Conference on Fairness, Accountability, and Transparency, 220–229.
- National Institute of Standards and Technology. (2023). Artificial Intelligence Risk Management Framework (AI RMF 1.0). NIST AI 100-1. NIST.
- OECD. (2019). Recommendation of the Council on Artificial Intelligence. OECD/LEGAL/0449.
- Obermeyer, Z., Powers, B., Vogeli, C., & Mullainathan, S. (2019). Dissecting racial bias in an algorithm used to manage the health of populations. Science, 366(6464), 447–453.
- Raji, I. D., Smart, A., White, R. N., Mitchell, M., Gebru, T., Hutchinson, B., Smith-Loud, J., Theron, D., & Barnes, P. (2020). Closing the AI accountability gap: Defining an end-to-end framework for internal algorithmic auditing. Proceedings of the ACM Conference on Fairness, Accountability, and Transparency, 33–44.
- Rudin, C. (2019). Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence, 1, 206–215.
- Selbst, A. D., Boyd, D., Friedler, S. A., Venkatasubramanian, S., & Vertesi, J. (2019). Fairness and abstraction in sociotechnical systems. Proceedings of the ACM Conference on Fairness, Accountability, and Transparency, 59–68.
- Wirtz, B. W., Weyerer, J. C., & Geyer, C. (2019). Artificial intelligence and the public sector: Applications and challenges. International Journal of Public Administration, 42(7), 596–615.
- Council of Europe. (2024). Council of Europe Framework Convention on Artificial Intelligence and Human Rights, Democracy and the Rule of Law. Council of Europe.
- Electronic Immigration Network. (2020). Home Office suspends use of digital streaming tool for visa applications after legal action by JCWI and Foxglove. Electronic Immigration Network, 9 August 2020.
- International Organization for Standardization. (2023a). ISO/IEC 42001:2023 Information technology—Artificial intelligence—Management system. ISO.
- International Organization for Standardization. (2023b). ISO/IEC 23894:2023 Information technology—Artificial intelligence—Guidance on risk management. ISO.
- Office of Qualifications and Examinations Regulation. (2020a). Awarding GCSE, AS & A levels in summer 2020: Interim report. Ofqual.
- Office of Qualifications and Examinations Regulation. (2020b). Summer 2020 results analysis: GCSE, AS and A level—Update to the interim report. Ofqual.
- Rechtbank Den Haag. (2020). Judgment of 5 February 2020, ECLI:NL:RBDHA:2020:1878. De Rechtspraak.
- Royal Commission into the Robodebt Scheme. (2023). Report of the Royal Commission into the Robodebt Scheme. Commonwealth of Australia.
- UK Parliament. (2019). Visa processing algorithms. Hansard, HC Deb, 19 June 2019.
- UNESCO. (2021). Recommendation on the Ethics of Artificial Intelligence. UNESCO.
- World Economic Forum. (2020). AI Procurement in a Box: AI Government Procurement Guidelines. World Economic Forum.
- World Health Organization. (2021). Ethics and governance of artificial intelligence for health: WHO guidance. World Health Organization.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).