Submitted:
15 January 2026
Posted:
16 January 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and methods
2.1. Study Design
2.2. Test Case
2.3. Test-case Images
| Original higher quality (jpeg) | Reduced quality (jpeg) | |||
| Resolution (pixels) | Size (KB) | Resolution (pixels) | Size (KB) | |
| Reference Image | ||||
| Identity A | 640x800 | 188 | 200x250 | 32.4 |
| Questioned Images | ||||
| Identity B (Image 1) | 950x1188 | 557 | 200x250 | 27.5 |
| Identity B (Image 2) | 801x1001 | 522 | 200x250 | 26.9 |
| Identity C (Image 1) | 1184x1481 | 468 | 300x375 | 31.6 |
| Identity C (Image 2) | 1154x1443 | 448 | 180x225 | 25.8 |
2.4. Participation
2.5. Case processing
2.6. Comparison of Images
3. Results
3.1. Opinion Result versus Ground Truth
| Comparison 1 Opinion | Comparison 2 Opinion | Accuracy | ||
| Agency A | Examiner 1 | +2 | -2 | 100 |
| Examiner 2 | +2 | -2 | 100 | |
| Agency B | Examiner 1 | +2 | -2 | 100 |
| Examiner 2 | +2 | -2 | 100 | |
| Agency C | Examiner 1 | +2 | -2 | 100 |
| Examiner 2 | +2 | -2 | 100 |
3.2. Strength of Opinion
| Opinion Scale | Agency 1 | Agency 2 | Agency 3 |
| +3 |
Strong Support for Same Source. The observed similar characteristics far outweigh the observed dissimilar characteristics. |
The analysis conducted strongly supports that the submitted images depict the same person. |
Very strong support for the same source. Other possibilities are considered to be practically ruled out. |
| +2 |
Moderate Support for Same Source. The observed similar characteristics outweigh the observed dissimilar characteristics. |
The analysis conducted moderately supports that the submitted images depict the same person. |
Strong support for same source. Other possibilities are considered to be very small. |
| +1 |
Weak Support for Same Source. The observed similar characteristics slightly outweigh the observed dissimilar characteristics. |
The analysis conducted supports to a certain extent that the submitted images depict the same person. |
Support for same source. Other possibilities are considered to be small. |
| 0 |
Inconclusive. The findings do not differentiate the same source/different source propositions. |
The analysis conducted supports no conclusion regarding whether the submitted images depict the same person or not. |
Inconclusive. Sufficient support for a conclusion for or against same source cannot be found. |
| -1 |
Weak Support for Common Source. The observed dissimilar characteristics slightly outweigh the observed similar characteristics. |
The analysis conducted supports to a certain extent that the submitted images depict different persons. |
Support for different source. Other possibilities are considered to be small. |
| -2 |
Moderate Support for Common Source. The observed dissimilar characteristics outweigh the observed similar characteristics. |
The analysis conducted moderately supports that the submitted images depict different persons. |
Strong support for different source. Other possibilities are considered to be very small. |
| -3 |
Strong Support for Different Source. The observed dissimilar characteristics far outweigh the observed similar characteristics. |
The analysis conducted strongly supports that the submitted images depict different persons. |
Very strong support for different source. Other possibilities are considered to be practically ruled out. |
3.3. Qualitative Results
3.3.1. Case handling
3.3.2. Interpretation and evaluation of observations
Summary of Examiner Observations
- 1)
- Inconclusive was used to suggest uncertainty. Other terms originally recorded were unreliable, insufficiently resolved, competing observations.
- 2)
- Not visible or not able to compare was used for features that were either occluded or not visible because of head angle, etc., but would normally be expected on a face (i.e. ears, forehead, neck).
- 3)
- Not observed was used for features that were not present on either image, and may not always be on a face (i.e. scars, alterations, facial hair).
Trends between Examiners and Agencies

- ▪ Forehead: There were differences in the observations of the forehead where Agency 2 provided an opinion of similar where Agency 1 and 3 determined the feature was not visible. In the examiner notes it showed that both A2E1 and A2E2 felt that there was enough visibility of the brow ridge to compare the feature.
- ▪ Eyes: A1E2 concluded that the eyes were dissimilar whereas all other examiners determined this feature to be similar. Review of the notes found that A1E2 observed both dissimilarities and similarities in the visible feature and sub-feature detail of the eyes, however the overall observation of eyes was determined to be dissimilar. During the evaluation phase, the feature was given little to no weight because of the imaging factors and expression.
- ▪ Neck: There was variation across examiners/agencies, specifically three examiners (A1E1, A1E2, A2E1) noted the feature as dissimilar, noting that the neck was broader/wider in the questioned images, but attributed it to age and assigned little weight to the feature. Three examiners (A2E2, A3E1, A3E2) noted the feature as similar, specifically that the overall neck and Adam’s apple/muscular shape appeared similar, however little weight was attributed in the evaluation.
- ▪ Facial Hair: The observations were broadly the same, with a variation in the feature being identified as not observed versus inconclusive.
- ▪ Facial marks: Three examiners (A1E1, A2E1, A2E2) noted that the comparison of the facial marks was inconclusive, while three examiners (A1E2, A3E1, A3E2) found that the facial marks were dissimilar.

- ▪ Face/Head Outline: One examiner found the overall observation to be similar, however no weight was given to this observation in their evaluation.
- ▪ Hair/Baldness pattern: The observation for this feature varied significantly across examiners. Agency 2 noted that the hairline was not visible as a result of the hair length and style. A1E1 and A3E2 found the texture and shape of the hair to be different. A1E1found the hair length and the angle of the right-side hairline to be dissimilar. A1E2 had competing observations but overall labelled the feature as similar with high uncertainty due to pose and potential grooming. In general, this feature was given little weight in the overall evaluation by those that compared the feature.
- ▪ Forehead: Agency 1 and 3 determined the feature was not visible (occluded by hair), whereas Agency 2 examiners provided an opinion on the feature. A2E1had an inconclusive comparison result, and A2E2 found the general shape of the brow ridge area to be similar.
- ▪ Facial Hair: The observations were broadly the same, with a variation in the feature being identified as not observed versus inconclusive.
- ▪ Facial Marks: The observations for this feature were the same as in Comparison 1, three examiners (A1E1, A2E1, A2E2) noted that the comparison of the facial marks was inconclusive, while three examiners (A1E2, A3E1, A3E2) found that the facial marks were dissimilar.
De-brief with Higher Quality Images
4. Discussion
4.1. Considerations
4.2. Possible sources of error and suggestions for improvement of performance
4.2.1. Case Intake
4.2.2. Case Strategy
5. Conclusions
Authors’ contributions
Declaration of interest statement
Acknowledgments
References
- ISO. ISO/IEC 17025 Testing and calibration laboratories. International Organization for Standardization. 2005. Available online: https://www.iso.org/ISO-IEC-17025-testing-and-calibration-laboratories.html (accessed on 2025 Feb 26).
- ENFSI. Best Practice Manual for Facial Image Comparison. In ENFSI; European Network of Forensic Science Institutes, 2018; Available online: https://enfsi.eu/about-enfsi/structure/working-groups/documents-page/documents/best-practice-manuals/.
- Collaborative Testing Services Inc. Facial Identification Comparison report. Available online: https://cts-forensics.com/program-9.php (accessed on 2025 Feb 26).
- ENFSI. ENFSI, European Network of Forensic Science Institutes Digital Imaging Working Group. Overview. Available online: https://enfsi.eu/about-enfsi/structure/working-groups/digital-imaging/.
- ENFSI. Framework for the Conduct of Proficiency Tests and Collaborative Exercises within ENFSI. European Network of Forensic Science Institutes. 2023. Available online: https://enfsi.eu/wp-content/uploads/2023/08/CQQ-FWK-004-PT-CE.pdf.
- Obertová, Z; Siebke, I; Schüler, G. Challenges of accreditation in forensic fields concerned with human identification: a survey of European facial examiners. Forensic Sci Res. 2024, 9, owae047. Available online: https://academic.oup.com/fsr/article/9/3/owae047/7731316. [CrossRef] [PubMed]
- Vanderkolk, JR. Examination Process. In Fingerprint Sourcebook; Justice, NI, Ed.; National Institute of Justice, 2011; p. p. Chapter 9. Available online: https://nij.ojp.gov/library/publications/fingerprint-sourcebook.
- ASTM. Standard Guide for Facial Image Comparison Feature List for Morphological Analysis; ASTM International, 2022; Available online: https://www.astm.org/e3149-18.html.
- OSAC. 2022-S-0001 Standard Guide for Image Comparison Opinions. The Organization of Scientific Area Committees for Forensic Science; National Institute of Standards and Technology, 2022. Available online: https://www.nist.gov/document/osac-2022-s-0001-standard-guide-image-comparison-opinions-3.
- OSAC. Standard Guide for Developing Discipline Specific Methodology for ACE-V. The Organization of Scientific Area Committees for Forensic Science, National Institute of Standards and Technology. 2020. Available online: https://www.nist.gov/document/standard-guide-developing-discipline-specific-methodology-ace-v.
- Bacci, N.; Briers, N.; Steyn, M. Prioritising quality: investigating the influence of image quality on forensic facial comparison. Int J Legal Med 2024, 138, 1713–1726. [Google Scholar] [CrossRef] [PubMed]
- Towler, A; White, D; Kemp, RI. Evaluating the Feature Comparison Strategy for Forensic Face Identification. 2017, 23(1), 47–58. [Google Scholar] [CrossRef] [PubMed]
- Taylor, M; Bird, C; Bishop, B; Burkes, T; Caligiuri, MP; Found, B; et al. Forensic Handwriting Examination and Human Factors: Improving the Practice Through a Systems Approach; U.S. Department of Commerce, National Institute of Standards and Technology, 2021. [Google Scholar] [CrossRef]
- Regulator, Forensic Science. Forensic Science Regulator Codes of Practice and Conduct Development of Evaluative Opinions. 2021. Available online: www.gov.uk/government/organisations/forensic-science-regulator.
- Quigley-McBride, A; Dror, IE; Roy, T; Garrett, BL; Kukucka, J. A practical tool for information management in forensic decisions: Using Linear Sequential Unmasking-Expanded (LSU-E) in casework. Forensic Sci Int Synergy 2022, 4, 100216. [Google Scholar] [CrossRef] [PubMed]
| 1 | |
| 2 | |
| 3 | |
| 4 | |
| 5 | |
| 5 | The order of the facial feature list has been modified to better illustrate the observations. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).