Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

AI-based Detection of Aspiration for Video-endoscopy with Visual Aids in Meaningful Frames to Interpret the Model Outcome

Version 1 : Received: 29 November 2022 / Approved: 2 December 2022 / Online: 2 December 2022 (12:22:54 CET)

How to cite: Konradi, J.; Zajber, M.; Betz, U.; Drees, P.; Gerken, A.; Meine, H. AI-based Detection of Aspiration for Video-endoscopy with Visual Aids in Meaningful Frames to Interpret the Model Outcome. Preprints 2022, 2022120051. https://doi.org/10.20944/preprints202212.0051.v1 Konradi, J.; Zajber, M.; Betz, U.; Drees, P.; Gerken, A.; Meine, H. AI-based Detection of Aspiration for Video-endoscopy with Visual Aids in Meaningful Frames to Interpret the Model Outcome. Preprints 2022, 2022120051. https://doi.org/10.20944/preprints202212.0051.v1

Abstract

Disorders of swallowing often lead to pneumonia when material enters the airways (aspiration). Flexible Endoscopic Evaluation of Swallowing (FEES) plays a key role in the diagnostics of aspiration but is prone to human errors. An AI-based tool could facilitate this process. Recent non-endoscopic/non-radiologic attempts to detect aspiration using machine-learning approaches have led to unsatisfying accuracy and show black box characteristics. Hence, for clinical users it is hard to trust in these model decisions. Our aim is to introduce an explainable artificial intelligence (XAI) approach to detect aspiration in FEES. Our approach is to teach the AI about the relevant anatomical structures like the vocal cords and the glottis based on 92 annotated FEES videos. Simultaneously, it is trained to detect bolus that passes the glottis and becomes aspirated. During testing, the AI successfully recognized glottis and vocal cords, but could not yet achieve satisfying aspiration detection quality. Albeit detection performance has to be optimized, our architecture results in a final model that explains its assessment by locating meaningful frames with relevant aspiration events and by highlighting the suspected bolus. In contrast to comparable AI tools, our framework is verifiable, interpretable and therefor accountable for clinical users.

Keywords

XAI; segmentation; detection; aspiration; glottis; vocal cords; endoscopy; FEES; interpretability; meaningful sequences; key frames

Subject

Medicine and Pharmacology, Neuroscience and Neurology

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.