Sensor Modality-Aware Human Activity Recognition with the Convolutional Tsetlin Machine: Interpretable and Resource-Efficient Neuro-Symbolic Learning

Olga Tarasyuk; Anatoliy Gorbenko; Oleksandr Gordieiev; Artem Akulynichev; Rishad Shafik; Alex Yakovlev

doi:10.20944/preprints202605.1931.v1

Submitted:

27 May 2026

Posted:

28 May 2026

You are already at the latest version

Abstract

Human activity recognition (HAR) based on smartphone and wearable sensor data is commonly addressed using statistical learning methods and deep neural networks that often provide strong predictive performance, but at the expense of limited interpretability and substantial computational and energy requirements. Such limitations reduce their suitability for deployment in practical sensing environments where model decisions must be transparent, verifiable and executable on resource-constrained devices. In this work, we investigate the Convolutional Tsetlin Machine (CTM) for multimodal HAR using the UCI-HAR dataset. The Tsetlin Machine is a novel neuro-symbolic machine learning approach that offers two important advantages over many conventional machine learning methods: (i) it learns logic-based decision rules that are human-readable and formally verifiable, and (ii) it operates with comparatively low computational complexity, making it well suited to efficient and low-power on-device learning. The proposed study systematically analyses the contribution of different feature modalities by decomposing the inertial signals space into semantically defined subsets according to: (i) sensor source: accelerometer or gyroscope; (ii) physical component: body or gravity; (iii) coordinate: x, y or z. A separate CTM classifier was trained for each modality and their combination in order to determine the relative discriminative value of each modality group for activity classification. In addition to predictive performance the study emphasizes the interpretability of the CTM model ensured by expressing each decision in the form of propositional clauses, thereby enabling visualization and direct inspection of the modality-specific patterns supporting each activity class. Owing to its symbolic structure and modest computational demands, the CTM provides a principled framework for the design of explainable, resource-efficient and deployable HAR systems. The proposed work therefore contributes toward trustworthy multimodal sensing by jointly addressing predictive performance, interpretability and suitability for embedded and mobile platforms.

Keywords:

human activity recognition

;

inertial signal modalities

;

neuro-symbolic machine learning

;

Tsetlin Machine

;

explainability

;

visualization

;

pattern recognition

Subject:

Computer Science and Mathematics - Computer Science

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Sensor Modality-Aware Human Activity Recognition with the Convolutional Tsetlin Machine: Interpretable and Resource-Efficient Neuro-Symbolic Learning

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe