Submitted:
31 December 2024
Posted:
31 December 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
- We propose an attention-based model that optimally leverages both phase and amplitude components to improve HAR performance. Both features are input into temporal and channel attention mechanisms, allowing comprehensive data utilization across temporal and spatial domains.
- We introduce a Gated Recurrent Network (GRN) architecture that integrates these attention mechanisms for both phase and amplitude signals, yielding more robust and accurate classification results.
- Our model is rigorously tested on three datasets, including two publicly available datasets and our own, demonstrating superior accuracy and performance relative to existing state-of-the-art (SOTA) models.
2. Literature Review
2.1. Wi-Fi CSI Based on CNN and LSTM Approaches
2.2. Transformer-Based Approaches
2. Materials and Methods
3.1. Channel State Information (CSI)
3.2. Datasets
3.2.1. StanWiFi
3.2.2. Multiple Environment (MultiEnv)
3.2.3. Our Research Team Dataset (MINE Lab Dataset)
3.3. Methods
3.3.1. Preprocessing: Kalman Filter
3.3.2. Preprocessing: Sliding Windows
3.3.3. Preprocessing: Time Alignment
3.3.4. Preprocessing: Feature Extraction
3.3.5. Normalization
3.3.6. Phase Unwrapping
3.3.7. Gaussian Range Encoding
3.3.8. Multi-Scale Convolution Augmented Transformer (MCAT) Layer
3.3.9. Gate Residual Networks (GRNs)
4. Experimental Evaluation
4.1. Hyperprameters
4.2. Experimental Results on StanWiFi and MINE Lab Datasets
4.3. Experimental Results on MultiEnv Dataset
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Kaseris, M.; Kostavelis, I.; Malassiotis, S. A Comprehensive Survey on Deep Learning Methods in Human Activity Recognition. Mach Learn Knowl Extr 2024, 6, 842–876. [Google Scholar] [CrossRef]
- Aggarwal, J.K.; Ryoo, M.S. Human Activity Analysis: A Review. ACM Comput Surv 2011, 43. [Google Scholar] [CrossRef]
- Li, X.; He, Y.; Jing, X. A Survey of Deep Learning-Based Human Activity Recognition in Radar. Remote Sens (Basel) 2019, 11. [Google Scholar] [CrossRef]
- Zhuravchak, A.; Kapshii, O.; Pournaras, E. Human Activity Recognition Based on Wi-Fi CSI Data -A Deep Neural Network Approach. In Proceedings of the Procedia Computer Science; Elsevier B.V., 2021; Vol. 198; pp. 59–66. [Google Scholar]
- Chen, C.; Zhou, G.; Lin, Y. Cross-Domain WiFi Sensing with Channel State Information: A Survey. ACM Comput Surv 2023, 55. [Google Scholar] [CrossRef]
- Chih-Yang Lin; Chia-Yu Lin; Yu-Tso Liu; Yi-Wei Chen; Timothy K. Shih WiFi-TCN: Temporal Convolution for Human Interaction Recognition Based on WiFi Signal. IEEE Access 2024, 12, 126970–126982. [Google Scholar] [CrossRef]
- Yang, Z.; Qian, K.; Wu, C.; Zhang, Y. Smart Wireless Sensing: From IoT to AIoT; Springer Nature, 2021; ISBN 9789811656583.
- Jannat, M.K.A.; Islam, M.S.; Yang, S.H.; Liu, H. Efficient Wi-Fi-Based Human Activity Recognition Using Adaptive Antenna Elimination. IEEE Access 2023, 11, 105440–105454. [Google Scholar] [CrossRef]
- Stephan Sigg; Ulf Blanke; Gerhard Troster The Telepathic Phone: Frictionless Activity Recognition from WiFi-RSSI.; IEEE, 2014.
- Gu, Y.; Ren, F.; Li, J. PAWS: Passive Human Activity Recognition Based on WiFi Ambient Signals. IEEE Internet Things J 2016, 3, 796–805. [Google Scholar] [CrossRef]
- Yu Gu; Lianghu Quan; Fuji Ren WiFi-Assisted Human Activity Recognition. In Proceedings of the Wireless and Mobile, 2014 IEEE Asia Pacific Conference; IEEE, 14; pp. 60–65. 20 August.
- Stephan Sigg; Shuyu Shi; Felix Buesching Leveraging RF-Channel Fluctuation for Activity Recognition: Active and Passive Systems, Continuous and RSSI-Based Signal Features. In Proceedings of the MoMM 2013 : the 11th International Conference on Advances in Mobile Computing and Multimedia; Association for Computing Machinery, 13; p. 599. 20 December.
- Moustafa Youssef; Matthew Mah; Ashok Agrawala Challenges: Device-Free Passive Localization for Wireless Environments. In Proceedings of the Proceedings of the 13th annual ACM international conference on Mobile computing and networking; ACM Digital Library, 07; pp. 222–229. 20 September.
- Rathnayake, R.M.M.R.; Maduranga, M.W.P.; Tilwari, V.; Dissanayake, M.B. RSSI and Machine Learning-Based Indoor Localization Systems for Smart Cities. Eng 2023, 4, 1468–1494. [Google Scholar] [CrossRef]
- Alsaify, B.A.; Almazari, M.M.; Alazrai, R.; Daoud, M.I. A Dataset for Wi-Fi-Based Human Activity Recognition in Line-of-Sight and Non-Line-of-Sight Indoor Environments. Data Brief 2020, 33. [Google Scholar] [CrossRef]
- Yousefi, S.; Narui, H.; Dayal, S.; Ermon, S.; Valaee, S. A Survey on Behavior Recognition Using WiFi Channel State Information. IEEE Communications Magazine 2017, 55, 98–104. [Google Scholar] [CrossRef]
- Zhang, Y.; Zheng, Y.; Qian, K.; Zhang, G.; Liu, Y.; Wu, C.; Yang, Z. Widar3.0: Zero-Effort Cross-Domain Gesture Recognition With Wi-Fi. IEEE Trans Pattern Anal Mach Intell 2022, 44, 8671–8688. [Google Scholar] [CrossRef]
- Guo, L.; Guo, S.; Wang, L.; Lin, C.; Liu, J.; Lu, B.; Fang, J.; Liu, Z.; Shan, Z.; Yang, J. Wiar: A Public Dataset for Wifi-Based Activity Recognition. IEEE Access 2019, 7, 154935–154945. [Google Scholar] [CrossRef]
- Yang, J.; Chen, X.; Zou, H.; Lu, C.X.; Wang, D.; Sun, S.; Xie, L. SenseFi: A Library and Benchmark on Deep-Learning-Empowered WiFi Human Sensing. Patterns 2023, 4. [Google Scholar] [CrossRef]
- Meneghello, F.; Fabbro, N.D.; Garlisi, D.; Tinnirello, I.; Rossi, M. A CSI Dataset for Wireless Human Sensing on 80 MHz Wi-Fi Channels. IEEE Communications Magazine 2023. [Google Scholar] [CrossRef]
- Wang, W.; Liu, A.X.; Shahzad, M.; Ling, K.; Lu, S. Device-Free Human Activity Recognition Using Commercial WiFi Devices. In Proceedings of the IEEE Journal on Selected Areas in Communications; Institute of Electrical and Electronics Engineers Inc., May 1 2017; Vol. 35; pp. 1118–1131. [Google Scholar]
- Alsaify, B.A.; Almazari, M.; Alazrai, R.; Alouneh, S.; Daoud, M.I. A CSI-Based Multi-Environment Human Activity Recognition Framework. Applied Sciences (Switzerland) 2022, 12. [Google Scholar] [CrossRef]
- Chen, Z.; Zhang, L.; Jiang, C.; Cao, Z.; Cui, W. WiFi CSI Based Passive Human Activity Recognition Using Attention Based BLSTM. IEEE Trans Mob Comput 2019, 18, 2714–2724. [Google Scholar] [CrossRef]
- Yadav, S.K.; Sai, S.; Gundewar, A.; Rathore, H.; Tiwari, K.; Pandey, H.M.; Mathur, M. CSITime: Privacy-Preserving Human Activity Recognition Using WiFi Channel State Information. Neural Networks 2022, 146, 11–21. [Google Scholar] [CrossRef] [PubMed]
- Fard Moshiri, P.; Shahbazian, R.; Nabati, M.; Ghorashi, S.A. A CSI-Based Human Activity Recognition Using Deep Learning. Sensors (Basel) 2021, 21. [Google Scholar] [CrossRef] [PubMed]
- Salehinejad, H.; Valaee, S. LiteHAR: Lightweight Human Activity Recognition from WiFi Signals with Random Convolution Kernels. 2022.
- Shalaby, E.; ElShennawy, N.; Sarhan, A. Utilizing Deep Learning Models in CSI-Based Human Activity Recognition. Neural Comput Appl 2022, 34, 5993–6010. [Google Scholar] [CrossRef] [PubMed]
- Islam, M.S.; Jannat, M.K.A.; Hossain, M.N.; Kim, W.S.; Lee, S.W.; Yang, S.H. STC-NLSTMNet: An Improved Human Activity Recognition Method Using Convolutional Neural Network with NLSTM from WiFi CSI. Sensors 2023, 23. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention Is All You Need. 2017.
- Ding, X.; Jiang, T.; Zhong, Y.; Wu, S.; Yang, J.; Zeng, J. Wi-Fi-Based Location-Independent Human Activity Recognition with Attention Mechanism Enhanced Method. Electronics (Switzerland) 2022, 11. [Google Scholar] [CrossRef]
- Yang, M.; Zhu, H.; Zhu, R.; Wu, F.; Yin, L.; Yang, Y. WiTransformer: A Novel Robust Gesture Recognition Sensing Model with WiFi. Sensors 2023, 23. [Google Scholar] [CrossRef]
- Li, B.; Cui, W.; Wang, W.; Zhang, L.; Chen, Z.; Wu, M. 2021.
- Zeng, Y.; Wu, D.; Gao, R.; Gu, T.; Zhang, D. FullBreathe. Proc ACM Interact Mob Wearable Ubiquitous Technol 2018, 2, 1–19. [Google Scholar] [CrossRef]
- Li, C.; Brauer, J.; Sezgin, A.; Zenger, C. Kalman Filter Based MIMO CSI Phase Recovery for COTS WiFi Devices. In Proceedings of the ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Institute of Electrical and Electronics Engineers Inc., 2021; Vol. 2021-June; pp. 4820–4824. [Google Scholar]
- Savarese, P.; Figueiredo, D. A: Gates, 2017.
- Lim, B.; Arik, S.O.; Loeff, N.; Pfister, T. Temporal Fusion Transformers for Interpretable Multi-Horizon Time Series Forecasting. 2019.
- Stragapede, G.; Delgado-Santos, P.; Tolosana, R.; Vera-Rodriguez, R.; Guest, R.; Morales, A. Mobile Keystroke Biometrics Using Transformers. 2022.
- Ba, J.L.; Kiros, J.R.; Hinton, G.E. Layer Normalization. 2016.
- Qin, Z.; Yang, S.; Zhong, Y. Hierarchically Gated Recurrent Neural Network for Sequence Modeling. 2023.











| Source | Dataset name | Description | |
|---|---|---|---|
| Alsaify et al. (2017) [15] | MultiEnv | This dataset was collected in three scenarios: line-of-sight (LOS) in both the office and hall, and non-line-of-sight (NLOS). | |
| Yousefi et al. (2017) [16] | StanWiFi | This dataset contains continuous CSI data for six activities without precise segmentation timestamps for each sample. | |
| Yang et al. (2019)[17] | Widar 3.0 | This large dataset, collected using Intel 5300 NIC with 30 subcarriers and containing 258K Wi-Fi-based hand gesture instances spanning 8,620 minutes across 75 domains. | |
| Linlin et al. (2019)[18] | WiAR | This dataset includes 16 activities, comprising coarse-grained activities and gestures, performed 30 times each, by ten volunteers. | |
| Yang et al. (2023) [19] | NTU-Fi | Collected using the Atheros CSI tool, this dataset features 114 subcarriers per antenna pair, and it includes six human activities and 14 gait patterns. | |
| Francesca et al. (2023) [20] | WiFi-80MHz | Collected using two Netgear X4S AC2600 IEEE 802.11ac routers with 256 subcarriers (242 usable), this dataset features ten subjects and three applications | |
| Class | Activity | Description |
|---|---|---|
| 0 | No movement | Sitting, standing, or lying on the ground |
| 1 | Falling | Falling from a standing position or from a chair |
| 2 | Sitting down or Standing up |
Sitting down on a chair or standing up from a chair |
| 3 | Walking | Walking between the transmitter and receiver |
| 4 | Turning | Turning at the transmitter's or receiver's location |
| 5 | Picking up | Picking up an object such as a pen from the ground |
| Hyperparameter | Values | |
|---|---|---|
| Window size | StanWifi dataset: 2000; and our own dataset: 1000 | |
| Stride size | StanWifi dataset: 200; our own dataset: 100 | |
| K-Gaussian encoding | 10 | |
| ) | StanWifi dataset: (2000,90); MultiEnv dataset: (850,90); MINE lab dataset: (1000,90) | |
| Filter size in Multi-scale CNN (horizontal, vertical) | Horizontal: {10, 40}, Vertical: {2, 4} | |
| The number of heads in the multi-head self-attention mechanism | h-head: 9, v-head: 50 | |
| Dropout rate | 0.1 | |
| Number of dense layers in GRN | 256 | |
| Optimizer | Adam (learning rate = 0.001, decay rate = 0.9) | |
| Batch size | 8 | |
| Epochs | 200 | |
| Training environment | NVIDIA GeForce RTX 3060 with CUDA v. 12.4, Python 3.11, TensorFlow 2.16 |
| Source | Model | Acc | Pre | Recall | F1-score | |
|---|---|---|---|---|---|---|
| Bing et al.[32] | THAT (2021) | 98.20 | - | - | - | |
| Yadav et al. [24] | CSITime (2022) | 98.00 | - | - | - | |
| Salehinejad et al. [26] | LiteHAR (2022) | 98.00 | 99.16 | 98.87 | 99.01 | |
| Salaby et al. [27] | CNN-GRU (2022) | 99.31 | 99.5 | 99.43 | - | |
| Islam et al. [28] | STC-NLSTMNet (2023) | 99.88 | 99.72 | 99.73 | - | |
| Jannat et al. [8] | AAE+RF (2023) | 99.84 | 99.82 | 99.83 | 99.81 | |
| Ours | PA-CSI (2024) | 99.93 | 99.86 | 99.95 | 99.95 |
| Source | Model | Acc | Pre | Recall | F1-score | |
|---|---|---|---|---|---|---|
| Bing et al. [32] | THAT (2021) | 97.00 | 97.00 | 97.00 | 97.00 | |
| Ours | PA-CSI (2024) | 99.24 | 99.24 | 99.24 | 99.24 |
| Environment | Source | Model | Acc | Pre | Recall | F1-score | |
|---|---|---|---|---|---|---|---|
| E1: Office (LOS) |
Alsaify et al. [16] | SVM (2021) | 94.03 | - | - | - | |
| Bing et al. [32] | THAT (2021) | 98.95 | 98.28 | 98.26 | 98.26 | ||
| Alsaify et al. [22] | SVM (2022) | 91.27 | - | - | - | ||
| Islam et al. [28] | STC-NLSTMNet (2023) | 98.20 | 98.10 | 98.08 | 98.09 | ||
| Jannat et al. [8] | AAE+RF (2023) | 97.65 | 96.42 | 96.41 | 94.40 | ||
| Ours | PA-CSI (2024) | 99.47 | 99.48 | 99.47 | 99.47 | ||
| E2: Hall (LOS) |
Alsaify et al. [16] | SVM (2021) | 94.03 | - | - | - | |
| Bing et al. [32] | THAT (2021) | 97.39 | 97.24 | 97.22 | 97.22 | ||
| Alsaify et al. [22] | SVM (2022) | 91.27 | - | - | - | ||
| Islam et al. [28] | STC-NLSTMNet (2023) | 96.65 | 96.54 | 96.41 | 96.48 | ||
| Ours | PA-CSI (2024) | 98.43 | 98.01 | 97.90 | 97.90 | ||
| E3: Room and hall (NLOS) |
Bing et al. [32] | THAT (2021) | 97.56 | 97.04 | 97.04 | 97.03 | |
| Islam et al. [28] | STC-NLSTMNet (2023) | 94.68 | 94.57 | 94.55 | 94.56 | ||
| Jannat et al. [8] | AAE+RF (2023) | 93.33 | 93.12 | 93.07 | 93.14 | ||
| Ours | PA-CSI | 98.78 | 98.79 | 98.78 | 98.78 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).