Submitted:
17 July 2025
Posted:
17 July 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. System Architecture

2.1. Video Processing Module
2.1.1. Face Processing
| Algorithm/ Library |
Number of Landmarks |
Detection Time (second) |
Detection Rate1 | Facial Expression | Iris Tracking |
|---|---|---|---|---|---|
| Dlib [27] | 68 | 0.036 | 0.78 | Some facial expressions can be computed manually2 | No |
| Mediapipe [30] | 478 | 0.0037 | 1 | 52 built-in blendshapes | Yes |
| Haar Cascade [28] | 0 | 0.0056 | 0.65 | No | No |
| MTCNN [31] | 5 | 0.215 | 1 | No | No |
| YuNet [32] | 5 | 0.026 | 0.93 | No | No |

2.1.2. Mapping Feature to Mouse Coordinate

2.1.3. Adaptive Movement Signal Filtering and Acceleration

2.1.4. Mapping Facial Expressions to Mouse and Keyboard Actions
| System | #Facial Expression |
Mouse Control | Keyboard Control | System Control | Triggering Mechanism |
|---|---|---|---|---|---|
| EMKEY [19] | 1 | - | - | x | Predefined Threshold |
| CameraMouseAI [22] | 2 | x | - | - | User-Defined Threshold |
| Project Gameface [23] | 8 | x | x | - | User-Defined Threshold |
| Zelinskyi et al. [33] | 8 | x | - | - | Predefined Threshold |
| 3M-HCI (Ours) | 13 | x | x | x | User-Defined Threshold with Priority |
2.2. Voice Processing Module

3. Materials and Methods
3.1. Development Platform

3.2. Testing Methodology

| Question | Description |
|---|---|
| Q1 | Does it take a lot of time to master the application? |
| Q2 | Is the response of left/right mouse click fast? |
| Q3 | Is the cursor movement responsive? |
| Q4 | Is it difficult to click the left/right mouse button? |
| Q5 | Is it difficult to move the cursor precisely? |
| Q6 | Is it difficult to move the cursor vertically? |
| Q7 | Is it difficult to move the cursor horizontally? |
| Q8 | Does moving the cursor cause fatigue? |
| Q9 | Do you think this mouse system can be applied for people with disabilities? |
4. Results and Discussion
4.1. Robustness Testing Under Environmental and Hardware Variations
4.1.1. Lighting Condition Test
- Bright Environment (Figure 8a, 8b): Whether in a brightly lit room or a dim room with high screen brightness, the system performed flawlessly. Facial landmarks were immediate and accurate. Mouse control operated smoothly without any jitter or delay. This represents the optimal environment for system usage.
- Dim Room with Medium Screen Brightness (Figure 8c): Under significantly darker conditions, where only moderate screen brightness was present, the system remained functional. Facial landmarks still worked, but occasional instability in mouse movement was observed. The system was still usable with minor degradation.
- Near-total Darkness with Low Screen Brightness (Figure 8d): In the most extreme case, with no external light and very low screen brightness, the system struggled. Although the Mediapipe framework could still detect the facial landmarks. However, the detection was inconsistent and unreliable. Landmarks often flickered or were lost entirely, making interaction with the system ineffective in this condition

4.1.2. Multiple Faces in a Frame
4.1.3. Background Noise
4.1.4. Different Hardware and Software Features of the Computer
| Laptop | CPU | RAM | OS | Overall Performance |
Computational Cost (3M-HCI) |
Computational Cost [6] |
Computational Cost [7] |
|---|---|---|---|---|---|---|---|
| Dell Inspiron 15 3530 | Intel Core i7-1355U | 16GB | Windows 11 | Excellent | 12.7% | 15.1% | 26.7% |
| Dell XPS 13 9360 | Intel Core i7-7660U | 16GB | Windows 10 | OK. The microphone takes time to boot | 49.9% | 60.3% | Unable to run |
| Dell G15 5530 | Intel Core i7-13650HX | 16GB | Windows 11 | Excellent | 10.7% | 23% | 8.5% |
| Lenovo ThinkPad T480 | Intel Core i5-8350U | 8GB | Windows 11 | OK. The program is a bit laggy | 52.8% | 57.7% | 31.2% |
| Dell Precision 7510 | Intel Core i7-6820HQ | 16GB | Windows 10 | Excellent | 37.9% | 41.5% | 18.8% |
| MSI GF63 Thin 11UD | Intel Core i7-11800H | 16GB | Windows 11 |
OK. The microphone takes time to boot | 19.7% | 28.3% | Unable to run |
| Dell Inspiron 16 5620 | Intel Core i5-1240P | 16GB | Windows 11 | Excellent | 6.1% | 20.56% | 16.2% |
| HP Laptop 16-d0xxx | Intel Core i5-11400H | 8GB | Windows 11 | Excellent | 24.7% | 19.1% | 11.05% |
| ASUS TUF Gaming F15 | Intel Core i5-10300H | 16GB | Windows 11 | Some commands cannot be recognized. | 41.88% | 47.37% | 23.4% |
| MSI Modern 15 | AMD Ryzen 5 5500U | 12GB | Windows 11 | Voice command runs badly. | 34.6% | 47.4% | 17.2% |
4.2. Empirical Task-Based Test
4.2.1. System Accuracy


4.2.2. System Responsiveness.

4.2.3. System Jitterness

4.3. Survey Results
| Question | Description | CameraMouseAI | Project GameFace | 3M-HCI (Ours) | Mouse |
|---|---|---|---|---|---|
| Q1 | Does it take a lot of time to master the application? | 3.5 ± 1.87 | 6.75 ± 1.98 | 7.25 ± 1.71 | 10.0 ± 0.0 |
| Q2 | Is the response of left/right mouse click fast? | 3.5 ± 2.35 | 7.5 ± 1.58 | 8.25 ± 1.09 | 10.0 ± 0.0 |
| Q3 | Is the cursor movement responsive? | 5.38 ± 2.06 | 6.62 ± 1.11 | 8.88 ± 0.6 | 10.0 ± 0.0 |
| Q4 | Is it difficult to click the left/right mouse button? | 4 ± 2.55 | 6.25 ± 1.56 | 7 ± 1.87 | 10.0 ± 0.0 |
| Q5 | Is it difficult to move the cursor precisely? | 3.5 ± 2.45 | 6.87 ± 1.17 | 8.37 ± 0.7 | 10.0 ± 0.0 |
| Q6 | Is it difficult to move the cursor vertically? | 4.5 ± 2.18 | 7.5 ± 1.41 | 8.37 ± 0.86 | 10.0 ± 0.0 |
| Q7 | Is it difficult to move the cursor horizontally? | 4.5 ± 2.18 | 7.5 ± 1.41 | 8.37 ± 0.86 | 10.0 ± 0.0 |
| Q8 | Does moving the cursor cause fatigue? | 2.62 ± 1.93 | 7 ± 1.87 | 7.25 ± 1.79 | 9.87 ± 0.33 |
| Q9 | Do you think this mouse system can be applied for people with disabilities? | 4 ± 3.53 | 7.12 ± 2.52 | 7.85 ± 2.71 | 7.75 ± 3.9 |
4.4. Limitations
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| 3M-HCI | 3-Modal Human-Computer Interaction |
| ALS | Amyotrophic Lateral Sclerosis (ALS) |
| AI | Artificial Intelligence |
| CNN | Convolution Neural Network |
| CPU | Central Processing Unit |
| GPU | Graphics Processing Unit |
| RAM | Random Access Memory |
| RGB | Red Green Blue |
| OS | Operating System |
References
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2016), Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788. [CrossRef]
- Ramachandra, C.K.; Joseph, A. IEyeGASE: An Intelligent Eye Gaze-Based Assessment System for Deeper Insights into Learner Performance. Sensors 2021, 21(20), 6783. [CrossRef]
- Walle, H.; De Runz, C.; Serres, B.; Venturini, G. A Survey on Recent Advances in AI and Vision-Based Methods for Helping and Guiding Visually Impaired People. Appl. Sci. 2022, 12(5), 2308. [CrossRef]
- Ramos, P.; Zapata, M.; Valencia, K.; Vargas, V.; Ramos-Galarza, C. Low-Cost Human–Machine Interface for Computer Control with Facial Landmark Detection and Voice Commands. Sensors 2022, 22(23), 9279. [CrossRef]
- Zapata, M.; Valencia-Aragón, K.; Ramos-Galarza, C. Experimental Evaluation of EMKEY: An Assistive Technology for People with Upper Limb Disabilities. Sensors 2023, 23(8), 4049. [CrossRef]
- Project Gameface. Available online: https://blog.google/technology/ai/google-project-gameface/ (accessed on 14 July 2025).
- MacLellan, L.E.; Stepp, C.E.; Fager, S.K.; Mentis, M.; Boucher, A.R.; Abur, D.; Cler, G.J. Evaluating Camera Mouse as a computer access system for augmentative and alternative communication in cerebral palsy: a case study. Assist. Technol. 2024, 36(3), 217–223. [CrossRef]
- Karimli, F.; Yu, H.; Jain, S.; Akosah, E.S.; Betke, M.; Feng, W. Demonstration of CameraMouseAI: A Head-Based Mouse-Control System for People with Severe Motor Disabilities. In Proceedings of the 26th ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2024), Atlanta, GA, USA, 7–10 October 2024; pp. 124:1–124:6. [CrossRef]
- Yuan, B.; Hu, D.; Gu, S.; Xiao, S.; Song, F. The global burden of traumatic amputation in 204 countries and territories. Front. Public Health 2023, 11, 1258853. [CrossRef]
- Mehta, P.; Raymond, J.; Nair, T.; Han, M.; Berry, J.; Punjani, R.; Larson, T.; Mohidul, S.; Horton, D.K. Amyotrophic lateral sclerosis estimated prevalence cases from 2022 to 2030, data from the National ALS Registry. Amyotroph. Lateral Scler. Frontotemporal Degener. 2025, 26(3-4), 290-295. [CrossRef]
- Chen, Y.-L. Application of tilt sensors in human–computer mouse interface for people with disabilities. IEEE Trans. Neural Syst. Rehabil. Eng. 2001, 9(3), 289–294. [CrossRef]
- Mishra, M.; Bhalla, A.; Kharad, S.; Yadav, D. Int. J. Recent Innov. Trends Comput. Commun. 2017, 5(5), 576–583.
- Ribas-Xirgo, L.; López-Varquiel, F. Accelerometer-Based Computer Mouse for People with Special Needs. J. Access. Des. All. 2017, 7, 1–20. [CrossRef]
- Kim, S.; Park, M.; Anumas, S.; Yoo, J. Head Mouse System Based on Gyro- and Opto-Sensors. In Proceedings of the International Conference on Biomedical Engineering and Informatics, Yantai, China, 16–18 October 2010. [CrossRef]
- Pereira, C.A.M.; Bolliger Neto, R.; Reynaldo, A.C.; Luzo, M.C.M.; Oliveira, R.P. Development and evaluation of a head-controlled human-computer interface with mouse-like functions for physically disabled users. Clinics 2009, 64(10), 975–981. [CrossRef]
- Lin, C.-S.; Ho, C.-W.; Chan, C.-N.; Chau, C.-R.; Wu, Y.-C.; Yeh, M.-S. An Eye-Tracking and Head-Control System Using Movement Increment-Coordinate Method. Opt. Laser Technol. 2007, 39(6), 1218–1225. [CrossRef]
- Betke, M.; Gips, J.; Fleming, P. The Camera Mouse: Visual Tracking of Body Features to Provide Computer Access for People with Severe Disabilities. IEEE Trans. Neural Syst. Rehabil. Eng. 2002, 10(1), 1–10. [CrossRef]
- Su, M.C.; Su, S.Y.; Chen, G.D. A low-cost vision-based human-computer interface for people with severe disabilities. Biomed. Eng. Appl. Basis Commun. 2005, 17, 284–292. [CrossRef]
- Naizhong, Z.; Jing, W.; Jun, W. Hand-free head mouse control based on mouth tracking. In Proceedings of the IEEE International Conference on Computational Science and Education (ICCSE 2015), Cambridge, UK, 22–24 July 2015. [CrossRef]
- Arai, K.; Mardiyanto, R. Camera as Mouse and Keyboard for Handicap Person with Troubleshooting Ability, Recovery, and Complete Mouse Events. Int. J. Hum. Comput. Interact. 2010, 1(3), 46–56.
- Ismail, A.; Al Hajjar, A.E.S.; Hajjar, M. A prototype system for controlling a computer by head movements and voice commands. arXiv 2011, arXiv:1109.1454. [CrossRef]
- Sawicki, D.; Kowalczyk, P. Head Movement Based Interaction in Mobility. Int. J. Hum.-Comput. Interact. 2017, 34, 653–665. [CrossRef]
- Abiyev, R.H.; Arslan, M. Head mouse control system for people with disabilities. Expert Syst. 2019, 37(1), e12398. [CrossRef]
- Rahmaniar, W.; Ma’Arif, A.; Lin, T.-L. Touchless Head-Control (THC): Head Gesture Recognition for Cursor and Orientation Control. IEEE Trans. Neural Syst. Rehabil. Eng. 2022, 30, 1817–1828. [CrossRef]
- Zelinskyi, S.; Boyko, Y. Using Facial Expressions for Custom Actions: Development and Evaluation of a Hands-Free Interaction Method. Computer Syst. Inf. Technol. 2024, 4, 116-125. [CrossRef]
- Zhang, H.; Yin, L.; Zhang, H. A real-time camera-based gaze-tracking system involving dual interactive modes and its application in gaming. Multimed. Syst. 2024, 30, 15. [CrossRef]
- Dlib. Available online: https://dlib.net/python/ (accessed on 17 July 2025).
- Viola, P.A.; Jones, M. Rapid Object Detection using a Boosted Cascade of Simple Features. In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), Kauai, HI, USA, 8–14 December 2001; pp. 511–518. [CrossRef]
- Singh, J.; Modi, N. A robust, real-time camera-based eye gaze tracking system to analyze users’ visual attention using deep learning. Interact. Learn. Environ. 2022, 30, 409–430. [CrossRef]
- Mediapipe. Available online: https://ai.google.dev/edge/mediapipe/solutions/guide (accessed on 15 July 2025).
- Zhang, K.; Zhang, Z.; Li, Z.; Qiao, Y. Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks. IEEE Signal Process. Lett. 2016, 23, 1499–1503. [CrossRef]
- Wu, W.; Peng, H.; Yu, S. YuNet: A Tiny Millisecond-level Face Detector. Mach. Intell. Res. 2023, 20, 656–665. [CrossRef]
- Casiez, G.; Roussel, N.; Vogel, D. 1 € Filter: A Simple Speed-based Low-pass Filter for Noisy Input in Interactive Systems. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI ’12), Austin, TX, USA, 5–10 May 2012; pp. 2527–2530. [CrossRef]
- Wang, H.; Sidenmark, L.; Weidner, F.; Newn, J.; Gellersen, H. HeadShift: Head Pointing with Dynamic Control-Display Gain. ACM Trans. Comput.-Hum. Interact. 2025, 32(1), Article 2. [CrossRef]
- Voelker, S.; Hueber, S.; Corsten, C.; Remy, C. HeadReach: Using Head Tracking to Increase Reachability on Mobile Touch Devices. In Proceedings of the 2020 ACM SIGCHI Conference on Human Factors in Computing Systems (CHI ’20), Honolulu, HI, USA, 25–30 April 2020; pp. 739:1–739:12. [CrossRef]
- Nancel, M.; Chapuis, O.; Pietriga, E.; Yang, X.-D.; Irani, P.P.; Beaudouin-Lafon, M. High-Precision Pointing on Large Wall Displays Using Small Handheld Devices. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13), Paris, France, 27 April–2 May 2013; pp. 831–840. [CrossRef]
- SAPI5 (Dragonfly). Available online: https://dragonfly2.readthedocs.io/en/latest/index.html (accessed on 15 July 2025).
- Radford, A.; Kim, J.W.; Xu, T.; Brockman, G.; McLeavey, C.; Sutskever, I. Robust Speech Recognition via Large-Scale Weak Supervision. In Proceedings of the 40th International Conference on Machine Learning (ICML 2023), Baltimore, MD, USA, 23–29 July 2023, pp. 28492–28518.
- Harada, S.; Wobbrock, J.O.; Landay, J.A. Voice Games: Investigation Into the Use of Non-speech Voice Input for Making Computer Games More Accessible. In Proceedings of the 13th IFIP TC 13 International Conference on Human-Computer Interaction (INTERACT 2011), Lisbon, Portugal, 5–9 September 2011; Lecture Notes in Computer Science, Vol. 6946, pp. 11–29. [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).