Submitted:
25 March 2026
Posted:
26 March 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Facial Emotion Recognition: From Classic to Deep and Mobile Models
2.1. Early FER Using Handcrafted Features
2.2. Deep Learning for FER
2.3. Mobile and Real-Time FER Systems
3. Federated Learning: Foundations and Vision Applications
3.1. Basic Federated Learning Paradigm
3.2. FL in Computer Vision and Medical Imaging
3.3. FL for Affective Computing and Emotion Recognition
4. Label Noise and Robust Federated Learning
4.1. Impact of Label Noise in FL

4.2. Robust FL Frameworks for Noisy Labels
4.3. Client-Side Data Quality Signals for FL
5. Crowdsourcing, Data Labeling, and Quality Assurance
5.1. Crowdsourced Annotation Pipelines

5.2. Multi-Stage Quality Control Frameworks
5.3. AI-Assisted Labeling and Just-in-Time Interventions
6. Mobile and Edge AI: On-Device Inference and Learning
6.1. On-Device Inference Frameworks
6.2. Edge FL Architectures
6.3. Security, Privacy, and Leakage Risks
7. Comparative Analysis: Past, Present, and Future
7.1. Methodological Evolution in FER
7.2. Deployment and System Architecture Evolution

7.3. Data Quality and Label-Noise Handling
7.4. Human-in-the-Loop and User Experience
7.5. Cross-Domain Comparison
8. Future Directions and Open Challenges
8.1. Formal Models of Client-Side Validation in FL
8.2. Human–AI Interaction and UX for FL Data Collection
8.3. Multimodal and Context-Aware Validation
8.4. Privacy, Security, and Fairness Considerations
8.5. From Simulation to Real-World Deployments
9. Federated Learning Frameworks and Deployment Choices
9.1. Categories of Federated Learning Tooling
- On-device ML runtimes, which execute models (and sometimes lightweight updates) locally on mobile or embedded hardware (e.g., TensorFlow Lite and companion libraries).[19]
9.2. Flower: Framework-Agnostic Orchestration
- It is easy to integrate with common deep-learning stacks (e.g., PyTorch-based FER models), allowing rapid prototyping of FL variants without rewriting models.[20]
- It scales from small simulations on a single machine to large-scale, distributed experiments with thousands of clients, which is useful when emulating mobile populations.[20]
- It exposes strategy hooks where researchers can plug in noise-aware aggregation, client selection, or custom validation logic, aligning well with robust FL and client-side validation experiments.
9.3. TensorFlow Federated: TensorFlow-Native FL
- The FER models and preprocessing are already implemented in TensorFlow, and one wants to minimize stack fragmentation.[22]
- The focus is on algorithm research and simulation rather than immediate deployment, since TFF currently targets simulation and specialized runtimes more than full end-user mobile apps.
- Formal reasoning about federated computations is important, as TFF’s strongly-typed federated core makes communication patterns explicit.[21]
9.4. FedML and PySyft: Research Ecosystems and Privacy Enhancements
- One needs standardized benchmarks and datasets to compare novel robust-FL or validation algorithms against published baselines.[23]
- Experiments span multiple domains (e.g., combining image-based FER with speech or physiological signals) and require consistent tooling.
- There is interest in moving from small-scale prototypes to more realistic, distributed evaluations with varied network and device conditions.[23]
- The primary research question concerns strong privacy guarantees (e.g., exploring combinations of FL, differential privacy, and secure aggregation for sensitive emotion data).[24]
- One wants to prototype secure computation patterns (such as additive secret sharing) on top of FL, not just standard FedAvg-like training.[25]
9.5. TensorFlow Lite: On-Device Inference and Lightweight Training
- It is primarily an on-device runtime for inference (and limited on-device training), not an orchestration framework.[19]
- It is well suited for deploying FER and validation models inside a mobile app, supporting real-time analysis of camera frames and just-in-time feedback to the user.[26]
- Experimental on-device training support makes it possible to perform small local adaptation steps (e.g., personalization of FER models) before sending updates back to an FL server.
9.6. Qualitative Comparison
9.7. Which Framework for Which Situation?
- Course projects and fast experimentation: Flower is often the most convenient choice when FER models are implemented in PyTorch or mixed frameworks, because its APIs are simple and it does not impose a fixed model representation.[20] It is therefore ideal for demonstrating robust FL under label noise or for prototyping validation strategies using simulated clients.
- TensorFlow-centric research: When the end-to-end pipeline (training, preprocessing, and deployment) is already in TensorFlow, TFF provides a coherent way to express and analyze federated computations without leaving the ecosystem.[21,22] This is attractive for groups that already use TensorFlow for FER and plan to deploy with TFLite.
- Large-scale algorithm benchmarks: FedML is well suited for systematic comparisons across many FL algorithms, datasets, and tasks.[23] It is a good fit when the goal is to situate a new robust-FL or validation method in a broader research landscape rather than to build a single integrated application.
- Production mobile apps: TFLite is the standard choice for running FER and validation models on-device in Android/iOS applications.[19,26] In this case, the FL orchestration layer (Flower, TFF, or FedML) runs on the server side, while TFLite handles local inference and any lightweight adaptation steps.
10. Conclusions
References
- S. Li and W. Deng, “Deep facial expression recognition: A survey,” IEEE Transactions on Affective Computing, 2020.
- M. Sajjad et al., “A comprehensive survey on deep facial expression recognition: challenges, applications, and future guidelines,” Alexandria Engineering Journal, vol. 64, 2023.
- M. Suk and B. Prabhakaran, “Real-time mobile facial expression recognition system,” in Proc. IEEE/CVF Conf. on Computer Vision and Pattern Recognition Workshops (CVPRW), 2014.
- H. Li et al., “Real-time facial affective computing on mobile devices,” Sensors, vol. 20, no. 3, 2020.
- H. Guan et al., “Federated learning for medical image analysis: A survey,” Medical Image Analysis, 2024.
- S. Latif et al., “Federated learning for speech emotion recognition,” in Proc. ACM/IEEE CPS-IoT Week Workshop, 2020.
- Y. Authors, “AFLEMP: Attention-based federated learning for emotion recognition using multi-modal physiological data,” Biomedical Signal Processing and Control, 2024.
- A. N. Salman and C. Busso, “Privacy preserving personalization for video facial expression recognition using federated learning,” in Proc. ACM Conf. on Multimodal Interaction, 2022.
- K. Authors, “Development of emotion recognition via federated learning,” in Proc. IEEE Conf., 2024, IEEE Xplore Doc. 10829216.
- N. Simic et al., “Enhancing emotion recognition through federated learning: A multimodal approach with convolutional neural networks,” 2024.
- A. Authors, “Micro expression detection using federated learning,” International Journal of Computer Science Trends and Technology, vol. 12, no. 2, 2024.
- X. Fang and M. Ye, “Robust federated learning with noisy and heterogeneous clients,” in Proc. IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2022.
- N. Wu et al., “FedNoRo: Towards noise-robust federated learning by addressing class imbalance and label noise heterogeneity,” in Proc. Int. Joint Conf. on Artificial Intelligence (IJCAI), 2023.
- B. Frénay and M. Verleysen, “Classification in the presence of label noise: A survey,” IEEE Transactions on Neural Networks and Learning Systems, vol. 25, no. 5, pp. 845–869, 2014.
- M. Authors, “Imbalanced classification with label noise: A systematic review and comparative analysis,” ICT Express, 2025.
- Y. Zhang et al., “Learning from crowdsourced labeled data: A survey,” Artificial Intelligence Review, vol. 46, no. 4, pp. 543–576, 2016.
- J. Zhang et al., “Improving label quality in crowdsourcing using noise correction,” in Proc. ACM Int. Conf. on Information and Knowledge Management (CIKM), 2015.
- Z. Zhu et al., “LabelAId: Just-in-time AI interventions for improving human labeling quality and domain knowledge in crowdsourcing systems,” in Proc. CHI Conf. on Human Factors in Computing Systems, ACM, 2024.
- Google AI Edge, “Image classification with TensorFlow Lite Model Maker,” 2024. [Online]. Available: https://ai.google.dev/edge/lite/libraries/modify/image_classification.
- Flower, “What is Federated Learning? Flower framework tutorial series,” 2024. [Online]. Available: https://flower.ai/docs/framework/tutorial-series-what-is-federated-learning.html.
- TensorFlow, “TensorFlow Federated: an open-source framework for machine learning and other computations on decentralized data,” 2021. [Online]. Available: https://www.tensorflow.org/federated.
- TensorFlow, “Federated learning with TensorFlow Federated (TFF),” tutorial, 2024. [Online]. Available: https://www.geeksforgeeks.org/deep-learning/federated-learning-with-tensorflow-federated/.
- C. He et al., “FedML: A research library and benchmark for federated learning,” arXiv preprint arXiv:2007.13518, 2020.
- OpenMined, “Federated learning with PySyft: privacy-preserving AI models,” 2026. [Online]. Available: https://www.nivalabs.ai/blogs/federated-learning-with-pysyft-privacy-preserving-ai-models.
- A. Rieke, “Federated learning in 10 lines of code with PySyft,” OpenMined blog, 2025. [Online]. Available: https://openmined.org/blog/fl-in-10-lines-of-code-with-pysyft/.
- Available online: https://www.topsinfosolutions.com/blog/tensorflow-lite-on-device-ml-use-cases/.
- GeeksforGeeks, “Introduction to TensorFlow Lite,” 2021. [Online]. Available: https://www.geeksforgeeks.org/deep-learning/introduction-to-tensorflow-lite/.
- V. Fomenko, “Young AffectNet HQ,” Kaggle, 2022. [Online]. Available: https://www.kaggle.com/datasets/vfomenko/young-affectnet-hq.
- L.B. Krithika, S. Kshitish, and M.R. Kishore, “Automated facial attendance logger for students," IOP Conference Series: Materials Science and Engineering, vol. 263, no. 4, p. 042022, 2017. [Online]. Available: https://iopscience.iop.org/article/10.1088/1757-899X/263/4/042022.
- L.B. Krithika, K. Venkatesh, S. Rathore, and M. Harish Kumar, “Facial recognition in education system," IOP Conference Series: Materials Science and Engineering, vol. 263, no. 4, p. 042021, 2017. [Online]. Available: https://iopscience.iop.org/article/10.1088/1757-899X/263/4/042021.
- L.B. Krithika and G.G. Lakshmi Priya, “Student emotion recognition system (SERS) for e-learning improvement based on learner concentration metric," Procedia Computer Science, vol. 85, pp. 767–776, 2016. [Online]. Available. [CrossRef]

| Axis | Past (pre-deep / centralized) | Present (deep FER + early FL) | Future (client-side validation in FL) |
|---|---|---|---|
| FER methodology | Handcrafted features + SVM/HMM on small, controlled datasets; limited robustness.[1] | Deep CNNs, attention and hybrid models on large in-the-wild datasets; edge-optimized backbones.[1,2,4] | Multimodal, uncertainty-aware models tightly coupled with on-device validators and active learning. |
| Deployment architecture | Centralized training and inference on servers; raw images often uploaded.[1] | On-device inference with mobile CNNs; FL for training in select domains (medical imaging, SER, FER prototypes).[3,5,6,9] | Hierarchical edge/cloud FL with local validation pipelines and selective, privacy-preserving reporting of quality signals. |
| Data-quality handling | Assumed expert-labeled, mostly clean datasets; ad-hoc manual cleaning.[1] | Robust centralized training; robust FL algorithms (RHFL, FedNoRo) mitigate noisy clients during training.[12,13,14] | Client-side validation filters and corrects labels pre-training, combined with robust FL and secure aggregation of quality metrics.[15,16,18] |
| Human-in-the-loop | Limited to offline dataset creation by experts or crowd workers; end users rarely involved.[16] | Crowdsourcing platforms with multi-stage quality-control and occasional AI assistance.[17,18] | End users participate in ongoing labeling/validation via unobtrusive prompts; UX and fairness central to system design. |
| Tool | Primary role | Framework support | Strengths for FER/validation | Typical usage scenario |
|---|---|---|---|---|
| Flower[20] | Server-side FL orchestration | PyTorch, TensorFlow, JAX (framework-agnostic) | Easy to wrap existing training loops; flexible strategy hooks for robust FL and client selection | Rapid FL prototyping and research with existing Python FER models; small to large-scale simulations. |
| TensorFlow Federated[21,22] | Server-side FL orchestration and simulation | TensorFlow/Keras-native | Strong integration with TF; expressive federated core for custom algorithms | Algorithm research when models are in TensorFlow and experiments run in simulated environments. |
| FedML[23] | Research library and benchmarks for FL | Multiple backends; focus on CV, NLP, graphs, IoT | Standardized datasets and baselines; support for cross-device and cross-silo FL | Comparative studies of FL algorithms across tasks, including FER-related vision workloads. |
| PySyft[24,25] | Privacy-preserving FL and secure computation | Extends PyTorch/TF with privacy primitives | Differential privacy, secure aggregation, and remote execution for sensitive data | Experiments where strong privacy guarantees for emotion data outweigh ease of deployment. |
| TensorFlow Lite[19,26] | On-device inference (and light training) runtime | Runs converted TensorFlow models on mobile/edge | Low-latency FER inference and validation on phones; offline operation; supports UI integration | Production mobile apps performing FER and client-side validation; combined with a server-side FL framework. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.