Submitted:
22 May 2024
Posted:
23 May 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Works
3. FedOps Mobile Overview
- TensorFlow Lite
- CoreML
- Firebase
- Flutter
- Flower mobile Software Development Kit(SDK)
- Cross-Platform On-Device Training: By integrating TensorFlow Lite and CoreML, FedOps facilitates real-time model training directly on devices, thereby harnessing the native computational capabilities without compromising privacy.
- Real-Time Operational and User-Device Identification Control: FedOps utilizes a web platform to provide comprehensive management and monitoring capabilities, allowing for dynamic adjustments and optimizations across the federated network.
- Remote Client Selection and Training: The incorporation of Firebase enables effective background task management and user notifications, which are critical for maintaining engagement and operational efficiency in FL environments.
- Adaptive Client Selection: FedOps employs intelligent client-selection algorithms that evaluate the current state and capabilities of devices, optimize network resources, and ensure uniform model training contributions. Custom client selection methods or changes can be deployed on web platforms.
- Personalization and Privacy: The framework’s capability to personalize learning models using aggregated data from a user’s device, coupled with stringent privacy protocols, ensures that FedOps is not only effective but also trustworthy.

3.1. Platform Manager
- Manages device-specific functionalities and ensures seamless integration and operation of various system components.
- Facilitates communication with native platform capabilities through MethodChannel and EventChannel.
3.2. Network Manager
- Handles all network-related functionalities, such as registering the client with the FL server and managing network communication.
- Responsible for essential tasks such as registerClient() and getServerIp(), ensuring efficient connectivity with the server.
3.3. Native Platforms
- Implements machine-learning functions such as Fit(), Evaluate(), and Get weights() that are crucial for training and evaluating models on the client device.
- Utilizes TensorFlow Lite and CoreML to optimize machine learning computations on mobile devices.
- Employs gRPC communication for robust and efficient data exchange with the server.
3.4. Server Side (Microk8s Environment)
3.5. FL Server
- Central to the FL process is the aggregation of models and data analysis.
- Stores evaluation data and maintains the global model, updating it with insights derived from client contributions.
3.6. Server Manager
- Manages the overall server operations, including client performance monitoring and data management.
- Coordinates the exchange of data and model updates between the server and clients, thereby ensuring system integrity and performance.
4. Web Service
- Web service represents a component designed to facilitate additional interactions with web-based services or external systems.

4.1. Device Resource Reporting
4.2. Firebase Realtime Database
4.3. Firebase Cloud Functions and OnWrite Trigger
4.4. Training Notification
5. Experiments
5.1. Experiment 1: Cross-Device FL Efficiency and Scalability
- MNIST: This dataset was used to assess basic FL functionalities and initial model personalization.
- CIFAR-10: Provided complexity and diversity, testing the framework’s scalability and data handling capabilities.
- FEMNIST: Ideal for detailed cross-device FL experiments owing to the partitioning by a digit writer.
- Each dataset was distributed across a simulated network of devices to ensure a non-IID (independent and identically distributed) distribution that mimics real-world scenarios.
- FedOps was deployed to orchestrate model training by incorporating a client selection algorithm that dynamically chooses devices based on their current state (e.g., battery level, available memory, and network bandwidth).
- Training sessions were conducted to iteratively refine client selection based on ongoing device performance metrics and resource availability.
- The performance was compared against baseline FL models and centralized approaches using metrics such as accuracy, loss, and training time efficiency.
5.2. Experiment 2: Model Personalization and Heterogeneity Handling
- FEMNIST: Assesses personalization capabilities.
- SHL (Sussex-Huawei Locomotion): Tests the framework in wearable contexts using diverse human activity data.
- Personalized models were created for devices using the FEMNIST and SHL datasets, reflecting diverse real-world data conditions.
- Client selection was implemented to prioritize devices that provide diverse and representative data samples, thereby enhancing the robustness of the global model.
- The effectiveness of personalization was measured through local and global model accuracy comparisons, and FedOps was evaluated for adaptability to data source diversity through device-specific performance assessments.
5.3. Experiment 3: System Heterogeneity and Resource Optimization
- The utilization of the CIFAR-10 and SHL datasets caters to the testing of FedOps under different computational demands and real-world mobility scenarios.
- We simulated a range of devices with varying computational power and network conditions to reflect the broad spectrum of user equipment.
- Client selection was integrated to optimize resource allocation and prioritize devices based on computational efficiency and network stability.
- We applied model compression techniques and adaptive communication protocols tailored to the capabilities of the selected devices.
- The impact of these optimizations on the training duration, model accuracy, and communication overhead was evaluated and compared with traditional FL setups.
5.4. Data Privacy and Security Evaluation
- The setup of each experiment, including device distribution, FL parameters, and specific FedOps configurations, was meticulously documented.
- Multiple iterations were conducted to ascertain the statistical significance, employing metrics suitable for each objective, such as model accuracy, precision, and recall, along with efficiency indicators, such as training duration and data consumption.
- Ablation studies were conducted to determine the impact of distinct FedOps components on the overall performance of the framework.
5.5. Device Configuration for Experiments Overview
- Lab Devices: Two real mobile devices available in our laboratory, specifically tablets, were used because of their enhanced screen size and processing capabilities suitable for intensive FL tasks.
- Emulators: Two emulators were configured to represent average consumer smartphones, creating a controlled environment for testing and debugging specific scenarios.
- AWS Device Farm: Twenty-one devices were accessed remotely via the AWS Device Farm, which provides a broad range of mobile devices with various operating systems, screen sizes, and hardware capabilities. This selection included both high-end smartphones and mid-range models to cover a wide spectrum of device capabilities.
|
Experiment |
Device Numbers |
Training Rounds, Epoch | Training Time (Overall, Avg Round) |
Energy Consump- tion (% of Max Battery) |
Accuracy of Global Model | Best Accuracy of Devices |
|---|---|---|---|---|---|---|
| Efficiency, Scalability, and Client Selection were Objectives. Model Personalization and Heterogeneity Handling System. Heterogeneity and Resource Optimization. |
25 25 25 |
30, 5 30, 5 30, 5 |
2 h, 5 min 1.5 h, 6 min 2.5 h, 7 min |
20% 16% 22% |
60% 54% 67% |
85% 61% 58% |
6. Conclusion
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Moon, J. , Yang, S., & Lee, K. (2024). FedOps: A Platform of Federated Learning Operations with Heterogeneity Management. IEEE Access. [CrossRef]
- S. Yang, J. Moon, J. Kim, K. Lee, and K. Lee, "FLScalize: Federated Learning Lifecycle Management Platform," IEEE Access, vol. 11, pp. 47212-47222, 2023. [CrossRef]
- Chen, Y.; Qin, X.; Wang, J.; Yu, C.; Gao, W. Fedhealth: A federated transfer learning framework for wearable healthcare. IEEE Intell. Syst. 2020, 35, 83–93. [Google Scholar] [CrossRef]
- Kairouz, P.; McMahan, H.B.; Avent, B.; Bellet, A.; Bennis, M. , Bhagoji, A.N.; Bonawitz, K.; Charles, Z.; Cormode, G.; Cummings, R.; et al. Advances and open problems in federated learning. Foundations and trends® in machine learning 2021, 14(1–2), 1–210.
- McMahan, B.; Moore, E.; Ramage, D.; Hampson, S.; y Arcas, B.A. Communication-efficient learning of deep networks from decentralized data. Artificial intelligence and statistics. In PMLR, 2017, 1273–1282.
- Koneˇcny`, J.; McMahan, H.B.; Ramage, D.; Richtárik, P. Federated optimization: Distributed machine learning for on-device intelligence. arXiv preprint . arXiv:1610.02527, 2016.
- Moriai, S. Privacy-preserving deep learning via additively homomorphic encryption. In 26th Symposium on Computer Arithmetic (ARITH); IEEE Publications; IEEE Publications, 2019; pp. 198–198. [CrossRef]
- Shayan, M.; Fung, C.; Yoon, C.J.M.; Beschastnikh, I. Biscotti: A ledger for private and secure peer-to-peer machine learning. arXiv Preprint ArXiv:1811.09904, 2018.
- Hard, A.; Rao, K.; Mathews, R.; Ramaswamy, S.; Beaufays, F.; Augenstein, S.; Eichner, H.; Kiddon, C.; Ramage, D. Federated learning for mobile keyboard prediction. arXiv preprint. arXiv:1811.03604, 2018.
- Abhishek, A.; Binny, S.; Johan, R.; Nithin, R.; Vishal, T. Federated learning: Collaborative machine learning without centralized training data. Int. J. Eng. Technol. Manag. Sci. 2022, 6, 355–359. [Google Scholar] [CrossRef]
- Blanchard, P.; Mahdi El Mhamdi, E.; Guerraoui, R.; Stainer, J. Machine learning with adversaries: Byzantine tolerant gradient descent. Adv. Neural Form. Process. Syst. 2017, 30. [Google Scholar]
- Dayan, I.; Roth, H.R.; Zhong, A.; Harouni, A.; Gentili, A.; Abidin, A.Z.; Liu, A.; Costa, A.B.; Wood, B.J.; Tsai, C.-S.; et al. Federated learning for predicting clinical outcomes in patients with Covid-19. Nat. Med. 2021, 27, 1735–1743. [Google Scholar] [CrossRef] [PubMed]
- Reddi, S.; Charles, Z.; Zaheer, M.; Garrett, Z.; Rush, K.; Koneˇcny`, J.; Kumar, S.; McMahan, H.B. Adaptive federated optimization. arXiv Preprint ArXiv:2003.00295, 2020.
- Cho, H.; Mathur, A.; Kawsar, F. Flame: Federated learning across multi-device environments. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 2022, 6, 1–29. [Google Scholar] [CrossRef]
- Bonawitz, K.; Eichner, H.; Grieskamp, W.; Huba, D.; Ingerman, A.; Ivanov, V.; Kiddon, C. Jakub Koneˇcny`, Stefano Mazzocchi, Brendan McMahan, et al. Towards federated learning at scale: System design. Proceedings of the Machine Learning and Systems 2019, 1, 374–388. [Google Scholar]
| Feature | FedOpsPS Mobile | FedML | Flower | PySyft |
|---|---|---|---|---|
| Cross-platform | ✓ | ✓ | ✓ | ✓ |
| Remote Device Management | ✓ | x | x | x |
| Energy optimization | ✓ | ◯ | x | ◯ |
| Supported Algorithms | FedAvg, FedProx, FedYogi | FedAvg, FedProx, personalized FL | FedAvg, FedProx, FedYogi | FedAvg, Secure Aggregation |
| Client Selection | ✓ | ✓ | x | x |
| Background task managment | ✓ | ✓ | x | x |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).