Prepare Learner Workers
Mandatory
FLOps’ main goal is to perform federated learning, i.e., to train a machine learning model on local data in a distributed and privacy-preserving way. Training an ML model requires compatible data. A worker node is required to aggregate matching data to be able to participate in training. By default, orchestrated nodes do not aggregate training data. Only worker nodes that have been prepared as described on this page can collect such data and become FL learners.
For FLOps to work as intended, you are required to prepare at least one of your orchestrated nodes, as described in this guide.
Optimize Learning
Training is the centerpiece of FLOps, not only conceptually but also computationally and runtime-wise. Select your learner nodes wisely. Prefer more powerful, resource-rich machines to speed up training times.
How do learners handle and store data for training in FLOps?
Explore how FLOps manages ML data for local training here
On the worker nodes where you wish to perform ML model training, do the following:
- Ensure the NodeEngine is running
sudo NodeEngine -a <cluster-address> -d && sudo NodeEngine status
- Activate the
FLOps-learner
addon for the NodeEngine:sudo NodeEngine config addon FLOps-learner on
- Restart the NodeEngine
- Either run
sudo NodeEngine stop
and then start it up again - Or run
sudo systemctl restart nodeengine.service
- Either run
- Verify that the addon is active:
> sudo NodeEngine config addon Configured Addons: - FLOps-learner: 🟢 Active