Releases · JMGaljaard/fltk-testbed

26 Sep 10:25

JMGaljaard

v0.3.1

083b0a6

v0.3.1 Latest

Latest

In short

This release updates the code base such that:

Documentation is updated to show how docker images can be built and pushed.
Memory issues are resolved/mitigated for FederatedLearning experiments
BatchOrchestrator has been tested and updated according to the findings from debugging.
- Supports waiting for HistoricalArrivalTasks
- Supports stop-and-go deployment through parallelism configuration parameters.

What's Changed

Demo by @JMGaljaard in #46
43 experiment replication batch by @JMGaljaard in #47
Demo by @JMGaljaard in #48
Develop by @JMGaljaard in #56

Full Changelog: v0.3.0...v0.3.1

Contributors

JMGaljaard

Assets 2

06 Sep 14:27

JMGaljaard

v0.3.0

586d2da

Terraform and repetitions

In short

This release revamps the deployment on GKE with Terraform, making deployment a breeze. Furthermore, the dependency list is slimmed down from Kubeflow to only Kubeflow-training-operators. This alliviates the overhead on your cluster, as for example istio is now no-longer required for deployment.

For experiments, the orchestrator allows for running repetitions of experiments directly. This allows to describe an experiment file once (e.g. a distributed learning configuration), and run it multiple times in a single deployment.

What's Changed

Terraform deployment by @JMGaljaard in #44
43 experiment replication by @JMGaljaard in #45

Full Changelog: v0.2.2...v0.3.0

Contributors

JMGaljaard

Assets 2

30 May 10:45

JMGaljaard

v0.2.2

af6af25

Federated num_epochs_per_round

Important information

In the configuration files (e.g. configs/federated/*.yaml) for Federated Learning, make sure to check that you use the following parameters as follows:

totalEpochs, you MUSN'T use, this will be removed later, after this years' course has finished. This parameter was also not used before this release, but keep this in mind. An warning will be logged during execution if you do access this.
rounds, the number of communication rounds, so how many times the Federator node will sample and contact Client nodes.
epochsPerRound, the number of epochs the `Clients perform within a communication round.

What's Changed

Resolve federated learning rounds per epoch by @JMGaljaard in #41. Thanks @AbeleMM for spotting this error.

Full Changelog: v0.2.1...v0.2.2

Contributors

JMGaljaard

Assets 2

24 May 11:55

JMGaljaard

v0.2.1

8b8d3a0

Experiment parsing and testing

This release introduces the following change:

All federated and data_parallel experiment parsers now load all their respective parameters.
The loss function can now be any of the functions that inherit from torch.nn.modules.loss._Loss base class, using their CamelCase name.
A series of test cases was added to test the parsing of default values with configured values for testing.

Issues closed

This version resolves the following issues. For more information read the changelog above.

Assets 2

12 May 09:42

JMGaljaard

v0.2.0

50ab397

Kubernetes FLTK Pre-release

Pre-release

This release introduces the following changes:

FederatedLearning datasets other than FashionMNIST failed to load without raising an Exception. Thereby resulting in a failed execution of experiments. This required a revision of the experiment configuration objects that were used by the learners, see also point 4.
- To help detect issues like these from getting introduced undetected, a series of smoke tests were introduced that can be run locally.
Distributed (DistributedDataParallel) experiments are now compatible again with KFLTK 🎉. See also #26.
Configuration files with small floating-point numbers without a decimal (such as 10e-5) will no longer be parsed as a string by FedLearningConfig and DistLearningConfig objects. This was mainly an issue for the min_lr configuration parameter, which would result in an exception being raised after 50 epochs.
The (flat) configuration objects for DistributedDataParallel and Federated learning experiments have been unified partially. These objects are now renamed to FedLearningConfig and DistLearningConfig respectively. ⚠️ Make sure to update your imports.
- Moreover both classes now allow being directly parsed from a json file. Note that the FromYamlFile function has been renamed to the more pythonic name from_yaml
- These objects now both make use of the Definitions Enum classes, allowing for more readable parsing errors when providing an incorrect type.
A typo has been resolved in the jinja templates such that theaggregation is now properly set.
Datasets for Federated experiments now make use of the test_batch_size, rather than using a default of 16. This parameter has also been introduced in the jinja template.
the fltk.util.env module has been renamed to fltk.util.environment to add it to the repository. This also resolves #30
A series of linter issues have been resolved (mostly warnings and some typos).

Issues closed

This version resolves the following issues. For more information read the changelog above.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In short

What's Changed

Contributors

In short

What's Changed

Contributors

Important information

What's Changed

Contributors

Issues closed

Issues closed

Releases: JMGaljaard/fltk-testbed

v0.3.1

In short

What's Changed

Contributors

Terraform and repetitions

In short

What's Changed

Contributors

Federated num_epochs_per_round

Important information

What's Changed

Contributors

Experiment parsing and testing

Issues closed

Kubernetes FLTK

Issues closed