This is an implementation of the proposed algorithms in Piecewise-Stationary Multi-Objective Multi-Armed Bandit with Application to Joint Communications and Sensing and Adaptive Energy-Efficient Waveform Design For Joint Communication and Sensing using Multiobjective Multiarmed Bandits. These algorithms are UCB-based Multi-Objective Multi-Armed Bandit and can work in non-stationary environments.
# In the project root folder
pip install -r requirements.txt
The environment can be configured in the "config_synthetic.py" file of the root folder. You then can run the experiment and save the result by commands below:
# In the project root folder
python ./run_synthetic.py
python ./plot_performances.py
A plot of mean rewards over time.
A plot of cumulative pareto regrets over time (averaged over 100 runs each).