Skip to content

This repository contains the code for Diversity Control (DiCo), a novel method to constrain behavioral diversity in multi-agent reinforcement learning.

Notifications You must be signed in to change notification settings

proroklab/ControllingBehavioralDiversity

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Controlling Behavioral Diversity

drawing

This is the code accompanying the paper: "Controlling Behavioral Diversity in Multi-Agent Reinforcement Learning".

Video

Watch the presentation video of DiCo.

DiCo Video

Installing

  1. Create a virtual environment (e.g., conda) with python 3.9
  2. Install our versions of VMAS, TensorDict, TorchRL, and BenchMARL.
git clone -b het_control https://github.com/proroklab/VectorizedMultiAgentSimulator.git
pip install -e VectorizedMultiAgentSimulator

git clone -b het_control https://github.com/matteobettini/tensordict.git
cd tensordict
python setup.py develop
cd ..

git clone -b het_control https://github.com/matteobettini/rl.git
cd rl
python setup.py develop
cd ..

git clone -b het_control https://github.com/matteobettini/BenchMARL.git
pip install -e BenchMARL
  1. Install optional dependencies for logging
pip installl wandb moviepy
  1. Install this project via
git clone https://github.com/proroklab/ControllingBehavioralDiversity.git
pip install -e ControllingBehavioralDiversity
  1. Try running a script (it will ask for cuda and wandb, you can change these values in ControllingBehavioralDiversity/het_control/conf/experiment/het_control_experiment.yaml)
python ControllingBehavioralDiversity/het_control/run_scripts/run_navigation_ippo.py model.desired_snd=0.1

Running

The het_control/run_scripts folder contains the files to run the various experiments included in the paper.

To run an experiment just do

python run_navigation_ippo.py model.desired_snd=0.3

You can run the same experiment over multiple config values, such as desired diversity values and seeds

python run_navigation_ippo.py -m model.desired_snd=-1,0,0.3 seed=0,1,2

model.desired_snd=-1 instructs to run unconstrained heterogeneous policies

For more information on the running syntax and options, check out the BenchMARL guide.

Configuring

The configuration for the codebase is available in the het_control/conf folder.

This configuration overrides the default BenchMARL configuration, contained in benchmarl/conf. So, if a value is not present in the configuration of this project, it will take the default value from that folder.

At the top level of the het_control/conf folder, you can find a config file for each experiment. For example: navigation_ippo_config.yaml.

These files define:

  • an algorithm
  • a policy model (Which, in this repository, is always our proposed method)
  • a critic model
  • a task
  • an experiment hyperparameter configuration

The values of these fields determine which configuration to load for each of these components. The configurations are loaded from the sub-folders: het_control/conf/algorithm, het_control/conf/model, het_control/conf/task, het_control/conf/experiment. The top level experiment file sometimes overrides certain values of these components.

All the configurations attributes in these sub-folders are documented to explain their meaning. Docs on these values are also available in BenchMARL.

You can override any of these values from the command-line using the basic Hydra syntax. For example:

python run_navigation_ippo.py model.desired_snd=0.3 seed=1 experiment.max_n_frames=1_000_000 algorithm.lmbda=0.8

Citation

@inproceedings{bettini2024controlling,
    title={Controlling Behavioral Diversity in Multi-Agent Reinforcement Learning},
    author={Bettini, Matteo and Kortvelesy, Ryan and Prorok, Amanda},
    booktitle={Forty-first International Conference on Machine Learning},
    year={2024},
    url={https://openreview.net/forum?id=qQjUgItPq4}
}

About

This repository contains the code for Diversity Control (DiCo), a novel method to constrain behavioral diversity in multi-agent reinforcement learning.

Topics

Resources

Stars

Watchers

Forks

Languages