v0.1.1
Better training, Bug fixes, and more 🚀
- Better automatic training with DDP #87
- Bug Fix
RL4COTrainer
- Avoid broadcasting error warning in critic baselines
- Fix rollout baseline bug
- New experiment config structure: interpolate with environment name (we won't need anymore to have separate folders for each environment name such as TSP, CVRP etc, simply use one config to rule them all!