-
Experiment with the simple naïve REINFORCE algorithm
-
Run and compare with industry gold-standard OpenAI's Stable-Baseline-3 implementations of DQN, A2C and PPO
-
4 main phases of development and experiments
- Simple environment with a single env. variable > the Tool wear (mm)
- A simulated Dasic (2006) milling tool wear model
- PHM 2010 prognostics data
- Complex environment with features from the PHM 2010 data-set
-
Metrics
- Test on another laptop
- Clone git repo
- Run test on models and publish results