- These results are from only 4 threads. So unstable to train.
- Tensorflow Implementation
- A3C type thread environment training method
- PongDeterministic-v4 environment
- Only CPU Training method
- Use Network protocol method
- Training on GPU, Inference on CPU