Skip to content
This repository has been archived by the owner on Jul 16, 2024. It is now read-only.

All workers' actions are "idle". Does it make sense? #159

Open
XinWang0219 opened this issue Feb 8, 2023 · 2 comments
Open

All workers' actions are "idle". Does it make sense? #159

XinWang0219 opened this issue Feb 8, 2023 · 2 comments
Assignees
Labels
bug Something isn't working

Comments

@XinWang0219
Copy link

Describe the bug
During training, all workers' actions are "idle". Does it make sense? Or do I do something wrong in the steps before?

To Reproduce
Steps to reproduce the behavior:

  1. After run "poker_ai cluster", get two files "card_info_lut.joblib" and "centroids.joblib".
  2. I create directory "./research/blueprint_algo" and put these files in it.
  3. Then run "poker_ai train start --lut_path ./research/blueprint_algo --sync_cfr --sync_discount --sync_serialise"

Expected behavior
I don't know whether these 'Woker-i': 'idle' are what they should be. But while I run "poker_ai play" and get "UnboundLocalError: local variable 'action' referenced before assignment", I think it may be because 'idle' is not in the expected choices of actions.

Screenshots
[23:12:01] INFO Sending synchronised cfr to workers server.py:196
INFO {'Worker-3': 'idle', 'Worker-4': 'idle', 'Worker-5': 'idle', 'Worker-6': 'idle', server.py:197
'Worker-7': 'idle', 'Worker-8': 'idle', 'Worker-9': 'idle'}
INFO Worker-7 updating status to cfr server.py:97
INFO Worker-7 updating status to idle server.py:97
INFO Worker-8 updating status to cfr server.py:97
INFO Worker-8 updating status to idle server.py:97
INFO Worker-9 updating status to cfr server.py:97
INFO Worker-9 updating status to idle server.py:97
INFO Sending synchronised cfr to workers server.py:196
INFO {'Worker-3': 'idle', 'Worker-4': 'idle', 'Worker-5': 'idle', 'Worker-6': 'idle', server.py:197
'Worker-7': 'idle', 'Worker-8': 'idle', 'Worker-9': 'idle'}
[23:12:03] INFO Sending synchronised cfr to workers server.py:196
INFO {'Worker-3': 'idle', 'Worker-4': 'idle', 'Worker-5': 'idle', 'Worker-6': 'idle', server.py:197
'Worker-7': 'idle', 'Worker-8': 'idle', 'Worker-9': 'idle'}
[23:12:04] INFO Sending synchronised cfr to workers server.py:196
INFO {'Worker-3': 'idle', 'Worker-4': 'idle', 'Worker-5': 'idle', 'Worker-6': 'idle', server.py:197
'Worker-7': 'idle', 'Worker-8': 'idle', 'Worker-9': 'idle'}
[23:12:05] INFO Worker-3 updating status to cfr server.py:97
INFO Worker-3 updating status to idle server.py:97
INFO Worker-4 updating status to cfr server.py:97
INFO Worker-4 updating status to idle server.py:97
INFO Worker-5 updating status to cfr server.py:97
INFO Worker-5 updating status to idle server.py:97
INFO {'Worker-6': 'update_strategy'} server.py:251
INFO Sending synchronised cfr to workers server.py:196
INFO {'Worker-3': 'idle', 'Worker-4': 'idle', 'Worker-5': 'idle', 'Worker-6': 'idle', server.py:197
'Worker-7': 'idle', 'Worker-8': 'idle', 'Worker-9': 'idle'}
[23:12:08] INFO {'Worker-8': 'update_strategy'} server.py:251
INFO Sending synchronised cfr to workers server.py:196
INFO {'Worker-3': 'idle', 'Worker-4': 'idle', 'Worker-5': 'idle', 'Worker-6': 'idle', server.py:197
'Worker-7': 'idle', 'Worker-8': 'idle', 'Worker-9': 'idle'}
[23:12:10] INFO {'Worker-3': 'update_strategy'} server.py:251
INFO Sending synchronised cfr to workers server.py:196
INFO {'Worker-3': 'idle', 'Worker-4': 'idle', 'Worker-5': 'idle', 'Worker-6': 'idle', server.py:197
'Worker-7': 'idle', 'Worker-8': 'idle', 'Worker-9': 'idle'}
INFO Sending synchronised serialise to workers server.py:196
INFO {'Worker-3': 'idle', 'Worker-4': 'idle', 'Worker-5': 'idle', 'Worker-6': 'idle', server.py:197
'Worker-7': 'idle', 'Worker-8': 'idle', 'Worker-9': 'idle'}

Optimisation iterations 83%|████████████████████████████████████████████▋ | 1239/1500 [1h 23:36<17:37, 0.25 iter/s]

Desktop (please complete the following information):

  • Windows
  • wsl2
  • conda version: 4.12.0
  • Python version: 3.7
@XinWang0219 XinWang0219 added the bug Something isn't working label Feb 8, 2023
@c976237222
Copy link

请问这个问题您解决了吗 我也是都是idle
Have you solved this problem? I am also idle

@380799631
Copy link

这个空闲应该不是不做事情,而是为了同步多个进程的资源。
用确保所有的工作进程(workers)在开始同步任务之前都处于闲置状态。这样做是为了避免在同步任务执行期间发生资源竞争或数据冲突。

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants