Rollout Problem #5

baitingzbt · 2023-06-30T08:26:02Z

When rolling out policy here. The nested function policy(index) ALWAYS assumes dreamer is None (i.e. never going to the else section).

The text was updated successfully, but these errors were encountered:

famishedrover · 2023-07-02T20:49:16Z

I think its correct, can put a debug point to be sure, but

def fun1():
  dreamer = None

  def fun2():
    nonlocal dreamer
    if dreamer == None: 
      print("dreamer was none")
      dreamer = 1
    else : 
      print("dreamer working!")
  return fun2

myfunc = fun1()
myfunc()
myfunc()
myfunc()

prints the following :

dreamer was none
dreamer working!
dreamer working!

which is as expected.

baitingzbt · 2023-07-03T08:48:45Z

Hi I think I see the problem now after more testing. The default config results in a train_every=0.76<1, which causes the trainer to create a new function object for rollout every step.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rollout Problem #5

Rollout Problem #5

baitingzbt commented Jun 30, 2023

famishedrover commented Jul 2, 2023

baitingzbt commented Jul 3, 2023

Rollout Problem #5

Rollout Problem #5

Comments

baitingzbt commented Jun 30, 2023

famishedrover commented Jul 2, 2023

baitingzbt commented Jul 3, 2023