Flax #41

m-wojnar · 2024-02-05T11:40:40Z

Migrate from haiku to flax
Fix a period of incorrect updates between the start of training and the buffer filling
Code refactor
Rename agents.deep.QLearning to DQN and agents.deep.DQN to DDQN
Move particle filter from agents to utils (it is not an agent, rather a core for some algorithms)

Wotaker

Okey, czyli widzę że rozwiązałęś sprawę z cloudpickle przez notkę w dokumentacji. Spoko, popieram.

Druga sprawa to widziałem że jednak się zdecydowałeś na aktywację tangensem hip. i zaznaczyłeś komentarzem że to się różni od oryginału. Myżlę że tak jest najlepiej. A jak to wpłynęło na wyniki?

m-wojnar · 2024-02-09T19:25:59Z

Druga sprawa to widziałem że jednak się zdecydowałeś na aktywację tangensem hip. i zaznaczyłeś komentarzem że to się różni od oryginału. Myżlę że tak jest najlepiej. A jak to wpłynęło na wyniki?

Tak, zdecydowałem się na tanh, bo dzięki temu DDPG działa za każdym razem (nie występuje ten problem z "uciekaniem" wyjścia sieci poza zakres [0, 6]). Wyniki są dobre (prawie takie same, jak poprzednio), więc chyba ostatecznie jest sukces i dlatego zostawiłem tak 😉

…mple

m-wojnar added 4 commits February 3, 2024 17:54

Update agent names to match literature

9b9915e

Code refactor and fix update with empty buffer

980e63c

Move particle filter to utils

0af3e62

Migrate from haiku to flax

2e744ad

m-wojnar requested a review from Wotaker February 5, 2024 11:40

m-wojnar added 9 commits February 5, 2024 12:47

Fix docs build

6417c55

Remove reference to actor and critic from DDPG

99f4196

Update neural networks definition in CCOD example

30af5c9

Fix checkpoint creation in CCOD example

79a2231

Fix a testing script in CCOD example

af12e77

Add custom PRNG key stream

d0063ca

Workaround for serialization of complex modules

585cf91

Fix DDPG behavior in CCOD example

ae0e9eb

Add scripts to preprocess output files in CCOD example

8f4a629

Wotaker approved these changes Feb 9, 2024

View reviewed changes

Wotaker and others added 2 commits February 9, 2024 20:16

Merge branch 'main' into flax

30bae91

fix comma in pyproject.toml

2858cbe

Add a script to run training and evaluation of all agents in CCOD exa…

6b4e8cf

…mple

m-wojnar merged commit 4e33f09 into main Feb 9, 2024
5 checks passed

m-wojnar deleted the flax branch February 9, 2024 19:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flax #41

Flax #41

m-wojnar commented Feb 5, 2024 •

edited

Loading

Wotaker left a comment

m-wojnar commented Feb 9, 2024

Flax #41

Flax #41

Conversation

m-wojnar commented Feb 5, 2024 • edited Loading

Wotaker left a comment

Choose a reason for hiding this comment

m-wojnar commented Feb 9, 2024

m-wojnar commented Feb 5, 2024 •

edited

Loading