Replies: 1 comment
-
Hi @zoctipus Your proposal makes perfect sense. If you feel like it, you can open a PR. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi thanks for such a great repository,
I have a proposal for interfacing algorithms with a generic
sample()
methodsome algorithms like ppo, rpo, a2c, trpo, uses
memory.sample_all()
withmini_batch
as argumentssome other algorithms like sac, td3. dqn, ddpg, uses
memory.sample()
withbatch_size
as argumentsif I want to incorporates some new loss, new procedure that requires calculating memory samples in my training application, and wants to test it against all available algorithms (this very reason why I like skrl), it will be very flexible if I can just write a patch that modifies
post_interation()
to Agent class, so that all algorithms all inherits this new features. While solution like #171 will work, but that requires adding new loss term in each possible algorithms.but that requires a uniform sample method across all specific algorithms, if some require
sample_all
withmini_batch
variable and others requires sample with batch variable it is not as so easy to work with.I think this suggestion can greatly enhance workflow that requires making edit your code base with creative algorithm designing, testing efficiently with existing algorithm.
Let me know what is your thought on this! If you like the idea, I can take extra caution when writing the code and open a pull request to help!
Thanks for your effort!
Beta Was this translation helpful? Give feedback.
All reactions