Releases: google-deepmind/acme
Releases · google-deepmind/acme
v.0.4.0
v0.3.0
0.2.4
0.2.3
0.2.2
Highlights
- This is a minor release to update the dependencies to new stable versions of Tensorflow and Reverb (14cb62f).
- New SVG-0 agent (1ecabdb).
Minor changes and fixes
- Fixes to docs and testing.
- Minor updates to tf-DQN.
- Updates to LP tear down.
- Update to how signals are handled.
- Update to setup/requirements.
- Reorganize Builder and integrate jax-specific functionality.
- Improvements to typing.
0.2.1
Highlights
- Released distributed versions of some agents 3bc0426.
- Adders use Reverb's TrajectoryWriter (instead of Writer).
- Introduce the Builder API 329aae5.
- Add JAX behavior cloning agent eb7073f.
- Add Python 3.8 support.
Minor changes and fixes
- Use nightly versions of TensorFlow, Reverb, and TensorFlow Probability.
- Remove pandas dependency.
- Skip tests that depend on Atari ROMs (atari_py no longer provides them as part of its pypi package).
- Fix links to ipynb pages.
0.2.0
Highlights
- Using stable releases for TensorFlow (>=2.3.0), Reverb, and TensorFlow Probability.
- Added Critic Regularized Regression (code, paper)
- Added Discrete Batch-Constrained Deep Q-learning (code, paper)
- Added
EnvironmentLoop.run_episode()
for running a single episode. - Update
EnvironmentLoop.run()
to takenum_steps
, allowing the control of step count rather than just episode count. - Add more distribution types (e.g. GaussianMixture) which can be used by policies.
- Added a environment wrapper for action repeats.
- Improvements/tuning to datasets exposed by
make_dataset
. - Add support for nested / multidimensional rewards and discounts.
Minor changes and fixes
ConstantInfo
logger for logging constant information.- Added a
should_update
parameter to theEnvironmentLoop
. - Various modifications and optimizations to the
make_reverb_dataset()
function. - Improvements to typing and pytype usage.
- Other minor bug and documentation fixes.