Skip to content

Navigation Menu

Explore
By size
By industry
By use case
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

m-wojnar / reinforced-lib Public

Notifications You must be signed in to change notification settings
Fork 4
Star 62

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Releases: m-wojnar/reinforced-lib

Releases · m-wojnar/reinforced-lib

Reinforced-lib 1.1.2

10 Aug 12:55

Compare

Choose a tag to compare

Loading

Reinforced-lib 1.1.2 Latest

Update dependencies.
Fix error with action space size of one.

Assets 2

Loading

All reactions

Reinforced-lib 1.1.1

13 Apr 17:30

Compare

Choose a tag to compare

Loading

Reinforced-lib 1.1.1

Improvements:

Update documentation.
Add reference to the SoftwareX paper.

Fix:

Normal Thompson sampling allows the lam parameter to be zero.
Bernoulli Thompson sampling is stationary by default.
Update the default value of the decay parameter the in ra-sim example.

Assets 2

Loading

All reactions

Reinforced-lib 1.1.0

11 Feb 14:54

Compare

Choose a tag to compare

Loading

Reinforced-lib 1.1.0

Major API changes:

Migrate from haiku (deprecated) to flax as the base naural network library.
Update agent names to match literature:
- QLearning (deep Q-learning) -> DQN,
- DQN (deep double Q-learning) -> DDQN.
Move particle filter from agents to utils.
New behavior of loggers - all declared loggers get values from all sources.

New functionalities:

Add Weights & Biases logger.

Other important changes:

Fix updates with empty replay buffer.
Fix logging of arrays to TensorBoard.
Minor improvements in documentation.
Rewrite Gymnasium integration example in documentation.
Improve the CCOD example to better reflect the original implementation.

Assets 2

Loading

Wotaker reacted with hooray emoji

All reactions

🎉 1 reaction

1 person reacted

Reinforced-lib 1.0.4

19 Dec 15:59

Compare

Choose a tag to compare

Loading

Reinforced-lib 1.0.4

Improvements:

Update documentation.
Enable the use of 64-bit JAX.

Assets 2

Loading

All reactions

Reinforced-lib 1.0.3

15 Dec 19:16

Compare

Choose a tag to compare

Loading

Reinforced-lib 1.0.3

New functionalities:

Add the normal-gamma Thompson sampling agent.
Add the log-normal Thompson sampling agent.

Assets 2

Loading

All reactions

Reinforced-lib 1.0.2

11 Dec 23:02

Compare

Choose a tag to compare

Loading

Reinforced-lib 1.0.2

Fix:

Make it easier to import the BasicMab extension.

Assets 2

Loading

All reactions

Reinforced-lib 1.0.1

11 Dec 22:48

Compare

Choose a tag to compare

Loading

Reinforced-lib 1.0.1

Important changes:

Move to pyproject.toml configuration file.
Add basic extension for MABs.
Update dependencies.
Fix bug modifying user values passed to library functions.
Fix agents behavior with multiple optimal actions - now agents draw one of the optimal actions instead of selecting the first one.

Assets 2

Loading

All reactions

Reinforced-lib 1.0.0

23 Jul 11:36

Compare

Choose a tag to compare

Loading

Reinforced-lib 1.0.0

Major API changes:

Added a support for deep reinforcement learning agents.
Relaxation of the requirements for the implementation of custom agents.
Major changes in the logging module (e.g., custom logging, synchronization).
Removed ability of the sample method to change state.
Introduced an inference only mode.

New functionalities:

Added new deep learning agents: deep Q-learning, deep expected SARSA, DQN, DDPG.
Added the Exp3 algorithm.
Added the Gymnasium extension.
Added the TensorBoard logger.
Added an easy export to TensorFlow Lite.
Added an automatic checkpointing.

Other important changes:

Upgraded the library to Python 3.9.
Updated and polished the documentation.
Added several new examples.
Moved Wi-Fi specific classes to examples.
Fixed known bugs.

Assets 2

Loading

All reactions

Reinforced-lib 0.2.0

20 Dec 21:01

Compare

Choose a tag to compare

Loading

Reinforced-lib 0.2.0

Add discounted UCB
Rename Gradient bandit to Softmax
Update IEEE 802.11ax extension
Improve agent descriptions
Extended getting started
Add Gym integration docs
Editorial updates in docs
Fix major and minor bugs

Assets 2

Loading

All reactions

Reinforced-lib 0.1.2

09 Nov 17:51

Compare

Choose a tag to compare

Loading

Reinforced-lib 0.1.2 Pre-release

Pre-release

Fix missing file with requirements

Assets 2

Loading

All reactions

Previous 1 2 Next

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.