Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

gazebosim / gz-sim Public

Notifications You must be signed in to change notification settings
Fork 270
Star 714

Code
Issues 335
Pull requests 32
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Example of doing Reinforcement Learning in Gazebo #2662

Open

19 tasks

arjo129 opened this issue Nov 1, 2024 · 0 comments

Open

19 tasks

Example of doing Reinforcement Learning in Gazebo #2662

arjo129 opened this issue Nov 1, 2024 · 0 comments

Assignees

Labels

New feature or request

We accept pull requests!

Comments

Copy link

Contributor

arjo129 commented Nov 1, 2024 •

edited

Loading

Desired behavior

Currently, doing Reinforcement Learning in Gazebo is really hard. It is not impossible (as we already have reset methods), but as RL gains traction in the Robotics Community we are seeing real world use cases where walking models are being developed on other simulators. It would be nice to provide users with a way of doing RL. In particular, perhaps an example which exposes a gym api for one of the major RL libraries (stable baselines, rllib, etc). This work would also necessitate some improvements to the python API and some work on the python packaging side. In particular my proposal is we do the following:

Start with replicating the simplest possible RL scenario: The inverted pendulum or cart-pole. Adds an initial StableBaselines3 RL environment as an example #2667
Improve the reset API story (currently we support resetting over the network, but a direct API call would be nice).
- Allow resetting a test fixture Adds support for ISystemReset in test fixture #2647
  OR
- Expose an API that exposes the ECM outside of PostUpdate, Update and PreUpdate.
Expose a gym API for one of the RL libraries.
Run multiple instances with randomized environment.
Write measured sensor values to ECM for deterministic closed-loop control #2391
Figure out a way of starting the gui client in a cross platform manner.

Stretch goals

Improve collision APIs. - Often we may want to use collisions as a proxy for whether a task should terminate or not.
Improve story of imports by removing package number from names.
Set up real world test rig to compare results
Extend to multi-agent support
More examples including grasping/object manipulation examples.
Make it easier to load python ML frameworks with gazebo.
Have a ROS Integration story
Environment randomization API story
Reset individual models or support vectorization.

Some relevant projects

Early attempt at RL with gazebo: https://github.com/robotology/gym-ignition
Gymnasium (which provides example environments): https://gymnasium.farama.org/

Few Other Design Considerations

The work done on the python bindings actually exposes a really nice API. It is also nice that we have a separation between client and server API.

The text was updated successfully, but these errors were encountered:

azeey, traversaro, iche033, and Danilrivero reacted with thumbs up emoji

All reactions

👍 4 reactions

arjo129 added the enhancement New feature or request label

arjo129 self-assigned this

azeey added this to Core development

github-project-automation bot moved this to Inbox in Core development

arjo129 added the help wanted We accept pull requests! label

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Assignees

Labels

New feature or request

We accept pull requests!

Projects

Core development

Status: Inbox

Milestone

No milestone

Development

No branches or pull requests

1 participant

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.