Continuation of: Add babyai bot and test #381

thesofakillers · 2023-07-19T09:19:35Z

Description

Continues #356:

filename change
class name change
change docstrings to google style
pytest.mark.parameterize over all BabyAI rather than choose one at random
check whether env render is necessary in test
answer about what is the goal of the bot

Fixes #308

Type of change

Please delete options that are not relevant.

New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)

Checklist:

I have run the pre-commit checks with pre-commit run --all-files (see CONTRIBUTING.md instructions to set it up)
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

thesofakillers · 2023-07-19T09:20:58Z

Can't find a contributing.md

plus some minor changes to pass pre-commit hooks

thesofakillers · 2023-07-19T11:02:20Z

The bot fails to complete 4/96 possible environments, with "nothing left to explore"
The bot code is identical to the original babyai repo, so I am not sure we should cover these cases.

Other than that, I've integrated all the changes requested by @pseudo-rnd-thoughts on #356. Opening for review.

see: Farama-Foundation/Gymnasium#535 (comment)

pseudo-rnd-thoughts · 2023-07-22T15:57:00Z

Sorry, I have no idea. @BolunDai0216 any ideas otherwise we could ask the original babyai authors

BolunDai0216 · 2023-07-22T17:16:51Z

Sorry, I'm not very familiar with the bot's implementation, I think we should ask the ask the original babyai authors.

thesofakillers · 2023-07-23T10:12:41Z

@saleml @maximecb @rizar could any of you comment on this (#381 (comment))?

The environments where the bot fails are:

BabyAI-PutNextS5N2Carrying-v0: nothing left to explore
BabyAI-PutNextS6N3Carrying-v0: nothing left to explore
BabyAI-PutNextS7N4Carrying-v0: nothing left to explore
BabyAI-KeyInBox-v0: with the following

def _find_obj_pos(self, obj_desc, adjacent=False):                                                                                                                                        
        """Find the position of the closest visible object matching a given description."""                                                                                                   
        assert len(obj_desc.obj_set) > 0     

AssertionError

For reference, these environments are registered here.

Thanks!

pseudo-rnd-thoughts · 2023-07-24T08:47:54Z

I'm the meantime, can we check that the environment is solve. I.e, with a known initial seed and list of actions always terminate the environment

thesofakillers · 2023-07-24T08:49:31Z

I'm not sure what you mean

pseudo-rnd-thoughts · 2023-07-24T10:42:25Z

My thinking is, if the solver is not able to solve the environment then there are two reasons

The solver has a bug
The environment has a bug

As the solver is more complex, is easier if we see if the first one is the issue

env = gym.make("env-id")

actions = []

env.reset()
for action in actions:
     _, reward, terminated, truncated, _ = env.step(action)

assert terminated is False 
assert reward > 0  # I'm guessing this is correct

We just need to find a list of actions that pass the bottom two tests

If this works, we can use it test the solver and at what point it fails

thesofakillers · 2023-07-24T11:06:56Z

I think its the environment that has a "bug", although I don't know if you would class it as much. The missions generated simply seem impossible for the environment considered. See the following examples for each of the first three failing envs

BabyAI-PutNextS5N2Carrying-v0

BabyAI-PutNextS6N3Carrying-v0

BabyAI-PutNextS7N4Carrying-v0

BabyAI-KeyInBox-v0

I am not sure what's wrong with the final env:

thesofakillers · 2023-07-26T08:53:41Z

See also mila-iqia/babyai#121 (comment) mentioning the same envs as #381 (comment) as having issues

pseudo-rnd-thoughts · 2023-07-28T10:58:22Z

Could you check the related paper to BabyAI and see what results they have. I.e., there may have been a time when this was solvable but just not now.

I'm a bit worried that there is a larger issue behind this one which is the cause.

thesofakillers · 2023-07-31T06:01:57Z

Looked at the original paper (link).

There seems to be no mention of the carrying variants of the PutNext task, and most of the discussion revolves around PutNextLocal, which works here.
There is also no mention of the KeyInBox level.

Upon further inspection, these are in fact from the "Bonus Levels", listed here with the following blurb:

The levels described in this file were created prior to the ICLR19 publication.
We've chosen to keep these because they may be useful for curriculum learning
or for specific research projects.

Please note that these levels are not as widely tested as the ICLR19 levels.
If you run into problems, please open an issue on this repository.

I am not sure these levels were ever covered by the Bot.

Perhaps we can provide a warning (or NotImplementedError) when instantiating the Bot with one of these 4 levels stating that they are not covered.

pseudo-rnd-thoughts · 2023-08-01T11:26:22Z

@thesofakillers I think that is fair solution to disable their testing with a note in the docstring

Is part of the issue that the environments are not solvable in the first place?
At least with the visualisations and missions, it doesn't seem possible

thesofakillers · 2023-08-01T13:45:34Z

Is part of the issue that the environments are not solvable in the first place?

I think this may be the case for the PutNextCarrying envs, at least visually. Perhaps the authors originally had an additional proprioceptive state dimension specifying what object the agent is carrying.

I'm not sure exactly why KeyInBox is breaking.

disable their testing with a note in the docstring

I can go ahead and disable their testing. What docstring exactly did you have in mind? The Bot's?

pseudo-rnd-thoughts · 2023-08-01T13:48:33Z

What docstring exactly did you have in mind? The Bot's?

Yes I think so and if possible in the environment docstring as well

thesofakillers · 2023-08-01T14:00:30Z

Ok, done.

thesofakillers · 2023-08-01T14:17:14Z

Seems like for certain seeds, the Bot gets stuck (or just takes a very long time) on some of the envs. I think this is why the tests are taking so long for certain python versions.

I can find a seed that works, or we can set a max number of steps, although this may cause the Bot to fail occasionally and not pass the tests.

Could also keep trying a different seed with a max number of steps until the bot is succesful

thesofakillers · 2023-08-02T15:44:28Z

@pseudo-rnd-thoughts let me know what you think about my previous comment, seems it leads to OOM errors eventually based on the results of the checks.

I've implemented the third option (keep trying a different seed with a max number of steps until the bot is succesful), which is what most repos using the Bot do from what I can tell.

I can commit and push if you think this is sensible

pseudo-rnd-thoughts · 2023-08-03T09:27:28Z

@thesofakillers Sorry, I have been ill for the last few days, yes, I think that plan would be great

thesofakillers · 2023-08-03T10:02:02Z

Done. No worries! Hope you feel better!

GilgameshD and others added 3 commits May 31, 2023 14:27

add babyai bot and test

9b82c09

add babyai bot and test

92b4571

Merge branch 'master' of github.com:GilgameshD/Minigrid into babyaibot

e81d546

thesofakillers added 6 commits July 19, 2023 11:27

change baby Bot classname to BabyAIBot

0392847

rename files to clarify they are for BabyAI only

c0a5859

change to google style docs

1d475d9

plus some minor changes to pass pre-commit hooks

pytest.mark.parameterize over all BabyAI envs

8f2bb32

remove redundant env.render and human mode to speed up testing

642aff0

answer: what is the goal of the baby ai bot?

b3692b8

thesofakillers marked this pull request as ready for review July 19, 2023 11:02

thesofakillers mentioned this pull request Jul 19, 2023

add babyai bot and test #356

Closed

7 tasks

use unwrapped to remove deprecation warnings

7165126

see: Farama-Foundation/Gymnasium#535 (comment)

thesofakillers added 2 commits August 1, 2023 15:55

dont test specific bonus levels not covered by bot

18e688d

add note to docstrings noting babyai bot limitations

b7f3f07

pseudo-rnd-thoughts approved these changes Aug 1, 2023

View reviewed changes

test envs with a max steps, try diff seed otherwise

c1b53db

pseudo-rnd-thoughts merged commit 9dbdf61 into Farama-Foundation:master Aug 5, 2023

dyth mentioned this pull request Feb 29, 2024

Get demonstration for 'MiniGrid-MultiRoom-N4-S5-v0' mila-iqia/babyai#123

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Continuation of: Add babyai bot and test #381

Continuation of: Add babyai bot and test #381

thesofakillers commented Jul 19, 2023 •

edited

Loading

thesofakillers commented Jul 19, 2023

thesofakillers commented Jul 19, 2023

pseudo-rnd-thoughts commented Jul 22, 2023

BolunDai0216 commented Jul 22, 2023

thesofakillers commented Jul 23, 2023

pseudo-rnd-thoughts commented Jul 24, 2023

thesofakillers commented Jul 24, 2023

pseudo-rnd-thoughts commented Jul 24, 2023

thesofakillers commented Jul 24, 2023

thesofakillers commented Jul 26, 2023

pseudo-rnd-thoughts commented Jul 28, 2023

thesofakillers commented Jul 31, 2023 •

edited

Loading

pseudo-rnd-thoughts commented Aug 1, 2023

thesofakillers commented Aug 1, 2023

pseudo-rnd-thoughts commented Aug 1, 2023

thesofakillers commented Aug 1, 2023

thesofakillers commented Aug 1, 2023

thesofakillers commented Aug 2, 2023 •

edited

Loading

pseudo-rnd-thoughts commented Aug 3, 2023

thesofakillers commented Aug 3, 2023

Continuation of: Add babyai bot and test #381

Continuation of: Add babyai bot and test #381

Conversation

thesofakillers commented Jul 19, 2023 • edited Loading

Description

Type of change

Checklist:

thesofakillers commented Jul 19, 2023

thesofakillers commented Jul 19, 2023

pseudo-rnd-thoughts commented Jul 22, 2023

BolunDai0216 commented Jul 22, 2023

thesofakillers commented Jul 23, 2023

pseudo-rnd-thoughts commented Jul 24, 2023

thesofakillers commented Jul 24, 2023

pseudo-rnd-thoughts commented Jul 24, 2023

thesofakillers commented Jul 24, 2023

thesofakillers commented Jul 26, 2023

pseudo-rnd-thoughts commented Jul 28, 2023

thesofakillers commented Jul 31, 2023 • edited Loading

pseudo-rnd-thoughts commented Aug 1, 2023

thesofakillers commented Aug 1, 2023

pseudo-rnd-thoughts commented Aug 1, 2023

thesofakillers commented Aug 1, 2023

thesofakillers commented Aug 1, 2023

thesofakillers commented Aug 2, 2023 • edited Loading

pseudo-rnd-thoughts commented Aug 3, 2023

thesofakillers commented Aug 3, 2023

thesofakillers commented Jul 19, 2023 •

edited

Loading

thesofakillers commented Jul 31, 2023 •

edited

Loading

thesofakillers commented Aug 2, 2023 •

edited

Loading